Robust parametric classification and variable selection by a minimum distance criterion

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

We investigate a robust penalized logistic regression algorithm based on a minimum distance criterion. Influential outliers are often associated with the explosion of parameter vector estimates, but in the context of standard logistic regression, the bias due to outliers always causes the parameter vector to implode, that is, shrink toward the zero vector. Thus, using LASSO-like penalties to perform variable selection in the presence of outliers can result in missed detections of relevant covariates.We show that by choosing a minimum distance criterion together with an elastic net penalty, we can simultaneously find a parsimonious model and avoid estimation implosion even in the presence of many outliers in the important small n large p situation. Minimizing the penalized minimum distance criterion is a challenging problem due to its nonconvexity. To meet the challenge, we develop a simple and efficientMM(majorization-minimization) algorithm that can be adapted gracefully to the small n large p context. Performance of our algorithm is evaluated on simulated and real datasets. This article has supplementary materials available online.

Original languageEnglish (US)
Pages (from-to)111-128
Number of pages18
JournalJournal of Computational and Graphical Statistics
Volume23
Issue number1
DOIs
StatePublished - 2014
Externally publishedYes

Keywords

  • Elastic net
  • Implosion breakdown
  • LASSO
  • Logistic regression
  • Majorization-minimization
  • Robust estimation

Fingerprint

Dive into the research topics of 'Robust parametric classification and variable selection by a minimum distance criterion'. Together they form a unique fingerprint.

Cite this