Robust nonparametric regression via sparsity control with application to load curve data cleansing

Gonzalo Mateos, Georgios B. Giannakis

Research output: Contribution to journalArticlepeer-review

41 Scopus citations

Abstract

Nonparametric methods are widely applicable to statistical inference problems, since they rely on a few modeling assumptions. In this context, the fresh look advocated here permeates benefits from variable selection and compressive sampling, to robustify nonparametric regression against outliersthat is, data markedly deviating from the postulated models. A variational counterpart to least-trimmed squares regression is shown closely related to an ℓ 0-(pseudo)norm-regularized estimator, that encourages sparsity in a vector explicitly modeling the outliers. This connection suggests efficient solvers based on convex relaxation, which lead naturally to a variational M-type estimator equivalent to the least-absolute shrinkage and selection operator (Lasso). Outliers are identified by judiciously tuning regularization parameters, which amounts to controlling the sparsity of the outlier vector along the whole robustification path of Lasso solutions. Reduced bias and enhanced generalization capability are attractive features of an improved estimator obtained after replacing the ℓ 0-(pseudo)norm with a nonconvex surrogate. The novel robust spline-based smoother is adopted to cleanse load curve data, a key task aiding operational decisions in the envisioned smart grid system. Computer simulations and tests on real load curve data corroborate the effectiveness of the novel sparsity-controlling robust estimators.

Original languageEnglish (US)
Article number6112694
Pages (from-to)1571-1584
Number of pages14
JournalIEEE Transactions on Signal Processing
Volume60
Issue number4
DOIs
StatePublished - Apr 2012

Bibliographical note

Funding Information:
Manuscript received April 03, 2011; revised August 10, 2011 and November 17, 2011; accepted December 20, 2011. Date of publication December 26, 2011; date of current version March 06, 2012. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Raviv Raich. This work was supported by MURI Grant (AFOSR FA9550-10-1-0567). This paper appeared in part in the Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Prague, Czech Republic, May 22–27, 2011.

Keywords

  • Lasso
  • load curve cleansing
  • nonparametric regression
  • outlier rejection
  • sparsity
  • splines

Fingerprint

Dive into the research topics of 'Robust nonparametric regression via sparsity control with application to load curve data cleansing'. Together they form a unique fingerprint.

Cite this