Abstract
Estimating a high-dimensional sparse covariance matrix from a limited number of samples is a fundamental task in contemporary data analysis. Most proposals to date, however, are not robust to outliers or heavy tails. Toward bridging this gap, in this work we consider estimating a sparse shape matrix from n samples following a possibly heavy-tailed elliptical distribution. We propose estimators based on thresholding either Tyler's M-estimator or its regularized variant. We prove that in the joint limit as the dimension p and the sample size n tend to infinity with p/n → γ > 0, our estimators are minimax rate optimal. Results on simulated data support our theoretical analysis.
Original language | English (US) |
---|---|
Pages (from-to) | 86-110 |
Number of pages | 25 |
Journal | Annals of Statistics |
Volume | 48 |
Issue number | 1 |
DOIs | |
State | Published - 2020 |
Bibliographical note
Funding Information:This work was supported by NSF Grants DMS-14-18386, DMS-18-21266 and GRFP-00039202, UMN Doctoral Dissertation Fellowship and the Feinberg Foundation Visiting Faculty Program Fellowship of the Weizmann Institute of Science.
Publisher Copyright:
© Institute of Mathematical Statistics, 2020
Keywords
- Elliptical distribution
- Matrix estimation
- Sparsity
- Spectral norm
- Thresholding
- Tyler's M-estimator