Cross-Fitted Residual Regression for High-Dimensional Heteroscedasticity Pursuit

Le Zhou, Hui Zou

Research output: Contribution to journalArticlepeer-review

Abstract

There is a vast amount of work on high-dimensional regression. The common starting point for the existing theoretical work is to assume the data generating model is a homoscedastic linear regression model with some sparsity structure. In reality the homoscedasticity assumption is often violated, and hence understanding the heteroscedasticity of the data is of critical importance. In this article we systematically study the estimation of a high-dimensional heteroscedastic regression model. In particular, the emphasis is on how to detect and estimate the heteroscedasticity effects reliably and efficiently. To this end, we propose a cross-fitted residual regression approach and prove the resulting estimator is selection consistent for heteroscedasticity effects and establish its rates of convergence. Our estimator has tuning parameters to be determined by the data in practice. We propose a novel high-dimensional BIC for tuning parameter selection and establish its consistency. This is the first high-dimensional BIC result under heteroscedasticity. The theoretical analysis is more involved in order to handle heteroscedasticity, and we develop a couple of interesting new concentration inequalities that are of independent interests.

Original languageEnglish (US)
Pages (from-to)1056-1065
Number of pages10
JournalJournal of the American Statistical Association
Volume118
Issue number542
DOIs
StatePublished - 2023

Bibliographical note

Funding Information:
This work is supported in part by NSF DMS 1915842 and 2015120. We thank to the editor, the AE, and referees for their helpful comments and suggestions.

Publisher Copyright:
© 2021 American Statistical Association.

Keywords

  • HBIC
  • Heteroscedasticity
  • High dimension
  • Model selection criterion
  • Sparsity

Fingerprint

Dive into the research topics of 'Cross-Fitted Residual Regression for High-Dimensional Heteroscedasticity Pursuit'. Together they form a unique fingerprint.

Cite this