High-Dimensional Censored Regression via the Penalized Tobit Likelihood

Tate Jacobson, Hui Zou

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

High-dimensional regression and regression with a left-censored response are each well-studied topics. In spite of this, few methods have been proposed which deal with both of these complications simultaneously. The Tobit model—long the standard method for censored regression in economics—has not been adapted for high-dimensional regression at all. To fill this gap and bring up-to-date techniques from high-dimensional statistics to the field of high-dimensional left-censored regression, we propose several penalized Tobit models. We develop a fast algorithm which combines quadratic majorization with coordinate descent to compute the penalized Tobit solution path. Theoretically, we analyze the Tobit lasso and Tobit with a folded concave penalty, bounding the (Formula presented.) estimation loss for the former and proving that a local linear approximation estimator for the latter possesses the strong oracle property. Through an extensive simulation study, we find that our penalized Tobit models provide more accurate predictions and parameter estimates than other methods on high-dimensional left-censored data. We use a penalized Tobit model to analyze high-dimensional left-censored HIV viral load data from the AIDS Clinical Trials Group and identify potential drug resistance mutations in the HIV genome. A supplementary file contains intermediate theoretical results and technical proofs.

Original languageEnglish (US)
Pages (from-to)286-297
Number of pages12
JournalJournal of Business and Economic Statistics
Volume42
Issue number1
DOIs
StatePublished - 2024

Bibliographical note

Publisher Copyright:
© 2023 American Statistical Association.

Keywords

  • Censored regression
  • Coordinate descent
  • Folded concave penalty
  • High dimensions
  • Strong oracle property
  • Tobit model

Fingerprint

Dive into the research topics of 'High-Dimensional Censored Regression via the Penalized Tobit Likelihood'. Together they form a unique fingerprint.

Cite this