PRINCIPAL COMPONENTS ANALYSIS FOR RIGHT CENSORED DATA

Benjamin W. Langworthy, Jianwen Cai, Robert W. Corty, Michael R. Kosorok, Jason P. Fine

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Principal components analysis (PCA) is a common dimension-reduction tool that transforms a set of variables into a linearly uncorrelated set of variables. Standard PCA estimators involve either the eigendecomposition of the estimated covariance matrix or a singular value decomposition of the centered data. However, for right-censored failure time data, estimating the principal components in this way is not straightforward because not all failure times are observed. Standard estimators for the covariance or correlation matrix should not be used in this case, because they require strong assumptions on the form of the joint distribution and on the marginal distributions beyond the final observation time. We present a novel, nonparametric estimator for the covariance of multivariate right-censored failure time data based on the counting processes and corresponding martingales defined by the failure times. We prove that these estimators are consistent and converge to a Gaussian process when properly standardized. We further show that these covariance estimates can be used to estimate a PCA for the martingales and counting processes for the different failure times. The corresponding estimates of the principal directions are consistent and asymptotically normal. We apply this method to data from a clinical trial of patients with pancreatic cancer, and recover a medically valid low-dimensional representation of adverse events.

Original languageEnglish (US)
Pages (from-to)1985-2016
Number of pages32
JournalStatistica Sinica
Volume33
Issue number3
DOIs
StatePublished - Jul 2023
Externally publishedYes

Bibliographical note

Funding Information:
The authors thank Dr. Yu Cheng from the University of Pittsburgh for supplying the code to compute the Dabrowska bivariate survival function estimator.

Publisher Copyright:
© 2023 Institute of Statistical Science. All rights reserved.

Keywords

  • Competing risks
  • multivariate survival analysis
  • principal components analysis

Fingerprint

Dive into the research topics of 'PRINCIPAL COMPONENTS ANALYSIS FOR RIGHT CENSORED DATA'. Together they form a unique fingerprint.

Cite this