Abstract
Principal components analysis (PCA) is a common dimension-reduction tool that transforms a set of variables into a linearly uncorrelated set of variables. Standard PCA estimators involve either the eigendecomposition of the estimated covariance matrix or a singular value decomposition of the centered data. However, for right-censored failure time data, estimating the principal components in this way is not straightforward because not all failure times are observed. Standard estimators for the covariance or correlation matrix should not be used in this case, because they require strong assumptions on the form of the joint distribution and on the marginal distributions beyond the final observation time. We present a novel, nonparametric estimator for the covariance of multivariate right-censored failure time data based on the counting processes and corresponding martingales defined by the failure times. We prove that these estimators are consistent and converge to a Gaussian process when properly standardized. We further show that these covariance estimates can be used to estimate a PCA for the martingales and counting processes for the different failure times. The corresponding estimates of the principal directions are consistent and asymptotically normal. We apply this method to data from a clinical trial of patients with pancreatic cancer, and recover a medically valid low-dimensional representation of adverse events.
Original language | English (US) |
---|---|
Pages (from-to) | 1985-2016 |
Number of pages | 32 |
Journal | Statistica Sinica |
Volume | 33 |
Issue number | 3 |
DOIs | |
State | Published - Jul 2023 |
Externally published | Yes |
Bibliographical note
Funding Information:The authors thank Dr. Yu Cheng from the University of Pittsburgh for supplying the code to compute the Dabrowska bivariate survival function estimator.
Publisher Copyright:
© 2023 Institute of Statistical Science. All rights reserved.
Keywords
- Competing risks
- multivariate survival analysis
- principal components analysis