Abstract
We propose an inverse probability of censoring weighted (IPCW) bagging (bootstrap aggregation) pre-processing that enables the application of any machine learning procedure for classification to be used to predict the cause-specific cumulative incidence, properly accounting for right-censored observations and competing risks. We consider the IPCW area under the time-dependent ROC curve (IPCW-AUC) as a performance evaluation metric. We also suggest a procedure to optimally stack predictions from any set of IPCW bagged methods. We illustrate our proposed method in the Swedish InfCareHIV register by predicting individuals for whom treatment will not maintain an undetectable viral load for at least 2 years following initial suppression. The R package stackBagg that implements our proposed method is available on Github.
Original language | English (US) |
---|---|
Pages (from-to) | 51-65 |
Number of pages | 15 |
Journal | Journal of the Royal Statistical Society. Series C: Applied Statistics |
Volume | 70 |
Issue number | 1 |
DOIs | |
State | Published - Nov 22 2020 |
Bibliographical note
Publisher Copyright:© 2020 The Authors. Journal of the Royal Statistical Society: Series C (Applied Statistics) published by John Wiley & Sons Ltd
Keywords
- HIV treatment
- competing risk
- cumulative incidence
- ensemble learning
- inverse probability of censoring weighting
- machine learning
- stacking