TY - JOUR
T1 - Consensus modeling
T2 - Safer transfer learning for small health systems
AU - Tourani, Roshan
AU - Murphree, Dennis H.
AU - Sheka, Adam
AU - Melton, Genevieve B.
AU - Kor, Daryl J.
AU - Simon, Gyorgy J.
N1 - Publisher Copyright:
© 2024
PY - 2024/8
Y1 - 2024/8
N2 - Predictive modeling is becoming an essential tool for clinical decision support, but health systems with smaller sample sizes may construct suboptimal or overly specific models. Models become over-specific when beside true physiological effects, they also incorporate potentially volatile site-specific artifacts. These artifacts can change suddenly and can render the model unsafe. To obtain safer models, health systems with inadequate sample sizes may adopt one of the following options. First, they can use a generic model, such as one purchased from a vendor, but often such a model is not sufficiently specific to the patient population and is thus suboptimal. Second, they can participate in a research network. Paradoxically though, sites with smaller datasets contribute correspondingly less to the joint model, again rendering the final model suboptimal. Lastly, they can use transfer learning, starting from a model trained on a large data set and updating this model to the local population. This strategy can also result in a model that is over-specific. In this paper we present the consensus modeling paradigm, which uses the help of a large site (source) to reach a consensus model at the small site (target). We evaluate the approach on predicting postoperative complications at two health systems with 9,044 and 38,045 patients (rare outcomes at about 1% positive rate), and conduct a simulation study to understand the performance of consensus modeling relative to the other three approaches as a function of the available training sample size at the target site. We found that consensus modeling exhibited the least over-specificity at either the source or target site and achieved the highest combined predictive performance.
AB - Predictive modeling is becoming an essential tool for clinical decision support, but health systems with smaller sample sizes may construct suboptimal or overly specific models. Models become over-specific when beside true physiological effects, they also incorporate potentially volatile site-specific artifacts. These artifacts can change suddenly and can render the model unsafe. To obtain safer models, health systems with inadequate sample sizes may adopt one of the following options. First, they can use a generic model, such as one purchased from a vendor, but often such a model is not sufficiently specific to the patient population and is thus suboptimal. Second, they can participate in a research network. Paradoxically though, sites with smaller datasets contribute correspondingly less to the joint model, again rendering the final model suboptimal. Lastly, they can use transfer learning, starting from a model trained on a large data set and updating this model to the local population. This strategy can also result in a model that is over-specific. In this paper we present the consensus modeling paradigm, which uses the help of a large site (source) to reach a consensus model at the small site (target). We evaluate the approach on predicting postoperative complications at two health systems with 9,044 and 38,045 patients (rare outcomes at about 1% positive rate), and conduct a simulation study to understand the performance of consensus modeling relative to the other three approaches as a function of the available training sample size at the target site. We found that consensus modeling exhibited the least over-specificity at either the source or target site and achieved the highest combined predictive performance.
KW - Hospital acquired infection
KW - Machine learning
KW - Predictive modeling
KW - Transfer learning
UR - http://www.scopus.com/inward/record.url?scp=85195052127&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85195052127&partnerID=8YFLogxK
U2 - 10.1016/j.artmed.2024.102899
DO - 10.1016/j.artmed.2024.102899
M3 - Article
C2 - 38843692
AN - SCOPUS:85195052127
SN - 0933-3657
VL - 154
JO - Artificial Intelligence in Medicine
JF - Artificial Intelligence in Medicine
M1 - 102899
ER -