Natural Language Processing to Adjudicate Heart Failure Hospitalizations in Global Clinical Trials

Pablo M. Marti-Castellote, Christopher Reeder, Brian Claggett, Pulkit Singh, Emily S. Lau, Shaan Khurshid, Puneet Batra, Steven A. Lubitz, Mahnaz Maddah, Orly Vardeny, Eldrin F. Lewis, Marc Pfeffer, Pardeep Jhund, Akshay S. Desai, John J.V. McMurray, Patrick T. Ellinor, Jennifer E. Ho, Scott D. Solomon, Jonathan W. Cunningham

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

BACKGROUND: Medical record review by a physician clinical events committee is the gold standard for identifying cardiovascular outcomes in clinical trials, but is labor-intensive and poorly reproducible. Automated outcome adjudication by artificial intelligence (AI) could enable larger and less expensive clinical trials but has not been validated in global studies. METHODS: We developed a novel model for automated AI-based heart failure adjudication (Heart Failure Natural Language Processing) using hospitalizations from 3 international clinical outcomes trials. This model was tested on potential heart failure hospitalizations from the DELIVER trial (Dapagliflozin Evaluation to Improve the Lives of Patients With Preserved Ejection Fraction Heart Failure), a cardiovascular outcomes trial comparing dapagliflozin with placebo in 6063 patients with heart failure with mildly reduced or preserved ejection fraction. AI-based adjudications were compared with adjudications from a clinical events committee that followed Food and Drug Administration-based criteria. RESULTS: AI-based adjudication agreed with the clinical events committee in 83% of events. A strategy of human review for events that the AI model deemed uncertain (16%) would have achieved 91% agreement with the clinical events committee while reducing the adjudication workload by 84%. The estimated effect of dapagliflozin on heart failure hospitalization was nearly identical with AI-based adjudication (hazard ratio, 0.76 [95% CI, 0.66-0.88]) compared with clinical events committee adjudication (hazard ratio, 0.77 [95% CI, 0.67-0.89]). The AI model extracted symptoms, signs, and treatments of heart failure from each medical record in tabular format and quoted sentences documenting them. CONCLUSIONS: AI-based adjudication of clinical outcomes has the potential to improve the efficiency of global clinical trials while preserving accuracy and interpretability.

Original languageEnglish (US)
Article numbere012514
JournalCirculation: Heart Failure
Volume18
Issue number1
DOIs
StatePublished - Jan 1 2025

Bibliographical note

Publisher Copyright:
© 2024 American Heart Association, Inc.

Keywords

  • Natural Language Processing
  • United States Food and Drug Administration
  • artificial intelligence
  • dapagliflozin
  • heart failure

PubMed: MeSH publication types

  • Journal Article

Fingerprint

Dive into the research topics of 'Natural Language Processing to Adjudicate Heart Failure Hospitalizations in Global Clinical Trials'. Together they form a unique fingerprint.

Cite this