Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge

Gregory Holste, Yiliang Zhou, Song Wang, Ajay Jaiswal, Mingquan Lin, Sherry Zhuge, Yuzhe Yang, Dongkyun Kim, Trong Hieu Nguyen-Mau, Minh Triet Tran, Jaehyup Jeong, Wongi Park, Jongbin Ryu, Feng Hong, Arsh Verma, Yosuke Yamagishi, Changhyun Kim, Hyeryeong Seo, Myungjoo Kang, Leo Anthony CeliZhiyong Lu, Ronald M. Summers, George Shih, Zhangyang Wang, Yifan Peng

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Many real-world image recognition problems, such as diagnostic medical imaging exams, are “long-tailed” – there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT, on long-tailed, multi-label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top-performing solutions, providing practical recommendations for long-tailed, multi-label medical image classification. Finally, we use these insights to propose a path forward involving vision-language foundation models for few- and zero-shot disease classification.

Original languageEnglish (US)
Article number103224
JournalMedical Image Analysis
Volume97
DOIs
StatePublished - Oct 2024
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2024 Elsevier B.V.

Keywords

  • Chest X-ray
  • Computer-aided diagnosis
  • Long-tailed learning

Fingerprint

Dive into the research topics of 'Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge'. Together they form a unique fingerprint.

Cite this