TY - JOUR
T1 - Application of Deep Learning Models to Improve Ulcerative Colitis Endoscopic Disease Activity Scoring under Multiple Scoring Systems
AU - Byrne, Michael F.
AU - Panaccione, Remo
AU - East, James E.
AU - Iacucci, Marietta
AU - Parsa, Nasim
AU - Kalapala, Rakesh
AU - Reddy, Duvvur N.
AU - Ramesh Rughwani, Hardik
AU - Singh, Aniruddha P.
AU - Berry, Sameer K.
AU - Monsurate, Ryan
AU - Soudan, Florian
AU - Laage, Greta
AU - Cremonese, Enrico D.
AU - St-Denis, Ludovic
AU - Lemaître, Paul
AU - Nikfal, Shima
AU - Asselin, Jerome
AU - Henkel, Milagros L.
AU - Travis, Simon P.
N1 - Publisher Copyright:
© 2022 The Author(s). Published by Oxford University Press on behalf of European Crohn's and Colitis Organisation. All rights reserved.
PY - 2023/4/1
Y1 - 2023/4/1
N2 - Background and Aims: Lack of clinical validation and inter-observer variability are two limitations of endoscopic assessment and scoring of disease severity in patients with ulcerative colitis [UC]. We developed a deep learning [DL] model to improve, accelerate and automate UC detection, and predict the Mayo Endoscopic Subscore [MES] and the Ulcerative Colitis Endoscopic Index of Severity [UCEIS]. Methods: A total of 134 prospective videos [1550 030 frames] were collected and those with poor quality were excluded. The frames were labelled by experts based on MES and UCEIS scores. The scored frames were used to create a preprocessing pipeline and train multiple convolutional neural networks [CNNs] with proprietary algorithms in order to filter, detect and assess all frames. These frames served as the input for the DL model, with the output being continuous scores for MES and UCEIS [and its components]. A graphical user interface was developed to support both labelling video sections and displaying the predicted disease severity assessment by the artificial intelligence from endoscopic recordings. Results: Mean absolute error [MAE] and mean bias were used to evaluate the distance of the continuous model's predictions from ground truth, and its possible tendency to over/under-predict were excellent for MES and UCEIS. The quadratic weighted kappa used to compare the inter-rater agreement between experts' labels and the model's predictions showed strong agreement [0.87, 0.88 at frame-level, 0.88, 0.90 at section-level and 0.90, 0.78 at video-level, for MES and UCEIS, respectively]. Conclusions: We present the first fully automated tool that improves the accuracy of the MES and UCEIS, reduces the time between video collection and review, and improves subsequent quality assurance and scoring.
AB - Background and Aims: Lack of clinical validation and inter-observer variability are two limitations of endoscopic assessment and scoring of disease severity in patients with ulcerative colitis [UC]. We developed a deep learning [DL] model to improve, accelerate and automate UC detection, and predict the Mayo Endoscopic Subscore [MES] and the Ulcerative Colitis Endoscopic Index of Severity [UCEIS]. Methods: A total of 134 prospective videos [1550 030 frames] were collected and those with poor quality were excluded. The frames were labelled by experts based on MES and UCEIS scores. The scored frames were used to create a preprocessing pipeline and train multiple convolutional neural networks [CNNs] with proprietary algorithms in order to filter, detect and assess all frames. These frames served as the input for the DL model, with the output being continuous scores for MES and UCEIS [and its components]. A graphical user interface was developed to support both labelling video sections and displaying the predicted disease severity assessment by the artificial intelligence from endoscopic recordings. Results: Mean absolute error [MAE] and mean bias were used to evaluate the distance of the continuous model's predictions from ground truth, and its possible tendency to over/under-predict were excellent for MES and UCEIS. The quadratic weighted kappa used to compare the inter-rater agreement between experts' labels and the model's predictions showed strong agreement [0.87, 0.88 at frame-level, 0.88, 0.90 at section-level and 0.90, 0.78 at video-level, for MES and UCEIS, respectively]. Conclusions: We present the first fully automated tool that improves the accuracy of the MES and UCEIS, reduces the time between video collection and review, and improves subsequent quality assurance and scoring.
KW - deep learning
KW - Inflammatory bowel disease
KW - Mayo Endoscopic Subscore
KW - Ulcerative Colitis Endoscopic Index of Severity
UR - http://www.scopus.com/inward/record.url?scp=85153120450&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85153120450&partnerID=8YFLogxK
U2 - 10.1093/ecco-jcc/jjac152
DO - 10.1093/ecco-jcc/jjac152
M3 - Article
C2 - 36254822
AN - SCOPUS:85153120450
SN - 1873-9946
VL - 17
SP - 463
EP - 471
JO - Journal of Crohn's and Colitis
JF - Journal of Crohn's and Colitis
IS - 4
ER -