TY - JOUR
T1 - Can artificial intelligence replace endoscopists when assessing mucosal healing in ulcerative colitis? A systematic review and diagnostic test accuracy meta-analysis
AU - Rimondi, Alessandro
AU - Gottlieb, Klaus
AU - Despott, Edward J.
AU - Iacucci, Marietta
AU - Murino, Alberto
AU - Tontini, Gian Eugenio
N1 - Publisher Copyright:
© 2023
PY - 2024/7
Y1 - 2024/7
N2 - Backgrounds and Aims: Mucosal healing (MH) in inflammatory bowel diseases (IBD) is an important landmark for clinical decision making. Artificial intelligence systems (AI) that automatically deliver the grade of endoscopic inflammation may solve moderate interobserver agreement and the need of central reading in clinical trials. Methods: We performed a systematic review of EMBASE and MEDLINE databases up to 01/12/2022 following PRISMA and the Joanna Briggs Institute methodologies to answer the following question: “Can AI replace endoscopists when assessing MH in IBD?”. The research was restricted to ulcerative colitis (UC), and a diagnostic odds ratio (DOR) meta-analysis was performed. Risk of bias was evaluated with QUADAS-2 tool. Results: A total of 21 / 739 records were selected for full text evaluation, and 12 were included in the meta-analysis. Deep learning algorithms based on convolutional neural networks architecture achieved a satisfactory performance in evaluating MH on UC, with sensitivity, specificity, DOR and SROC of respectively 0.91(CI95 %:0.86–0.95);0.89(CI95 %:0.84–0.93);92.42(CI95 %:54.22–157.53) and 0.957 when evaluating fixed images (n = 8) and 0.86(CI95 %:0.75–0.93);0.91(CI95 %:0.87–0.94);70.86(CI95 %:24.63–203.86) and 0.941 when evaluating videos (n = 6). Moderate-high levels of heterogeneity were noted, limiting the quality of the evidence. Conclusions: AI systems showed high potential in detecting MH in UC with optimal diagnostic performance, although moderate-high heterogeneity of the data was noted. Standardised and shared AI training may reduce heterogeneity between systems.
AB - Backgrounds and Aims: Mucosal healing (MH) in inflammatory bowel diseases (IBD) is an important landmark for clinical decision making. Artificial intelligence systems (AI) that automatically deliver the grade of endoscopic inflammation may solve moderate interobserver agreement and the need of central reading in clinical trials. Methods: We performed a systematic review of EMBASE and MEDLINE databases up to 01/12/2022 following PRISMA and the Joanna Briggs Institute methodologies to answer the following question: “Can AI replace endoscopists when assessing MH in IBD?”. The research was restricted to ulcerative colitis (UC), and a diagnostic odds ratio (DOR) meta-analysis was performed. Risk of bias was evaluated with QUADAS-2 tool. Results: A total of 21 / 739 records were selected for full text evaluation, and 12 were included in the meta-analysis. Deep learning algorithms based on convolutional neural networks architecture achieved a satisfactory performance in evaluating MH on UC, with sensitivity, specificity, DOR and SROC of respectively 0.91(CI95 %:0.86–0.95);0.89(CI95 %:0.84–0.93);92.42(CI95 %:54.22–157.53) and 0.957 when evaluating fixed images (n = 8) and 0.86(CI95 %:0.75–0.93);0.91(CI95 %:0.87–0.94);70.86(CI95 %:24.63–203.86) and 0.941 when evaluating videos (n = 6). Moderate-high levels of heterogeneity were noted, limiting the quality of the evidence. Conclusions: AI systems showed high potential in detecting MH in UC with optimal diagnostic performance, although moderate-high heterogeneity of the data was noted. Standardised and shared AI training may reduce heterogeneity between systems.
KW - Artificial intelligence
KW - Endoscopy
KW - Inflammatory Bowel Disease
KW - Ulcerative Colitis
UR - https://www.scopus.com/pages/publications/85179112419
U2 - 10.1016/j.dld.2023.11.005
DO - 10.1016/j.dld.2023.11.005
M3 - Review article
C2 - 38057218
AN - SCOPUS:85179112419
SN - 1590-8658
VL - 56
SP - 1164
EP - 1172
JO - Digestive and Liver Disease
JF - Digestive and Liver Disease
IS - 7
ER -