TY - JOUR
T1 - Interobserver agreement between an artificial intelligence algorithm and colon capsule endoscopy readers on bowel-cleansing quality
AU - Schelde-Olesen, Benedicte
AU - Herp, Jürgen
AU - Braun, Jan-Matthias
AU - Koulaouzidis, Anastasios
AU - Bjørsum-Meyer, Thomas
AU - Møller, Lasse Kaalby
AU - Baatrup, Gunnar
AU - Nadimi, Esmaeil
AU - Deding, Ulrik
PY - 2023/6
Y1 - 2023/6
N2 - Background and AimsColon capsule endoscopy (CCE) faces substantial challenges, one of which is achieving adequate colon cleansing. Furthermore, the interobserver agreement on bowel-cleansing quality varies. To address this issue, we developed an artificial intelligence algorithm (AIA) to evaluate bowel-cleansing quality. The aim of this study was to estimate the interobserver agreement on bowel cleansing between a group of experienced CCE readers and an AIA and to examine whether percentiles of the overall bowel-cleansing quality are a suitable way of reporting the results generated by the AIA.MethodsBowel-cleansing quality in 842 CCE investigations was scored on both a 2- and 4-point grading scale for the entire colon and by segment by experienced CCE readers and the AIA. For the algorithm, a score was given based on the mean score, median, upper and lower quartiles, and second and 98th percentiles. The level of agreement was evaluated using Cohen’s κ.ResultsThe interobserver agreement between the CCE readers and AIA on bowel-cleansing quality was minimal to none for the overall bowel evaluation, by segment, and on the 2- and 4- point grading scale regardless of the threshold for the AIA score.ConclusionsWe found minimal agreement on evaluation of bowel-cleansing quality in CCE between CCE readers and the AIA. Mean or percentiles of the AIA grading did not seem suitable for AI-generated bowel-cleansing evaluation.
AB - Background and AimsColon capsule endoscopy (CCE) faces substantial challenges, one of which is achieving adequate colon cleansing. Furthermore, the interobserver agreement on bowel-cleansing quality varies. To address this issue, we developed an artificial intelligence algorithm (AIA) to evaluate bowel-cleansing quality. The aim of this study was to estimate the interobserver agreement on bowel cleansing between a group of experienced CCE readers and an AIA and to examine whether percentiles of the overall bowel-cleansing quality are a suitable way of reporting the results generated by the AIA.MethodsBowel-cleansing quality in 842 CCE investigations was scored on both a 2- and 4-point grading scale for the entire colon and by segment by experienced CCE readers and the AIA. For the algorithm, a score was given based on the mean score, median, upper and lower quartiles, and second and 98th percentiles. The level of agreement was evaluated using Cohen’s κ.ResultsThe interobserver agreement between the CCE readers and AIA on bowel-cleansing quality was minimal to none for the overall bowel evaluation, by segment, and on the 2- and 4- point grading scale regardless of the threshold for the AIA score.ConclusionsWe found minimal agreement on evaluation of bowel-cleansing quality in CCE between CCE readers and the AIA. Mean or percentiles of the AIA grading did not seem suitable for AI-generated bowel-cleansing evaluation.
U2 - 10.1016/j.igie.2023.04.006
DO - 10.1016/j.igie.2023.04.006
M3 - Journal article
SN - 2949-7086
VL - 2
SP - 148-153.e3
JO - iGIE
JF - iGIE
IS - 2
ER -