Abstract
Over the last few years, capsule endoscopy has been established as a fundamental device in the practicing gastroenterologist’s toolbox. Its utilization in diagnostic algorithms for suspected small bowel bleeding, Crohn’s disease, and small bowel tumors has been approved by several guidelines. The advent of double-balloon enteroscopy has significantly increased the therapeutic possibilities and release of multiple devices (single-balloon enteroscopy and spiral enteroscopy) aimed at improving the performance of small bowel enteroscopy. Recently, some important innovations have appeared in the small bowel endoscopy scene, providing further improvement to its evolution. Artificial intelligence in capsule endoscopy should increase diagnostic accuracy and reading efficiency, and the introduction of motorized spiral enteroscopy into clinical practice could also improve the therapeutic yield. This review focuses on the most recent studies on artificial-intelligence-assisted capsule endoscopy and motorized spiral enteroscopy.
The 21st century has marked the onset of a revolution in the treatment of small bowel diseases. In fact, evaluation of the small bowel is traditionally challenging for gastroenterologists because of its length and tortuous anatomy in the abdominal cavity. In particular, the introduction of capsule endoscopy (CE) and double-balloon enteroscopy has enabled evaluation of the entire gastrointestinal (GI) tract.1,2 CE has revolutionized small bowel imaging by providing a non-invasive method of examination of the mucosal surface. Given the increased detection of small bowel diseases by CE, the release of multiple device-assisted enteroscopes in the market has been crucial for histopathological sampling and endoscopic therapy in selected cases, thus avoiding the need for angiography and surgery.
Further technological advancements have occurred in recent years concerning small bowel endoscopy. In particular, some CE software now employs artificial intelligence (AI), and device-assisted enteroscopy (DAE) has revved up with power spiral motorization. This narrative review highlights the most recent studies of these two new techniques.
Since its introduction in 2000, the clinical use of CE has increased, and the indications for small bowel CE (SBCE) include suspected small bowel bleeding, assessment of Crohn's disease, screening for polyposis syndromes, celiac disease, and small bowel tumor investigation.1,3-5 However, because it captures and transmits approximately 60,000 images/person through the entire GI tract, SBCE results are time-consuming for physicians, requiring intense focus for an average time estimated between 30 and 120 minutes. It has been described that after just one capsule study examination, the accuracy of SBCE readers declines.6 To overcome such limitations, AI has recently been proposed for SBCE readers’ assistance, mainly as a first reader, with the aim of reducing workload while improving accuracy.7
AI allows computerized approaches to the resolution of cognitive problems. Owing to recent improvements in AI, particularly the implementation of machine learning (ML) technologies, in the last few decades, research on models for computer-aided diagnosis (CAD) and computer-assisted image analysis have become major areas of interest in medical imaging.8 ML is a subdomain of AI that provides systems that can independently learn and infer decisions.9 In supervised ML, learning is achieved through a training dataset used as a reference according to human expert indications. In supervised ML, learning is achieved through a training data-set used as reference according to human experts indications, for example a large database of normal and abnormal GI images, after which the model extracts image features and translates them to mathematical values, which are statistically elaborated with predictive models. In medical imaging, ML systems are able to recognize disease patterns and estimate the probability that one specific image belongs to different classes (e.g., normal vs. abnormal, malignant vs. benign).8,9 The performance of these methods can be affected by the reliability of experts’ first inputs, effectiveness of mathematical formulation translation, and difficulty of translating a large complex of normal and abnormal variations in medical imaging. Most recent technologies have led to a further subdomain of ML, namely deep learning (DL). DL includes ML models based on particular artificial neural network systems with complex multi-layer architecture.10 The most important type of DL technology is based on convolutional neural network (CNN) systems (Fig. 1), which are extremely useful in medical image analysis because they are structured to resemble the organization of the animal visual cortex. When properly trained with a large dataset of images, CNN systems are able to independently discover image features for further classification without requiring manually designed inputs; in fact, CNN systems are expected to be superior to hand-engineered features input.10,11
Recently, an increasing number of CNN-based CAD models have been proposed and validated for medical imaging. Many studies, both bio-computational and clinical, have shown encouraging results upon their application. CNN-based CAD systems were first employed for single abnormality detection, but new implementations are being conducted for SBCE multiclass lesion detection, localization, characterization, and even some attempts regarding entirely self-reporting capsules.9 Even if they are still in the experimental phase and have not yet been introduced in clinical practice, CNN-based CAD models will probably revolutionize SBCE readings.
The initial performance was promising, as many CNN-based systems have been trained for blood and angioectasia detection, always with excellent accuracy results, with some algorithms achieving a sensitivity of 98.8% to 100% and a specificity of 96% to 98.4%.12,13 Recently, other CNN-based systems have been developed for the detection of ulcers and erosions in both inflammatory bowel disease (IBD) and non-IBD patients. A recent meta-analysis by Soffer et al.11 evaluated CNN-based system accuracy, pooling together five studies on bleeding source detection and five studies on ulcer detection, providing a bleeding content/source pooled sensitivity of 98% (95% confidence interval [CI], 96%–99%) and pooled specificity of 99% (95% CI, 97%–99%). For ulcer detection, the pooled sensitivity was 95% (95% CI, 89%–98%), while the pooled specificity was 94% (95% CI, 90%–96%). Similar performance was described in another meta-analysis by Mohan et al.,14 including nine studies and aimed to assess CNN-based system performance in hemorrhage and/or ulceration detection, establishing an overall pooled accuracy of 95.4% (sensitivity 95.5%, specificity 95.8%, positive predictive value 95.8%, and negative predictive value 96.8%).
Thus, CNN-based CAD could be useful in IBD diagnosis and assessment, especially in Crohn's disease. Some algorithms have already shown good results in this context, proving their value in grading the severity of ulcerations and in differentiating strictures from normal mucosa or other ulcerative lesions.15,16 The first results have also been described for celiac small bowel changes and protruding lesions, such as tumors and polyps, suggesting a potentially increasing role for SBCE in celiac disease diagnosis and follow-up, and in familial polyposis screening.17,18
Today, the main challenge for AI-assisted CE is the development of CNN-based systems capable of detecting, classifying, and characterizing multiple types of lesions. Accuracy is proven to be higher when systems are trained independently for different types of abnormalities; thus, very large datasets with high-quality annotations are required for these systems.19
In 2019, Ding et al.20 conducted a large study, collecting data from 6,970 patients who underwent SBCE in 77 medical centers for two years; a CNN-based model was trained with a 158,235 image dataset from 1,970 patients with the aim of classifying them into normal, inflammation, polyps, bleeding, and other pathological findings. The CNN model was further validated in 5,000 patients and showed excellent performance compared with conventional gastroenterologist reading, with high sensitivity (99.8%–99.9%) and an error rate of only 3% in distinguishing abnormal from normal images.
Similarly, the RetinaNet AI system trained by Otani et al.21 provided high area under the receiver operative curve (AUC) values for detecting ulcers/erosions (0.966), vascular lesions (0.95), and tumors (0.95), with slightly inferior AUC values after external validation (0.928, 0.884, and 0.9, respectively).
Some studies have also reported algorithms able to recognize small bowel segments and/or localize lesions. The model trained in a study by Dimas et al.22 showed an average localization error of 2.70±1.62 cm. In addition, the first attempts at bowel cleanliness assessment and for entirely self-reporting SBCE have also been described.9,23 In particular, regarding small bowel cleanliness, Nam et al.24 developed a DL-based automation software for calculating its score; a scoring system based on mucosal visibility was trained for DL in the training set, and the performance of the trained software was subsequently evaluated in the validation set.
Despite these promising results, many relevant concerns about medico-legal issues are emerging: in clinical practice, a fully developed and potentially self-reporting CNN-based system entrusts a machine with the entire diagnostic procedure. Even if just used as first reader assistance, this would result in a huge number of images never being reviewed by any human expert. Such concerns can only be overcome with adequate evidence on the usefulness of AI in SBCE in terms of cost-effectiveness, time saved, and improved accuracy in lesion detection.
A multicenter retrospective study conducted by Aoki et al.25 compared the CE reading time of physicians, alone or after CNN-aided first screening, in 20 videos of the entire small bowel, each of which included 0 to 5 lesions of small bowel mucosal breaks. The mean reading time was significantly shorter when the capsule study was performed after CNN assessment, both in the expert readers group (3.1 vs. 12.2 min, p<0.001) and in the trainee readers group (5.2 vs. 20.7 min, p<0.001), without affecting the accuracy in abnormalities detection (i.e., erosions or ulcerations). In the previously mentioned meta-analysis, Mohan et al.14 established in nine studies an average SBCE reading time of 4.5 minutes using CNN-based systems.
A large study published in 2019 by Ding et al.20 compared the results of conventional analysis and multiclass-trained CNN-assisted analysis performed by 20 gastroenterologists. The model outperformed conventional analysis, providing not only a significantly shorter mean reading time (5.9±2.2 min vs. 96.6±22.5 min, p<0.001), but also higher sensitivity (99.9% vs. 74.6%) and negative predictive value (99.8% vs. 67.4%), where the total detection rate increased by 16.3%.
Therefore, CNN-based CAD is promising for being potentially more valuable than conventional physician SBCE readings. However, before being fully introduced in clinical practice, there are still other challenges to solve: most studies are conducted under experimental conditions and have a retrospective single-center study design11; in addition, models are mainly trained on a large number of selected still images (with risks of overfitting) and are also usually validated on image datasets, rather than using full-length videos. External validation of all CNN models is limited by the variability of devices, networks, and images, but also by the possibility of discordance in lesion characterization or in medical terminology between expert groups.9
The motorized spiral enteroscope (PowerSpiral; Olympus Medical, Tokyo, Japan) is a novel advancement in the field of enteroscopy. This enteroscope operates on the same principle as the manual spiral enteroscope, with a spiral overtube mounted on the insertion tube portion, converting the rotational energy into a linear force to pull the bowel onto the scope. This innovation is powered by the integrated electric motor located in the endoscope handle, which allows rotation of the spiral overtube. Owing to the automated rotation, the procedure may be performed faster by a single operator.
PowerSpiral enteroscope (PSE) is composed of three elements (Fig. 2). (1) A reusable endoscope, similar to a standard pediatric colonoscope, with a working length of 168 cm, an outer diameter of 12.8 mm, and a 3.2 mm caliber accessory channel with a separate water irrigating channel. Therefore, routine colonoscopic accessories can be used for therapeutic procedures. In the latest endoscopes being released, PSE is equipped with image enhancement technology, such as high-definition imaging and narrow-band imaging. (2) A 24 cm disposable and atraumatic spiral overtube, made of soft spiral fins with an outer diameter of 31 mm, was attached to the rotation coupler on the tip of the endoscope. (3) A control unit with a foot pedal and visual force gauge. The operator controlled the integrated electric motor for rotating the spiral overtube through a foot pedal. During the procedure, a visual force gauge was used to monitor the direction of overtube rotation and to control the amount of torque applied to the small bowel. When excessive resistance is detected, spiral rotation is stopped automatically to avoid perforation, as reported in 0.27% of cases with conventional spiral enteroscopy.26
The first in-human procedure using PSE was performed in November 2015 by Neuhaus et al.27 (Dusseldorf, Germany) on a 48-year-old patient with iron deficiency anemia. PSE was inserted approximately 250 cm distal to the ligament of Treitz within 20 minutes, and jejunal angioectasia was detected and treated with argon plasma coagulation. Iatrogenic mucosal trauma or delayed adverse events (AEs) were registered.
The first prospective study of PSE was published by Beyna et al. in 2021.28 A total of 132 patients with suspected or confirmed small bowel diseases underwent antegrade enteroscopy with PSE at two European tertiary referral endoscopic centers (Dusseldorf, Brussels). The technical success rate of PSE, defined as reaching at least the ligament of Treitz, was 97%. The overall diagnostic yield of PSE, which was the primary endpoint of this study, was 74.2%. Therapeutic procedures were performed in 68.2% of cases. The median depth of maximum insertion (DMI) was 450 cm beyond the ligament of Treitz, with a median insertion time of 25 minutes. The median duration of PSE enteroscopy was 54 minutes. Total antegrade enteroscopy of the cecum was performed in 14 patients (10.6%). The overall AE rate was 14.4%, with two patients experiencing major AEs (1.5%): a perforation in the terminal ileum treated with laparoscopic suturing and upper GI bleeding from a Mallory-Weiss lesion of the gastric cardia that required endoscopic treatment and hemotransfusions.
A case series of 14 novel motorized spiral enteroscopies was recently published by Prasad et al.29 in India. The indications for the procedure were suspected small bowel thickening or stricture in 10 patients and obscure GI bleeding in the four remaining patients. The diagnostic yield obtained was 92.8% (13/14), which was defined as reaching the target lesion or performing a total enteroscopy. The average duration was 61.1 minutes for anterograde enteroscopy and 90 minutes for retrograde enteroscopy. Panenteroscopy, confirmed on entering the cecum from the antegrade approach, was achieved in two patients, with procedure times of 90 and 105 minutes. The therapeutic procedures performed were argon plasma coagulation (two patients) and balloon dilation of the stricture (one patient). Three patients experienced mild postprocedural odynophagia due to cricopharynx abrasions, and one patient developed pancreatitis and was treated conservatively.
Beyna et al.30 performed a second prospective trial of 30 patients at two tertiary referral centers, in which attention was focused on the ability of motorized spiral enteroscopy to perform panenteroscopy using the antegrade approach along with a retrograde approach, if needed. Indeed, the primary outcome of this study was the total endoscopic small bowel visualization rate, achieved in 72.4% of the patients, of which only five (16.6%) procedures were completed using an antegrade-alone approach. The median procedure times required for the antegrade and retrograde approaches were 51 and 40 minutes, respectively. The overall diagnostic yield of PSE in this study was 80%.
A further retrospective study by Ramchandani et al.,31 including 61 patients from an Indian population, showed similar results, wherein the overall total enteroscopy rate was 60.6%, of which 31.1% was achieved using an antegrade-alone approach. In both case series, no major AEs were reported.
However, all of the aforementioned studies included patients without a history of abdominal surgery. A very recent paper from the Netherlands showed good results in a population of 170 patients, 34 of whom (20%) had a history of surgically altered anatomy (mostly partial small bowel resection/strictureplasty or bariatric Roux-en-Y gastric bypass).32 The overall diagnostic yield was 64.1%, the panenteroscopy rate (when intended) was 70.3%, and most importantly, only minor AEs were registered in 15.8% of procedures, without differences between patients with or without altered anatomy. Moreover, the results of a large prospective observational multicenter trial in a real-life setting were recently published,33 which included 54 out of 251 patients (enrolled for the “core” phase of the study) with previous abdominal surgery (21.5%), resulting in surgically altered GI anatomy in 25 (10%). The overall serious AE rate (“core” and “training” phase patients) was 2.3% (7/298), which did not increase in the subgroup of post-surgical patients (1.9%). Diagnostic and therapeutic yields were 83% and 60.2%, respectively. Panenteroscopy was performed in 51% of the patients who were initially planned for total enteroscopy. All the published studies mentioned above are summarized in Table 1.29-34
In addition, motorized spiral enteroscopy showed promising outcomes in the first clinical case of enteroscopy-assisted endoscopic retrograde cholangiopancreatography (ERCP) in a patient with altered anatomy.34 DAE-ERCP with balloon dilation of bilioenteric anastomotic strictures was successfully performed using a novel motorized spiral enteroscope in a 78-year-old man who underwent duodenum-preserving pancreatic head resection and Roux-en-Y reconstruction. No AEs were observed during or after the procedure.
The results of these previous studies suggest that the motorized spiral enteroscope is a promising and reliable technique, in line with other DAE techniques in terms of diagnostic yield, DMI, and AE rate. Motorized spiral enteroscopy potentially provides many advantages in performing deep enteroscopy, including decreased procedure time, increased total enteroscopy rates, and the possibility of performing a single-operator procedure. However, at present, only indirect comparisons with other DAE are possible because of the lack of randomized controlled trials.
The introduction of AI in SBCE software and the release in the market of power spiral enteroscopy have revolutionized the approach to small bowel pathologies. The small bowel, once considered the black box for gastroenterologists, can now be studied and approached endoscopically more easily and with more possibilities. A definitive introduction in clinical practice of CNN-based CE and PSE will require further evidence prospectively developed and validated in real-life scenarios.
REFERENCES
1. Iddan G, Meron G, Glukhovsky A, et al. Wireless capsule endoscopy. Nature. 2000; 405:417.
2. Yamamoto H, Sekine Y, Sato Y, et al. Total enteroscopy with a nonsurgical steerable double-balloon method. Gastrointest Endosc. 2001; 53:216–220.
3. Pennazio M, Spada C, Eliakim R, et al. Small-bowel capsule endoscopy and device-assisted enteroscopy for diagnosis and treatment of small-bowel disorders: European Society of Gastrointestinal Endoscopy (ESGE) Clinical Guideline. Endoscopy. 2015; 47:352–376.
4. Mussetto A, Fuccio L, Dari S, et al. MiroCam capsule for obscure gastrointestinal bleeding: a prospective, single centre experience. Dig Liver Dis. 2013; 45:124–128.
5. Kim ER. Roles of capsule endoscopy and device-assisted enteroscopy in the diagnosis and treatment of small-bowel tumors. Clin Endosc. 2020; 53:410–416.
6. Beg S, Card T, Sidhu R, et al. The impact of reader fatigue on the accuracy of capsule endoscopy interpretation. Dig Liver Dis. 2021; 53:1028–1033.
7. Piccirelli S, Milluzzo SM, Bizzotto A, et al. Small bowel capsule endoscopy and artificial intelligence: first or second reader? Best Pract Res Clin Gastroenterol. 2021; 52-53:101742.
8. Chan HP, Samala RK, Hadjiiski LM, et al. Deep learning in medical image analysis. Adv Exp Med Biol. 2020; 1213:3–21.
9. Dray X, Iakovidis D, Houdeville C, et al. Artificial intelligence in small bowel capsule endoscopy: current status, challenges and future promise. J Gastroenterol Hepatol. 2021; 36:12–19.
10. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015; 521:436–444.
11. Soffer S, Klang E, Shimon O, et al. Deep learning for wireless capsule endoscopy: a systematic review and meta-analysis. Gastrointest Endosc. 2020; 92:831–839.
12. Tsuboi A, Oka S, Aoyama K, et al. Artificial intelligence using a convolutional neural network for automatic detection of small-bowel angioectasia in capsule endoscopy images. Dig Endosc. 2020; 32:382–390.
13. Leenhardt R, Vasseur P, Li C, et al. A neural network algorithm for detection of GI angiectasia during small-bowel capsule endoscopy. Gastrointest Endosc. 2019; 89:189–194.
14. Mohan BP, Khan SR, Kassab LL, et al. High pooled performance of convolutional neural networks in computer-aided diagnosis of GI ulcers and/or hemorrhage on wireless capsule endoscopy images: a systematic review and meta-analysis. Gastrointest Endosc. 2021; 93:356–364.
15. Klang E, Barash Y, Margalit RY, et al. Deep learning algorithms for automated detection of Crohn’s disease ulcers by video capsule endoscopy. Gastrointest Endosc. 2020; 91:606–613.
16. Klang E, Grinman A, Soffer S, et al. Automated detection of Crohn’s disease intestinal strictures on capsule endoscopy images using deep neural networks. J Crohns Colitis. 2021; 15:749–756.
17. Vicnesh J, Wei JKE, Ciaccio EJ, et al. Automated diagnosis of celiac disease by video capsule endoscopy using DAISY Descriptors. J Med Syst. 2019; 43:157.
18. Saito H, Aoki T, Aoyama K, et al. Automatic detection and classification of protruding lesions in wireless capsule endoscopy images based on a deep convolutional neural network. Gastrointest Endosc. 2020; 92:144–151.
19. Trasolini R, Byrne MF. Artificial intelligence and deep learning for small bowel capsule endoscopy. Dig Endosc. 2021; 33:290–297.
20. Ding Z, Shi H, Zhang H, et al. Gastroenterologist-level identification of small-bowel diseases and normal variants by capsule endoscopy using a deep-learning model. Gastroenterology. 2019; 157:1044–1054.
21. Otani K, Nakada A, Kurose Y, et al. Automatic detection of different types of small-bowel lesions on capsule endoscopy images using a newly developed deep convolutional neural network. Endoscopy. 2020; 52:786–791.
22. Dimas G, Spyrou E, Iakovidis DK, et al. Intelligent visual localization of wireless capsule endoscopes enhanced by color information. Comput Biol Med. 2017; 89:429–440.
23. Leenhardt R, Souchaud M, Houist G, et al. A neural network-based algorithm for assessing the cleanliness of small bowel during capsule endoscopy. Endoscopy. 2021; 53:932–936.
24. Nam JH, Hwang Y, Oh DJ, et al. Development of a deep learning-based software for calculating cleansing score in small bowel capsule endoscopy. Sci Rep. 2021; 11:4417.
25. Aoki T, Yamada A, Aoyama K, et al. Clinical usefulness of a deep learning-based system as the first screening on small-bowel capsule endoscopy reading. Dig Endosc. 2020; 32:585–591.
26. Akerman PA, Cantero D. Severe complications of spiral enteroscopy in the first 1750 patients. Gastrointest Endosc. 2009; 69:PAB127.
27. Neuhaus H, Beyna T, Schneider M, et al. Novel motorized spiral enteroscopy: first clinical case. VideoGIE. 2016; 1:32–33.
28. Beyna T, Arvanitakis M, Schneider M, et al. Motorised spiral enteroscopy: first prospective clinical feasibility study. Gut. 2021; 70:261–267.
29. Prasad M, Prasad VG, Sangameswaran A, et al. A spiraling journey into the small bowel: a case series of novel motorized power spiral enteroscopies. VideoGIE. 2020; 5:591–596.
30. Beyna T, Arvanitakis M, Schneider M, et al. Total motorized spiral enteroscopy: first prospective clinical feasibility trial. Gastrointest Endosc. 2021; 93:1362–1370.
31. Ramchandani M, Rughwani H, Inavolu P, et al. Diagnostic yield and therapeutic impact of novel motorized spiral enteroscopy in small-bowel disorders: a single-center, real-world experience from a tertiary care hospital (with video). Gastrointest Endosc. 2021; 93:616–626.
32. Al-Toma A, Beaumont H, Koornstra JJ, et al. The performance and safety of motorized spiral enteroscopy, including in patients with surgically altered gastrointestinal anatomy: a multicenter prospective study. Endoscopy. 2022; Feb. 28. [Epub]. https://doi.org/10.1055/a-1783-4802.
33. Beyna T, Moreels T, Arvanitakis M, et al. Motorized spiral enteroscopy: results of an international, multicenter, prospective observational clinical study on patients with normal and altered gastrointestinal anatomy. Endoscopy. 2022; Apr. 21. [Epub]. https://doi.org/10.1055/a-1831-6215.
34. Beyna T, Schneider M, Höllerich J, et al. Motorized spiral enteroscopy-assisted ERCP after Roux-en-Y reconstructive surgery and bilioenteric anastomosis: first clinical case. VideoGIE. 2020; 5:311–313.
Table 1.
Study | Study design | No. of patients | Diagnostic yield (%) | Therapeutic yield (%) | Panenteroscopy rate (%) | Major adverse events (%) | Minor adverse events (%) |
---|---|---|---|---|---|---|---|
Beyna et al.,34 2020 | Prospective | 132 | 74.2 | 68.2 | 10.6 | 1.5 | 14.4 |
Prasad et al.,29 2020 | Case series | 14 | 92.8 | 35.7 | 35.7 | 28.5 | 42.8 |
Ramchandani et al.,31 2021 | Retrospective | 61 | 65.5 | 23.0 | 60.6 | 0 | 24.5 |
Beyna et al.,30 2021 | Prospective | 30 | 80.0 | 86.7 | 70.0 | 0 | 16.7 |
Al-Toma et al.,32 2022 | Prospective | 170 | 64.1 | 53.5 | 70.3 | 0 | 15.8 |
Beyna et al.,33 2022 | Prospective | 298 | 83.0 | 60.2 | 51.9 | 2.3 | 8.7 |