Integrating Deep Feature Extraction and Hybrid ResNet-DenseNet Model for Multi-Class Abnormality Detection in Endoscopic Images

Sagar, Aman, Mehta, Preeti, Shrivastva, Monika, Kumari, Suchi

arXiv.org Artificial Intelligence 

Gastrointestinal (GI) and liver diseases have become increasingly prevalent across the globe, largely due to factors such as industrialization, dietary shifts, and the widespread use of antibiotics. These diseases pose significant diagnostic and treatment challenges, emphasizing the need for advanced medical technologies. Video Capsule Endoscopy (VCE) is a key non-invasive tool for examining the GI tract, especially in diagnosing conditions related to the small intestine, such as Crohn's disease, Celiac disease, and GI cancer. Unlike traditional endoscopy, VCE involves a small, pill-sized camera that travels through the digestive tract, capturing detailed images without sedation or invasive procedures. This method offers a comprehensive view of areas that are difficult to reach using conventional endoscopy. Despite its advantages, VCE faces challenges in practical application. A typical VCE procedure can generate between 57,000 to 1,000,000 images for 6-8 hours, which gastroenterologists must review.