A novel network-level fused deep learning architecture with shallow neural network classifier for gastrointestinal cancer classification from wireless capsule endoscopy imagesopen access
- Authors
- Khan, Muhammad Attique; Shafiq, Usama; Hamza, Ameer; Mirza, Anwar M.; Baili, Jamel; Alhammadi, Dina Abdulaziz; Cho, Hee-Chan; Chang, Byoungchol
- Issue Date
- Mar-2025
- Publisher
- BioMed Central
- Keywords
- Gastrointestinal disease; Wireless capsule endoscopy; Deep learning; LSTM; Fusion; Optimization; Shallow machine learning
- Citation
- BMC Medical Informatics and Decision Making, v.25, no.1, pp 1 - 19
- Pages
- 19
- Indexed
- SCIE
SCOPUS
- Journal Title
- BMC Medical Informatics and Decision Making
- Volume
- 25
- Number
- 1
- Start Page
- 1
- End Page
- 19
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207305
- DOI
- 10.1186/s12911-025-02966-0
- ISSN
- 1472-6947
1472-6947
- Abstract
- Deep learning has significantly contributed to medical imaging and computer-aided diagnosis (CAD), providing accurate disease classification and diagnosis. However, challenges such as inter- and intra-class similarities, class imbalance, and computational inefficiencies due to numerous hyperparameters persist. This study aims to address these challenges by presenting a novel deep-learning framework for classifying and localizing gastrointestinal (GI) diseases from wireless capsule endoscopy (WCE) images. The proposed framework begins with dataset augmentation to enhance training robustness. Two novel architectures, Sparse Convolutional DenseNet201 with Self-Attention (SC-DSAN) and CNN-GRU, are fused at the network level using a depth concatenation layer, avoiding the computational costs of feature-level fusion. Bayesian Optimization (BO) is employed for dynamic hyperparameter tuning, and an Entropy-controlled Marine Predators Algorithm (EMPA) selects optimal features. These features are classified using a Shallow Wide Neural Network (SWNN) and traditional classifiers. Experimental evaluations on the Kvasir-V1 and Kvasir-V2 datasets demonstrate superior performance, achieving accuracies of 99.60% and 95.10%, respectively. The proposed framework offers improved accuracy, precision, and computational efficiency compared to state-of-the-art models. The proposed framework addresses key challenges in GI disease diagnosis, demonstrating its potential for accurate and efficient clinical applications. Future work will explore its adaptability to additional datasets and optimize its computational complexity for broader deployment.
- Files in This Item
-
Go to Link
- Appears in
Collections - ETC > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.