A hybrid deep learning approach integrating CNN and transformer for lung cancer classification using CT scansopen access
- Authors
- Yousafzai, Samia Nawaz; Nasir, Inzamam Mashood; Mansour, Sahar; Negm, Noha; Alhashmi, Asma A.; Alharbi, Mohannad A.; Kim, Eunchan
- Issue Date
- Mar-2026
- Publisher
- NATURE PORTFOLIO
- Keywords
- Lung Cancer; CT Imaging; Deep Learning; Improved Swin Transformer; Hybrid Shifted Window; Convolutional Neural Network
- Citation
- SCIENTIFIC REPORTS, v.16, no.1, pp 1 - 18
- Pages
- 18
- Indexed
- SCIE
SCOPUS
- Journal Title
- SCIENTIFIC REPORTS
- Volume
- 16
- Number
- 1
- Start Page
- 1
- End Page
- 18
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212932
- DOI
- 10.1038/s41598-026-41161-7
- ISSN
- 2045-2322
- Abstract
- Lung cancer is an extremely fatal kind of cancer, resulting in the deaths of almost 7.6 million individuals annually around the globe. Nevertheless, a timely diagnosis is a crucial necessity for enhancing the likelihood of human survival. Regarding tumor identification, CT scans are normally used to identify affected areas. Nevertheless, CT imaging face significant problems such as poor visibility of tumor locations and high false negative rates. The small dataset size of medical imaging makes it challenging to capture local lesion features by iterative training, considering all input features equally. This work integrates Convolutional Neural Network (CNN) and Improved Swin Transformer (C-Swin), a deep learning model that extracts and integrates fine-grained local and global features. C-Swin has Transformer encoder and a CNN module. The CNN module extracts local features, whereas the Transformer module captures global features. The Transformer encoder uses a hybrid shifted window attention method to focus on a spatial region of the CT image, reducing background semantic information and improving local feature capture accuracy. The proposed method is validated using the publicly accessible Kaggle dataset namely IQ-OTH/NCCD with three classes. the proposed C-Swin model achieved average accuracy of 96.26%, precision of 97.48%, recall of 96.39% and F1-score of 97.42%. The numerical findings unequivocally demonstrate that our proposed method surpasses various existing methods with an increase in accuracy ranging from 2.31% to 6.81%. The C-Swin model is capable of extracting detailed local lesion features, resulting in improved classification performance.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 정보시스템학과 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.