DCNN models with post-hoc interpretability for the automated detection of glossitis and OSCC on the tongue

Lee, Yeon-hee; Jeon, Seonggwang; Jung, Junho; Auh, Q. Schick; Lee, Jae-seo; Chaurasia, Akhilanand; Noh, Yung Kyun

doi:10.1038/s41598-025-16760-5

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

DCNN models with post-hoc interpretability for the automated detection of glossitis and OSCC on the tongueopen accessDCNN models with post-hoc interpretability for the automated detection of glossitis and OSCC on the tongue

Other Titles: DCNN models with post-hoc interpretability for the automated detection of glossitis and OSCC on the tongue

Authors: Lee, Yeon-hee; Jeon, Seonggwang; Jung, Junho; Auh, Q. Schick; Lee, Jae-seo; Chaurasia, Akhilanand; Noh, Yung Kyun

Issue Date: Aug-2025

Publisher: Nature Publishing Group

Keywords: Tongue diagnosis; Glossitis; Oral squamous cell carcinoma; Deep learning; Convolutional neural network; Artificial intelligence; Interpretability

Citation: Scientific Reports, v.15, no.1, pp 1 - 18

Pages: 18

Indexed: SCIE
SCOPUS

Journal Title: Scientific Reports

Volume: 15

Number: 1

Start Page: 1

End Page: 18

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208793

DOI: 10.1038/s41598-025-16760-5

ISSN: 2045-2322
2045-2322

Abstract: This study aimed to develop and evaluate deep convolutional neural network (DCNN) models with Grad-CAM visualization for the automated classification with interpretability of tongue conditions—specifically glossitis and oral squamous cell carcinoma (OSCC)—using clinical tongue photographs, with a focus on their potential for early detection and telemedicine-based diagnostics. A total of 652 tongue images were categorized into normal control (n = 294), glossitis (n = 340), and OSCC (n = 17). Four pretrained DCNN architectures (VGG16, VGG19, ResNet50, ResNet152) were fine-tuned using transfer learning. Model interpretability was enhanced via Grad-CAM and sparsity analysis. Diagnostic performance was assessed using AUROC, with subgroup analysis by age, sex, and image segmentation strategy. For glossitis classification, VGG16 (AUROC = 0.8428, 95% CI 0.7757–0.9100) and VGG19 (AUROC = 0.8639, 95% CI 0.7988–0.9170) performed strongly, while the ensemble of VGG16 and VGG19 achieved the best result (AUROC = 0.8731, 95% CI 0.8072–0.9298). OSCC detection showed near-perfect performance across all models, with VGG19 and ResNet152 achieving AUROC = 1.0000 and VGG16 reaching AUROC = 0.9902 (95% CI 0.9707–1.0000). Diagnostic performance did not differ significantly by age (P = 0.3052) or sex (P = 0.4531), and whole-image classification outperformed patch-wise segmentation (P = 0.7440). DCNN models with Grad-CAM demonstrated robust performance in classifying glossitis and OSCC from tongue photographs with interpretability. The results highlight the potential of AI-driven tongue diagnosis as a valuable tool for remote healthcare, promoting early detection and expanding access to oral health services.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Noh, Yung Kyun photo

Noh, Yung Kyun: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE