Toward better ear disease diagnosis: A multi-modal multi-fusion model using endoscopic images of the tympanic membrane and pure-tone audiometry
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kim, Taewan | - |
dc.contributor.author | Kim, Sangyeop | - |
dc.contributor.author | Kim, Jaeyoung | - |
dc.contributor.author | Lee, Yeonjoon | - |
dc.contributor.author | Choi, June | - |
dc.date.accessioned | 2023-11-14T01:32:03Z | - |
dc.date.available | 2023-11-14T01:32:03Z | - |
dc.date.issued | 2023-10 | - |
dc.identifier.issn | 2169-3536 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/115434 | - |
dc.description.abstract | Chronic otitis media is characterized by recurrent infections, leading to serious complications, such as meningitis, facial palsy, and skull base osteomyelitis. Therefore, active treatment based on early diagnosis is essential. This study developed a multi-modal multi-fusion (MMMF) model that automatically diagnoses ear diseases by applying endoscopic images of the tympanic membrane (TM) and pure-tone audiometry (PTA) data to a deep learning model. The primary aim of the proposed MMMF model is adding "normal with hearing loss" as a category, and improving the diagnostic accuracy of the conventional four ear diseases: normal, TM perforation, retraction, and cholesteatoma. To this end, the MMMF model was trained on 1,480 endoscopic images of the TM and PTA data to distinguish five ear disease states: normal, TM perforation, retraction, cholesteatoma, and normal (hearing loss). It employs a feature fusion strategy of cross-attention, concatenation, and gated multi-modal units in a multi-modal architecture encompassing a convolutional neural network (CNN) and multi-layer perceptron. We expanded the classification capability to include an additional category, normal (hearing loss), thereby enhancing the diagnostic performance of extant ear disease classification. The MMMF model demonstrated superior performance when implemented with EfficientNet-B7, achieving 92.9% accuracy and 90.9% recall, thereby outpacing the existing feature fusion methods. In addition, five-fold cross-validation experiments were conducted, in which the model consistently demonstrated robust performance when endoscopic images of the TM and PTA data were applied to the deep learning model across all datasets. The proposed MMMF model is the first to include a category of normal ear disease state with hearing loss. The developed model demonstrated superior performance compared to existing CNN models and feature fusion methods. Consequently, this study substantiates the utility of simultaneously applying PTA data and endoscopic images of the TM for the automated diagnosis of ear diseases in clinical settings and validates the usefulness of the multi-fusion method. Author | - |
dc.format.extent | 11 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | Toward better ear disease diagnosis: A multi-modal multi-fusion model using endoscopic images of the tympanic membrane and pure-tone audiometry | - |
dc.type | Article | - |
dc.publisher.location | 미국 | - |
dc.identifier.doi | 10.1109/ACCESS.2023.3325346 | - |
dc.identifier.scopusid | 2-s2.0-85174827245 | - |
dc.identifier.wosid | 001092093800001 | - |
dc.identifier.bibliographicCitation | IEEE Access, v.11, pp 116721 - 116731 | - |
dc.citation.title | IEEE Access | - |
dc.citation.volume | 11 | - |
dc.citation.startPage | 116721 | - |
dc.citation.endPage | 116731 | - |
dc.type.docType | Article | - |
dc.description.isOpenAccess | Y | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Telecommunications | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Telecommunications | - |
dc.subject.keywordPlus | SENSORINEURAL HEARING-LOSS | - |
dc.subject.keywordPlus | ARTIFICIAL-INTELLIGENCE | - |
dc.subject.keywordPlus | OTITIS-MEDIA | - |
dc.subject.keywordAuthor | Artificial intelligence | - |
dc.subject.keywordAuthor | Artificial intelligence | - |
dc.subject.keywordAuthor | Auditory system | - |
dc.subject.keywordAuthor | Biomedical imaging | - |
dc.subject.keywordAuthor | Biomedical imaging | - |
dc.subject.keywordAuthor | Bones | - |
dc.subject.keywordAuthor | Classification algorithms | - |
dc.subject.keywordAuthor | Classification algorithms | - |
dc.subject.keywordAuthor | Computer aided diagnosis | - |
dc.subject.keywordAuthor | Computer aided diagnosis | - |
dc.subject.keywordAuthor | Convolutional neural networks | - |
dc.subject.keywordAuthor | Convolutional neural networks | - |
dc.subject.keywordAuthor | Data models | - |
dc.subject.keywordAuthor | Deep learning | - |
dc.subject.keywordAuthor | Deep learning | - |
dc.subject.keywordAuthor | Diseases | - |
dc.subject.keywordAuthor | Ear | - |
dc.subject.keywordAuthor | Electronic medical records | - |
dc.subject.keywordAuthor | Electronic medical records | - |
dc.subject.keywordAuthor | Media | - |
dc.identifier.url | https://ieeexplore.ieee.org/document/10286540 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.