Deep Learning Methods with the Improved Attention for Explainable Image Recognition

Bai, Na; Joe, Inwhee

doi:10.1109/ACCESS.2024.3397323

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Deep Learning Methods with the Improved Attention for Explainable Image Recognition

Full metadata record

DC Field	Value	Language
dc.contributor.author	Bai, Na	-
dc.contributor.author	Joe, Inwhee	-
dc.date.accessioned	2025-12-11T01:00:16Z	-
dc.date.available	2025-12-11T01:00:16Z	-
dc.date.issued	2024-05	-
dc.identifier.issn	2169-3536	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/209719	-
dc.description.abstract	This study presents a deep learning research aimed at improving the performance of image classification models and increasing interpretability as well. We explore ways to improve the model by combining attention mechanisms with convolutional neural networks. This study uses garbage classification data in public datasets for training in a supervised learning manner, and employs Grad-CAM (Gradient-weighted Class Activation Mapping) with channel attention mechanism SE (Squeeze-and-Excitation) to generate heat maps for better understanding the decision-making process of the model’s classification. By using the Grad-CAM function to generate heat maps, it is possible to visualize the areas which the model focuses on during classification. This provides a method to explain the model’s classification decisions, allowing us to better understand the basis of the model’s decisions on different categories of images. The model is improved by adding attention modules to different stages of the ResNet50 (Residual Network-50) network, thereby improving the accuracy and performance of the network. We notice that within the same stage, the structure and required attention of each module are consistent, so only one attention module is added in each stage to reduce the burden of network learning and speed up learning. In order to simplify the calculation process of attention, a global tensor is introduced to store the attention of each stage, thereby avoiding repeated calculations in each module. The experimental results show that compared with traditional convolutional neural networks, our proposed method achieves better performance on garbage classification tasks. By combining the attention mechanism and heat map interpretation, our model is able to improve classification accuracy. This is of great significance for image classification tasks in practical applications and helps to promote the research progress of deep learning in interpretability.	-
dc.format.extent	9	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Deep Learning Methods with the Improved Attention for Explainable Image Recognition	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/ACCESS.2024.3397323	-
dc.identifier.scopusid	2-s2.0-85193017126	-
dc.identifier.wosid	001231446300001	-
dc.identifier.bibliographicCitation	IEEE Access, v.12, pp 70559 - 70567	-
dc.citation.title	IEEE Access	-
dc.citation.volume	12	-
dc.citation.startPage	70559	-
dc.citation.endPage	70567	-
dc.type.docType	Article in press	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Telecommunications	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Telecommunications	-
dc.subject.keywordPlus	Cams	-
dc.subject.keywordPlus	Classification (of information)	-
dc.subject.keywordPlus	Convolution	-
dc.subject.keywordPlus	Decision making	-
dc.subject.keywordPlus	Deep neural networks	-
dc.subject.keywordPlus	Image enhancement	-
dc.subject.keywordPlus	Image recognition	-
dc.subject.keywordPlus	Job analysis	-
dc.subject.keywordPlus	Tensors	-
dc.subject.keywordAuthor	Attention mechanism	-
dc.subject.keywordAuthor	convolutional neural networks	-
dc.subject.keywordAuthor	deep learning	-
dc.subject.keywordAuthor	Grad-CAM	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10521527	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE