Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A Novel Attention-Based Parallel Blocks Deep Architecture for Human Action Recognition

Full metadata record
DC Field Value Language
dc.contributor.authorJadoon, Yasir Khan-
dc.contributor.authorKhalid, Yasir Noman-
dc.contributor.authorKhan, Muhammad Attique-
dc.contributor.authorShin, Jungpil-
dc.contributor.authorAlhayan, Fatimah-
dc.contributor.authorCho, Hee-Chan-
dc.contributor.authorChang, Byoungchol-
dc.date.accessioned2025-09-08T08:30:25Z-
dc.date.available2025-09-08T08:30:25Z-
dc.date.issued2025-07-
dc.identifier.issn1526-1492-
dc.identifier.issn1526-1506-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208678-
dc.description.abstractReal-time surveillance is attributed to recognizing the variety of actions performed by humans. Human Action Recognition (HAR) is a technique that recognizes human actions from a video stream. A range of variations in human actions makes it difficult to recognize with considerable accuracy. This paper presents a novel deep neural network architecture called Attention RB-Net for HAR using video frames. The input is provided to the model in the form of video frames. The proposed deep architecture is based on the unique structuring of residual blocks with several filter sizes. Features are extracted from each frame via several operations with specific parameters defined in the presented novel Attention-based Residual Bottleneck (Attention-RB) DCNN architecture. A fully connected layer receives an attention-based features matrix, and final classification is performed. Several hyperparameters of the proposed model are initialized using Bayesian Optimization (BO) and later utilized in the trained model for testing. In testing, features are extracted from the self-attention layer and passed to neural network classifiers for the final action classification. Two highly cited datasets, HMDB51 and UCF101, were used to validate the proposed architecture and obtained an average accuracy of 87.70% and 97.30%, respectively. The deep convolutional neural network (DCNN) architecture is compared with state-of-the-art (SOTA) methods, including pre-trained models, inside blocks, and recently published techniques, and performs better.-
dc.format.extent22-
dc.language영어-
dc.language.isoENG-
dc.publisherTech Science Press-
dc.titleA Novel Attention-Based Parallel Blocks Deep Architecture for Human Action Recognition-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.32604/cmes.2025.066984-
dc.identifier.scopusid2-s2.0-105012133613-
dc.identifier.wosid001546263500001-
dc.identifier.bibliographicCitationCMES - Computer Modeling in Engineering and Sciences, v.144, no.1, pp 1143 - 1164-
dc.citation.titleCMES - Computer Modeling in Engineering and Sciences-
dc.citation.volume144-
dc.citation.number1-
dc.citation.startPage1143-
dc.citation.endPage1164-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaMathematics-
dc.relation.journalWebOfScienceCategoryEngineering, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryMathematics, Interdisciplinary Applications-
dc.subject.keywordPlusFEATURES-
dc.subject.keywordPlusLSTM-
dc.subject.keywordAuthorHuman action recognition-
dc.subject.keywordAuthorself-attention-
dc.subject.keywordAuthorvideo streams-
dc.subject.keywordAuthorresidual bottleneck-
dc.subject.keywordAuthorclassification-
dc.subject.keywordAuthorneural networks-
dc.identifier.urlhttps://www.techscience.com/CMES/v144n1/63294-
Files in This Item
Go to Link
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher CHANG, BYOUNGCHOL photo

CHANG, BYOUNGCHOL
서울 부총장(서울) (서울 창의융합교육원(소프트웨어교육위원회))
Read more

Altmetrics

Total Views & Downloads

BROWSE