Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A Novel Attention-Based Parallel Blocks Deep Architecture for Human Action Recognition

Authors
Jadoon, Yasir KhanKhalid, Yasir NomanKhan, Muhammad AttiqueShin, JungpilAlhayan, FatimahCho, Hee-ChanChang, Byoungchol
Issue Date
Jul-2025
Publisher
Tech Science Press
Keywords
Human action recognition; self-attention; video streams; residual bottleneck; classification; neural networks
Citation
CMES - Computer Modeling in Engineering and Sciences, v.144, no.1, pp 1143 - 1164
Pages
22
Indexed
SCIE
SCOPUS
Journal Title
CMES - Computer Modeling in Engineering and Sciences
Volume
144
Number
1
Start Page
1143
End Page
1164
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208678
DOI
10.32604/cmes.2025.066984
ISSN
1526-1492
1526-1506
Abstract
Real-time surveillance is attributed to recognizing the variety of actions performed by humans. Human Action Recognition (HAR) is a technique that recognizes human actions from a video stream. A range of variations in human actions makes it difficult to recognize with considerable accuracy. This paper presents a novel deep neural network architecture called Attention RB-Net for HAR using video frames. The input is provided to the model in the form of video frames. The proposed deep architecture is based on the unique structuring of residual blocks with several filter sizes. Features are extracted from each frame via several operations with specific parameters defined in the presented novel Attention-based Residual Bottleneck (Attention-RB) DCNN architecture. A fully connected layer receives an attention-based features matrix, and final classification is performed. Several hyperparameters of the proposed model are initialized using Bayesian Optimization (BO) and later utilized in the trained model for testing. In testing, features are extracted from the self-attention layer and passed to neural network classifiers for the final action classification. Two highly cited datasets, HMDB51 and UCF101, were used to validate the proposed architecture and obtained an average accuracy of 87.70% and 97.30%, respectively. The deep convolutional neural network (DCNN) architecture is compared with state-of-the-art (SOTA) methods, including pre-trained models, inside blocks, and recently published techniques, and performs better.
Files in This Item
Go to Link
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher CHANG, BYOUNGCHOL photo

CHANG, BYOUNGCHOL
서울 부총장(서울) (서울 창의융합교육원(소프트웨어교육위원회))
Read more

Altmetrics

Total Views & Downloads

BROWSE