A Novel Attention-Based Parallel Blocks Deep Architecture for Human Action Recognition

Jadoon, Yasir Khan; Khalid, Yasir Noman; Khan, Muhammad Attique; Shin, Jungpil; Alhayan, Fatimah; Cho, Hee-Chan; Chang, Byoungchol

doi:10.32604/cmes.2025.066984

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A Novel Attention-Based Parallel Blocks Deep Architecture for Human Action Recognition

Authors: Jadoon, Yasir Khan; Khalid, Yasir Noman; Khan, Muhammad Attique; Shin, Jungpil; Alhayan, Fatimah; Cho, Hee-Chan; Chang, Byoungchol

Issue Date: Jul-2025

Publisher: Tech Science Press

Keywords: Human action recognition; self-attention; video streams; residual bottleneck; classification; neural networks

Citation: CMES - Computer Modeling in Engineering and Sciences, v.144, no.1, pp 1143 - 1164

Pages: 22

Indexed: SCIE
SCOPUS

Journal Title: CMES - Computer Modeling in Engineering and Sciences

Volume: 144

Number: 1

Start Page: 1143

End Page: 1164

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208678

DOI: 10.32604/cmes.2025.066984

ISSN: 1526-1492
1526-1506

Abstract: Real-time surveillance is attributed to recognizing the variety of actions performed by humans. Human Action Recognition (HAR) is a technique that recognizes human actions from a video stream. A range of variations in human actions makes it difficult to recognize with considerable accuracy. This paper presents a novel deep neural network architecture called Attention RB-Net for HAR using video frames. The input is provided to the model in the form of video frames. The proposed deep architecture is based on the unique structuring of residual blocks with several filter sizes. Features are extracted from each frame via several operations with specific parameters defined in the presented novel Attention-based Residual Bottleneck (Attention-RB) DCNN architecture. A fully connected layer receives an attention-based features matrix, and final classification is performed. Several hyperparameters of the proposed model are initialized using Bayesian Optimization (BO) and later utilized in the trained model for testing. In testing, features are extracted from the self-attention layer and passed to neural network classifiers for the final action classification. Two highly cited datasets, HMDB51 and UCF101, were used to validate the proposed architecture and obtained an average accuracy of 87.70% and 97.30%, respectively. The deep convolutional neural network (DCNN) architecture is compared with state-of-the-art (SOTA) methods, including pre-trained models, inside blocks, and recently published techniques, and performs better.

Files in This Item: Go to Link

Appears in Collections: ETC > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher CHANG, BYOUNGCHOL photo

CHANG, BYOUNGCHOL: 서울 부총장(서울) (서울 창의융합교육원(소프트웨어교육위원회))

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE