Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

POSNet: a hybrid deep learning model for efficient person re-identification

Authors
Batool, E.Gillani, S.Naz, S.Bukhari, M.Maqsood, M.Yeo, S.-S.Rho, Seungmin
Issue Date
Aug-2023
Publisher
Springer
Keywords
Hybrid deep learning; Intra- and inter-class variations; Limited labeled data challenges; LSTM; Person re-identification; POSNet; Soft-pool-assisted attentions; Spatio-temporal feature learning
Citation
Journal of Supercomputing, v.79, no.12, pp 13090 - 13118
Pages
29
Journal Title
Journal of Supercomputing
Volume
79
Number
12
Start Page
13090
End Page
13118
URI
https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/69979
DOI
10.1007/s11227-023-05169-4
ISSN
0920-8542
1573-0484
Abstract
Person re-identification refers to the process of recognizing a person across several non-overlapping cameras. It is becoming increasingly important in computer vision for real-world surveillance applications. However, the deployment of person-re-identification systems as a surveillance system raises various challenges in their performance. These challenges include limited labeled data, occlusions conditions, human body postures, as well as inter- and intra-class variations. Such challenges deteriorate the effectiveness of person-re-identification systems and lead to the extraction of less discriminative features. Hence, to address these problems, we proposed a hybrid deep learning model, namely POSNet (pseudo-labeled omni-scale network) for efficient person re-identification. The proposed method is referred to as a hybrid because it combines label estimate with modified omni-scale feature learning, i.e., spatiotemporal-assisted omni-scale feature extraction to accomplish person re-identification. To further enhance omni-scale feature learning, we have proposed soft-pool-assisted attention mechanisms during spatial learning. More precisely, soft-pool preserves more important features, and that features are further emphasized by spatial and channel attention layers. Following on, this omni-scale with soft-pool attention learning extracts the spatial information from all frames of videos, and later on, the temporal learning is incorporated using the LSTM model. To handle limited labeled data problems, the proposed hybrid model first assigns pseudo-labels to the unlabeled data and adopts a progressive learning strategy to retrain the model on both labeled and unlabeled data with improved feature extraction, i.e., modified omni-scale feature learning. Moreover, the proposed POSNet model is validated on two large video-based person re-identification datasets, namely MARS and DukeMTMC-Video. It is observed from the research findings that the proposed POSNet outperformed the existing studies with the highest mAP and rank@1 score of 83.7 and 90.3%, respectively. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Business & Economics > Department of Industrial Security > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Rho, Seungmin photo

Rho, Seungmin
경영경제대학 (산업보안학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE