POSNet: a hybrid deep learning model for efficient person re-identification

Batool, E.; Gillani, S.; Naz, S.; Bukhari, M.; Maqsood, M.; Yeo, S.-S.; Rho, Seungmin

doi:10.1007/s11227-023-05169-4

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

POSNet: a hybrid deep learning model for efficient person re-identification

Authors: Batool, E.; Gillani, S.; Naz, S.; Bukhari, M.; Maqsood, M.; Yeo, S.-S.; Rho, Seungmin

Issue Date: Aug-2023

Publisher: Springer

Keywords: Hybrid deep learning; Intra- and inter-class variations; Limited labeled data challenges; LSTM; Person re-identification; POSNet; Soft-pool-assisted attentions; Spatio-temporal feature learning

Citation: Journal of Supercomputing, v.79, no.12, pp 13090 - 13118

Pages: 29

Journal Title: Journal of Supercomputing

Volume: 79

Number: 12

Start Page: 13090

End Page: 13118

URI: https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/69979

DOI: 10.1007/s11227-023-05169-4

ISSN: 0920-8542
1573-0484

Abstract: Person re-identification refers to the process of recognizing a person across several non-overlapping cameras. It is becoming increasingly important in computer vision for real-world surveillance applications. However, the deployment of person-re-identification systems as a surveillance system raises various challenges in their performance. These challenges include limited labeled data, occlusions conditions, human body postures, as well as inter- and intra-class variations. Such challenges deteriorate the effectiveness of person-re-identification systems and lead to the extraction of less discriminative features. Hence, to address these problems, we proposed a hybrid deep learning model, namely POSNet (pseudo-labeled omni-scale network) for efficient person re-identification. The proposed method is referred to as a hybrid because it combines label estimate with modified omni-scale feature learning, i.e., spatiotemporal-assisted omni-scale feature extraction to accomplish person re-identification. To further enhance omni-scale feature learning, we have proposed soft-pool-assisted attention mechanisms during spatial learning. More precisely, soft-pool preserves more important features, and that features are further emphasized by spatial and channel attention layers. Following on, this omni-scale with soft-pool attention learning extracts the spatial information from all frames of videos, and later on, the temporal learning is incorporated using the LSTM model. To handle limited labeled data problems, the proposed hybrid model first assigns pseudo-labels to the unlabeled data and adopts a progressive learning strategy to retrain the model on both labeled and unlabeled data with improved feature extraction, i.e., modified omni-scale feature learning. Moreover, the proposed POSNet model is validated on two large video-based person re-identification datasets, namely MARS and DukeMTMC-Video. It is observed from the research findings that the proposed POSNet outperformed the existing studies with the highest mAP and rank@1 score of 83.7 and 90.3%, respectively. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Business & Economics > Department of Industrial Security > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Rho, Seungmin photo

Rho, Seungmin: 경영경제대학 (산업보안학과)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE