Global and local feature communications with transformers for 3D human pose estimation

No, Changho; Lee, Minsik

doi:10.1038/s41598-025-91426-w

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Global and local feature communications with transformers for 3D human pose estimation

Full metadata record

DC Field	Value	Language
dc.contributor.author	No, Changho	-
dc.contributor.author	Lee, Minsik	-
dc.date.accessioned	2025-03-27T08:00:48Z	-
dc.date.available	2025-03-27T08:00:48Z	-
dc.date.issued	2025-02	-
dc.identifier.issn	2045-2322	-
dc.identifier.issn	2045-2322	-
dc.identifier.uri	https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/122316	-
dc.description.abstract	Recently, spatiotemporal Transformer structures have been widely applied to the problem of 3D human pose estimation, achieving state-of-the-art performance. Many of these approaches consider a single joint in a single frame as a token, and attention is applied over the tokens in either the same frame or the same trajectory. While this structure is effective for calculating correlations between individual joints, it is too restrictive in that global features such as frames or trajectories are not well communicated. In this paper, we propose GaLFormer to resolve this issue. GaLFormer is composed of local and global Transformer blocks, where the former is based on joint tokens as in the previous methods, while the latter, i.e., global mixing Transformer, mixes all joints existing in a specific range of frames to enforce an inductive bias for feature exchange. These two Transformer blocks are alternately repeated in the proposed method to calculate correlations between joints, shapes, and trajectories. Experiments show that our approach achieves superior or at least competitive performance compared to existing methods on the Human 3.6M, MPI-INF-3DHP, and HumanEva datasets. © The Author(s) 2025.	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Nature Research	-
dc.title	Global and local feature communications with transformers for 3D human pose estimation	-
dc.type	Article	-
dc.publisher.location	영국	-
dc.identifier.doi	10.1038/s41598-025-91426-w	-
dc.identifier.scopusid	2-s2.0-85218692682	-
dc.identifier.wosid	001433272600002	-
dc.identifier.bibliographicCitation	Scientific Reports, v.15, no.1	-
dc.citation.title	Scientific Reports	-
dc.citation.volume	15	-
dc.citation.number	1	-
dc.type.docType	Article	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Science & Technology - Other Topics	-
dc.relation.journalWebOfScienceCategory	Multidisciplinary Sciences	-
dc.identifier.url	https://www.nature.com/articles/s41598-025-91426-w	-

Files in This Item: Go to Link

Appears in Collections: COLLEGE OF ENGINEERING SCIENCES > SCHOOL OF ELECTRICAL ENGINEERING > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Lee, Min sik photo

Lee, Min sik: ERICA 공학대학 (SCHOOL OF ELECTRICAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE