Global and local feature communications with transformers for 3D human pose estimation
DC Field | Value | Language |
---|---|---|
dc.contributor.author | No, Changho | - |
dc.contributor.author | Lee, Minsik | - |
dc.date.accessioned | 2025-03-27T08:00:48Z | - |
dc.date.available | 2025-03-27T08:00:48Z | - |
dc.date.issued | 2025-02 | - |
dc.identifier.issn | 2045-2322 | - |
dc.identifier.issn | 2045-2322 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/122316 | - |
dc.description.abstract | Recently, spatiotemporal Transformer structures have been widely applied to the problem of 3D human pose estimation, achieving state-of-the-art performance. Many of these approaches consider a single joint in a single frame as a token, and attention is applied over the tokens in either the same frame or the same trajectory. While this structure is effective for calculating correlations between individual joints, it is too restrictive in that global features such as frames or trajectories are not well communicated. In this paper, we propose GaLFormer to resolve this issue. GaLFormer is composed of local and global Transformer blocks, where the former is based on joint tokens as in the previous methods, while the latter, i.e., global mixing Transformer, mixes all joints existing in a specific range of frames to enforce an inductive bias for feature exchange. These two Transformer blocks are alternately repeated in the proposed method to calculate correlations between joints, shapes, and trajectories. Experiments show that our approach achieves superior or at least competitive performance compared to existing methods on the Human 3.6M, MPI-INF-3DHP, and HumanEva datasets. © The Author(s) 2025. | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Nature Research | - |
dc.title | Global and local feature communications with transformers for 3D human pose estimation | - |
dc.type | Article | - |
dc.publisher.location | 영국 | - |
dc.identifier.doi | 10.1038/s41598-025-91426-w | - |
dc.identifier.scopusid | 2-s2.0-85218692682 | - |
dc.identifier.wosid | 001433272600002 | - |
dc.identifier.bibliographicCitation | Scientific Reports, v.15, no.1 | - |
dc.citation.title | Scientific Reports | - |
dc.citation.volume | 15 | - |
dc.citation.number | 1 | - |
dc.type.docType | Article | - |
dc.description.isOpenAccess | Y | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Science & Technology - Other Topics | - |
dc.relation.journalWebOfScienceCategory | Multidisciplinary Sciences | - |
dc.identifier.url | https://www.nature.com/articles/s41598-025-91426-w | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.