Implementation experiments on convolutional neural network training using synthetic images for 3D pose estimation of an excavator on real images
- Authors
- Mahmood, Bilawal; Han, SangUk; Seo, Jongwon
- Issue Date
- Jan-2022
- Publisher
- Elsevier BV
- Keywords
- Excavator; 3D pose estimation; Synthetic dataset; Kinematic constraints; Perspective camera projection
- Citation
- Automation in Construction, v.133, pp 1 - 17
- Pages
- 17
- Indexed
- SCIE
SCOPUS
- Journal Title
- Automation in Construction
- Volume
- 133
- Start Page
- 1
- End Page
- 17
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/139900
- DOI
- 10.1016/j.autcon.2021.103996
- ISSN
- 0926-5805
1872-7891
- Abstract
- Remote and descriptive visualization of spatio-temporal information of excavator activities may increase awareness about jobsite hazards and operational performance in earthwork operations. One of the emerging approaches to collect this information is to extract the 3D pose of an excavator from the video frames using a convolutional neural network (CNN). However, this method requires labeling the training datasets, which are difficult to prepare because of conditions unsuitable for installing the motion capture sensors. This study investigates the performance of a CNN for estimating the 3D pose when trained on a synthetic dataset. In particular, a kinematic constraint is proposed to update the model parameters efficiently during training. The results show that the proposed method estimated the 3D poses of a real excavator with an average pose error of 9.63°. Hence, the proposed data augmentation method could help address the training data issues and improves the learning of real data complexity.
- Files in This Item
-
- Appears in
Collections - 서울 공과대학 > 서울 건설환경공학과 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.