Generating realistic training images from synthetic data for excavator pose estimation
- Authors
- Pham, Hieu T.T.L.; Han, SangUk
- Issue Date
- Nov-2024
- Publisher
- Elsevier BV
- Keywords
- 3D excavator pose estimation; Convolutional neural network; CycleGAN; Synthetic dataset; Vision transformer
- Citation
- Automation in Construction, v.167, pp 1 - 16
- Pages
- 16
- Indexed
- SCIE
SCOPUS
- Journal Title
- Automation in Construction
- Volume
- 167
- Start Page
- 1
- End Page
- 16
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/195286
- DOI
- 10.1016/j.autcon.2024.105718
- ISSN
- 0926-5805
1872-7891
- Abstract
- Computer vision-based 3D pose estimation for automated excavator operation monitoring requires numerous training images annotated with 3D pose labels. Owing to challenges in collecting such datasets in a field setting, using synthetic images from virtual environments has emerged recently. However, synthetic images lack the realism inherent in onsite images, potentially impacting pose estimation performance on real images. This paper thus proposes a generative model for generating realistic training excavator images with multiple backgrounds. The evaluation was conducted by comparing estimation models trained on synthetic images (Model #1), generated excavator images with single background (Model #2), and generated excavator images with multiple backgrounds (Model #3). Model #3 exhibited the lowest mean angular error of 5.96° on real data, implying its superiority in generalizing real patterns. The proposed model facilitates data acquisition for improving pose estimation without manual annotation, providing rich information on excavator movements for proactive safety and productivity management.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 건설환경공학과 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.