A Large-scale 3D Object Dataset for 6-DoF Pose Estimation

장재훈; 김준용; 김성흠

doi:10.5302/J.ICROS.2023.23.0141

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A Large-scale 3D Object Dataset for 6-DoF Pose Estimation6자유도 자세 추정을 위한 대용량 3D 객체 데이터 구축

Other Titles: 6자유도 자세 추정을 위한 대용량 3D 객체 데이터 구축

Authors: 장재훈; 김준용; 김성흠

Issue Date: Dec-2023

Publisher: 제어·로봇·시스템학회

Keywords: large-scale object dataset; monocular 3D object detection; 6-DoF object pose estimation; .

Citation: 제어.로봇.시스템학회 논문지, v.29, no.12, pp 1008 - 1014

Pages: 7

Journal Title: 제어.로봇.시스템학회 논문지

Volume: 29

Number: 12

Start Page: 1008

End Page: 1014

URI: https://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/49059

DOI: 10.5302/J.ICROS.2023.23.0141

ISSN: 1976-5622
2233-4335

Abstract: Given the growing necessity of substantial human annotations in deep learning systems to enhance functionality and performance, it is imperative for researchers to scrutinize existing databases and develop their own datasets with custom labels, particularly for target applications such as object detection and pose estimation. This study introduces a large-scale 3D object dataset tailored for six degrees of freedom pose estimation in real-world scenarios. We describe the key features of our datasets available in the AI hub, emphasizing the expansive 3D object collection. Our methodology involves establishing a correspondence between eight points of an object cube in a 2D image, with the object’s pose determined using the conventional perspective-n-point (PnP) algorithm. To analyze the reprojection error, we employed a high-quality 3D mesh model and a binary mask of the target object in the RGB image. For database validation, all object categories were tested using a representative YOLO-like convolutional neural network architecture, such as real-time singleshot pose estimation. In addition, we conduct an in-depth analysis of the current database’s limitations. In the AI hub, we meticulously released all information regarding our new database, presenting it in a format consistent with our baseline database, LINEMOD. A comparative analysis against this baseline was conducted. To overcome the scalability concerns associated with unseen object categories, we explored an effective methodology that leverages vision and language knowledge distillation.

Files in This Item: Go to Link

Appears in Collections: ETC > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Kim, Seongheum photo

Kim, Seongheum: College of Information Technology (Department of Smart Systems Software)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :7,951,813; Today View :2,756

RSS_1.0 RSS_2.0 ATOM_1.0

Soongsil University Library 369 Sangdo-Ro, Dongjak-Gu, Seoul, Korea (06978)02-820-0733

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE