Voxel-Based Scene Representation for Camera Pose Estimation of a Single RGB Image
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Sangyoon | - |
dc.contributor.author | Hong, Hyunki | - |
dc.contributor.author | Eem, Changkyoung | - |
dc.date.accessioned | 2021-07-15T02:40:09Z | - |
dc.date.available | 2021-07-15T02:40:09Z | - |
dc.date.issued | 2020-12 | - |
dc.identifier.issn | 2076-3417 | - |
dc.identifier.issn | 2076-3417 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/47534 | - |
dc.description.abstract | Deep learning has been utilized in end-to-end camera pose estimation. To improve the performance, we introduce a camera pose estimation method based on a 2D-3D matching scheme with two convolutional neural networks (CNNs). The scene is divided into voxels, whose size and number are computed according to the scene volume and the number of 3D points. We extract inlier points from the 3D point set in a voxel using random sample consensus (RANSAC)-based plane fitting to obtain a set of interest points consisting of a major plane. These points are subsequently reprojected onto the image using the ground truth camera pose, following which a polygonal region is identified in each voxel using the convex hull. We designed a training dataset for 2D-3D matching, consisting of inlier 3D points, correspondence across image pairs, and the voxel regions in the image. We trained the hierarchical learning structure with two CNNs on the dataset architecture to detect the voxel regions and obtain the location/description of the interest points. Following successful 2D-3D matching, the camera pose was estimated using n-point pose solver in RANSAC. The experiment results show that our method can estimate the camera pose more precisely than previous end-to-end estimators. | - |
dc.format.extent | 15 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | MDPI | - |
dc.title | Voxel-Based Scene Representation for Camera Pose Estimation of a Single RGB Image | - |
dc.type | Article | - |
dc.identifier.doi | 10.3390/app10248866 | - |
dc.identifier.bibliographicCitation | APPLIED SCIENCES-BASEL, v.10, no.24, pp 1 - 15 | - |
dc.description.isOpenAccess | N | - |
dc.identifier.wosid | 000602754300001 | - |
dc.identifier.scopusid | 2-s2.0-85098508645 | - |
dc.citation.endPage | 15 | - |
dc.citation.number | 24 | - |
dc.citation.startPage | 1 | - |
dc.citation.title | APPLIED SCIENCES-BASEL | - |
dc.citation.volume | 10 | - |
dc.type.docType | Article | - |
dc.publisher.location | 스위스 | - |
dc.subject.keywordAuthor | camera pose estimation | - |
dc.subject.keywordAuthor | deep learning | - |
dc.subject.keywordAuthor | convolutional neural network | - |
dc.subject.keywordAuthor | voxels | - |
dc.subject.keywordAuthor | interest points detection and description | - |
dc.relation.journalResearchArea | Chemistry | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Materials Science | - |
dc.relation.journalResearchArea | Physics | - |
dc.relation.journalWebOfScienceCategory | Chemistry, Multidisciplinary | - |
dc.relation.journalWebOfScienceCategory | Engineering, Multidisciplinary | - |
dc.relation.journalWebOfScienceCategory | Materials Science, Multidisciplinary | - |
dc.relation.journalWebOfScienceCategory | Physics, Applied | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194
COPYRIGHT 2019 Chung-Ang University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.