Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Boosting Monocular 3D Object Detection With Object-Centric Auxiliary Depth Supervision

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Youngseok-
dc.contributor.authorKim, Sanmin-
dc.contributor.authorSim, Sangmin-
dc.contributor.authorChoi, Jun Won-
dc.contributor.authorKum, Dongsuk-
dc.date.accessioned2023-05-03T10:21:10Z-
dc.date.available2023-05-03T10:21:10Z-
dc.date.created2023-01-05-
dc.date.issued2023-02-
dc.identifier.issn1524-9050-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/185118-
dc.description.abstractRecent advances in monocular 3D detection leverage a depth estimation network explicitly as an intermediate stage of the 3D detection network. Depth map approaches yield more accurate depth to objects than other methods thanks to the depth estimation network trained on a large-scale dataset. However, depth map approaches can be limited by the accuracy of the depth map, and sequentially using two separated networks for depth estimation and 3D detection significantly increases computation cost and inference time. In this work, we propose a method to boost the RGB image-based 3D detector by jointly training the detection network with a depth prediction loss analogous to the depth estimation task. In this way, our 3D detection network can be supervised by more depth supervision from raw LiDAR points, which does not require any human annotation cost, to estimate accurate depth without explicitly predicting the depth map. Our novel object-centric depth prediction loss focuses on depth around foreground objects, which is important for 3D object detection, to leverage pixel-wise depth supervision in an object-centric manner. Our depth regression model is further trained to predict the uncertainty of depth to represent the 3D confidence of objects. To effectively train the 3D detector with raw LiDAR points and to enable end-to-end training, we revisit the regression target of 3D objects and design a network architecture. Extensive experiments on KITTI and nuScenes benchmarks show that our method can significantly boost the monocular image-based 3D detector to outperform depth map approaches while maintaining the real-time inference speed.-
dc.language영어-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleBoosting Monocular 3D Object Detection With Object-Centric Auxiliary Depth Supervision-
dc.typeArticle-
dc.contributor.affiliatedAuthorChoi, Jun Won-
dc.identifier.doi10.1109/TITS.2022.3224082-
dc.identifier.scopusid2-s2.0-85144084129-
dc.identifier.wosid000912789500001-
dc.identifier.bibliographicCitationIEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, v.24, no.2, pp.1801 - 1813-
dc.relation.isPartOfIEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS-
dc.citation.titleIEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS-
dc.citation.volume24-
dc.citation.number2-
dc.citation.startPage1801-
dc.citation.endPage1813-
dc.type.rimsART-
dc.type.docTypeArticle; Early Access-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTransportation-
dc.relation.journalWebOfScienceCategoryEngineering, Civil-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTransportation Science & Technology-
dc.subject.keywordPlusCost benefit analysis-
dc.subject.keywordPlusCost estimating-
dc.subject.keywordPlusDeep learning-
dc.subject.keywordPlusForecasting-
dc.subject.keywordPlusLarge dataset-
dc.subject.keywordPlusNetwork architecture-
dc.subject.keywordPlusObject recognition-
dc.subject.keywordPlusOptical radar-
dc.subject.keywordPlusRegression analysis-
dc.subject.keywordPlusObject detection-
dc.subject.keywordPlus3D object-
dc.subject.keywordPlus3d object detection-
dc.subject.keywordPlusAutonomous driving-
dc.subject.keywordPlusAuxiliary supervision-
dc.subject.keywordPlusDeep learning-
dc.subject.keywordPlusDepth Estimation-
dc.subject.keywordPlusDepthmap-
dc.subject.keywordPlusDetection networks-
dc.subject.keywordPlusMonocular image-
dc.subject.keywordPlusObjects detection-
dc.subject.keywordAuthor3D object detection-
dc.subject.keywordAuthormonocular image-
dc.subject.keywordAuthorauxiliary supervision-
dc.subject.keywordAuthorautonomous driving-
dc.subject.keywordAuthordeep learning-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/9966379-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Jun Won photo

Choi, Jun Won
COLLEGE OF ENGINEERING (MAJOR IN ELECTRICAL ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE