Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding

Full metadata record
DC Field Value Language
dc.contributor.authorHwang, Minyoung-
dc.contributor.authorJeong, Jaeyeon-
dc.contributor.authorKim, Minsoo-
dc.contributor.authoroh, yoonseon-
dc.contributor.authorOh, Songhwai-
dc.date.accessioned2023-09-11T01:52:58Z-
dc.date.available2023-09-11T01:52:58Z-
dc.date.issued2023-08-
dc.identifier.issn1063-6919-
dc.identifier.issn2575-7075-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/190385-
dc.description.abstractThe main challenge in vision-and-language navigation (VLN) is how to understand natural-language instructions in an unseen environment. The main limitation of conventional VLN algorithms is that if an action is mistaken, the agent fails to follow the instructions or explores unnecessary regions, leading the agent to an irrecoverable path. To tackle this problem, we propose Meta-Explore, a hierarchical navigation method deploying an exploitation policy to correct misled recent actions. We show that an exploitation policy, which moves the agent toward a well-chosen local goal among unvisited but observable states, outperforms a method which moves the agent to a previously visited state. We also highlight the demand for imagining regretful explorations with semantically meaningful clues. The key to our approach is understanding the object placements around the agent in spectral-domain. Specifically, we present a novel visual representation, called scene object spectrum (SOS), which performs category-wise 2D Fourier transform of detected objects. Combining exploitation policy and SOS features, the agent can correct its path by choosing a promising local goal. We evaluate our method in three VLN benchmarks: R2R, SOON, and REVERIE. Meta-Explore outper-forms other baselines and shows significant generalization performance. In addition, local goal search using the proposed spectral-domain SOS features significantly improves the success rate by 17.1% and SPL by 20. 6% against the state-of-the-art method of the SOON benchmark. Project page: https://rllab-snu.github.io/projects/Meta-Explore/doc.html-
dc.format.extent11-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEE-
dc.titleMeta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/CVPR52729.2023.00646-
dc.identifier.scopusid2-s2.0-85218002930-
dc.identifier.wosid001058542607004-
dc.identifier.bibliographicCitationProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 6683 - 6693-
dc.citation.titleProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition-
dc.citation.startPage6683-
dc.citation.endPage6693-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Interdisciplinary Applications-
dc.relation.journalWebOfScienceCategoryComputer Science, Software Engineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.subject.keywordPlusBenchmarking-
dc.subject.keywordPlusElectric grounding-
dc.subject.keywordPlusFourier transforms-
dc.subject.keywordPlusRobotics-
dc.subject.keywordPlusVisual languages-
dc.subject.keywordAuthorRobotics-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10205004-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher oh, yoonseon photo

oh, yoonseon
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE