Task-Aware Semantic Map++: Cost-Efficient Task Assignment with Advanced Benchmark
- Authors
- Choi, Daewon; Hwang, Soeun; Oh, Yoonseon
- Issue Date
- Mar-2026
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Semantic scene understanding; mapping; AI-based methods
- Citation
- IEEE Robotics and Automation Letters, v.11, no.3, pp 3278 - 3285
- Pages
- 8
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Robotics and Automation Letters
- Volume
- 11
- Number
- 3
- Start Page
- 3278
- End Page
- 3285
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210895
- DOI
- 10.1109/LRA.2026.3656794
- ISSN
- 2377-3774
2377-3766
- Abstract
- Enabling robots to perform diverse tasks autonomously requires a sophisticated semantic understanding of 3D scenes. However, conventional scene representations, which primarily rely on static attributes like visual information or object labels, have significant limitations in allowing robots to infer context-aware actions. We introduce Task-Aware Semantic Map++ (TASMap++), the framework that overcomes these limitations by constructing a map that assigns appropriate tasks to objects based on their holistic context. While prior work like TASMap pioneered this task-centric approach, it suffered from high computational costs and inaccuracies due to its reliance on single-frame analysis, which often fails to capture an object's complete state. In contrast, TASMap++ resolves these issues with a multi-view synthesis pipeline that integrates multiple perspectives of an object for task assignment, resulting in significantly improved computational efficiency over its predecessor. Furthermore, to overcome biases in the existing TASMap evaluation, we established a reliable benchmark derived from the consensus of 32 participants across 231 cluttered scenes. On this benchmark, TASMap++ demonstrates superior accuracy over baselines. Finally, we introduce context-aware grounding, a paradigm distinct from conventional object grounding that relies on visual and spatial attributes. We present a downstream application of TASMap++ as a method to address this challenge and show experimentally that conventional grounding methods struggle in this setting, whereas TASMap++ is markedly more effective. To confirm these findings, the framework's robustness and practicality were validated through extensive experiments on 3D indoor datasets, including real-world scan datasets.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.