Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

From Language to Grasp: Object Retrieval and Grasping Through Explicit and Implicit Linguistic Commands

Full metadata record
DC Field Value Language
dc.contributor.authorYoon, Dongmin-
dc.contributor.authorCha, Seonghun-
dc.contributor.authorOh, Yoonseon-
dc.date.accessioned2025-02-12T08:00:34Z-
dc.date.available2025-02-12T08:00:34Z-
dc.date.issued2024-10-
dc.identifier.issn1598-7833-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206472-
dc.description.abstractIn human-centered environments, assistive robots are required to understand verbal commands to retrieve and grasp objects within complex scenes. We propose a novel Language Understanding Object Retrieval module (LUOR) by fine-tuning the CLIP text encoder to enhance robot manipulators' understanding of both explicit and implicit natural language commands. A new dataset with 712 verb-object pairs is created for training. This dataset includes 78 verbs associated with 244 ImageNet classes, providing a comprehensive range of scenarios. Additionally, 336 verb-object pairs cover 54 verbs for 138 ObjectNet classes, further expanding the model's applicability. Experimental results demonstrate that LUOR outperforms existing baselines in both accuracy and efficiency, particularly in handling implicit commands. The integrated system with the Multi-Task Detection module (MTD) shows strong performance in real-world robotic applications using a Panda Franka manipulator. These findings confirm the practical applicability of our approach and suggest potential for further improvements in robotic grasping and manipulation tasks.-
dc.format.extent2-
dc.language영어-
dc.language.isoENG-
dc.titleFrom Language to Grasp: Object Retrieval and Grasping Through Explicit and Implicit Linguistic Commands-
dc.typeArticle-
dc.identifier.doi10.23919/ICCAS63016.2024.10773029-
dc.identifier.scopusid2-s2.0-85214363266-
dc.identifier.bibliographicCitationInternational Conference on Control, Automation and Systems, pp 1565 - 1566-
dc.citation.titleInternational Conference on Control, Automation and Systems-
dc.citation.startPage1565-
dc.citation.endPage1566-
dc.type.docTypeConference paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusAdversarial machine learning-
dc.subject.keywordPlusContent based retrieval-
dc.subject.keywordPlusContrastive Learning-
dc.subject.keywordPlusIndustrial robots-
dc.subject.keywordPlusLinguistics-
dc.subject.keywordPlusModular robots-
dc.subject.keywordPlusMulti-task learning-
dc.subject.keywordPlusNatural language processing systems-
dc.subject.keywordPlusObject detection-
dc.subject.keywordPlusObject recognition-
dc.subject.keywordPlusRobot applications-
dc.subject.keywordPlusRobot learning-
dc.subject.keywordAuthorgrasp detection-
dc.subject.keywordAuthormulti-modal learning-
dc.subject.keywordAuthorRobotic object retrieval-
Files in This Item
There are no files associated with this item.
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher oh, yoonseon photo

oh, yoonseon
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE