Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Task Planning for Long-Horizon Cooking Tasks Based on Large Language Models

Full metadata record
DC Field Value Language
dc.contributor.authorShin, Jungkyoo-
dc.contributor.authorHan, Jieun-
dc.contributor.authorKim, SeungJun-
dc.contributor.authorOh, Yoonseon-
dc.contributor.authorKim, Eunwoo-
dc.date.accessioned2025-02-20T08:00:10Z-
dc.date.available2025-02-20T08:00:10Z-
dc.date.issued2024-10-
dc.identifier.issn2153-0858-
dc.identifier.issn2153-0866-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206514-
dc.description.abstractIn the field of robot manipulation, learnable task planners are gaining attention, especially for long-horizon tasks such as cooking. However, existing methods that predominantly rely on symbolic representations suffer from limitations in generalization capabilities, particularly in handling unseen objects. Given that objects may vary in real-world environments, this limitation may constrain their practical applicability. To address this issue, we propose a novel task-planning framework that leverages a pretrained large language model (LLM) for environmental interpretation. Our proposed framework extracts semantic features directly from textual data, enabling the planner to accommodate unfamiliar objects. We further incorporate a transformer-based encoder-decoder framework to understand environmental attributes derived from the language model and generate sequential predictions in line with object-oriented subgoals. To validate the effectiveness of our model, we utilize a dataset focused on cooking recipes. Going a step further, we propose a method that automatically generates object-oriented data from natural language description using recurrent LLM, enhancing the framework to manage previously unseen targets as well. Our framework shows an average success rate of 95% when validated with test sets that involve unseen objects. By providing the automatically generated dataset to the framework, we achieve a significant 27% increase in success rate on unknown target recipes. We also provide evidence of the real-world viability of our planner by successfully deploying it on a robot platform.-
dc.format.extent7-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleTask Planning for Long-Horizon Cooking Tasks Based on Large Language Models-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/IROS58592.2024.10801687-
dc.identifier.scopusid2-s2.0-85216490298-
dc.identifier.wosid001433985300779-
dc.identifier.bibliographicCitationIEEE International Conference on Intelligent Robots and Systems, pp 13613 - 13619-
dc.citation.titleIEEE International Conference on Intelligent Robots and Systems-
dc.citation.startPage13613-
dc.citation.endPage13619-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaAutomation & Control Systems-
dc.relation.journalResearchAreaRobotics-
dc.relation.journalWebOfScienceCategoryAutomation & Control Systems-
dc.relation.journalWebOfScienceCategoryRobotics-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10801687-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher oh, yoonseon photo

oh, yoonseon
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE