A Controllable Agent by Subgoals in Path Planning Using Goal-Conditioned Reinforcement Learning
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Gyeong Taek | - |
dc.contributor.author | Kim, Kangjin | - |
dc.date.accessioned | 2024-03-20T13:30:27Z | - |
dc.date.available | 2024-03-20T13:30:27Z | - |
dc.date.issued | 2023-04 | - |
dc.identifier.issn | 2169-3536 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/90765 | - |
dc.description.abstract | The aim of path planning is to search for a path from the starting point to the goal. Numerous studies, however, have dealt with a single predefined goal. That is, an agent who has completed learning cannot reach other goals that have not been visited in the training. In the present study, we propose a novel reinforcement learning (RL) framework for an agent reachable to any subgoal as well as the final goal in path planning. To do this, we utilize goal-conditioned RL and propose bidirectional memory editing to obtain various bidirectional trajectories of the agent. Bidirectional memory editing can generate various behavior and subgoals of the agent from the limited trajectory. Then, the generated subgoals and behaviors of the agent are trained on the policy network so that the agent can reach any subgoals from any starting point. In addition, we present reward shaping for the short path of the agent to reach the goal. In the experimental result, the agent was able to reach the various goals that had never been visited by the agent during the training. We confirmed that the agent could perform difficult missions, such as a round trip, and the agent used the shorter route with reward shaping. | - |
dc.format.extent | 14 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
dc.title | A Controllable Agent by Subgoals in Path Planning Using Goal-Conditioned Reinforcement Learning | - |
dc.type | Article | - |
dc.identifier.wosid | 000970912700001 | - |
dc.identifier.doi | 10.1109/ACCESS.2023.3264264 | - |
dc.identifier.bibliographicCitation | IEEE ACCESS, v.11, pp 33812 - 33825 | - |
dc.description.isOpenAccess | Y | - |
dc.identifier.scopusid | 2-s2.0-85153333561 | - |
dc.citation.endPage | 33825 | - |
dc.citation.startPage | 33812 | - |
dc.citation.title | IEEE ACCESS | - |
dc.citation.volume | 11 | - |
dc.type.docType | Article | - |
dc.publisher.location | 미국 | - |
dc.subject.keywordAuthor | Trajectory | - |
dc.subject.keywordAuthor | Training | - |
dc.subject.keywordAuthor | Behavioral sciences | - |
dc.subject.keywordAuthor | Robots | - |
dc.subject.keywordAuthor | Reinforcement learning | - |
dc.subject.keywordAuthor | Task analysis | - |
dc.subject.keywordAuthor | Memory | - |
dc.subject.keywordAuthor | Controllable agent | - |
dc.subject.keywordAuthor | path planning | - |
dc.subject.keywordAuthor | goal-conditioned reinforcement learning | - |
dc.subject.keywordAuthor | bidirectional memory editing | - |
dc.subject.keywordPlus | MEMORY | - |
dc.subject.keywordPlus | ENVIRONMENTS | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Telecommunications | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Telecommunications | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114
COPYRIGHT 2020 Gachon University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.