A Controllable Agent by Subgoals in Path Planning Using Goal-Conditioned Reinforcement Learning

Lee, Gyeong Taek; Kim, Kangjin

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A Controllable Agent by Subgoals in Path Planning Using Goal-Conditioned Reinforcement Learning

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lee, Gyeong Taek	-
dc.contributor.author	Kim, Kangjin	-
dc.date.accessioned	2024-03-20T13:30:27Z	-
dc.date.available	2024-03-20T13:30:27Z	-
dc.date.issued	2023-04	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/90765	-
dc.description.abstract	The aim of path planning is to search for a path from the starting point to the goal. Numerous studies, however, have dealt with a single predefined goal. That is, an agent who has completed learning cannot reach other goals that have not been visited in the training. In the present study, we propose a novel reinforcement learning (RL) framework for an agent reachable to any subgoal as well as the final goal in path planning. To do this, we utilize goal-conditioned RL and propose bidirectional memory editing to obtain various bidirectional trajectories of the agent. Bidirectional memory editing can generate various behavior and subgoals of the agent from the limited trajectory. Then, the generated subgoals and behaviors of the agent are trained on the policy network so that the agent can reach any subgoals from any starting point. In addition, we present reward shaping for the short path of the agent to reach the goal. In the experimental result, the agent was able to reach the various goals that had never been visited by the agent during the training. We confirmed that the agent could perform difficult missions, such as a round trip, and the agent used the shorter route with reward shaping.	-
dc.format.extent	14	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	A Controllable Agent by Subgoals in Path Planning Using Goal-Conditioned Reinforcement Learning	-
dc.type	Article	-
dc.identifier.wosid	000970912700001	-
dc.identifier.doi	10.1109/ACCESS.2023.3264264	-
dc.identifier.bibliographicCitation	IEEE ACCESS, v.11, pp 33812 - 33825	-
dc.description.isOpenAccess	Y	-
dc.identifier.scopusid	2-s2.0-85153333561	-
dc.citation.endPage	33825	-
dc.citation.startPage	33812	-
dc.citation.title	IEEE ACCESS	-
dc.citation.volume	11	-
dc.type.docType	Article	-
dc.publisher.location	미국	-
dc.subject.keywordAuthor	Trajectory	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	Behavioral sciences	-
dc.subject.keywordAuthor	Robots	-
dc.subject.keywordAuthor	Reinforcement learning	-
dc.subject.keywordAuthor	Task analysis	-
dc.subject.keywordAuthor	Memory	-
dc.subject.keywordAuthor	Controllable agent	-
dc.subject.keywordAuthor	path planning	-
dc.subject.keywordAuthor	goal-conditioned reinforcement learning	-
dc.subject.keywordAuthor	bidirectional memory editing	-
dc.subject.keywordPlus	MEMORY	-
dc.subject.keywordPlus	ENVIRONMENTS	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Telecommunications	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Telecommunications	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-

Files in This Item: There are no files associated with this item.

Appears in Collections: ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Lee, GyeongTaek photo

Lee, GyeongTaek: Engineering (Department of Mechanical, Smart and Industrial Engineering (Smart Factory Major))

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :4,247,340; Today View :4,650

RSS_1.0 RSS_2.0 ATOM_1.0

1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE