Train a real-world local path planner in one hour via partially decoupled reinforcement learning and vectorized diversity

Xin, Jinghao; Kim, Jinwoo; Li, Zhi; Li, Ning

doi:10.1016/j.engappai.2024.109726

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Train a real-world local path planner in one hour via partially decoupled reinforcement learning and vectorized diversity

Authors: Xin, Jinghao; Kim, Jinwoo; Li, Zhi; Li, Ning

Issue Date: Feb-2025

Publisher: Pergamon Press Ltd.

Keywords: Deep reinforcement learning; Local path planning; Mobile robot

Citation: Engineering Applications of Artificial Intelligence, v.141, pp 1 - 20

Pages: 20

Indexed: SCIE
SCOPUS

Journal Title: Engineering Applications of Artificial Intelligence

Volume: 141

Start Page: 1

End Page: 20

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207565

DOI: 10.1016/j.engappai.2024.109726

ISSN: 0952-1976
1873-6769

Abstract: Deep Reinforcement Learning (DRL) has exhibited efficacy in resolving the Local Path Planning (LPP) problem. However, its practical application remains significantly constrained due to its limited training efficiency and generalization capability. To address these challenges, we propose a solution termed Color, which includes an Actor-Sharer-Learner (ASL) training framework designed to improve efficiency, and a fast yet diverse simulator named Sparrow aimed at elevating both efficiency and generalization. Specifically, the ASL employs a Vectorized Data Collection (VDC) mode to enhance data collection, decouples the model optimization from data collection to expedite data consumption, and partially connects the two procedures with a Time Feedback Mechanism (TFM) to evade data underuse or overuse. Meanwhile, the Sparrow simulator utilizes a 2-Dimensional (2D) grid-based world, simplified kinematics, matrix operation, and conversion-free data flow to achieve a lightweight design. The lightness facilitates vectorized diversity, allowing for rapid and diversified simulation across numerous copies of the vectorized environments, thereby significantly enhancing both efficiency and generalization capacity. Comprehensive experiments demonstrate that with merely one hour of simulation training, Color achieves impressive arrival rates of 84% and 90% on 32 simulated and 42 real-world LPP scenarios, respectively. The code and video of this paper are accessible on our website.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 건설환경공학과 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Kim, Jinwoo photo

Kim, Jinwoo: COLLEGE OF ENGINEERING (DEPARTMENT OF CIVIL AND ENVIRONMENTAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE