Cited 0 time in
Train a real-world local path planner in one hour via partially decoupled reinforcement learning and vectorized diversity
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Xin, Jinghao | - |
| dc.contributor.author | Kim, Jinwoo | - |
| dc.contributor.author | Li, Zhi | - |
| dc.contributor.author | Li, Ning | - |
| dc.date.accessioned | 2025-06-13T07:30:27Z | - |
| dc.date.available | 2025-06-13T07:30:27Z | - |
| dc.date.issued | 2025-02 | - |
| dc.identifier.issn | 0952-1976 | - |
| dc.identifier.issn | 1873-6769 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207565 | - |
| dc.description.abstract | Deep Reinforcement Learning (DRL) has exhibited efficacy in resolving the Local Path Planning (LPP) problem. However, its practical application remains significantly constrained due to its limited training efficiency and generalization capability. To address these challenges, we propose a solution termed Color, which includes an Actor-Sharer-Learner (ASL) training framework designed to improve efficiency, and a fast yet diverse simulator named Sparrow aimed at elevating both efficiency and generalization. Specifically, the ASL employs a Vectorized Data Collection (VDC) mode to enhance data collection, decouples the model optimization from data collection to expedite data consumption, and partially connects the two procedures with a Time Feedback Mechanism (TFM) to evade data underuse or overuse. Meanwhile, the Sparrow simulator utilizes a 2-Dimensional (2D) grid-based world, simplified kinematics, matrix operation, and conversion-free data flow to achieve a lightweight design. The lightness facilitates vectorized diversity, allowing for rapid and diversified simulation across numerous copies of the vectorized environments, thereby significantly enhancing both efficiency and generalization capacity. Comprehensive experiments demonstrate that with merely one hour of simulation training, Color achieves impressive arrival rates of 84% and 90% on 32 simulated and 42 real-world LPP scenarios, respectively. The code and video of this paper are accessible on our website. | - |
| dc.format.extent | 20 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Pergamon Press Ltd. | - |
| dc.title | Train a real-world local path planner in one hour via partially decoupled reinforcement learning and vectorized diversity | - |
| dc.type | Article | - |
| dc.publisher.location | 영국 | - |
| dc.identifier.doi | 10.1016/j.engappai.2024.109726 | - |
| dc.identifier.scopusid | 2-s2.0-85211066248 | - |
| dc.identifier.wosid | 001373763500001 | - |
| dc.identifier.bibliographicCitation | Engineering Applications of Artificial Intelligence, v.141, pp 1 - 20 | - |
| dc.citation.title | Engineering Applications of Artificial Intelligence | - |
| dc.citation.volume | 141 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 20 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Automation & Control Systems | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalWebOfScienceCategory | Automation & Control Systems | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Multidisciplinary | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.subject.keywordPlus | Adversarial machine learning | - |
| dc.subject.keywordPlus | Deep learning | - |
| dc.subject.keywordPlus | Mobile robots | - |
| dc.subject.keywordPlus | Motion planning | - |
| dc.subject.keywordPlus | Reinforcement learning | - |
| dc.subject.keywordPlus | Robot programming | - |
| dc.subject.keywordAuthor | Deep reinforcement learning | - |
| dc.subject.keywordAuthor | Local path planning | - |
| dc.subject.keywordAuthor | Mobile robot | - |
| dc.identifier.url | https://www.sciencedirect.com/science/article/pii/S0952197624018840?via%3Dihub | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
