Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Prianto, Evan | - |
dc.contributor.author | Kim, Myeongseop | - |
dc.contributor.author | Park, Jae-Han | - |
dc.contributor.author | Bae, Ji-Hun | - |
dc.contributor.author | Kim, Jung-Su | - |
dc.date.accessioned | 2024-04-09T03:02:21Z | - |
dc.date.available | 2024-04-09T03:02:21Z | - |
dc.date.issued | 2020-10 | - |
dc.identifier.issn | 1424-8220 | - |
dc.identifier.issn | 1424-3210 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/118598 | - |
dc.description.abstract | Since path planning for multi-arm manipulators is a complicated high-dimensional problem, effective and fast path generation is not easy for the arbitrarily given start and goal locations of the end effector. Especially, when it comes to deep reinforcement learning-based path planning, high-dimensionality makes it difficult for existing reinforcement learning-based methods to have efficient exploration which is crucial for successful training. The recently proposed soft actor–critic (SAC) is well known to have good exploration ability due to the use of the entropy term in the objective function. Motivated by this, in this paper, a SAC-based path planning algorithm is proposed. The hindsight experience replay (HER) is also employed for sample efficiency and configuration space augmentation is used in order to deal with complicated configuration space of the multi-arms. To show the effectiveness of the proposed algorithm, both simulation and experiment results are given. By comparing with existing results, it is demonstrated that the proposed method outperforms the existing results. © 2020 by the authors. Licensee MDPI, Basel, Switzerland. | - |
dc.format.extent | 22 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Multidisciplinary Digital Publishing Institute (MDPI) | - |
dc.title | Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay | - |
dc.type | Article | - |
dc.publisher.location | 스위스 | - |
dc.identifier.doi | 10.3390/s20205911 | - |
dc.identifier.scopusid | 2-s2.0-85093682118 | - |
dc.identifier.wosid | 000585809000001 | - |
dc.identifier.bibliographicCitation | Sensors, v.20, no.20, pp 1 - 22 | - |
dc.citation.title | Sensors | - |
dc.citation.volume | 20 | - |
dc.citation.number | 20 | - |
dc.citation.startPage | 1 | - |
dc.citation.endPage | 22 | - |
dc.type.docType | 정기학술지(Article(Perspective Article포함)) | - |
dc.description.isOpenAccess | Y | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Chemistry | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Instruments & Instrumentation | - |
dc.relation.journalWebOfScienceCategory | Chemistry, Analytical | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Instruments & Instrumentation | - |
dc.subject.keywordAuthor | Collision avoidance | - |
dc.subject.keywordAuthor | Hindsight Experience Replay (HER) | - |
dc.subject.keywordAuthor | Multi-arm manipulators | - |
dc.subject.keywordAuthor | Path planning | - |
dc.subject.keywordAuthor | Reinforcement learning | - |
dc.subject.keywordAuthor | Soft Actor-Critic (SAC) | - |
dc.identifier.url | https://www.mdpi.com/1424-8220/20/20/5911 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.