Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay

Prianto, Evan; Kim, Myeongseop; Park, Jae-Han; Bae, Ji-Hun; Kim, Jung-Su

doi:10.3390/s20205911

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay

Full metadata record

DC Field	Value	Language
dc.contributor.author	Prianto, Evan	-
dc.contributor.author	Kim, Myeongseop	-
dc.contributor.author	Park, Jae-Han	-
dc.contributor.author	Bae, Ji-Hun	-
dc.contributor.author	Kim, Jung-Su	-
dc.date.accessioned	2024-04-09T03:02:21Z	-
dc.date.available	2024-04-09T03:02:21Z	-
dc.date.issued	2020-10	-
dc.identifier.issn	1424-8220	-
dc.identifier.issn	1424-3210	-
dc.identifier.uri	https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/118598	-
dc.description.abstract	Since path planning for multi-arm manipulators is a complicated high-dimensional problem, effective and fast path generation is not easy for the arbitrarily given start and goal locations of the end effector. Especially, when it comes to deep reinforcement learning-based path planning, high-dimensionality makes it difficult for existing reinforcement learning-based methods to have efficient exploration which is crucial for successful training. The recently proposed soft actor–critic (SAC) is well known to have good exploration ability due to the use of the entropy term in the objective function. Motivated by this, in this paper, a SAC-based path planning algorithm is proposed. The hindsight experience replay (HER) is also employed for sample efficiency and configuration space augmentation is used in order to deal with complicated configuration space of the multi-arms. To show the effectiveness of the proposed algorithm, both simulation and experiment results are given. By comparing with existing results, it is demonstrated that the proposed method outperforms the existing results. © 2020 by the authors. Licensee MDPI, Basel, Switzerland.	-
dc.format.extent	22	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Multidisciplinary Digital Publishing Institute (MDPI)	-
dc.title	Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay	-
dc.type	Article	-
dc.publisher.location	스위스	-
dc.identifier.doi	10.3390/s20205911	-
dc.identifier.scopusid	2-s2.0-85093682118	-
dc.identifier.wosid	000585809000001	-
dc.identifier.bibliographicCitation	Sensors, v.20, no.20, pp 1 - 22	-
dc.citation.title	Sensors	-
dc.citation.volume	20	-
dc.citation.number	20	-
dc.citation.startPage	1	-
dc.citation.endPage	22	-
dc.type.docType	정기학술지(Article(Perspective Article포함))	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Chemistry	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Instruments & Instrumentation	-
dc.relation.journalWebOfScienceCategory	Chemistry, Analytical	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Instruments & Instrumentation	-
dc.subject.keywordAuthor	Collision avoidance	-
dc.subject.keywordAuthor	Hindsight Experience Replay (HER)	-
dc.subject.keywordAuthor	Multi-arm manipulators	-
dc.subject.keywordAuthor	Path planning	-
dc.subject.keywordAuthor	Reinforcement learning	-
dc.subject.keywordAuthor	Soft Actor-Critic (SAC)	-
dc.identifier.url	https://www.mdpi.com/1424-8220/20/20/5911	-

Files in This Item: Go to Link

Appears in Collections: COLLEGE OF ENGINEERING SCIENCES > DEPARTMENT OF ROBOT ENGINEERING > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Bae, Ji Hun photo

Bae, Ji Hun: ERICA 공학대학 (DEPARTMENT OF ROBOT ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE