DDPG Reinforcement Learning Experiment for Improving the Stability of Bipedal Walking of Humanoid Robots

Chun, Yeonghun; Choi, Junghun; Min, Injoon; Ahn, Minsung; Han, Jeakweon

doi:10.1109/SII55687.2023.10039306

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

DDPG Reinforcement Learning Experiment for Improving the Stability of Bipedal Walking of Humanoid Robots

Full metadata record

DC Field	Value	Language
dc.contributor.author	Chun, Yeonghun	-
dc.contributor.author	Choi, Junghun	-
dc.contributor.author	Min, Injoon	-
dc.contributor.author	Ahn, Minsung	-
dc.contributor.author	Han, Jeakweon	-
dc.date.accessioned	2023-05-03T09:32:03Z	-
dc.date.available	2023-05-03T09:32:03Z	-
dc.date.issued	2023-02	-
dc.identifier.issn	0000-0000	-
dc.identifier.uri	https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/112517	-
dc.description.abstract	To improve the stability of bipedal walking of humanoid robots, we developed a method of setting trajectory parameters using reinforcement learning on a treadmill like testbed in a real-world environment. A deep deterministic policy gradient (DDPG) was used as the reinforcement learning algorithm. By improving the reward using a zero moment point (ZMP), the optimum value of walking stability and walking speed was determined. The robot was designed to measure the ZMP and mount weights on the upper body. In addition, a treadmill was manufactured to operate at the same speed as the walking speed of the robot. Reinforcement learning was divided into unweighted cases and cases with a weight of 1kg. At approximately 100 min, 300 episodes were performed, and reward improvements of 16.71% and 26.25% reward improvements were made. The ZMP measurements indicated that bipedal walking was performed in a safe area. Therefore, we demonstrated that the biped walking performance of a humanoid robot can be improved by the reinforcement learning of walking speed and ZMP similarity. © 2023 IEEE.	-
dc.format.extent	7	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	DDPG Reinforcement Learning Experiment for Improving the Stability of Bipedal Walking of Humanoid Robots	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/SII55687.2023.10039306	-
dc.identifier.scopusid	2-s2.0-85149113531	-
dc.identifier.wosid	000972217000102	-
dc.identifier.bibliographicCitation	2023 IEEE/SICE International Symposium on System Integration, SII 2023, pp 1 - 7	-
dc.citation.title	2023 IEEE/SICE International Symposium on System Integration, SII 2023	-
dc.citation.startPage	1	-
dc.citation.endPage	7	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	other	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalWebOfScienceCategory	Computer Science, Interdisciplinary Applications	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.subject.keywordAuthor	Bipedal Walking	-
dc.subject.keywordAuthor	Humanoid and Bipedal Locomotion	-
dc.subject.keywordAuthor	Reinforcement Learning	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10039306	-

Files in This Item: Go to Link

Appears in Collections: COLLEGE OF ENGINEERING SCIENCES > DEPARTMENT OF ROBOT ENGINEERING > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Han, Jea kweon photo

Han, Jea kweon: ERICA 공학대학 (DEPARTMENT OF ROBOT ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE