DDPG Reinforcement Learning Experiment for Improving the Stability of Bipedal Walking of Humanoid Robots
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chun, Yeonghun | - |
dc.contributor.author | Choi, Junghun | - |
dc.contributor.author | Min, Injoon | - |
dc.contributor.author | Ahn, Minsung | - |
dc.contributor.author | Han, Jeakweon | - |
dc.date.accessioned | 2023-05-03T09:32:03Z | - |
dc.date.available | 2023-05-03T09:32:03Z | - |
dc.date.issued | 2023-02 | - |
dc.identifier.issn | 0000-0000 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/112517 | - |
dc.description.abstract | To improve the stability of bipedal walking of humanoid robots, we developed a method of setting trajectory parameters using reinforcement learning on a treadmill like testbed in a real-world environment. A deep deterministic policy gradient (DDPG) was used as the reinforcement learning algorithm. By improving the reward using a zero moment point (ZMP), the optimum value of walking stability and walking speed was determined. The robot was designed to measure the ZMP and mount weights on the upper body. In addition, a treadmill was manufactured to operate at the same speed as the walking speed of the robot. Reinforcement learning was divided into unweighted cases and cases with a weight of 1kg. At approximately 100 min, 300 episodes were performed, and reward improvements of 16.71% and 26.25% reward improvements were made. The ZMP measurements indicated that bipedal walking was performed in a safe area. Therefore, we demonstrated that the biped walking performance of a humanoid robot can be improved by the reinforcement learning of walking speed and ZMP similarity. © 2023 IEEE. | - |
dc.format.extent | 7 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | DDPG Reinforcement Learning Experiment for Improving the Stability of Bipedal Walking of Humanoid Robots | - |
dc.type | Article | - |
dc.publisher.location | 미국 | - |
dc.identifier.doi | 10.1109/SII55687.2023.10039306 | - |
dc.identifier.scopusid | 2-s2.0-85149113531 | - |
dc.identifier.wosid | 000972217000102 | - |
dc.identifier.bibliographicCitation | 2023 IEEE/SICE International Symposium on System Integration, SII 2023, pp 1 - 7 | - |
dc.citation.title | 2023 IEEE/SICE International Symposium on System Integration, SII 2023 | - |
dc.citation.startPage | 1 | - |
dc.citation.endPage | 7 | - |
dc.type.docType | Proceedings Paper | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | other | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.subject.keywordAuthor | Bipedal Walking | - |
dc.subject.keywordAuthor | Humanoid and Bipedal Locomotion | - |
dc.subject.keywordAuthor | Reinforcement Learning | - |
dc.identifier.url | https://ieeexplore.ieee.org/document/10039306 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.