Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

DDPG Reinforcement Learning Experiment for Improving the Stability of Bipedal Walking of Humanoid Robots

Full metadata record
DC Field Value Language
dc.contributor.authorChun, Yeonghun-
dc.contributor.authorChoi, Junghun-
dc.contributor.authorMin, Injoon-
dc.contributor.authorAhn, Minsung-
dc.contributor.authorHan, Jeakweon-
dc.date.accessioned2023-05-03T09:32:03Z-
dc.date.available2023-05-03T09:32:03Z-
dc.date.issued2023-02-
dc.identifier.issn0000-0000-
dc.identifier.urihttps://scholarworks.bwise.kr/erica/handle/2021.sw.erica/112517-
dc.description.abstractTo improve the stability of bipedal walking of humanoid robots, we developed a method of setting trajectory parameters using reinforcement learning on a treadmill like testbed in a real-world environment. A deep deterministic policy gradient (DDPG) was used as the reinforcement learning algorithm. By improving the reward using a zero moment point (ZMP), the optimum value of walking stability and walking speed was determined. The robot was designed to measure the ZMP and mount weights on the upper body. In addition, a treadmill was manufactured to operate at the same speed as the walking speed of the robot. Reinforcement learning was divided into unweighted cases and cases with a weight of 1kg. At approximately 100 min, 300 episodes were performed, and reward improvements of 16.71% and 26.25% reward improvements were made. The ZMP measurements indicated that bipedal walking was performed in a safe area. Therefore, we demonstrated that the biped walking performance of a humanoid robot can be improved by the reinforcement learning of walking speed and ZMP similarity. © 2023 IEEE.-
dc.format.extent7-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleDDPG Reinforcement Learning Experiment for Improving the Stability of Bipedal Walking of Humanoid Robots-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/SII55687.2023.10039306-
dc.identifier.scopusid2-s2.0-85149113531-
dc.identifier.wosid000972217000102-
dc.identifier.bibliographicCitation2023 IEEE/SICE International Symposium on System Integration, SII 2023, pp 1 - 7-
dc.citation.title2023 IEEE/SICE International Symposium on System Integration, SII 2023-
dc.citation.startPage1-
dc.citation.endPage7-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassother-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Interdisciplinary Applications-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordAuthorBipedal Walking-
dc.subject.keywordAuthorHumanoid and Bipedal Locomotion-
dc.subject.keywordAuthorReinforcement Learning-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10039306-
Files in This Item
Go to Link
Appears in
Collections
COLLEGE OF ENGINEERING SCIENCES > DEPARTMENT OF ROBOT ENGINEERING > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Han, Jea kweon photo

Han, Jea kweon
ERICA 공학대학 (DEPARTMENT OF ROBOT ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE