Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Learning to Walk a Tripod Mobile Robot Using Nonlinear Soft Vibration Actuators With Entropy Adaptive Reinforcement Learning

Authors
Kim, Jae InHong, MineuiLee, KyungjaeKim, DongWookPark, Yong-LaeOh, Songhwai
Issue Date
Apr-2020
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Keywords
Modeling; control; and learning for soft robots; hydraulic; pneumatic actuators; motion and path planning
Citation
IEEE ROBOTICS AND AUTOMATION LETTERS, v.5, no.2, pp 2317 - 2324
Pages
8
Journal Title
IEEE ROBOTICS AND AUTOMATION LETTERS
Volume
5
Number
2
Start Page
2317
End Page
2324
URI
https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/59366
DOI
10.1109/LRA.2020.2970945
ISSN
2377-3766
Abstract
Soft mobile robots have shown great potential in unstructured and confined environments by taking advantage of their excellent adaptability and high dexterity. However, there are several issues to be addressed, such as actuating speeds and controllability, in soft robots. In this letter, a new vibration actuator is proposed using the nonlinear stiffness characteristic of a hyperelastic material, which creates continuous vibration of the actuator. By integrating three proposed actuators, we also present an advanced soft mobile robot with high degrees of freedom of movement. However, since the dynamic model of the soft mobile robot is generally hard to obtain(intractable), it is difficult to design a controller for the robot. In this regard, we present a method to train a controller, using a novel reinforcement learning (RL) algorithm called adaptive soft actor-critic (ASAC). ASAC gradually reduces a parameter called an entropy temperature, which regulates the entropy of the control policy. In this way, the proposed method can narrow down the search space during training, and reduce the duration of demanding data collection processes in real-world experiments. For the verification of the robustness and the controllability of our robot and the RL algorithm, experiments for zig-zagging path tracking and obstacle avoidance were conducted, and the robot successfully finished the missions with only an hour of training time.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Software > Department of Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Kyungjae photo

Lee, Kyungjae
소프트웨어대학 (AI학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE