Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Deep reinforcement learning for cooperative robots based on adaptive sentiment feedback

Authors
Jeon, HaeinKim, Dae-WonKang, Bo-Yeong
Issue Date
Jun-2024
Publisher
Elsevier Ltd
Keywords
Deep reinforcement learning; Human-in-the-loop; Human–robot interaction; Interactive reinforcement learning; Reward shaping
Citation
Expert Systems with Applications, v.243
Journal Title
Expert Systems with Applications
Volume
243
URI
https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/71331
DOI
10.1016/j.eswa.2023.121198
ISSN
0957-4174
1873-6793
Abstract
Human–robot cooperative tasks have gained importance with the emergence of robotics and artificial intelligence technology. In interactive reinforcement learning techniques, robots learn target tasks by receiving feedback from an experienced human trainer. However, most interactive reinforcement learning studies require a separate process to integrate the trainer's feedback into the training dataset, making it challenging for robots to learn new tasks from humans in real-time. Furthermore, the types of feedback sentences that trainers can use are limited in previous research. To address these limitations, this paper proposes a robot teaching strategy that uses deep RL via human–robot interaction to learn table balancing tasks interactively. The proposed system employs Deep Q-Network with real-time sentiment feedback delivered through the trainer's speech to learn cooperative tasks. We designed a novel reward function that incorporates sentiment feedback from human speech in real-time during the learning process. The paper presents an improved reward shaping technique based on subdivided feedback levels and shrinking feedback. This function serves as a guide for the robot to engage in natural interactions with humans and enables it to learn the tasks effectively. Experimental results demonstrate that the proposed interactive deep reinforcement learning model achieved a high success rate of up to 99.06%, outperforming the model without sentiment feedback. © 2023
Files in This Item
Appears in
Collections
College of Software > School of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Dae-Won photo

Kim, Dae-Won
소프트웨어대학 (소프트웨어학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE