Robust Learning From Demonstrations With Mixed Qualities Using Leveraged Gaussian Processes
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Choi, Sungjoon | - |
dc.contributor.author | Lee, Kyungjae | - |
dc.contributor.author | Oh, Songhwai | - |
dc.date.accessioned | 2022-11-28T01:57:58Z | - |
dc.date.available | 2022-11-28T01:57:58Z | - |
dc.date.issued | 2019-06 | - |
dc.identifier.issn | 1552-3098 | - |
dc.identifier.issn | 1941-0468 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/59368 | - |
dc.description.abstract | In this paper, we focus on the problem of learning from demonstration (LfD) where demonstrations with different proficiencies are provided without labeling. To this end, we model multiple policies with different qualities as correlated Gaussian processes and present a leverage optimization method that estimates the leverage of each policy where the difference between two leverages defines the correlation between the corresponding policies. To recover a single policy function of an expert, we present a sparsity constraint on the leverage parameters. We first show that the proposed leverage optimization method can recover the correlations between sensory fields where the fields are realized from correlated Gaussian processes and sensor measurements are collected from the fields. Furthermore, we applied the proposed method to autonomous driving experiments, where demonstrations are collected from three different driving modes. While the driving policies are not realized from correlated processes, the proposed method assigns reasonable leverages to the driving demonstrations. The estimated driving policy of an expert, which incorporates the optimized leverages, outperforms previous LfD methods in terms of both safety and driving quality. | - |
dc.format.extent | 13 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
dc.title | Robust Learning From Demonstrations With Mixed Qualities Using Leveraged Gaussian Processes | - |
dc.type | Article | - |
dc.identifier.doi | 10.1109/TRO.2019.2891173 | - |
dc.identifier.bibliographicCitation | IEEE TRANSACTIONS ON ROBOTICS, v.35, no.3, pp 564 - 576 | - |
dc.description.isOpenAccess | N | - |
dc.identifier.wosid | 000517266900003 | - |
dc.identifier.scopusid | 2-s2.0-85067108220 | - |
dc.citation.endPage | 576 | - |
dc.citation.number | 3 | - |
dc.citation.startPage | 564 | - |
dc.citation.title | IEEE TRANSACTIONS ON ROBOTICS | - |
dc.citation.volume | 35 | - |
dc.type.docType | Article | - |
dc.publisher.location | 미국 | - |
dc.subject.keywordAuthor | Autonomous navigation | - |
dc.subject.keywordAuthor | learning from demonstration (LfD) | - |
dc.subject.keywordAuthor | leveraged Gaussian processes (LGPs) | - |
dc.subject.keywordAuthor | robust estimation | - |
dc.subject.keywordPlus | ROBOT | - |
dc.subject.keywordPlus | NAVIGATION | - |
dc.relation.journalResearchArea | Robotics | - |
dc.relation.journalWebOfScienceCategory | Robotics | - |
dc.description.journalRegisteredClass | sci | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194
COPYRIGHT 2019 Chung-Ang University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.