Kernel-based actor-critic approach with applications
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 주백석 | - |
dc.contributor.author | 정근우 | - |
dc.contributor.author | 박주영 | - |
dc.date.available | 2020-04-24T13:25:35Z | - |
dc.date.created | 2020-03-31 | - |
dc.date.issued | 2011 | - |
dc.identifier.issn | 1598-2645 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/2721 | - |
dc.description.abstract | Recently, actor-critic methods have drawn significant interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy. In this paper, we consider a new type of actor-critic algorithms employing the kernel methods, which have recently shown to be very effective tools in the various fields of machine learning, and have performed investigations on combining the actor-critic strategy together with kernel methods. More specifically, this paper studies actor-critic algorithms utilizing the kernel-based least-squares estimation and policy gradient, and in its critic’s part, the study uses a sliding-window-based kernel least-squares method, which leads to a fast and efficient value-function-estimation in a nonparametric setting. The applicability of the considered algorithms is illustrated via a robot locomotion problem and a tunnel ventilation control problem. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | 한국지능시스템학회 | - |
dc.title | Kernel-based actor-critic approach with applications | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | 주백석 | - |
dc.identifier.bibliographicCitation | International Journal of Fuzzy Logic and Intelligent systems, v.11, no.4, pp.267 - 274 | - |
dc.citation.title | International Journal of Fuzzy Logic and Intelligent systems | - |
dc.citation.volume | 11 | - |
dc.citation.number | 4 | - |
dc.citation.startPage | 267 | - |
dc.citation.endPage | 274 | - |
dc.type.rims | ART | - |
dc.identifier.kciid | ART001611869 | - |
dc.description.journalClass | 2 | - |
dc.subject.keywordAuthor | reinforcement learning | - |
dc.subject.keywordAuthor | actor-critic algorithm | - |
dc.subject.keywordAuthor | kernel methods | - |
dc.subject.keywordAuthor | least-squares | - |
dc.subject.keywordAuthor | sliding-windows | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
350-27, Gumi-daero, Gumi-si, Gyeongsangbuk-do, Republic of Korea (39253)054-478-7170
COPYRIGHT 2020 Kumoh University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.