Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Kernel-based actor-critic approach with applications

Full metadata record
DC Field Value Language
dc.contributor.author주백석-
dc.contributor.author정근우-
dc.contributor.author박주영-
dc.date.available2020-04-24T13:25:35Z-
dc.date.created2020-03-31-
dc.date.issued2011-
dc.identifier.issn1598-2645-
dc.identifier.urihttps://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/2721-
dc.description.abstractRecently, actor-critic methods have drawn significant interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy. In this paper, we consider a new type of actor-critic algorithms employing the kernel methods, which have recently shown to be very effective tools in the various fields of machine learning, and have performed investigations on combining the actor-critic strategy together with kernel methods. More specifically, this paper studies actor-critic algorithms utilizing the kernel-based least-squares estimation and policy gradient, and in its critic’s part, the study uses a sliding-window-based kernel least-squares method, which leads to a fast and efficient value-function-estimation in a nonparametric setting. The applicability of the considered algorithms is illustrated via a robot locomotion problem and a tunnel ventilation control problem.-
dc.language영어-
dc.language.isoen-
dc.publisher한국지능시스템학회-
dc.titleKernel-based actor-critic approach with applications-
dc.typeArticle-
dc.contributor.affiliatedAuthor주백석-
dc.identifier.bibliographicCitationInternational Journal of Fuzzy Logic and Intelligent systems, v.11, no.4, pp.267 - 274-
dc.citation.titleInternational Journal of Fuzzy Logic and Intelligent systems-
dc.citation.volume11-
dc.citation.number4-
dc.citation.startPage267-
dc.citation.endPage274-
dc.type.rimsART-
dc.identifier.kciidART001611869-
dc.description.journalClass2-
dc.subject.keywordAuthorreinforcement learning-
dc.subject.keywordAuthoractor-critic algorithm-
dc.subject.keywordAuthorkernel methods-
dc.subject.keywordAuthorleast-squares-
dc.subject.keywordAuthorsliding-windows-
Files in This Item
There are no files associated with this item.
Appears in
Collections
School of Mechanical System Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher CHU, BAEK SUK photo

CHU, BAEK SUK
College of Engineering (School of Mechanical System Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE