Kernel-based actor-critic approach with applications

주백석; 정근우; 박주영

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Kernel-based actor-critic approach with applications

Full metadata record

DC Field	Value	Language
dc.contributor.author	주백석	-
dc.contributor.author	정근우	-
dc.contributor.author	박주영	-
dc.date.available	2020-04-24T13:25:35Z	-
dc.date.created	2020-03-31	-
dc.date.issued	2011	-
dc.identifier.issn	1598-2645	-
dc.identifier.uri	https://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/2721	-
dc.description.abstract	Recently, actor-critic methods have drawn significant interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy. In this paper, we consider a new type of actor-critic algorithms employing the kernel methods, which have recently shown to be very effective tools in the various fields of machine learning, and have performed investigations on combining the actor-critic strategy together with kernel methods. More specifically, this paper studies actor-critic algorithms utilizing the kernel-based least-squares estimation and policy gradient, and in its critic’s part, the study uses a sliding-window-based kernel least-squares method, which leads to a fast and efficient value-function-estimation in a nonparametric setting. The applicability of the considered algorithms is illustrated via a robot locomotion problem and a tunnel ventilation control problem.	-
dc.language	영어	-
dc.language.iso	en	-
dc.publisher	한국지능시스템학회	-
dc.title	Kernel-based actor-critic approach with applications	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	주백석	-
dc.identifier.bibliographicCitation	International Journal of Fuzzy Logic and Intelligent systems, v.11, no.4, pp.267 - 274	-
dc.citation.title	International Journal of Fuzzy Logic and Intelligent systems	-
dc.citation.volume	11	-
dc.citation.number	4	-
dc.citation.startPage	267	-
dc.citation.endPage	274	-
dc.type.rims	ART	-
dc.identifier.kciid	ART001611869	-
dc.description.journalClass	2	-
dc.subject.keywordAuthor	reinforcement learning	-
dc.subject.keywordAuthor	actor-critic algorithm	-
dc.subject.keywordAuthor	kernel methods	-
dc.subject.keywordAuthor	least-squares	-
dc.subject.keywordAuthor	sliding-windows	-

Files in This Item: There are no files associated with this item.

Appears in Collections: School of Mechanical System Engineering > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher CHU, BAEK SUK photo

CHU, BAEK SUK: College of Engineering (School of Mechanical System Engineering)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

350-27, Gumi-daero, Gumi-si, Gyeongsangbuk-do, Republic of Korea (39253)054-478-7170

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE