An Enhanced Proximal Policy Optimization-Based Reinforcement Learning Method with Random Forest for Hyperparameter Optimization

Ma, Zhixin; Cui, Shengmin; Joe, Inwhee

doi:10.3390/app12147006

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

An Enhanced Proximal Policy Optimization-Based Reinforcement Learning Method with Random Forest for Hyperparameter Optimization

Full metadata record

DC Field	Value	Language
dc.contributor.author	Ma, Zhixin	-
dc.contributor.author	Cui, Shengmin	-
dc.contributor.author	Joe, Inwhee	-
dc.date.accessioned	2023-07-05T03:53:22Z	-
dc.date.available	2023-07-05T03:53:22Z	-
dc.date.created	2022-09-08	-
dc.date.issued	2022-07	-
dc.identifier.issn	2076-3417	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/186187	-
dc.description.abstract	For most machine learning and deep learning models, the selection of hyperparameters has a significant impact on the performance of the model. Therefore, deep learning and data analysis experts have to spend a lot of time on hyperparameter tuning when building a model for accomplishing a task. Although there are many algorithms used to solve hyperparameter optimization (HPO), these methods require the results of the actual trials at each epoch to help perform the search. To reduce the number of trials, model-based reinforcement learning adopts multilayer perceptron (MLP) to capture the relationship between hyperparameter settings and model performance. However, MLP needs to be carefully designed because there is a risk of overfitting. Thus, we propose a random forest-enhanced proximal policy optimization (RFEPPO) reinforcement learning algorithm to solve the HPO problem. In addition, reinforcement learning as a solution to HPO will encounter the sparse reward problem, eventually leading to slow convergence. To address this problem, we employ the intrinsic reward, which introduces the prediction error as the reward signal. Experiments carried on nine tabular datasets and two image classification datasets demonstrate the effectiveness of our model.	-
dc.language	영어	-
dc.language.iso	en	-
dc.publisher	MDPI	-
dc.title	An Enhanced Proximal Policy Optimization-Based Reinforcement Learning Method with Random Forest for Hyperparameter Optimization	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Joe, Inwhee	-
dc.identifier.doi	10.3390/app12147006	-
dc.identifier.scopusid	2-s2.0-85137367350	-
dc.identifier.wosid	000834406900001	-
dc.identifier.bibliographicCitation	APPLIED SCIENCES-BASEL, v.12, no.14, pp.1 - 20	-
dc.relation.isPartOf	APPLIED SCIENCES-BASEL	-
dc.citation.title	APPLIED SCIENCES-BASEL	-
dc.citation.volume	12	-
dc.citation.number	14	-
dc.citation.startPage	1	-
dc.citation.endPage	20	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Chemistry	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Materials Science	-
dc.relation.journalResearchArea	Physics	-
dc.relation.journalWebOfScienceCategory	Chemistry, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Engineering, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Materials Science, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Physics, Applied	-
dc.subject.keywordAuthor	hyperparameter optimization (HPO)	-
dc.subject.keywordAuthor	proximal policy optimization	-
dc.subject.keywordAuthor	random forest	-
dc.subject.keywordAuthor	reinforcement learning	-
dc.identifier.url	https://www.mdpi.com/2076-3417/12/14/7006	-

Files in This Item

applsci-12-07006.pdf 852.27 kB

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Joe, Inwhee photo

Joe, Inwhee: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE