A hybrid deep Q-network for the SVM Lagrangian

Kim; C.; Kim, Hyeyoung; H.-Y.

Detailed Information

Cited 0 time in webofscience

Cited 1 time in scopus

Metadata Downloads

A hybrid deep Q-network for the SVM Lagrangian

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim	-
dc.contributor.author	C.	-
dc.contributor.author	Kim, Hyeyoung	-
dc.contributor.author	H.-Y.	-
dc.date.available	2021-03-17T07:51:01Z	-
dc.date.created	2021-02-26	-
dc.date.issued	2019	-
dc.identifier.issn	1876-1100	-
dc.identifier.uri	https://scholarworks.bwise.kr/hongik/handle/2020.sw.hongik/12695	-
dc.description.abstract	The setting hyperparameters in the support vector machine (SVM) is very important with regard to its accuracy and efficiency. In this paper, we employ a novel definition of the reinforcement learning state, actions and reward function that allows a deep Q-network (DQN) to learn to control the optimization hyperparameters for the SVM deep neural networks by supervised Big-Data. In this framework, the DQN algorithm with experience replay is based on the off-policy reinforcement learning for the expected discounted return of rewards, or q-values, connected to the actions of adjusting the hyperprameters in the SVM. We propose the two deep neural networks, one with the SVM and the other with Q-network (DQN). The SVM deep neural networks learns a policy for the optimization hyperparameters, but differ in the number of allowed actions. The SVM deep neural networks trains the hyperparameters of the SVM simultaneously such as the Lagrangian multiplier. The proposed algorithm is called a Hybrid DQN combined with SVM deep neural networks. This algorithm could be considered as the classifier in the real-world domains such as network anomalies in the distributed server loads, because the SVM is suitable for the application in a classification, especially for the one-againstthe others. Algorithm comparisons show that our proposed algorithm leads to good optimization of the Lagrangian multiplier and can prevent overfitting to a certain extent automatically without human system designers. In terms of the classification performance of the proposed algorithm can be compared to the original LIBSVM with no controls of the hyperparameters.	-
dc.publisher	SPRINGER	-
dc.title	A hybrid deep Q-network for the SVM Lagrangian	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Kim, Hyeyoung	-
dc.identifier.doi	10.1007/978-981-13-1056-0_63	-
dc.identifier.scopusid	2-s2.0-85051064954	-
dc.identifier.wosid	000454443400061	-
dc.identifier.bibliographicCitation	Lecture Notes in Electrical Engineering, v.514, pp.643 - 651	-
dc.relation.isPartOf	Lecture Notes in Electrical Engineering	-
dc.citation.title	Lecture Notes in Electrical Engineering	-
dc.citation.volume	514	-
dc.citation.startPage	643	-
dc.citation.endPage	651	-
dc.type.rims	ART	-
dc.type.docType	Proceedings Paper	-
dc.description.journalClass	1	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Information Science & Library Science	-
dc.relation.journalWebOfScienceCategory	Information Science & Library Science	-
dc.subject.keywordAuthor	SVM deep neural networks	-
dc.subject.keywordAuthor	Network anomalies in distributed server loads	-
dc.subject.keywordAuthor	Hybrid deep Q-Network reinforcement learning	-
dc.subject.keywordAuthor	Hyperprameters	-

Files in This Item: There are no files associated with this item.

Appears in Collections: School of Games > Game Software Major > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Hye Young photo

Kim, Hye Young: Game (Major in Game Software)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :2,623,089; Today View :1,710

RSS_1.0 RSS_2.0 ATOM_1.0

94, Wausan-ro, Mapo-gu, Seoul, 04066, Korea02-320-1314

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE