Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Learning to schedule network resources throughput and delay optimally using q+-learning

Full metadata record
DC Field Value Language
dc.contributor.authorBae, Jeongmin-
dc.contributor.authorLee, Joohyun-
dc.contributor.authorChong, Song-
dc.date.accessioned2021-11-08T04:34:46Z-
dc.date.available2021-11-08T04:34:46Z-
dc.date.issued2021-04-
dc.identifier.issn1063-6692-
dc.identifier.issn1558-2566-
dc.identifier.urihttps://scholarworks.bwise.kr/erica/handle/2021.sw.erica/106214-
dc.description.abstractAs network architecture becomes complex and the user requirement gets diverse, the role of efficient network resource management becomes more important. However, existing throughput-optimal scheduling algorithms such as the max-weight algorithm suffer from poor delay performance. In this paper, we present reinforcement learning-based network scheduling algorithms for a single-hop downlink scenario which achieve throughput-optimality and converge to minimal delay. To this end, we first formulate the network optimization problem as a Markov decision process (MDP) problem. Then, we introduce a new state-action value function called Q+-function and develop a reinforcement learning algorithm called Q+-learning with UCB (Upper Confidence Bound) exploration which guarantees small performance loss during a learning process. We also derive an upper bound of the sample complexity in our algorithm, which is more efficient than the best known bound from Q-learning with UCB exploration by a factor of γ2 where γ is the discount factor of the MDP problem. Finally, via simulation, we verify that our algorithm shows a delay reduction of up to 40.8% compared to the max-weight algorithm over various scenarios. We also show that the Q+-learning with UCB exploration converges to an ϵ-epsilon-optimal policy 10 times faster than Q-learning with UCB.-
dc.format.extent14-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleLearning to schedule network resources throughput and delay optimally using q+-learning-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/tnet.2021.3051663-
dc.identifier.scopusid2-s2.0-85100475440-
dc.identifier.wosid000641964600020-
dc.identifier.bibliographicCitationIEEE/ACM Transactions on Networking, v.29, no.2, pp 750 - 763-
dc.citation.titleIEEE/ACM Transactions on Networking-
dc.citation.volume29-
dc.citation.number2-
dc.citation.startPage750-
dc.citation.endPage763-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Hardware & Architecture-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordPlusALLOCATION-
dc.subject.keywordPlusOPTIMIZATION-
dc.subject.keywordPlusMODEL-
dc.subject.keywordAuthorNetwork resource management-
dc.subject.keywordAuthorthroughput and delay optimality-
dc.subject.keywordAuthorreinforcement learning-
dc.subject.keywordAuthorupper confidence bound-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/9336288-
Files in This Item
Go to Link
Appears in
Collections
COLLEGE OF ENGINEERING SCIENCES > SCHOOL OF ELECTRICAL ENGINEERING > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Joo hyun photo

Lee, Joo hyun
ERICA 공학대학 (SCHOOL OF ELECTRICAL ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE