Reinforcement Learning for Random Access in Multi-cell Networks
- Authors
- Lee, D.; Zhao, Y.; Lee, J.
- Issue Date
- Apr-2021
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Exponential-weight algorithm for Exploration and Exploitation; Multi-Armed bandit; Random access; Reinforcement learning
- Citation
- 3rd International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2021, pp 335 - 338
- Pages
- 4
- Indexed
- SCIE
SCOPUS
- Journal Title
- 3rd International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2021
- Start Page
- 335
- End Page
- 338
- URI
- https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/105791
- DOI
- 10.1109/ICAIIC51459.2021.9415281
- ISSN
- 0000-0000
- Abstract
- In this paper, our goal is to maximize the system throughput in a time-slotted uplink multi-cell random access communication system. To this end, we propose a two-stage reinforcement learning (RL)-based algorithm based on the exponential-weight algorithm for exploration and exploitation (EXP3). In each macro-Time slot that consists of multiple time slots, users run the RL-based algorithm to choose the associated access point (AP). Then, a transmission policy determines the sub-Time slot that user will transmit data in each time slot. Another RL-based learning algorithm is used to obtain an optimal transmission policy. To show that our method is efficient, we compare our proposed algorithm with the $\epsilon$-greedy algorithm in two different scenarios. The simulation results show that the average system throughput of our algorithm outperforms that of $\epsilon$-greedy exploration. © 2021 IEEE.
- Files in This Item
-
Go to Link
- Appears in
Collections - COLLEGE OF ENGINEERING SCIENCES > SCHOOL OF ELECTRICAL ENGINEERING > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.