Deep Reinforcement Learning for Scalable Dynamic Bandwidth Allocation in RAN Slicing With Highly Mobile Users

Choi, Sihyun; Choi, Siyoung; Lee, Goodsol; Yoon, Sung-Guk; Bahk, Saewoong

doi:10.1109/TVT.2023.3302416

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Deep Reinforcement Learning for Scalable Dynamic Bandwidth Allocation in RAN Slicing With Highly Mobile Users

Authors: Choi, Sihyun; Choi, Siyoung; Lee, Goodsol; Yoon, Sung-Guk; Bahk, Saewoong

Issue Date: Jan-2024

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Quality of service; Quality of experience; 5G mobile communication; Resource management; Deep learning; Bandwidth; Reinforcement learning; RAN slicing; deep reinforcement learning; long short-term memories; action factorization; dynamic bandwidth allocation; high mobility

Citation: IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, v.73, no.1, pp 576 - 590

Pages: 15

Journal Title: IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY

Volume: 73

Number: 1

Start Page: 576

End Page: 590

URI: https://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/49541

DOI: 10.1109/TVT.2023.3302416

ISSN: 0018-9545
1939-9359

Abstract: Radio Access Network (RAN) slicing is a key technology in 5G communication systems. It dynamically allocates network resources such as bandwidth and time slots to each RAN slice, meeting the quality of service (QoS) requirements of each slice on a common underlying 5G infrastructure. This RAN slicing problem normally has a large number of resource combinations with a practical number of RAN slices. However, most Q-learning based Deep reinforcement learning (DRL) algorithms cannot successfully converge with the size of the action space. To address this issue, we introduce the architecture of Action Factorization (AF) with a soft-max layer, which aids exploration by decomposing a large action space into multiple independent sub-action spaces. In addition, RAN slicing problem is facing a performance issue for highly mobile users. To improve the performance of this problem, we use current channel information and future channel information predicted by long short-term memory (LSTM). We then propose a DRL architecture combined with AF and LSTM for bandwidth allocation in RAN slicing. Furthermore, we point out that the QoS requirements used as a performance metric in existing studies are inconsistent with the QoE achievement from the user's point of view. Therefore, we introduce new metrics, data rate indicators (DRI), to compensate the discrepancy. Through extensive simulations, we confirm that our proposed solution efficiently allocates bandwidth to each slice for a reasonable number of slices by maximizing the sum of rewards from QoE achievement for each user under high mobility.

Files in This Item: Go to Link

Appears in Collections: College of Engineering > School of Electrical Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Yoon, Sung Guk photo

Yoon, Sung Guk: College of Engineering (School of Electrical Engineering)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,222,750; Today View :2,294

RSS_1.0 RSS_2.0 ATOM_1.0

Soongsil University Library 369 Sangdo-Ro, Dongjak-Gu, Seoul, Korea (06978)02-820-0733

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE