Deep Reinforcement Learning for Distributed Dynamic MISO Downlink-Beamforming Coordination

Ge, Jungang; Liang, Ying-Chang; Joung, Jingon; Sun, Sumei

doi:10.1109/TCOMM.2020.3004524

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Deep Reinforcement Learning for Distributed Dynamic MISO Downlink-Beamforming Coordination

Authors: Ge, Jungang; Liang, Ying-Chang; Joung, Jingon; Sun, Sumei

Issue Date: Oct-2020

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Downlink; Intercell interference; Wireless communication; MISO communication; Data communication; Reinforcement learning; Downlink-beamforming coordination; multi-input single-output (MISO) interference channel; deep reinforcement learning; interference mitigation

Citation: IEEE TRANSACTIONS ON COMMUNICATIONS, v.68, no.10, pp 6070 - 6085

Pages: 16

Journal Title: IEEE TRANSACTIONS ON COMMUNICATIONS

Volume: 68

Number: 10

Start Page: 6070

End Page: 6085

URI: https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/53850

DOI: 10.1109/TCOMM.2020.3004524

ISSN: 0090-6778
1558-0857

Abstract: We consider a homogeneous cellular network where a multi-antenna base station (BS) in each cell transmits messages to its intended user over a common frequency band. To improve the system capacity of this multi-cell multi-input single-output (MISO) interference channel, one of the state-of-the-art algorithms, namely, downlink-beamforming coordination, allows all BSs to cooperate with one another to mitigate the effect of inter-cell interference. However, most existing algorithms are suboptimal and impractical in a dynamic wireless environment, due to the high computational complexity and the overhead involved in collecting global channel state information (CSI). In this study, we exploit deep reinforcement learning (DRL) and propose a distributed dynamic downlink-beamforming coordination (DDBC) method with partial observability of the CSI. Each BS is able to train its own deep Q-network and employs appropriate beamformer depending on its environment, which is observed through a designed limited-information exchange protocol. The simulation results show that the proposed DRL-based DDBC method, with a considerably lower system overhead, achieves a system capacity that is very close to that of the fractional programming algorithm with global and instantaneous CSI measurements. In addition, this work demonstrates the potential of utilizing DRL to solve DDBC problems in a more practical manner.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of ICT Engineering > School of Electrical and Electronics Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Joung, Jin Gon photo

Joung, Jin Gon: 창의ICT공과대학 (전자전기공학부)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :7,415,034; Today View :2,532

RSS_1.0 RSS_2.0 ATOM_1.0

84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE