DRL-Based Resource Allocation for NOMA-Enabled D2D Communications Underlay Cellular Networks

Jeong, Yun Jae; Yu, Seoyoung; Lee, Jeong Woo

doi:10.1109/ACCESS.2023.3341585

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

DRL-Based Resource Allocation for NOMA-Enabled D2D Communications Underlay Cellular Networksopen access

Authors: Jeong, Yun Jae; Yu, Seoyoung; Lee, Jeong Woo

Issue Date: 2023

Publisher: Institute of Electrical and Electronics Engineers Inc.

Keywords: cellular network; Cellular networks; deep reinforcement learning; Device-to-device communication; Device-to-device communications; Downlink; NOMA; non-orthogonal multiple access; Protocols; Quality of service; resource allocation; Resource management

Citation: IEEE Access, v.11, pp 140270 - 140286

Pages: 17

Journal Title: IEEE Access

Volume: 11

Start Page: 140270

End Page: 140286

URI: https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/71363

DOI: 10.1109/ACCESS.2023.3341585

ISSN: 2169-3536

Abstract: Since the emergence of device-to-device (D2D) communications, an efficient resource allocation (RA) scheme with low-complexity suited for high variability of network environments has been continuously demanded. As a solution, we propose a RA scheme based on deep reinforcement learning (DRL) for D2D communications exploiting cluster-wise non-orthogonal multiple access (NOMA) protocol underlay cellular networks. The goal of RA is allocating transmit power and channel spectrum to D2D links to maximize a benefit. We analyze and formulate the outage of NOMA-enabled D2D links and investigate performance measures. To alleviate system overhead and computational complexity with maintaining high benefit, we propose a sub-optimal RA scheme under a centralized multi-agent DRL framework. Each agent corresponding to each D2D cluster trains its own artificial neural networks in a cyclic manner with a timing-offset. The proposed DRL-based RA scheme enables prompt allocation of resources to D2D links based on the observation of time-varying environments. The proposed RA scheme outperforms other schemes in terms of benefit, energy efficiency, fairness and coordination of D2D users, where the performance gain becomes significant when the mutual interference among user equipments is severe. In a cell of radius 100-meter with target rates for D2D and cellular links of 2 and 8 bits/s/Hz, respectively, the proposed RA scheme improves normalized benefit, energy efficiency, fairness and coordination of D2D users by 18%, 23%, 75% and 80%, respectively, over a greedy scheme. The improvements in these performance measures over a random RA scheme are 152%, 164%, 87% and 77%, respectively. Authors

Files in This Item

DRL-Based Resource Allocation for NOMA-Enabled D2D Communications Underlay Cellular Networks.pdf 2.44 MB

Appears in Collections: College of ICT Engineering > School of Electrical and Electronics Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Lee, Jeong Woo photo

Lee, Jeong Woo: 창의ICT공과대학 (전자전기공학부)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :6,952,484; Today View :4,002

RSS_1.0 RSS_2.0 ATOM_1.0

84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE