Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Multi-agent reinforcement learning based optimal energy sensing threshold control in distributed cognitive radio networks with directional antennaopen accessMulti-agent reinforcement learning based optimal energy sensing threshold control in distributed cognitive radio networks with directional antenna

Authors
Pham, Thi Thu HienNoh, WonjongCho, Sungrae
Issue Date
2024
Publisher
Korean Institute of Communications and Information Sciences
Keywords
Cognitive radio networks; CRNs; Cooperative spectrum sensing; CSS; Directional antennas; Multi-agent deep deterministic policy gradient; MADDPG; Reinforcement learning; RL
Citation
ICT Express, v.10, no.3, pp 472 - 478
Pages
7
Journal Title
ICT Express
Volume
10
Number
3
Start Page
472
End Page
478
URI
https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/72787
DOI
10.1016/j.icte.2024.01.001
ISSN
2405-9595
2405-9595
Abstract
In CRNs, it is crucial to develop an efficient and reliable spectrum detector that consistently provides accurate information about the channel state. In this work, we investigate a CSS in a fully-distributed environment where all secondary users (SUs) are equipped with directional antennas and make decisions based solely on their local knowledge without information sharing between SUs. First, we establish a stochastic sequential optimization problem, which is an NP-hard, that maximizes the SU's detection accuracy by the dynamic and optimal control of the energy sensing/detection threshold. It can enable SUs to select an available channel and sector without causing interference to the primary network. To address it in a distributed environment, the problem is transformed into a decentralized partially observed Markov decision process (Dec-POMDP) problem. Second, in order to determine the best control for the Dec-POMDP in a practical environment without any prior knowledge of state–action transition probabilities, we develop a multi-agent deep deterministic policy gradient (MADDPG)-based algorithm, which is referred to as MA-DCSS. This algorithm adopts the centralized training and decentralized execution (CTDE) architecture. Third, we analyzed its computational complexity and showed the proposed approach's scalability by the polynomial computational complexity, in terms of the number of channels, sectors, and SUs. Lastly, the simulation confirms that the proposed scheme provides enhanced performance in terms of convergence speed, accurate detection, and false alarm probabilities when it is compared to baseline algorithms. © 2024 The Author(s)
Files in This Item
Appears in
Collections
College of Software > School of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Cho, Sung Rae photo

Cho, Sung Rae
소프트웨어대학 (소프트웨어학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE