Layerwise Quantum Deep Reinforcement Learning for Joint Optimization of UAV Trajectory and Resource Allocation
- Authors
- Silvirianti; Narottama, Bhaskara; Shin, Soo Young
- Issue Date
- Jan-2024
- Publisher
- IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
- Keywords
- Quantum computing; Training; Quantum state; Optimization; Autonomous aerial vehicles; Trajectory; Resource management; Deep reinforcement learning; joint optimization; layerwise training; local loss; quantum embedding; unmanned aerial vehicle (UAV)
- Citation
- IEEE INTERNET OF THINGS JOURNAL, v.11, no.1, pp 430 - 443
- Pages
- 14
- Journal Title
- IEEE INTERNET OF THINGS JOURNAL
- Volume
- 11
- Number
- 1
- Start Page
- 430
- End Page
- 443
- URI
- https://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/28498
- DOI
- 10.1109/JIOT.2023.3285968
- ISSN
- 2327-4662
- Abstract
- This study proposes a layerwise quantum-based deep reinforcement learning (LQ-DRL) method for optimizing continuous large space and time series problems using deep-layer training. The actions in LQ-DRL are optimized using a layerwise quantum embedding that leverages the advantages of quantum computing to maximize reward and reduce training loss. Moreover, this study employs a local loss to minimize the occurrence of barren plateaus phenomena and further enhance performance. As a particular case, the proposed scheme is employed to jointly optimize: 1) unmanned aerial vehicle (UAV) trajectory planning; 2) user grouping; and 3) power allocation for higher energy efficiency of a UAV as the reward. The combination of these optimized factors is referred to as action space in the presented LQ-DRL. The LQ-DRL is employed to solve the optimization problem due to its nonconvexity, continuous and large action space, and time-series domain. In a practical view, LQ-DRL aims to solve the issue of energy consumption related to limited-battery energy of a UAV base station (BS) while maintaining Quality of Service (QoS) for users, by gaining maximum energy efficiency as the reward. One of real applications, as an example, LQ-DRL can be employed to maximize the energy efficiency of a UAV BS in UAV empowered disaster recovery networks scenario. The quantum circuits of layerwise quantum embedding are presented to show the practical implementation in noisy intermediate-scale quantum computers. Based on the results, LQ-DRL outperformed the classical DRL by achieving higher effective dimension, rewards, and lower learning losses. In addition, better performances were achieved using more layers.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - School of Electronic Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.