Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Layerwise Quantum Deep Reinforcement Learning for Joint Optimization of UAV Trajectory and Resource Allocation

Authors
SilviriantiNarottama, BhaskaraShin, Soo Young
Issue Date
Jan-2024
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Keywords
Quantum computing; Training; Quantum state; Optimization; Autonomous aerial vehicles; Trajectory; Resource management; Deep reinforcement learning; joint optimization; layerwise training; local loss; quantum embedding; unmanned aerial vehicle (UAV)
Citation
IEEE INTERNET OF THINGS JOURNAL, v.11, no.1, pp 430 - 443
Pages
14
Journal Title
IEEE INTERNET OF THINGS JOURNAL
Volume
11
Number
1
Start Page
430
End Page
443
URI
https://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/28498
DOI
10.1109/JIOT.2023.3285968
ISSN
2327-4662
Abstract
This study proposes a layerwise quantum-based deep reinforcement learning (LQ-DRL) method for optimizing continuous large space and time series problems using deep-layer training. The actions in LQ-DRL are optimized using a layerwise quantum embedding that leverages the advantages of quantum computing to maximize reward and reduce training loss. Moreover, this study employs a local loss to minimize the occurrence of barren plateaus phenomena and further enhance performance. As a particular case, the proposed scheme is employed to jointly optimize: 1) unmanned aerial vehicle (UAV) trajectory planning; 2) user grouping; and 3) power allocation for higher energy efficiency of a UAV as the reward. The combination of these optimized factors is referred to as action space in the presented LQ-DRL. The LQ-DRL is employed to solve the optimization problem due to its nonconvexity, continuous and large action space, and time-series domain. In a practical view, LQ-DRL aims to solve the issue of energy consumption related to limited-battery energy of a UAV base station (BS) while maintaining Quality of Service (QoS) for users, by gaining maximum energy efficiency as the reward. One of real applications, as an example, LQ-DRL can be employed to maximize the energy efficiency of a UAV BS in UAV empowered disaster recovery networks scenario. The quantum circuits of layerwise quantum embedding are presented to show the practical implementation in noisy intermediate-scale quantum computers. Based on the results, LQ-DRL outperformed the classical DRL by achieving higher effective dimension, rewards, and lower learning losses. In addition, better performances were achieved using more layers.
Files in This Item
There are no files associated with this item.
Appears in
Collections
School of Electronic Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher SHIN, SOO YOUNG photo

SHIN, SOO YOUNG
College of Engineering (School of Electronic Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE