Layerwise Quantum Deep Reinforcement Learning for Joint Optimization of UAV Trajectory and Resource Allocation

Silvirianti; Narottama, Bhaskara; Shin, Soo Young

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Layerwise Quantum Deep Reinforcement Learning for Joint Optimization of UAV Trajectory and Resource Allocation

Authors: Silvirianti; Narottama, Bhaskara; Shin, Soo Young

Issue Date: Jan-2024

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Quantum computing; Training; Quantum state; Optimization; Autonomous aerial vehicles; Trajectory; Resource management; Deep reinforcement learning; joint optimization; layerwise training; local loss; quantum embedding; unmanned aerial vehicle (UAV)

Citation: IEEE INTERNET OF THINGS JOURNAL, v.11, no.1, pp 430 - 443

Pages: 14

Journal Title: IEEE INTERNET OF THINGS JOURNAL

Volume: 11

Number: 1

Start Page: 430

End Page: 443

URI: https://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/28498

DOI: 10.1109/JIOT.2023.3285968

ISSN: 2327-4662

Abstract: This study proposes a layerwise quantum-based deep reinforcement learning (LQ-DRL) method for optimizing continuous large space and time series problems using deep-layer training. The actions in LQ-DRL are optimized using a layerwise quantum embedding that leverages the advantages of quantum computing to maximize reward and reduce training loss. Moreover, this study employs a local loss to minimize the occurrence of barren plateaus phenomena and further enhance performance. As a particular case, the proposed scheme is employed to jointly optimize: 1) unmanned aerial vehicle (UAV) trajectory planning; 2) user grouping; and 3) power allocation for higher energy efficiency of a UAV as the reward. The combination of these optimized factors is referred to as action space in the presented LQ-DRL. The LQ-DRL is employed to solve the optimization problem due to its nonconvexity, continuous and large action space, and time-series domain. In a practical view, LQ-DRL aims to solve the issue of energy consumption related to limited-battery energy of a UAV base station (BS) while maintaining Quality of Service (QoS) for users, by gaining maximum energy efficiency as the reward. One of real applications, as an example, LQ-DRL can be employed to maximize the energy efficiency of a UAV BS in UAV empowered disaster recovery networks scenario. The quantum circuits of layerwise quantum embedding are presented to show the practical implementation in noisy intermediate-scale quantum computers. Based on the results, LQ-DRL outperformed the classical DRL by achieving higher effective dimension, rewards, and lower learning losses. In addition, better performances were achieved using more layers.

Files in This Item: There are no files associated with this item.

Appears in Collections: School of Electronic Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher SHIN, SOO YOUNG photo

SHIN, SOO YOUNG: College of Engineering (School of Electronic Engineering)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :1,521,506; Today View :606

RSS_1.0 RSS_2.0 ATOM_1.0

350-27, Gumi-daero, Gumi-si, Gyeongsangbuk-do, Republic of Korea (39253)054-478-7170

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE