Evolutionary Reinforcement Learning with Double Replay Buffers for UAV Online Target Tracking
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Yu, Bai-Jiang | - |
dc.contributor.author | Wei, Feng-Feng | - |
dc.contributor.author | Hu, Xiao-Min | - |
dc.contributor.author | Jeon, Sang-Woon | - |
dc.contributor.author | Luo, Wen-Jian | - |
dc.contributor.author | Chen, Wei-Neng | - |
dc.date.accessioned | 2025-06-13T07:00:26Z | - |
dc.date.available | 2025-06-13T07:00:26Z | - |
dc.date.issued | 2025-01 | - |
dc.identifier.issn | 1062-922X | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/125618 | - |
dc.description.abstract | Target tracking has broad applications like disaster relief, and unmanned aerial vehicles (UAVs) have been universally applied in target tracking in recent years. Due to the strong responsiveness to deceptive reward signals and diverse exploration, evolutionary reinforcement learning (ERL) is a more noteworthy option for training UAVs than common reinforcement learning. However, for ERL contains too many neural networks, its training efficiency is not satisfactory enough. To address this shortcoming, this paper proposes an evolutionary reinforcement learning with double replay buffers (ERLDRB) for UAV online target tracking problem. Firstly, considering the energy consumption and the possible delay of feedback signals to the UAV, a more realistic model of UAV online target tracking problem is designed. Then based on the problem formulation, ERLDRB utilizes a double experience replay buffers technique to increase learning efficiency in the training stage, which can better solve real-world UAV online target tracking problem. Simulation results show that ERLDRB outperforms multiple contrasting algorithms on the designed model. © 2024 IEEE. | - |
dc.format.extent | 8 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | Evolutionary Reinforcement Learning with Double Replay Buffers for UAV Online Target Tracking | - |
dc.type | Article | - |
dc.identifier.doi | 10.1109/SMC54092.2024.10831604 | - |
dc.identifier.scopusid | 2-s2.0-85217878505 | - |
dc.identifier.bibliographicCitation | Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics, pp 1350 - 1357 | - |
dc.citation.title | Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics | - |
dc.citation.startPage | 1350 | - |
dc.citation.endPage | 1357 | - |
dc.type.docType | Conference paper | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scopus | - |
dc.subject.keywordAuthor | Evolutionary Algorithms | - |
dc.subject.keywordAuthor | Evolutionary Reinforcement Learning | - |
dc.subject.keywordAuthor | Target tracking problem | - |
dc.subject.keywordAuthor | Unmanned aerial vehicle systems | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.