Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Moon, Jiseon | - |
dc.contributor.author | Papaioannou, Savvas | - |
dc.contributor.author | Laoudias, Christos | - |
dc.contributor.author | Kolios, Panayiotis | - |
dc.contributor.author | Kim, Sunwoo | - |
dc.date.accessioned | 2022-07-06T11:47:40Z | - |
dc.date.available | 2022-07-06T11:47:40Z | - |
dc.date.created | 2021-07-14 | - |
dc.date.issued | 2021-10 | - |
dc.identifier.issn | 2327-4662 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/140623 | - |
dc.description.abstract | In this article, we propose a novel deep reinforcement learning (DRL) approach for controlling multiple unmanned aerial vehicles (UAVs) with the ultimate purpose of tracking multiple first responders (FRs) in challenging 3-D environments in the presence of obstacles and occlusions. We assume that the UAVs receive noisy distance measurements from the FRs which are of two types, i.e., Line of Sight (LoS) and non-LoS (NLoS) measurements and which are used by the UAV agents in order to estimate the state (i.e., position) of the FRs. Subsequently, the proposed DRL-based controller selects the optimal joint control actions according to the Cramer-Rao lower bound (CRLB) of the joint measurement likelihood function to achieve high tracking performance. Specifically, the optimal UAV control actions are quantified by the proposed reward function, which considers both the CRLB of the entire system and each UAV's individual contribution to the system, called global reward and difference reward, respectively. Since the UAVs take actions that reduce the CRLB of the entire system, tracking accuracy is improved by ensuring the reception of high quality LoS measurements with high probability. Our simulation results show that the proposed DRL-based UAV controller provides a highly accurate target tracking solution with a very low runtime cost. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
dc.title | Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Kim, Sunwoo | - |
dc.identifier.doi | 10.1109/JIOT.2021.3073973 | - |
dc.identifier.scopusid | 2-s2.0-85104657401 | - |
dc.identifier.wosid | 000704110900042 | - |
dc.identifier.bibliographicCitation | IEEE INTERNET OF THINGS JOURNAL, v.8, no.20, pp.15441 - 15455 | - |
dc.relation.isPartOf | IEEE INTERNET OF THINGS JOURNAL | - |
dc.citation.title | IEEE INTERNET OF THINGS JOURNAL | - |
dc.citation.volume | 8 | - |
dc.citation.number | 20 | - |
dc.citation.startPage | 15441 | - |
dc.citation.endPage | 15455 | - |
dc.type.rims | ART | - |
dc.type.docType | Article in Press | - |
dc.description.journalClass | 1 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Telecommunications | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Telecommunications | - |
dc.subject.keywordPlus | OPTIMIZATION | - |
dc.subject.keywordPlus | NAVIGATION | - |
dc.subject.keywordPlus | COVERAGE | - |
dc.subject.keywordAuthor | Target tracking | - |
dc.subject.keywordAuthor | Unmanned aerial vehicles | - |
dc.subject.keywordAuthor | Reinforcement learning | - |
dc.subject.keywordAuthor | Navigation | - |
dc.subject.keywordAuthor | Location awareness | - |
dc.subject.keywordAuthor | Time measurement | - |
dc.subject.keywordAuthor | State estimation | - |
dc.subject.keywordAuthor | Multiagent deep reinforcement learning (DRL) | - |
dc.subject.keywordAuthor | multitarget tracking | - |
dc.subject.keywordAuthor | unmanned aerial vehicle (UAV) | - |
dc.identifier.url | https://ieeexplore.ieee.org/document/9406813 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365
COPYRIGHT © 2021 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.