Reinforcement Learning-Based Adaptive Critic Control Design for Leader-Followers Multi-Agent Systems in Zero-Sum Games

Oh, Yuna; Lee, Jinyoung; Moon, Jun

doi:10.1109/TASE.2026.3689894

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Reinforcement Learning-Based Adaptive Critic Control Design for Leader-Followers Multi-Agent Systems in Zero-Sum Games

Full metadata record

DC Field	Value	Language
dc.contributor.author	Oh, Yuna	-
dc.contributor.author	Lee, Jinyoung	-
dc.contributor.author	Moon, Jun	-
dc.date.accessioned	2026-06-01T07:00:07Z	-
dc.date.available	2026-06-01T07:00:07Z	-
dc.date.issued	2026-05	-
dc.identifier.issn	1545-5955	-
dc.identifier.issn	1558-3783	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212919	-
dc.description.abstract	This paper proposes the reinforcement learning (RL)-based adaptive critic controller for multi-agent systems with one leader and N followers. The overall problem is formulated by the zero-sum game framework, where the followers minimize while the leader maximizes the same cost to capture the formation problem between the leader and followers. The optimal value function is approximated using the RL approach, and an adaptive critic network with an additional stabilization term is employed to derive the weight update rule. By incorporating the additional stabilization term, the need for decomposition in the controller design is eliminated, and the analysis for the closed-loop system stability in the sense of uniformly ultimately boundedness is simplified. In addition, an asymmetric barrier Lyapunov function is utilized to effectively handle output constraints and enhance the control performance. The proposed RL-based adaptive controllers for the leader and followers do not require the exact modeling information, since it is estimated by the neural network-based identifier. The performance of the proposed method is demonstrated through various simulations, including the leader-follower vehicle tracking problem using the CarSim environment. Note to Practitioners - In practical multi-agent systems, controllers usually operate with a partially unknown system and constrained outputs. These constraints sometimes make it challenging to design stable control strategies. The proposed controller assumes that the system dynamics is partially unknown, and the constraints of the output and tracking error exist. These challenges commonly arise in real-world applications such as autonomous driving, cooperative robotics, and autonomous aerial vehicle (AAV) swarms. The proposed methods achieve these challenges through a neural network (NN)-based identifier, an asymmetric barrier Lyapunov function (ABLF) candidate. The NN-based identifier enables the inference of physical properties that are difficult to describe analytically. This improves reliability and makes the algorithm easier to implement in practical systems. The ABLF candidate also allows practitioners to ensure different upper and lower output boundaries. In addition, compared to complex backstepping procedures, the proposed method incorporates an additional stabilization term in the adaptive controller. Consequently, this simplifies the controller design and enables stable learning.	-
dc.format.extent	16	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Reinforcement Learning-Based Adaptive Critic Control Design for Leader-Followers Multi-Agent Systems in Zero-Sum Games	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/TASE.2026.3689894	-
dc.identifier.scopusid	2-s2.0-105038293588	-
dc.identifier.wosid	001764917400027	-
dc.identifier.bibliographicCitation	IEEE Transactions on Automation Science and Engineering, v.23, pp 9146 - 9161	-
dc.citation.title	IEEE Transactions on Automation Science and Engineering	-
dc.citation.volume	23	-
dc.citation.startPage	9146	-
dc.citation.endPage	9161	-
dc.type.docType	Article	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Automation & Control Systems	-
dc.relation.journalWebOfScienceCategory	Automation & Control Systems	-
dc.subject.keywordPlus	Adaptive control systems	-
dc.subject.keywordPlus	Closed loop systems	-
dc.subject.keywordPlus	Controllers	-
dc.subject.keywordPlus	Game theory	-
dc.subject.keywordPlus	Intelligent agents	-
dc.subject.keywordPlus	Lyapunov methods	-
dc.subject.keywordPlus	Multi agent systems	-
dc.subject.keywordPlus	Reinforcement learning	-
dc.subject.keywordPlus	Stabilization	-
dc.subject.keywordPlus	Tracking (position)	-
dc.subject.keywordAuthor	Adaptive critic	-
dc.subject.keywordAuthor	asymmetric barrier Lyapunov function	-
dc.subject.keywordAuthor	multi-agent	-
dc.subject.keywordAuthor	reinforcement learning (RL)	-
dc.subject.keywordAuthor	zero-sum game	-
dc.identifier.url	https://ieeexplore.ieee.org/document/11505877	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Moon, Jun photo

Moon, Jun: COLLEGE OF ENGINEERING (MAJOR IN ELECTRICAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE