Gameplanner: A Game-Theoretical Landmark Planning for Multi-Agent Goal-Conditioned Reinforcement Learning

Heo, Sunhaeng; Moon, Jun

doi:10.1007/s12555-026-00088-5

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Gameplanner: A Game-Theoretical Landmark Planning for Multi-Agent Goal-Conditioned Reinforcement Learning

Full metadata record

DC Field	Value	Language
dc.contributor.author	Heo, Sunhaeng	-
dc.contributor.author	Moon, Jun	-
dc.date.accessioned	2026-06-11T01:00:10Z	-
dc.date.available	2026-06-11T01:00:10Z	-
dc.date.issued	2026-06	-
dc.identifier.issn	1598-6446	-
dc.identifier.issn	2005-4092	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/213228	-
dc.description.abstract	Off-policy multi-agent reinforcement learning in decentralized settings faces key problems of non-stationarity and sparse rewards. When applied to goal-conditioned tasks, conventional planners often lead to conflicts and deadlocks, as each agent plans optimally only for itself while treating others as dynamic obstacles. To solve these problems, we propose Gameplanner, a framework that coordinates agents using a game-theoretical planner for goal selection, starting point selection, and landmark selection. In these three games, each agent’s choice is treated as a strategy, and a mixed Nash Equilibrium (NE) is computed to determine a mutually stable selection. Moreover, concerning the payoff for the landmark selection game, Gameplanner grounds game-theoretical planning in learning through a Learning-Guided Payoff (LGP), which uses each agent’s learned critic values to construct the game’s payoff matrix. This ensures that game-theoretical decisions are guided by individual learning progress. We demonstrate the effectiveness of our method in the AntMaze goal-reaching environment, where Gameplanner increases the success rate and enables stable, independent learning without centralized communication.	-
dc.format.extent	14	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	INST CONTROL ROBOTICS & SYSTEMS	-
dc.title	Gameplanner: A Game-Theoretical Landmark Planning for Multi-Agent Goal-Conditioned Reinforcement Learning	-
dc.type	Article	-
dc.publisher.location	대한민국	-
dc.identifier.doi	10.1007/s12555-026-00088-5	-
dc.identifier.scopusid	2-s2.0-105037523223	-
dc.identifier.wosid	001750913800001	-
dc.identifier.bibliographicCitation	INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, v.24, no.6, pp 1560 - 1573	-
dc.citation.title	INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS	-
dc.citation.volume	24	-
dc.citation.number	6	-
dc.citation.startPage	1560	-
dc.citation.endPage	1573	-
dc.type.docType	Article in press	-
dc.identifier.kciid	ART003341683	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.description.journalRegisteredClass	kci	-
dc.relation.journalResearchArea	Automation & Control Systems	-
dc.relation.journalWebOfScienceCategory	Automation & Control Systems	-
dc.subject.keywordPlus	Computation theory	-
dc.subject.keywordPlus	Computational methods	-
dc.subject.keywordPlus	Intelligent agents	-
dc.subject.keywordPlus	Machine learning	-
dc.subject.keywordPlus	Multi agent systems	-
dc.subject.keywordPlus	Nash equilibrium	-
dc.subject.keywordPlus	Problem solving	-
dc.subject.keywordAuthor	Multi-agent system	-
dc.subject.keywordAuthor	Goal-conditioned reinforcement learning	-
dc.subject.keywordAuthor	Game theory	-
dc.subject.keywordAuthor	Mixed nash equilibrium	-
dc.subject.keywordAuthor	Decentralized coordination	-
dc.identifier.url	https://link.springer.com/article/10.1007/s12555-026-00088-5	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Moon, Jun photo

Moon, Jun: COLLEGE OF ENGINEERING (MAJOR IN ELECTRICAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE