Gameplanner: A Game-Theoretical Landmark Planning for Multi-Agent Goal-Conditioned Reinforcement Learning
- Authors
- Heo, Sunhaeng; Moon, Jun
- Issue Date
- Jun-2026
- Publisher
- INST CONTROL ROBOTICS & SYSTEMS
- Keywords
- Multi-agent system; Goal-conditioned reinforcement learning; Game theory; Mixed nash equilibrium; Decentralized coordination
- Citation
- INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, v.24, no.6, pp 1560 - 1573
- Pages
- 14
- Indexed
- SCIE
SCOPUS
KCI
- Journal Title
- INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS
- Volume
- 24
- Number
- 6
- Start Page
- 1560
- End Page
- 1573
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/213228
- DOI
- 10.1007/s12555-026-00088-5
- ISSN
- 1598-6446
2005-4092
- Abstract
- Off-policy multi-agent reinforcement learning in decentralized settings faces key problems of non-stationarity and sparse rewards. When applied to goal-conditioned tasks, conventional planners often lead to conflicts and deadlocks, as each agent plans optimally only for itself while treating others as dynamic obstacles. To solve these problems, we propose Gameplanner, a framework that coordinates agents using a game-theoretical planner for goal selection, starting point selection, and landmark selection. In these three games, each agent’s choice is treated as a strategy, and a mixed Nash Equilibrium (NE) is computed to determine a mutually stable selection. Moreover, concerning the payoff for the landmark selection game, Gameplanner grounds game-theoretical planning in learning through a Learning-Guided Payoff (LGP), which uses each agent’s learned critic values to construct the game’s payoff matrix. This ensures that game-theoretical decisions are guided by individual learning progress. We demonstrate the effectiveness of our method in the AntMaze goal-reaching environment, where Gameplanner increases the success rate and enables stable, independent learning without centralized communication.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.