Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Gameplanner: A Game-Theoretical Landmark Planning for Multi-Agent Goal-Conditioned Reinforcement Learning

Full metadata record
DC Field Value Language
dc.contributor.authorHeo, Sunhaeng-
dc.contributor.authorMoon, Jun-
dc.date.accessioned2026-06-11T01:00:10Z-
dc.date.available2026-06-11T01:00:10Z-
dc.date.issued2026-06-
dc.identifier.issn1598-6446-
dc.identifier.issn2005-4092-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/213228-
dc.description.abstractOff-policy multi-agent reinforcement learning in decentralized settings faces key problems of non-stationarity and sparse rewards. When applied to goal-conditioned tasks, conventional planners often lead to conflicts and deadlocks, as each agent plans optimally only for itself while treating others as dynamic obstacles. To solve these problems, we propose Gameplanner, a framework that coordinates agents using a game-theoretical planner for goal selection, starting point selection, and landmark selection. In these three games, each agent’s choice is treated as a strategy, and a mixed Nash Equilibrium (NE) is computed to determine a mutually stable selection. Moreover, concerning the payoff for the landmark selection game, Gameplanner grounds game-theoretical planning in learning through a Learning-Guided Payoff (LGP), which uses each agent’s learned critic values to construct the game’s payoff matrix. This ensures that game-theoretical decisions are guided by individual learning progress. We demonstrate the effectiveness of our method in the AntMaze goal-reaching environment, where Gameplanner increases the success rate and enables stable, independent learning without centralized communication.-
dc.format.extent14-
dc.language영어-
dc.language.isoENG-
dc.publisherINST CONTROL ROBOTICS & SYSTEMS-
dc.titleGameplanner: A Game-Theoretical Landmark Planning for Multi-Agent Goal-Conditioned Reinforcement Learning-
dc.typeArticle-
dc.publisher.location대한민국-
dc.identifier.doi10.1007/s12555-026-00088-5-
dc.identifier.scopusid2-s2.0-105037523223-
dc.identifier.wosid001750913800001-
dc.identifier.bibliographicCitationINTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, v.24, no.6, pp 1560 - 1573-
dc.citation.titleINTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS-
dc.citation.volume24-
dc.citation.number6-
dc.citation.startPage1560-
dc.citation.endPage1573-
dc.type.docTypeArticle in press-
dc.identifier.kciidART003341683-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.description.journalRegisteredClasskci-
dc.relation.journalResearchAreaAutomation & Control Systems-
dc.relation.journalWebOfScienceCategoryAutomation & Control Systems-
dc.subject.keywordPlusComputation theory-
dc.subject.keywordPlusComputational methods-
dc.subject.keywordPlusIntelligent agents-
dc.subject.keywordPlusMachine learning-
dc.subject.keywordPlusMulti agent systems-
dc.subject.keywordPlusNash equilibrium-
dc.subject.keywordPlusProblem solving-
dc.subject.keywordAuthorMulti-agent system-
dc.subject.keywordAuthorGoal-conditioned reinforcement learning-
dc.subject.keywordAuthorGame theory-
dc.subject.keywordAuthorMixed nash equilibrium-
dc.subject.keywordAuthorDecentralized coordination-
dc.identifier.urlhttps://link.springer.com/article/10.1007/s12555-026-00088-5-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Moon, Jun photo

Moon, Jun
COLLEGE OF ENGINEERING (MAJOR IN ELECTRICAL ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE