Solving Two-Player General-Sum Games Between Swarms

Ghimire, Mukesh; Zhang, Lei; Zhang, Wenlong; Ren, Yi; Xu, Zhe

Computer Science > Multiagent Systems

arXiv:2310.01682 (cs)

[Submitted on 2 Oct 2023 (v1), last revised 3 Nov 2023 (this version, v2)]

Title:Solving Two-Player General-Sum Games Between Swarms

Authors:Mukesh Ghimire, Lei Zhang, Wenlong Zhang, Yi Ren, Zhe Xu

View PDF

Abstract:Hamilton-Jacobi-Isaacs (HJI) PDEs are the governing equations for the two-player general-sum games. Unlike Reinforcement Learning (RL) methods, which are data-intensive methods for learning value function, learning HJ PDEs provide a guaranteed convergence to the Nash Equilibrium value of the game when it exists. However, a caveat is that solving HJ PDEs becomes intractable when the state dimension increases. To circumvent the curse of dimensionality (CoD), physics-informed machine learning methods with supervision can be used and have been shown to be effective in generating equilibrial policies in two-player general-sum games. In this work, we extend the existing work on agent-level two-player games to a two-player swarm-level game, where two sub-swarms play a general-sum game. We consider the \textit{Kolmogorov forward equation} as the dynamic model for the evolution of the densities of the swarms. Results show that policies generated from the physics-informed neural network (PINN) result in a higher payoff than a Nash Double Deep Q-Network (Nash DDQN) agent and have comparable performance with numerical solvers.

Comments:	Submitted to ACC 2024. Revised Version, fixed typo in algorithm (DQN instead of DDQN)
Subjects:	Multiagent Systems (cs.MA); Computer Science and Game Theory (cs.GT); Robotics (cs.RO)
Cite as:	arXiv:2310.01682 [cs.MA]
	(or arXiv:2310.01682v2 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.2310.01682

Submission history

From: Mukesh Ghimire [view email]
[v1] Mon, 2 Oct 2023 22:35:10 UTC (7,787 KB)
[v2] Fri, 3 Nov 2023 00:36:22 UTC (7,787 KB)

Computer Science > Multiagent Systems

Title:Solving Two-Player General-Sum Games Between Swarms

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:Solving Two-Player General-Sum Games Between Swarms

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators