On Gradient-Based Learning in Continuous Games

Mazumdar, Eric; Ratliff, Lillian J.; Sastry, S. Shankar

doi:10.1137/18M1231298

Computer Science > Machine Learning

arXiv:1804.05464 (cs)

[Submitted on 16 Apr 2018 (v1), last revised 20 Feb 2020 (this version, v3)]

Title:On Gradient-Based Learning in Continuous Games

Authors:Eric Mazumdar, Lillian J. Ratliff, S. Shankar Sastry

View PDF

Abstract:We formulate a general framework for competitive gradient-based learning that encompasses a wide breadth of multi-agent learning algorithms, and analyze the limiting behavior of competitive gradient-based learning algorithms using dynamical systems theory. For both general-sum and potential games, we characterize a non-negligible subset of the local Nash equilibria that will be avoided if each agent employs a gradient-based learning algorithm. We also shed light on the issue of convergence to non-Nash strategies in general- and zero-sum games, which may have no relevance to the underlying game, and arise solely due to the choice of algorithm. The existence and frequency of such strategies may explain some of the difficulties encountered when using gradient descent in zero-sum games as, e.g., in the training of generative adversarial networks. To reinforce the theoretical contributions, we provide empirical results that highlight the frequency of linear quadratic dynamic games (a benchmark for multi-agent reinforcement learning) that admit global Nash equilibria that are almost surely avoided by policy gradient.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1804.05464 [cs.LG]
	(or arXiv:1804.05464v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1804.05464
Journal reference:	SIAM Journal on Mathematics of Data Science 2020 2:1, 103-131
Related DOI:	https://doi.org/10.1137/18M1231298

Submission history

From: Eric Mazumdar [view email]
[v1] Mon, 16 Apr 2018 01:14:17 UTC (2,655 KB)
[v2] Thu, 27 Sep 2018 03:54:44 UTC (3,119 KB)
[v3] Thu, 20 Feb 2020 18:26:35 UTC (634 KB)

Computer Science > Machine Learning

Title:On Gradient-Based Learning in Continuous Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On Gradient-Based Learning in Continuous Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators