Follower Agnostic Methods for Stackelberg Games

Maheshwari, Chinmay; Cheng, James; Sasty, S. Shankar; Ratliff, Lillian; Mazumdar, Eric

Mathematics > Optimization and Control

arXiv:2302.01421 (math)

[Submitted on 2 Feb 2023 (v1), last revised 27 Mar 2024 (this version, v3)]

Title:Follower Agnostic Methods for Stackelberg Games

Authors:Chinmay Maheshwari, James Cheng, S. Shankar Sasty, Lillian Ratliff, Eric Mazumdar

View PDF HTML (experimental)

Abstract:In this paper, we present an efficient algorithm to solve online Stackelberg games, featuring multiple followers, in a follower-agnostic manner. Unlike previous works, our approach works even when leader has no knowledge about the followers' utility functions or strategy space. Our algorithm introduces a unique gradient estimator, leveraging specially designed strategies to probe followers. In a departure from traditional assumptions of optimal play, we model followers' responses using a convergent adaptation rule, allowing for realistic and dynamic interactions. The leader constructs the gradient estimator solely based on observations of followers' actions. We provide both non-asymptotic convergence rates to stationary points of the leader's objective and demonstrate asymptotic convergence to a \emph{local Stackelberg equilibrium}. To validate the effectiveness of our algorithm, we use this algorithm to solve the problem of incentive design on a large-scale transportation network, showcasing its robustness even when the leader lacks access to followers' demand.

Comments:	31 pages
Subjects:	Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Dynamical Systems (math.DS)
MSC classes:	91A65
Cite as:	arXiv:2302.01421 [math.OC]
	(or arXiv:2302.01421v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2302.01421

Submission history

From: Chinmay Maheshwari [view email]
[v1] Thu, 2 Feb 2023 21:21:14 UTC (24 KB)
[v2] Wed, 24 May 2023 17:43:37 UTC (31 KB)
[v3] Wed, 27 Mar 2024 00:38:33 UTC (627 KB)

Mathematics > Optimization and Control

Title:Follower Agnostic Methods for Stackelberg Games

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Follower Agnostic Methods for Stackelberg Games

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators