research-article

Truthful Bandit Mechanisms for Repeated Two-stage Ad Auctions

Authors:

Jian Xu,

Fan WuAuthors Info & Claims

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 1565 - 1575

https://doi.org/10.1145/3637528.3671813

Published: 24 August 2024 Publication History

Get Access

Abstract

Online advertising platforms leverage a two-stage auction architecture to deliver personalized ads to users with low latency. The first stage efficiently selects a small subset of promising candidates out of the complete pool of ads. In the second stage, an auction is conducted within the subset to determine the winning ad for display, using click-through-rate predictions from the second-stage machine learning model. In this work, we investigate the online learning process of the first-stage subset selection policy, while ensuring game-theoretic properties in repeated two-stage ad auctions. Specifically, we model the problem as designing a combinatorial bandit mechanism with a general reward function, as well as additional requirements of truthfulness and individual rationality (IR). We establish an O(T) regret lower bound for truthful bandit mechanisms, which demonstrates the challenge of simultaneously achieving allocation efficiency and truthfulness. To circumvent this impossibility result, we introduce truthful α-approximation oracles and evaluate the bandit mechanism through α-approximation regret. Two mechanisms are proposed, both of which are ex-post truthful and ex-post IR. The first mechanism is an explore-then-commit mechanism with regret O(T^2/3 ), and the second mechanism achieves an improved O(log T /Δ_Φ²) regret where Δ_Φ is a distribution-dependent gap, but requires additional assumptions on the oracles and information about the strategic bidders.

Supplemental Material

MP4 File - rtfp0782-video

Promotional Video for paper "Truthful Bandit Mechanisms for Repeated Two-stage Ad Auctions".

Download
8.91 MB

References

[1]

Kumar Abhishek, Shweta Jain, and Sujit Gujar. 2020. Designing Truthful Contextual Multi-Armed Bandits based Sponsored Search Auctions. In Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems (Auckland, New Zealand) (AAMAS '20). International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC, 1732--1734.

Abstract

Supplemental Material

References

Index Terms

Recommendations

Characterizing truthful multi-armed bandit mechanisms: extended abstract

Incentive-Compatible Learning of Reserve Prices for Repeated Auctions

Truthful learning mechanisms for multi-slot sponsored search auctions with externalities

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations