Cited By
View all- Liu X(2025)Scalable and Robust Online Learning for AI-powered Networked SystemsACM SIGMETRICS Performance Evaluation Review10.1145/3712170.371218352:3(39-42)Online publication date: 9-Jan-2025
We study federated contextual linear bandits, where M agents cooperate with each other to solve a global contextual linear bandit problem with the help of a central server. We consider the asynchronous setting, where all agents work independently and the ...
We study the federated pure exploration problem of multi-armed bandits and linear bandits, where M agents cooperatively identify the best arm via communicating with the central server. To enhance the robustness against latency and unavailability of ...
Motivated by the important and urgent need for efficient optimization in online recommender systems, we revisit the cascading bandit model proposed by Kveton et al. (2015a). While Thompson sampling (TS) algorithms have been shown to be empirically ...
AAAI Press