Optimal Adaptive Learning in Uncontrolled Restless Bandit Problems

Tekin, Cem; Liu, Mingyan

Mathematics > Optimization and Control

arXiv:1107.4042 (math)

[Submitted on 20 Jul 2011 (v1), last revised 29 Jan 2015 (this version, v3)]

Title:Optimal Adaptive Learning in Uncontrolled Restless Bandit Problems

Authors:Cem Tekin, Mingyan Liu

View PDF

Abstract:In this paper we consider the problem of learning the optimal policy for uncontrolled restless bandit problems. In an uncontrolled restless bandit problem, there is a finite set of arms, each of which when pulled yields a positive reward. There is a player who sequentially selects one of the arms at each time step. The goal of the player is to maximize its undiscounted reward over a time horizon T. The reward process of each arm is a finite state Markov chain, whose transition probabilities are unknown by the player. State transitions of each arm is independent of the selection of the player. We propose a learning algorithm with logarithmic regret uniformly over time with respect to the optimal finite horizon policy. Our results extend the optimal adaptive learning of MDPs to POMDPs.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG)
Cite as:	arXiv:1107.4042 [math.OC]
	(or arXiv:1107.4042v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1107.4042

Submission history

From: Cem Tekin [view email]
[v1] Wed, 20 Jul 2011 17:33:43 UTC (19 KB)
[v2] Wed, 17 Oct 2012 05:06:22 UTC (181 KB)
[v3] Thu, 29 Jan 2015 10:15:00 UTC (212 KB)

Mathematics > Optimization and Control

Title:Optimal Adaptive Learning in Uncontrolled Restless Bandit Problems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Optimal Adaptive Learning in Uncontrolled Restless Bandit Problems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators