Asynchronous Parallel Empirical Variance Guided Algorithms for the Thresholding Bandit Problem

Zhong, Jie; Huang, Yijun; Liu, Ji

Statistics > Machine Learning

arXiv:1704.04567 (stat)

[Submitted on 15 Apr 2017 (v1), last revised 9 Jul 2017 (this version, v2)]

Title:Asynchronous Parallel Empirical Variance Guided Algorithms for the Thresholding Bandit Problem

Authors:Jie Zhong, Yijun Huang, Ji Liu

View PDF

Abstract:This paper considers the multi-armed thresholding bandit problem -- identifying all arms whose expected rewards are above a predefined threshold via as few pulls (or rounds) as possible -- proposed by Locatelli et al. [2016] recently. Although the proposed algorithm in Locatelli et al. [2016] achieves the optimal round complexity in a certain sense, there still remain unsolved issues. This paper proposes an asynchronous parallel thresholding algorithm and its parameter-free version to improve the efficiency and the applicability. On one hand, the proposed two algorithms use the empirical variance to guide the pull decision at each round, and significantly improve the round complexity of the "optimal" algorithm when all arms have bounded high order moments. The proposed algorithms can be proven to be optimal. On the other hand, most bandit algorithms assume that the reward can be observed immediately after the pull or the next decision would not be made before all rewards are observed. Our proposed asynchronous parallel algorithms allow making the choice of the next pull with unobserved rewards from earlier pulls, which avoids such an unrealistic assumption and significantly improves the identification process. Our theoretical analysis justifies the effectiveness and the efficiency of proposed asynchronous parallel algorithms.

Comments:	added lower bound
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1704.04567 [stat.ML]
	(or arXiv:1704.04567v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1704.04567

Submission history

From: Jie Zhong [view email]
[v1] Sat, 15 Apr 2017 02:42:30 UTC (152 KB)
[v2] Sun, 9 Jul 2017 00:14:13 UTC (157 KB)

Statistics > Machine Learning

Title:Asynchronous Parallel Empirical Variance Guided Algorithms for the Thresholding Bandit Problem

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Asynchronous Parallel Empirical Variance Guided Algorithms for the Thresholding Bandit Problem

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators