Data Poisoning Attacks in Contextual Bandits

Ma, Yuzhe; Jun, Kwang-Sung; Li, Lihong; Zhu, Xiaojin

Computer Science > Machine Learning

arXiv:1808.05760 (cs)

[Submitted on 17 Aug 2018 (v1), last revised 24 Aug 2018 (this version, v2)]

Title:Data Poisoning Attacks in Contextual Bandits

Authors:Yuzhe Ma, Kwang-Sung Jun, Lihong Li, Xiaojin Zhu

View PDF

Abstract:We study offline data poisoning attacks in contextual bandits, a class of reinforcement learning problems with important applications in online recommendation and adaptive medical treatment, among others. We provide a general attack framework based on convex optimization and show that by slightly manipulating rewards in the data, an attacker can force the bandit algorithm to pull a target arm for a target contextual vector. The target arm and target contextual vector are both chosen by the attacker. That is, the attacker can hijack the behavior of a contextual bandit. We also investigate the feasibility and the side effects of such attacks, and identify future directions for defense. Experiments on both synthetic and real-world data demonstrate the efficiency of the attack algorithm.

Comments:	GameSec 2018
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:1808.05760 [cs.LG]
	(or arXiv:1808.05760v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1808.05760

Submission history

From: Yuzhe Ma [view email]
[v1] Fri, 17 Aug 2018 05:25:29 UTC (830 KB)
[v2] Fri, 24 Aug 2018 03:26:42 UTC (830 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-08

Change to browse by:

cs
cs.CR
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yuzhe Ma
Kwang-Sung Jun
Lihong Li
Xiaojin Zhu

export BibTeX citation

Computer Science > Machine Learning

Title:Data Poisoning Attacks in Contextual Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Data Poisoning Attacks in Contextual Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators