Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers

Steinhardt, Jacob; Charikar, Moses; Valiant, Gregory

Computer Science > Machine Learning

arXiv:1703.04940 (cs)

[Submitted on 15 Mar 2017 (v1), last revised 27 Nov 2017 (this version, v3)]

Title:Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers

Authors:Jacob Steinhardt, Moses Charikar, Gregory Valiant

View PDF

Abstract:We introduce a criterion, resilience, which allows properties of a dataset (such as its mean or best low rank approximation) to be robustly computed, even in the presence of a large fraction of arbitrary additional data. Resilience is a weaker condition than most other properties considered so far in the literature, and yet enables robust estimation in a broader variety of settings. We provide new information-theoretic results on robust distribution learning, robust estimation of stochastic block models, and robust mean estimation under bounded $k$th moments. We also provide new algorithmic results on robust distribution learning, as well as robust mean estimation in $\ell_p$-norms. Among our proof techniques is a method for pruning a high-dimensional distribution with bounded $1$st moments to a stable "core" with bounded $2$nd moments, which may be of independent interest.

Comments:	32 pages, full version of ITCS2018 paper (minor citation edit from v2)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:1703.04940 [cs.LG]
	(or arXiv:1703.04940v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1703.04940

Submission history

From: Jacob Steinhardt [view email]
[v1] Wed, 15 Mar 2017 05:43:48 UTC (47 KB)
[v2] Thu, 23 Nov 2017 07:22:21 UTC (82 KB)
[v3] Mon, 27 Nov 2017 03:16:54 UTC (82 KB)

Computer Science > Machine Learning

Title:Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators