Robustly Learning General Mixtures of Gaussians
Abstract
1 Introduction
Is there a statistically efficient method for learning the parameters of a mixture of Gaussians from samples?
Is there an efficient algorithm for learning the parameters?
Is there a provably robust and computationally efficient algorithm for learning mixtures of Gaussians? Can we robustify the existing learning results?
1.1 Key Challenges
1.2 Our Techniques and Main Result
1.2.1 Discussion of Assumptions and Later Improvements.
1.3 Proof Overview
1.4 Concurrent and Subsequent Work
2 Preliminaries
2.1 Problem Setup
2.2 Sum of Squares Proofs
3 Fun with Generating Functions
3.1 Polynomial Factorizations
4 Components Are Not Far Apart
4.1 Reducing to All Pairs of Parameters Equal or Separated
4.2 SOS Program Setup
4.3 Analysis
4.3.1 Algebraic Identities.
After taking derivatives and polynomial combinations of either of the above formal power series, the coefficients can still be expressed as polynomial combinations of their respective Hermite polynomials.
4.3.2 Warm-up: All Pairs of Parameters are Separated.
4.3.3 Finishing Up: Finding the Covariances and then the Means.
4.3.4 All Pairs of Parameters are Equal or Separated.
5 Robust Moment Estimation
5.1 Distance between Gaussians
5.2 Hermite Polynomial Estimation
6 Rough Clustering
6.1 SOS Program
6.2 Clustering Algorithm
6.3 Improved Clustering Result from [4]
7 Putting Everything Together
7.1 Distance Between Gaussians
7.2 Full Algorithm
7.3 Analysis of Full Algorithm
8 Identifiability
8.1 Improving the Separation Assumption
Footnotes
References
Index Terms
- Robustly Learning General Mixtures of Gaussians
Recommendations
Learning mixtures of spherical gaussians: moment methods and spectral decompositions
ITCS '13: Proceedings of the 4th conference on Innovations in Theoretical Computer ScienceThis work provides a computationally efficient and statistically consistent moment-based estimator for mixtures of spherical Gaussians. Under the condition that component means are in general position, a simple spectral decomposition technique yields ...
Robustly learning mixtures of k arbitrary Gaussians
STOC 2022: Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of ComputingWe give a polynomial-time algorithm for the problem of robustly estimating a mixture of k arbitrary Gaussians in ℝd, for any fixed k, in the presence of a constant fraction of arbitrary corruptions. This resolves the main open problem in several ...
Settling the robust learnability of mixtures of Gaussians
STOC 2021: Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of ComputingThis work represents a natural coalescence of two important lines of work – learning mixtures of Gaussians and algorithmic robust statistics. In particular we give the first provably robust algorithm for learning mixtures of any constant number of ...
Comments
Information & Contributors
Information
Published In
- Editor:
- Venkatesan Guruswami
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Check for updates
Author Tags
Qualifiers
- Research-article
Funding Sources
- Microsoft Trustworthy AI Grant
- David and Lucile Packard Fellowship and an ONR Young Investigator Award
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 1,701Total Downloads
- Downloads (Last 12 months)788
- Downloads (Last 6 weeks)81
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in