skip to main content
10.5555/3042817.3042830guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Parallel Markov chain Monte Carlo for nonparametric mixture models

Published: 16 June 2013 Publication History

Abstract

Nonparametric mixture models based on the Dirichlet process are an elegant alternative to finite models when the number of underlying components is unknown, but inference in such models can be slow. Existing attempts to parallelize inference in such models have relied on introducing approximations, which can lead to inaccuracies in the posterior estimate. In this paper, we describe auxiliary variable representations for the Dirichlet process and the hierarchical Dirichlet process that allow us to perform MCMC using the correct equilibrium distribution, in a distributed manner. We show that our approach allows scalable inference without the deterioration in estimate quality that accompanies existing methods.

References

[1]
Ahmed, A., Ho, Q., Teo, C. H., Eisenstein, J., Smola, A. J., and Xing, E. P. Online inference for the infinite topic-cluster model: Storylines from streaming text. In AISTATS, 2011.
[2]
Aldous, D. J. Exchangeability and related topics. In École d'Été de probabilités de Saint-Flour XIII. 1985.
[3]
Antoniak, C. E. Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. Ann. Statist., 2(6):1152-1174, 1974.
[4]
Asuncion, A., Smyth, P., and Welling, M. Asynchronous distributed learning of topic models. In NIPS, 2008.
[5]
Blei, D. M. and Jordan, M. I. Variational methods for the Dirichlet process. In ICML, 2004.
[6]
Fearnhead, P. Particle filters for mixture models with an unknown number of components. Statistics and Computing, 14:11-21, 2004.
[7]
Ferguson, T. S. A Bayesian analysis of some nonparametric problems. Ann. Statist., 1(2):209-230, 1973.
[8]
Fox, E. B., Sudderth, E. B., Jordan, M. I., and Willsky, A. S. An HDP-HMM for systems with state persistence. In ICML, 2008.
[9]
Ghosh, J. K. and Ramamoorthi, R. V. Bayesian Nonparametrics . Springer, 2003.
[10]
Ishwaran, H. and James, L. F. Gibbs sampling methods for stick-breaking priors. JASA, 96(453):161- 173, 2001.
[11]
Kingman, J. F. C. Completely random measures. Pacific Journal of Mathematics, 21(1):59-78, 1967.
[12]
Kurihara, K., Welling, M., and Teh, Y.-W. Collapsed variational Dirichlet process mixture models. In IJCAI, 2007.
[13]
Lovell, D., Adams, R. P., and Mansingka, V. K. Parallel Markov chain Monte Carlo for Dirichlet process mixtures. In Workshop on Big Learning, NIPS, 2012.
[14]
Neal, R. M. Markov chain sampling methods for Dirichlet process mixture models. Technical Report 9815, Dept. of Statistics, University of Toronto, 1998.
[15]
Rodriguez, A. On-line learning for the infinite hidden Markov model. Communications in Statistics - Simulation and Computation, 40(6):879-893, 2011.
[16]
Sohn, K.-A. and Xing, E. P. A hierarchical Dirichlet process mixture model for haplotype reconstruction from multi-population data. Ann. Appl. Stat., 3(2): 791-821, 2009.
[17]
Sudderth, E. B., Torralba, A., Freeman, W. T., and Willsky, A. S. Describing visual scenes using transformed Dirichlet processes. In NIPS, 2005.
[18]
Teh, Y.-W., Jordan, M. I., Beal, M. J., and Blei, D. M. Hierarchical Dirichlet processes. Journal of the American Statistical Association, 101(476): 1566-1581, 2006.
[19]
Teh, Y.-W., Kurihara, K., and Welling, M. Collapsed variational inference for HDP. In NIPS, 2007.
[20]
Ulker, Y., Gunsel, B., and Cemgil, A. T. Sequential Monte Carlo samplers for Dirichlet process mixtures. In AISTATS, 2010.
[21]
Wang, C., Paisley, J., and Blei, D. M. Online variational inference for the hierarchical Dirichlet process. In AISTATS, 2011.
[22]
Xing, E. P., Ng, A. Y., Jordan, M. I., and Russell, S. Distance metric learning, with application to clustering with side-information. In NIPS, 2002.

Cited By

View all
  1. Parallel Markov chain Monte Carlo for nonparametric mixture models

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    ICML'13: Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28
    June 2013
    2534 pages

    Publisher

    JMLR.org

    Publication History

    Published: 16 June 2013

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 16 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media