nips nips2012 nips2012-339 nips2012-339-reference knowledge-graph by maker-knowledge-mining

339 nips-2012-The Time-Marginalized Coalescent Prior for Hierarchical Clustering


Source: pdf

Author: Levi Boyles, Max Welling

Abstract: We introduce a new prior for use in Nonparametric Bayesian Hierarchical Clustering. The prior is constructed by marginalizing out the time information of Kingman’s coalescent, providing a prior over tree structures which we call the Time-Marginalized Coalescent (TMC). This allows for models which factorize the tree structure and times, providing two benefits: more flexible priors may be constructed and more efficient Gibbs type inference can be used. We demonstrate this on an example model for density estimation and show the TMC achieves competitive experimental results. 1


reference text

[1] R.P. Adams, Z. Ghahramani, and M.I. Jordan. Tree-structured stick breaking for hierarchical data. Advances in Neural Information Processing Systems, 23:19–27, 2010.

[2] D Aldous. Probability distributions on cladograms. IMA Volumes in Mathematics and its . . . , 1995.

[3] David Blei, Thomas L. Griffiths, Michael I. Jordan, and Joshua B. Tenenbaum. Hierarchical topic models and the nested chinese restaurant process. In Sebastian Thrun, Lawrence Saul, and Bernhard Sch¨ lkopf, editors, Advances in Neural Information Processing Systems, o volume 16. MIT Press, Cambridge, MA, 2004.

[4] A. Drummond and A. Rambaut. Beast: Bayesian evolutionary analysis by sampling trees. BMC evolutionary biology, 7(1):214, 2007.

[5] T.S. Ferguson. Bayesian density estimation by mixtures of normal distributions. Recent advances in statistics, pages 287–303, 1983. ˘

[6] D. G¨ r¨ r, L. Boyles, and M. Welling. Scalable inference on kingman2019s coalescent using ou pair similarity. In Proceedings of AISTATS, 2012.

[7] D. G¨ r¨ r and Y. W. Teh. An efficient sequential Monte Carlo algorithm for coalescent clusou tering. In D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, editors, Advances in Neural Information Processing Systems 21, pages 521–528, 2009.

[8] Katherine Heller and Zoubin Ghahramani. Bayesian hierarchical clustering. In Proceedings of ICML, volume 22, 2005.

[9] J. F. C. Kingman. The coalescent. Stochastic Processes and their Applications, 13:235–248, 1982.

[10] J. F. C. Kingman. On the genealogy of large populations. Journal of Applied Probability, 19:27–43, 1982.

[11] D.A. Knowles and Z. Ghahramani. Pitman-yor diffusion trees. Arxiv preprint arXiv:1106.2494, 2011.

[12] D.A. Knowles, J. Van Gael, and Z. Ghahramani. Message passing algorithms for dirichlet diffusion trees. In Proceedings of the 28th Annual International Conference on Machine Learning, 2011.

[13] A.Y. Lo. On a class of bayesian nonparametric estimates: I. density estimates. The Annals of Statistics, 12(1):351–357, 1984.

[14] Peter McCullagh, Jim Pitman, and Matthias Winkel. Gibbs fragmentation trees. Bernoulli, 14(4):988–1002, November 2008.

[15] R. Neal. Software for flexible bayesian modeling and markov chain sampling. see http://www. cs. toronto. edu/ radford/fbm. software. html, 2003.

[16] R.M. Neal. Density modeling and clustering using dirichlet diffusion trees. Bayesian Statistics, 7:619–629, 2003.

[17] A. Rodriguez, D.B. Dunson, and A.E. Gelfand. The nested dirichlet process. Journal of the American Statistical Association, 103(483):1131–1154, 2008.

[18] R. Salakhutdinov, J. Tenenbaum, and A. Torralba. Learning to learn with compound hd models. In Advances in Neural Information Processing Systems 21, 2012.

[19] J. Steinhardt and Z. Ghahramani. Flexible martingale priors for deep hierarchies. In International Conference on Artificial Intelligence and Statistics (AISTATS), volume 43, pages 61–62, 2012.

[20] Y. W. Teh, H. Daum´ III, and D. M. Roy. Bayesian agglomerative clustering with coalescents. e In Advances in Neural Information Processing Systems, volume 20, 2008.

[21] P. Welinder, S. Branson, T. Mita, C. Wah, F. Schroff, S. Belongie, and P. Perona. CaltechUCSD Birds 200. Technical Report CNS-TR-2010-001, California Institute of Technology, 2010.

[22] E.J. Yeoh, M.E. Ross, S.A. Shurtleff, W.K. Williams, D. Patel, R. Mahfouz, F.G. Behm, S.C. Raimondi, M.V. Relling, A. Patel, et al. Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. Cancer cell, 1(2):133–143, 2002. 9