nips nips2011 nips2011-54 nips2011-54-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Abhishek Kumar, Piyush Rai, Hal Daume
Abstract: In many clustering problems, we have access to multiple views of the data each of which could be individually used for clustering. Exploiting information from multiple views, one can hope to find a clustering that is more accurate than the ones obtained using the individual views. Often these different views admit same underlying clustering of the data, so we can approach this problem by looking for clusterings that are consistent across the views, i.e., corresponding data points in each view should have same cluster membership. We propose a spectral clustering framework that achieves this goal by co-regularizing the clustering hypotheses, and propose two co-regularization schemes to accomplish this. Experimental comparisons with a number of baselines on two synthetic and three real-world datasets establish the efficacy of our proposed approaches.
[1] A. Blum and T. Mitchell. Combining labeled and unlabeled data with co-training. In Conference on Learning Theory, 1998.
[2] Kamalika Chaudhuri, Sham M. Kakade, Karen Livescu, and Karthik Sridharan. Multi-view Clustering via Canonical Correlation Analysis. In International Conference on Machine Learning, 2009.
[3] Ulrike von Luxburg. A Tutorial on Spectral Clustering. Statistics and Computing, 2007.
[4] J. Shi and J. Malik. Normalized cuts and Image Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22:888–905, 1997.
[5] A. Ng, M. Jordan, and Y. Weiss. On spectral clustering: analysis and an algorithm. In Advances in Neural Information Processing Systems, 2002.
[6] Vikas Sindhwani, Partha Niyogi, and Mikhail Belkin. A Co-regularization approach to semisupervised learning with multiple views. In Proceedings of the Workshop on Learning with Multiple Views, International Conference on Machine Learning, 2005.
[7] Alexander Strehl and Joydeep Ghosh. Cluster Ensembles - A Knowledge Reuse Framework for Combining Multiple Partitions. Journal of Machine Learning Research, pages 583–617, 2002.
[8] Donglin Niu, Jennifer G. Dy, and Michael I. Jordan. Multiple non-redundant spectral clustering views. In International Conference on Machine Learning, 2010.
[9] Corinna Cortes, Mehryar Mohri, and Afshin Rostamizadeh. Learning non-linear combination of kernels. In Advances in Neural Information Processing Systems, 2009.
[10] Matthew B. Blaschko and Christoph H. Lampert. Correlational Spectral Clustering. In Computer Vision and Pattern Recognition, 2008.
[11] Virginia R. de Sa. Spectral Clustering with two views. In Proceedings of the Workshop on Learning with Multiple Views, International Conference on Machine Learning, 2005.
[12] Xing Yi, Yunpeng Xu, and Changshui Zhang. Multi-view em algorithm for finite mixture models. In ICAPR, Lecture Notes in Computer Science, Springer-Verlag, 2005.
[13] Massih-Reza Amini, Nicolas Usunier, and Cyril Goutte. Learning from multiple partially observed views - an application to multilingual text categorization. In Advances in Neural Information Processing Systems, 2009.
[14] D. D. Lewis, Y. Yang, T. Rose, and F. Li. RCV1. A new benchmark collection for text categorization research. Journal of Machine Learning Research, 5:361–397, 2004.
[15] Reuters. Corpus, volume 2, multilingual corpus, 1996-08-20 to 1997-08-19, 2005.
[16] Thomas Hofmann. Probabilistic latent semantic analysis. In Uncertainty in Artificial Intelligence, pages 289–296, 1999.
[17] David M. Blei, Andreq Y. Ng, and Michael I. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, pages 993–1022, 2003.
[18] The UCSD Multiple Kernel Learning Repository. http://mkl.ucsd.edu.
[19] Steffen Bickel and Tobias Scheffer. Multi-View Clustering. In IEEE International Conference on Data Mining, 2004.
[20] Dengyong Zhou and Christopher J. C. Burges. Spectral Clustering and Transductive Learning with Multiple Views. In International Conference on Machine Learning, 2007.
[21] Abhishek Kumar and Hal Daum´ . A Co-training Approach for Multiview Spectral Clustering. e In International Conference on Machine Learning, 2011.
[22] Wei Tang, Zhengdong Lu, and Inderjit S. Dhillon. Clustering with Multiple Graphs. In IEEE International Conference on Data Mining, 2009.
[23] Y. Bengio, P. Vincent, and J.F. Paiement. Spectral clustering and kernel PCA are learning eigenfunctions. Technical Report 2003s-19, CIRANO, 2003.
[24] Ulrike von Luxburg, Mikhail Belkin, and Olivier Bousquet. Consistency of Spectral Clustering. Annals of Statistics, 36(2):555–586, 2008. 9