nips nips2010 nips2010-23 nips2010-23-reference knowledge-graph by maker-knowledge-mining

23 nips-2010-Active Instance Sampling via Matrix Partition


Source: pdf

Author: Yuhong Guo

Abstract: Recently, batch-mode active learning has attracted a lot of attention. In this paper, we propose a novel batch-mode active learning approach that selects a batch of queries in each iteration by maximizing a natural mutual information criterion between the labeled and unlabeled instances. By employing a Gaussian process framework, this mutual information based instance selection problem can be formulated as a matrix partition problem. Although matrix partition is an NP-hard combinatorial optimization problem, we show that a good local solution can be obtained by exploiting an effective local optimization technique on a relaxed continuous optimization problem. The proposed active learning approach is independent of employed classification models. Our empirical studies show this approach can achieve comparable or superior performance to discriminative batch-mode active learning methods. 1


reference text

[1] S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, 2004.

[2] K. Brinker. Incorporating diversity in active learning with support vector machines. In Proceedings of International Conference on Machine learning, 2003.

[3] T. Cover and J. Thomas. Elements of Information Theory. John Wiley & sons, 1991.

[4] C. Guestrin, A. Krause, and A. Singh. Near-optimal sensor placements in Gaussian processes. In Proceedings of International Conference on Machine Learning, 2005.

[5] Y. Guo and R. Greiner. Optimistic active learning using mutual information. In Proceedings of International Joint Conference on Artificial Intelligence, 2007.

[6] Y. Guo and D. Schuurmans. Discriminative batch mode active learning. In Proceedings of Neural Information Processing Systems, 2007.

[7] S. Hoi, R. Jin, and M. Lyu. Large-scale text categorization by batch mode active learning. In Proceedings of the International World Wide Web Conference, 2006.

[8] S. Hoi, R. Jin, J. Zhu, and M. Lyu. Batch mode active learning and its application to medical image classification. In Proceedings of International Conference on Machine Learning, 2006.

[9] S. Hoi, R. Jin, J. Zhu, and M. Lyu. Semi-supervised SVM batch mode active learning for image retrieval. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2008.

[10] A. Krause, C. Guestrin, A. Gupta, and J. Kleinberg. Near-optimal sensor placements: Maximizing information while minimizing communication cost. In International Symposium on Information Processing in Sensor Networks, 2006.

[11] C. Rasmussen and C. Williams. Gaussian Processes for Machine Learning. MIT Press, 2006.

[12] G. Schohn and D. Cohn. Less is more: Active learning with support vector machines. In Proceedings of International Conference on Machine Learning, 2000.

[13] B. Settles. Active learning literature survey. Computer Sciences Technical Report 1648, University of Wisconsin–Madison, 2009.

[14] Z. Xu, K. Yu, V. Tresp, X. Xu, and J. Wang. Representative sampling for text classification using support vector machines. In European Conference on Information Retrieval, 2003.

[15] K. Yu and J. Bi. Active learning via transductive experimental design. In In Proceedings of the International Conference on Machine Learning, 2006. 9