nips nips2010 nips2010-228 nips2010-228-reference knowledge-graph by maker-knowledge-mining

228 nips-2010-Reverse Multi-Label Learning

Source: pdf

Author: James Petterson, Tibério S. Caetano

Abstract: Multi-label classiﬁcation is the task of predicting potentially multiple labels for a given instance. This is common in several applications such as image annotation, document classiﬁcation and gene function prediction. In this paper we present a formulation for this problem based on reverse prediction: we predict sets of instances given the labels. By viewing the problem from this perspective, the most popular quality measures for assessing the performance of multi-label classiﬁcation admit relaxations that can be efﬁciently optimised. We optimise these relaxations with standard algorithms and compare our results with several stateof-the-art methods, showing excellent performance. 1

reference text

[1] Krzysztof Dembczynski, Weiwei Cheng, and Eyke H¨ llermeier. Bayes Optimal Multilabel u Classiﬁcation via Probabilistic Classiﬁer Chains. In Proc. Intl. Conf. Machine Learning, 2010.

[2] Xinhua Zhang, T. Graepel, and Ralf Herbrich. Bayesian Online Learning for Multi-label and Multi-variate Performance Measures. In Proc. Intl. Conf. on Artiﬁcial Intelligence and Statistics, volume 9, pages 956–963, 2010.

[3] Piyush Rai and Hal Daume. Multi-Label Prediction via Sparse Inﬁnite CCA. In Y. Bengio, D. Schuurmans, J. Lafferty, C. K. I. Williams, and A. Culotta, editors, Advances in Neural Information Processing Systems 22, pages 1518–1526. 2009.

[4] Jesse Read, Bernhard Pfahringer, Geoffrey Holmes, and Eibe Frank. Classiﬁer chains for multi-label classiﬁcation. In Wray L. Buntine, Marko Grobelnik, Dunja Mladenic, and John Shawe-Taylor, editors, ECML/PKDD (2), volume 5782 of Lecture Notes in Computer Science, pages 254–269. Springer, 2009.

[5] Andr´ Elisseeff and Jason Weston. A kernel method for multi-labelled classiﬁcation. In Annual e ACM Conference on Research and Development in Information Retrieval, pages 274–281, 2005.

[6] Matthieu Guillaumin, Thomas Mensink, Jakob Verbeek, and Cordelia Schmid. TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Auto-Annotation. In Proc. Intl. Conf. Computer Vision, 2009.

[7] Douglas W. Oard and Jason R. Baron. Overview of the TREC 2008 Legal Track.

[8] Linli Xu, Martha White, and Dale Schuurmans. Optimal reverse prediction. Proc. Intl. Conf. Machine Learning, pages 1–8, 2009.

[9] Grigorios Tsoumakas, Ioannis Katakis, and Ioannis P. Vlahavas. Mining Multi-label Data. Springer, 2009.

[10] Grigorios Tsoumakas and Ioannis P. Vlahavas. Random k-labelsets: An ensemble method for multilabel classiﬁcation. In Proceedings of the 18th European Conference on Machine Learning (ECML 2007), pages 406–417, Warsaw, Poland, 2007.

[11] Jesse Read, Bernhard Pfahringer, and Geoff Holmes. Multi-label classiﬁcation using ensembles of pruned sets. In ICDM ’08: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, pages 995–1000, Washington, DC, USA, 2008. IEEE Computer Society.

[12] Shantanu Godbole and Sunita Sarawagi. Discriminative methods for multi-labeled classiﬁcation. In Proceedings of the 8th Paciﬁc-Asia Conference on Knowledge Discovery and Data Mining, pages 22–30. Springer, 2004.

[13] Martin Jansche. Maximum expected F-measure training of logistic regression models. HLT, pages 692–699, 2005.

[14] T. Joachims. A support vector method for multivariate performance measures. In Proc. Intl. Conf. Machine Learning, pages 377–384, San Francisco, California, 2005. Morgan Kaufmann Publishers.

[15] V. Vapnik. Statistical Learning Theory. John Wiley and Sons, New York, 1998.

[16] I. Tsochantaridis, T. Joachims, T. Hofmann, and Y. Altun. Large margin methods for structured and interdependent output variables. J. Mach. Learn. Res., 6:1453–1484, 2005.

[17] D. E. Knuth. The Art of Computer Programming: Fundamental Algorithms, volume 1. Addison-Wesley, Reading, Massachusetts, second edition, 1998.

[18] Choon Hui Teo, S. V. N. Vishwanathan, Alex J. Smola, and Quoc V. Le. Bundle methods for regularized risk minimization. Journal of Machine Learning Research, 11:311–365, 2010.

[19] Robert E. Schapire and Y. Singer. Improved boosting algorithms using conﬁdence-rated predictions. Machine Learning, 37(3):297–336, 1999.

[20] Min-Ling Zhang and Zhi-Hua Zhou. ML-KNN: A lazy learning approach to multi-label learning. Pattern Recognition, 40(7):2038–2048, July 2007. 9