nips nips2012 nips2012-252 nips2012-252-reference knowledge-graph by maker-knowledge-mining

252 nips-2012-On Multilabel Classification and Ranking with Partial Feedback

Source: pdf

Author: Claudio Gentile, Francesco Orabona

Abstract: We present a novel multilabel/ranking algorithm working in partial information settings. The algorithm is based on 2nd-order descent methods, and relies on upper-conﬁdence bounds to trade-off exploration and exploitation. We analyze this algorithm in a partial adversarial setting, where covariates can be adversarial, but multilabel probabilities are ruled by (generalized) linear models. We show O(T 1/2 log T ) regret bounds, which improve in several ways on the existing results. We test the effectiveness of our upper-conﬁdence scheme by contrasting against full-information baselines on real-world multilabel datasets, often obtaining comparable performance. 1

reference text

[1] Y. Abbasi-Yadkori, D. Pal, and C. Szepesv´ ri. Improved algorithms for linear stochastic bana dits. In 25th NIPS, 2011.

[2] K. Amin, M. Kearns, and U. Syed. Graphical models for bandit problems. In UAI, 2011.

[3] P. Auer. Using conﬁdence bounds for exploitation-exploration trade-offs. JMLR, 3, 2003.

[4] K. Crammer and C. Gentile. Multiclass classiﬁcation with bandit feedback using adaptive regularization. In 28th ICML, 2011.

[5] V. Dani, T. Hayes, and S. Kakade. Stochastic linear optimization under bandit feedback. In 21th Colt, 2008.

[6] K. Dembczynski, W. Waegeman, W. Cheng, and E. Hullermeier. On label dependence and loss minimization in multi-label classiﬁcation. Machine Learning, to appear.

[7] S. Filippi, O. Capp´ , A. Garivier, and C. Szepesv´ ri. Parametric bandits: The generalized e a linear case. In NIPS, pages 586–594, 2010.

[8] Y. Freund, R. D. Iyer, R. E. Schapire, and Y. Singer. An efﬁcient boosting algorithm for combining preferences. JMLR, 4:933–969, 2003.

[9] J. Furnkranz, E. Hullermeier, E. Loza Menca, and K. Brinker. Multilabel classiﬁcation via calibrated label ranking. Machine Learning, 73:133–153, 2008.

[10] E. Hazan and S. Kale. Online submodular minimization. In NIPS 22, 2009.

[11] E. Hazan and S. Kale. Newtron: an efﬁcient bandit algorithm for online multiclass prediction. In NIPS, 2011.

[12] R. Herbrich, T. Graepel, and K. Obermayer. Large margin rank boundaries for ordinal regression. In Advances in Large Margin Classiﬁers. MIT Press, 2000.

[13] S. Kakade, S. Shalev-Shwartz, and A. Tewari. Efﬁcient bandit algorithms for online multiclass prediction. In 25th ICML, 2008.

[14] S. Kale, L. Reyzin, and R. Schapire. Non-stochastic bandit slate problems. In 24th NIPS, 2010.

[15] A. Krause and C. S. Ong. Contextual gaussian process bandit optimization. In 25th NIPS, 2011.

[16] T. H. Lai and H. Robbins. Asymptotically efﬁcient adaptive allocation rules. Adv. Appl. Math, 6, 1985.

[17] P. McCullagh and J.A. Nelder. Generalized linear models. Chapman and Hall, 1989.

[18] F. Pachet and P. Roy. Improving multilabel analysis of music titles: A large-scale validation of the correction approach. IEEE Trans. on Audio, Speech, and Lang. Proc., 17(2):335–343, February 2009.

[19] P. Shivaswamy and T. Joachims. Online structured prediction via coactive learning. In 29th ICML, 2012, to appear.

[20] A. Slivkins, F. Radlinski, and S. Gollapudi. Learning optimally diverse rankings over large document collections. In 27th ICML, 2010.

[21] C. G. M. Snoek, M. Worring, J.C. van Gemert, J.-M. Geusebroek, and A. W. M. Smeulders. The challenge problem for automated detection of 101 semantic concepts in multimedia. In Proc. of the 14th ACM international conference on Multimedia, MULTIMEDIA ’06, pages 421–430, New York, NY, USA, 2006.

[22] M. Streeter, D. Golovin, and A. Krause. Online learning of assignments. In 23rd NIPS, 2009.

[23] G. Tsoumakas, I. Katakis, and I. Vlahavas. Random k-labelsets for multilabel classiﬁcation. IEEE Transactions on Knowledge and Data Engineering, 23:1079–1089, 2011. 9