nips nips2011 nips2011-168 nips2011-168-reference knowledge-graph by maker-knowledge-mining

168 nips-2011-Maximum Margin Multi-Instance Learning

Source: pdf

Author: Hua Wang, Heng Huang, Farhad Kamangar, Feiping Nie, Chris H. Ding

Abstract: Multi-instance learning (MIL) considers input as bags of instances, in which labels are assigned to the bags. MIL is useful in many real-world applications. For example, in image categorization semantic meanings (labels) of an image mostly arise from its regions (instances) instead of the entire image (bag). Existing MIL methods typically build their models using the Bag-to-Bag (B2B) distance, which are often computationally expensive and may not truly reﬂect the semantic similarities. To tackle this, in this paper we approach MIL problems from a new perspective using the Class-to-Bag (C2B) distance, which directly assesses the relationships between the classes and the bags. Taking into account the two major challenges in MIL, high heterogeneity on data and weak label association, we propose a novel Maximum Margin Multi-Instance Learning (M3 I) approach to parameterize the C2B distance by introducing the class speciﬁc distance metrics and the locally adaptive signiﬁcance coefﬁcients. We apply our new approach to the automatic image categorization tasks on three (one single-label and two multilabel) benchmark data sets. Extensive experiments have demonstrated promising results that validate the proposed method.

reference text

[1] O. Maron and A.L. Ratan. Multiple-instance learning for natural scene classiﬁcation. In ICML, 1998.

[2] Y. Chen and J.Z. Wang. Image categorization by learning and reasoning with regions. JMLR, 5:913–939, 2004.

[3] Z.H. Zhou and M.L. Zhang. Multi-instance multi-label learning with application to scene classiﬁcation. In NIPS, 2007.

[4] Z.J. Zha, X.S. Hua, T. Mei, J. Wang, G.J. Qi, and Z. Wang. Joint multi-label multi-instance learning for image classiﬁcation. In CVPR, 2008.

[5] R. Jin, S. Wang, and Z.H. Zhou. Learning a distance metric from multi-instance multi-label data. In CVPR, 2009.

[6] M. Guillaumin, J. Verbeek, and C. Schmid. Multiple instance metric learning from automatically labeled bags of faces. In ECCV, 2010.

[7] H. Wang, F. Nie, and H. Huang. Learning instance speciﬁc distance for multi-instance classiﬁcation. In AAAI, 2011.

[8] H. Wang, F. Nie, H. Huang, and Y. Yang. Learning frame relevance for video classiﬁcation. In ACM MM, 2011.

[9] O. Boiman, E. Shechtman, and M. Irani. In defense of nearest-neighbor based image classiﬁcation. In CVPR, 2008.

[10] A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. IEEE TPAMI, 22(12):1349–1380, 2002.

[11] H. Wang, H. Huang, and C. Ding. Image annotation using multi-label correlated Green’s function. In ICCV, 2009.

[12] H. Wang, H. Huang, and C. Ding. Multi-label feature transform for image classiﬁcations. In ECCV, 2010.

[13] H. Wang, C. Ding, and H. Huang. Multi-label linear discriminant analysis. In ECCV, pages 126–139. Springer, 2010.

[14] H. Wang, H. Huang, and C. Ding. Image annotation using bi-relational graph of images and semantic labels. In CVPR, 2011.

[15] M. Schultz and T. Joachims. Learning a distance metric from relative comparisons. In NIPS, 2003.

[16] A. Frome, Y. Singer, and J. Malik. Image retrieval and classiﬁcation using local distance functions. In NIPS, 2007.

[17] A. Frome, Y. Singer, F. Sha, and J. Malik. Learning globally-consistent local distance functions for shape-based image retrieval and classiﬁcation. In ICCV, 2007.

[18] Z. Wang, Y. Hu, and L.T. Chia. Image-to-Class Distance Metric Learning for Image Classiﬁcation. In ECCV, 2010.

[19] P. Duygulu, K. Barnard, J. De Freitas, and D. Forsyth. Object recognition as machine translation: Learning a lexicon for a ﬁxed image vocabulary. In ECCV, 2002.

[20] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2010 (VOC2010) Results. http://pascallin.ecs.soton.ac.uk/challenges/VOC/voc2010/.

[21] H. Wang, C. Ding, and H. Huang. Multi-label classiﬁcation: Inconsistency and class balanced k-nearest neighbor. In AAAI, 2010.

[22] J. Wang and J.D. Zucker. Solving the multiple-instance problem: A lazy learning approach. In ICML, 2000.

[23] R.E. Schapire and Y. Singer. BoosTexter: A boosting-based system for text categorization. Machine learning, 39(2):135–168, 2000. 9