nips nips2011 nips2011-216 nips2011-216-reference knowledge-graph by maker-knowledge-mining

216 nips-2011-Portmanteau Vocabularies for Multi-Cue Image Representation

Source: pdf

Author: Fahad S. Khan, Joost Weijer, Andrew D. Bagdanov, Maria Vanrell

Abstract: We describe a novel technique for feature combination in the bag-of-words model of image classiﬁcation. Our approach builds discriminative compound words from primitive cues learned independently from training images. Our main observation is that modeling joint-cue distributions independently is more statistically robust for typical classiﬁcation problems than attempting to empirically estimate the dependent, joint-cue distribution directly. We use Information theoretic vocabulary compression to ﬁnd discriminative combinations of cues and the resulting vocabulary of portmanteau1 words is compact, has the cue binding property, and supports individual weighting of cues in the ﬁnal image representation. State-of-theart results on both the Oxford Flower-102 and Caltech-UCSD Bird-200 datasets demonstrate the effectiveness of our technique compared to other, signiﬁcantly more complex approaches to multi-cue image representation. 1

reference text

[1] Francis Bach. Exploring large feature spaces with hierarchical multiple kernel learning. In NIPS, 2008.

[2] A. Bosch, A. Zisserman, and X. Munoz. Scene classiﬁcation via plsa. In ECCV, 2006.

[3] Steve Branson, Catherine Wah, Florian Schroff, Boris Babenko, Peter Welinder, Pietro Perona, and Serge Belongie. Visual recognition with humans in the loop. In ECCV, 2010.

[4] G. Csurka, C. Bray, C. Dance, and L. Fan. Visual categorization with bags of keypoints. In Workshop on Statistical Learning in Computer Vision, ECCV, 2004.

[5] Inderjit Dhillon, Subramanyam Mallela, and Rahul Kumar. A divisive information-theoretic feature clustering algorithm for text classiﬁcation. Journal of Machine Learning Research (JMLR), 3:1265–1287, 2003.

[6] Noha M. Elﬁky, Fahad Shahbaz Khan, Joost van de Weijer, and Jordi Gonzalez. Discriminative compact pyramids for object and scene recognition. Pattern Recgnition, 2011.

[7] Brian Fulkerson, Andrea Vedaldi, and Stefano Soatto. Localizing objects with smart dictionaries. In ECCV, 2008.

[8] Satoshi Ito and Susumu Kubota. Object classiﬁcation using hetrogeneous co-occurrence features. In ECCV, 2010.

[9] Christopher Kanan and Garrison Cottrell. Robust classiﬁcation of objects, faces, and ﬂowers using natural image statistics. In CVPR, 2010.

[10] Fahad Shahbaz Khan, Joost van de Weijer, and Maria Vanrell. Top-down color attention for object recognition. In ICCV, 2009.

[11] Svetlana Lazebnik, Cordelia Schmid, and Jean Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR, 2006.

[12] D. G. Lowe. Distinctive image features from scale-invariant points. IJCV, 60(2):91–110, 2004.

[13] M-E Nilsback and A. Zisserman. Automated ﬂower classiﬁcation over a large number of classes. In ICVGIP, 2008.

[14] Alain Rakotomamonjy, Francis Bach, Stephane Canu, and Yves Grandvalet. More efﬁciency in multiple kernel learning. In ICML, 2007.

[15] J. Sivic, B. Russell, A. Efros, A. Zisserman, and W.Freeman. Discovering object categories in image collections. In ICCV, 2005.

[16] Noam Slonim and Naftali Tishby. Agglomerative information bottleneck. In NIPS, 1999.

[17] Anne Treisman. Feature Binding, Attention and Object Perception. Philosophical Transactions: Biological Sciences, 353(1373):1295–1306, 1998.

[18] Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek. Evaluating color descriptors for object and scene recognition. PAMI, 32(9):1582–1596, 2010.

[19] J. van de Weijer, C. Schmid, Jakob J. Verbeek, and D. Larlus. Learning color names for real-world applications. IEEE Transaction in Image Processing (TIP), 18(7):1512–1524, 2009.

[20] Manik Varma and Bodla Rakesh Babu. More generality in efﬁcient multiple kernel learning. In ICML, 2009.

[21] Manik Varma and Debajyoti Ray. Learning the discriminative power-invariance trade-off. In ICCV, 2007.

[22] Jinjun Wang, Jianchao Yang, Kai Yu, Fengjun Lv, Thomas Huang, and Yihong Gong. constrained linear coding for image classiﬁcation. In CVPR, 2010. Locality-

[23] Bangpeng Yao, Aditya Khosla, and Li Fei-Fei. Combining randomization and discrimination for ﬁnegrained image categorization. In CVPR, 2011.

[24] J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid. Local features and kernels for classiﬁcation of texture and object catergories: A comprehensive study. IJCV, 73(2):213–218, 2007. 9