nips nips2011 nips2011-216 nips2011-216-reference knowledge-graph by maker-knowledge-mining

216 nips-2011-Portmanteau Vocabularies for Multi-Cue Image Representation


Source: pdf

Author: Fahad S. Khan, Joost Weijer, Andrew D. Bagdanov, Maria Vanrell

Abstract: We describe a novel technique for feature combination in the bag-of-words model of image classification. Our approach builds discriminative compound words from primitive cues learned independently from training images. Our main observation is that modeling joint-cue distributions independently is more statistically robust for typical classification problems than attempting to empirically estimate the dependent, joint-cue distribution directly. We use Information theoretic vocabulary compression to find discriminative combinations of cues and the resulting vocabulary of portmanteau1 words is compact, has the cue binding property, and supports individual weighting of cues in the final image representation. State-of-theart results on both the Oxford Flower-102 and Caltech-UCSD Bird-200 datasets demonstrate the effectiveness of our technique compared to other, significantly more complex approaches to multi-cue image representation. 1


reference text

[1] Francis Bach. Exploring large feature spaces with hierarchical multiple kernel learning. In NIPS, 2008.

[2] A. Bosch, A. Zisserman, and X. Munoz. Scene classification via plsa. In ECCV, 2006.

[3] Steve Branson, Catherine Wah, Florian Schroff, Boris Babenko, Peter Welinder, Pietro Perona, and Serge Belongie. Visual recognition with humans in the loop. In ECCV, 2010.

[4] G. Csurka, C. Bray, C. Dance, and L. Fan. Visual categorization with bags of keypoints. In Workshop on Statistical Learning in Computer Vision, ECCV, 2004.

[5] Inderjit Dhillon, Subramanyam Mallela, and Rahul Kumar. A divisive information-theoretic feature clustering algorithm for text classification. Journal of Machine Learning Research (JMLR), 3:1265–1287, 2003.

[6] Noha M. Elfiky, Fahad Shahbaz Khan, Joost van de Weijer, and Jordi Gonzalez. Discriminative compact pyramids for object and scene recognition. Pattern Recgnition, 2011.

[7] Brian Fulkerson, Andrea Vedaldi, and Stefano Soatto. Localizing objects with smart dictionaries. In ECCV, 2008.

[8] Satoshi Ito and Susumu Kubota. Object classification using hetrogeneous co-occurrence features. In ECCV, 2010.

[9] Christopher Kanan and Garrison Cottrell. Robust classification of objects, faces, and flowers using natural image statistics. In CVPR, 2010.

[10] Fahad Shahbaz Khan, Joost van de Weijer, and Maria Vanrell. Top-down color attention for object recognition. In ICCV, 2009.

[11] Svetlana Lazebnik, Cordelia Schmid, and Jean Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR, 2006.

[12] D. G. Lowe. Distinctive image features from scale-invariant points. IJCV, 60(2):91–110, 2004.

[13] M-E Nilsback and A. Zisserman. Automated flower classification over a large number of classes. In ICVGIP, 2008.

[14] Alain Rakotomamonjy, Francis Bach, Stephane Canu, and Yves Grandvalet. More efficiency in multiple kernel learning. In ICML, 2007.

[15] J. Sivic, B. Russell, A. Efros, A. Zisserman, and W.Freeman. Discovering object categories in image collections. In ICCV, 2005.

[16] Noam Slonim and Naftali Tishby. Agglomerative information bottleneck. In NIPS, 1999.

[17] Anne Treisman. Feature Binding, Attention and Object Perception. Philosophical Transactions: Biological Sciences, 353(1373):1295–1306, 1998.

[18] Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek. Evaluating color descriptors for object and scene recognition. PAMI, 32(9):1582–1596, 2010.

[19] J. van de Weijer, C. Schmid, Jakob J. Verbeek, and D. Larlus. Learning color names for real-world applications. IEEE Transaction in Image Processing (TIP), 18(7):1512–1524, 2009.

[20] Manik Varma and Bodla Rakesh Babu. More generality in efficient multiple kernel learning. In ICML, 2009.

[21] Manik Varma and Debajyoti Ray. Learning the discriminative power-invariance trade-off. In ICCV, 2007.

[22] Jinjun Wang, Jianchao Yang, Kai Yu, Fengjun Lv, Thomas Huang, and Yihong Gong. constrained linear coding for image classification. In CVPR, 2010. Locality-

[23] Bangpeng Yao, Aditya Khosla, and Li Fei-Fei. Combining randomization and discrimination for finegrained image categorization. In CVPR, 2011.

[24] J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid. Local features and kernels for classification of texture and object catergories: A comprehensive study. IJCV, 73(2):213–218, 2007. 9