nips nips2006 nips2006-78 nips2006-78-reference knowledge-graph by maker-knowledge-mining

78 nips-2006-Fast Discriminative Visual Codebooks using Randomized Clustering Forests

Source: pdf

Author: Frank Moosmann, Bill Triggs, Frederic Jurie

Abstract: Some of the most effective recent methods for content-based image classiﬁcation work by extracting dense or sparse local image descriptors, quantizing them according to a coding rule such as k-means vector quantization, accumulating histograms of the resulting “visual word” codes over the image, and classifying these with a conventional classiﬁer such as an SVM. Large numbers of descriptors and large codebooks are needed for good results and this becomes slow using k-means. We introduce Extremely Randomized Clustering Forests – ensembles of randomly created clustering trees – and show that these provide more accurate results, much faster training and testing and good resistance to background clutter in several state-of-the-art image classiﬁcation tasks. 1

reference text

[1] E. Bauer and R. Kohavi. An empirical comparison of voting classiﬁcation algorithms: Bagging, boosting, and variants. Machine Learning Journal, 36(1-2):105–139, 1999.

[2] K. Beyer, J. Goldstein, R. Ramakrishnan, and U. Shaft. When is nearest neighbors meaningful? In Int. Conf. Database Theorie, pages 217–235, 1999.

[3] H. Blockeel, L. De Raedt, and J. Ramon. Top-down induction of clustering trees. In ICML, pages 55–63, 1998.

[4] L. Breiman. Random forests. ML Journal, 45(1):5–32, 2001.

[5] G. Csurka, C. Dance, L. Fan, J. Williamowski, and C. Bray. Visual categorization with bags of keypoints. In ECCV’04 workshop on Statistical Learning in CV, pages 59–74, 2004.

[6] M. Everingham et al. (33 authors). The 2005 PASCAL visual object classes challenge. In F. d’Alche Buc, I. Dagan, and J. Quinonero, editors, Proc. 1st PASCAL Challenges Workshop. Springer LNAI, 2006.

[7] R. Fergus, L. Fei-Fei, P. Perona, and A. Zisserman. Learning object categories from google’s image search. In ICCV, pages II: 1816–1823, 2005.

[8] P. Geurts, D. Ernst, and L. Wehenkel. Extremely randomized trees. Machile Learning Journal, 63(1), 2006.

[9] F. Jurie and B. Triggs. Creating efﬁcient codebooks for visual recognition. In ICCV, 2005. ´

[10] H. Lejsek, F.H. Asmundsson, B. Th´ r-J´ nsson, and L. Amsaleg. Scalability of local image descriptors: o o A comparative study. In ACM Int. Conf. on Multimedia, Santa Barbara, 2006.

[11] V. Lepetit, P. Lagger, and P. Fua. Randomized trees for real-time keypoint recognition. In CVPR ’05 Vol.2, pages 775–781, 2005.

[12] T. Leung and J. Malik. Representing and recognizing the visual appearance of materials using threedimensional textons. IJCV, 43(1):29–44, June 2001.

[13] Bing Liu, Yiyuan Xia, and Philip S. Yu. Clustering through decision tree construction. In CIKM ’00, pages 20–29, 2000.

[14] D.G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60(2), 2004.

[15] R. Mar´ e, P. Geurts, J. Piater, and L. Wehenkel. Random subwindows for robust image classiﬁcation. In e CVPR, volume 1, pages 34–40, 2005.

[16] F. Moosmann, D. Larlus, and F. Jurie. Learning saliency maps for object categorization. In ECCV’06 Workshop on the Representation and Use of Prior Knowledge in Vision, 2006.

[17] D. Nist´ r and H. Stew´ nius. Scalable recognition with a vocabulary tree. In CVPR, 2006. e e

[18] E. Nowak, F. Jurie, and B. Triggs. Sampling strategies for bag-of-features image classiﬁcation. In ECCV’06, 2006.

[19] A. Opelt and A. Pinz. Object localization with boosting and weak supervision for generic object recognition. In SCIA, 2005.

[20] F. Perronnin, C. Dance, G. Csurka, and M. Bressan. Adapted vocabularies for generic visual categorization. In ECCV, 2006.

[21] U. Shaft, J. Goldstein, and K. Beyer. Nearest neighbor query performance for unstable distributions. Technical Report TR 1388, Dpt of Computer Science, Univ. of Wisconsin, 1998.

[22] J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In ICCV, volume 2, pages 1470–1477, October 2003.

[23] J. Winn and A. Criminisi. Object class recognition at a glance. In CVPR’06 - video tracks, 2006.

[24] J. Winn, A. Criminisi, and T. Minka. Object categorization by learned universal visual dictionary. In ICCV, pages II: 1800–1807, 2005.

[25] J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid. Local features and kernels for classiﬁcation of texture and object categories: A comprehensive study. Int. J. Computer Vision. To appear, 2006.