nips nips2007 nips2007-113 nips2007-113-reference knowledge-graph by maker-knowledge-mining

113 nips-2007-Learning Visual Attributes

Source: pdf

Author: Vittorio Ferrari, Andrew Zisserman

Abstract: We present a probabilistic generative model of visual attributes, together with an efﬁcient learning algorithm. Attributes are visual qualities of objects, such as ‘red’, ‘striped’, or ‘spotted’. The model sees attributes as patterns of image segments, repeatedly sharing some characteristic properties. These can be any combination of appearance, shape, or the layout of segments within the pattern. Moreover, attributes with general appearance are taken into account, such as the pattern of alternation of any two colors which is characteristic for stripes. To enable learning from unsegmented training images, the model is learnt discriminatively, by optimizing a likelihood ratio. As demonstrated in the experimental evaluation, our model can learn in a weakly supervised setting and encompasses a broad range of attributes. We show that attributes can be learnt starting from a text query to Google image search, and can then be used to recognize the attribute and determine its spatial extent in novel real-world images.

reference text

[1] N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, CVPR, 2005.

[2] P. Felzenszwalb and D Huttenlocher, Efﬁcient Graph-Based Image Segmentation, IJCV, (50):2, 2004.

[3] R. Fergus, P. Perona, and A. Zisserman, Object Class Recognition by Unsupervised Scale-Invariant Learning, CVPR, 2003.

[4] N. Jojic and Y. Caspi, Capturing image structure with probabilistic index maps, CVPR, 2004

[5] S. Lazebnik, C. Schmid, and J. Ponce, A Sparse Texture Representation Using Local Afﬁne Regions, PAMI, (27):8, 2005

[6] Y. Liu, Y. Tsin, and W. Lin, The Promise and Perils of Near-Regular Texture, IJCV, (62):1, 2005

[7] J. Van de Weijer, C. Schmid, and J. Verbeek, Learning Color Names from Real-World Images, CVPR, 2007.

[8] M. Varma and A. Zisserman, Texture classiﬁcation: Are ﬁlter banks necessary?, CVPR, 2003.

[9] J. Winn, A. Criminisi, and T. Minka, Object Categorization by Learned Universal Visual Dictionary, ICCV, 2005.

[10] J. Winn and N. Jojic. LOCUS: Learning Object Classes with Unsupervised Segmentation, ICCV, 2005.

[11] K. Yanai and K. Barnard, Image Region Entropy: A Measure of ”Visualness” of Web Images Associated with One Concept, ACM Multimedia, 2005.

[12] Caltech 101 dataset: www.vision.caltech.edu/Image Datasets/Caltech101/Caltech101.html