nips nips2004 nips2004-44 nips2004-44-reference knowledge-graph by maker-knowledge-mining

44 nips-2004-Conditional Random Fields for Object Recognition


Source: pdf

Author: Ariadna Quattoni, Michael Collins, Trevor Darrell

Abstract: We present a discriminative part-based approach for the recognition of object classes from unsegmented cluttered scenes. Objects are modeled as flexible constellations of parts conditioned on local observations found by an interest operator. For each object class the probability of a given assignment of parts to local features is modeled by a Conditional Random Field (CRF). We propose an extension of the CRF framework that incorporates hidden variables and combines class conditional CRFs into a unified framework for part-based object recognition. The parameters of the CRF are estimated in a maximum likelihood framework and recognition proceeds by finding the most likely class under our model. The main advantage of the proposed CRF framework is that it allows us to relax the assumption of conditional independence of the observed data (i.e. local features) often used in generative approaches, an assumption that might be too restrictive for a considerable number of object classes.


reference text

[1] R. Fergus, P. Perona,and A. Zisserman. Object class recognition by unsupervised scale-invariant learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,volume 2, pages 264-271, June 2003.

[2] S. Kumar and M. Hebert. Discriminative random fields: A framework for contextual interaction in classification. In IEEE Int Conference on Computer Vision,volume 2, pages 1150-1157, June 2003.

[3] J. Lafferty,A. McCallum and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. Int Conf. on Machine Learning, 2001.

[4] D. Lowe. Object Recognition from local scale-invariant features. In IEEE Int Conference on Computer Vision, 1999.

[5] A. McCallum, D. Freitag, and F. Pereira. Maximum entropy markov models for information extraction and segmentation. In ICML-2000, 2000.

[6] J. Pearl. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann,1988.

[7] A. Ratnaparkhi. A maximum entropy part-of-speech tagger. In EMNLP, 1996.