nips nips2006 nips2006-66 nips2006-66-reference knowledge-graph by maker-knowledge-mining

66 nips-2006-Detecting Humans via Their Pose


Source: pdf

Author: Alessandro Bissacco, Ming-Hsuan Yang, Stefano Soatto

Abstract: We consider the problem of detecting humans and classifying their pose from a single image. Specifically, our goal is to devise a statistical model that simultaneously answers two questions: 1) is there a human in the image? and, if so, 2) what is a low-dimensional representation of her pose? We investigate models that can be learned in an unsupervised manner on unlabeled images of human poses, and provide information that can be used to match the pose of a new image to the ones present in the training set. Starting from a set of descriptors recently proposed for human detection, we apply the Latent Dirichlet Allocation framework to model the statistics of these features, and use the resulting model to answer the above questions. We show how our model can efficiently describe the space of images of humans with their pose, by providing an effective representation of poses for tasks such as classification and matching, while performing remarkably well in human/non human decision problems, thus enabling its use for human detection. We validate the model with extensive quantitative experiments and comparisons with other approaches on human detection and pose matching. 1


reference text

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23] A. Agarwal and B. Triggs. 3d human pose from silhouettes by relevance vector regression. CVPR, 2004. A. Agarwal and B. Triggs. Hyperfeatures: Multilevel local coding for visual recognition. ECCV, 2006. D. Blei, A. Ng, and M. Jordan. Latent drichlet allocation. Journal on Machine Learning Research, 2003. W. Buntine and A. Jakulin. Discrete principal component analysis. HIIT Technical Report, 2005. J. Canny. GaP: a factor model for discrete data. ACM SIGIR, pages 122–129, 2004. N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. CVPR, 2005. P. F. Felzenszwalb and D. P. Huttenlocher. Efficient matching of pictorial structures. CVPR, 2000. R. Fergus, L. Fei-Fei, P. Perona, and A. Zisserman. Learning object categories from Google’s image search. Proc. ICCV, pages 1816–1823, 2005. D. M. Gavrila and V. Philomin. Real-time object detection for smart vehicles. Proc. ICCV, 1999. T. L. Griffiths and M. Steyvers. Finding scientific topics. Proc. National Academy of Science, 2004. R. Gross and J. Shi. The cmu motion of body dataset. Technical report, CMU, 2001. G.Shakhnarovich, P.Viola, andT.Darrell Fast pose estimation with parameter-sensitive hashing ICCV 2003. . . , . D. Lee and H. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 1999. D. G. Lowe. Object recognition from local scale-invariant features. Proc. ICCV, pages 1150–1157, 1999. G. Mori, X. Ren, A. A. Efros, and J. Malik. Recovering human body configurations: Combining segmentation and recognition. Proc. CVPR, 2:326–333, 2004. J. C. Niebles, H. Wang, and L. Fei-Fei. Unsupervised learning of human action categories using spatialtemporal words. Proc. BMVC, 2006. K. Nigam, A. K. McCallum, S. Thurn, and T. Mitchell. Text classification from labeled and unlabeled documents using EM. Machine Learning, pages 1–34, 2000. P.Viola, M.Jones, and D.Snow Detecting pedestrians using patterns of motion and appearance ICCV 2003 . . , . R. Ronfard, C. Schmid, and B. Triggs. Learning to parse pictures of people. ECCV, 2002. R. Rosales and S. Sclaroff. Inferring body without tracking body parts. Proc. CVPR, 2:506–511, 2000. L. Sigal, M. Isard, B. H. Sigelman, and M. Black. Attractive people: Assembling loose-limbed models using non-parametric belief propagation. Proc. NIPS, pages 1539–1546, 2003. J. Sivic, B. C. Russell, A. A. Efros, A. Zisserman, and W. T. Freeman. Discovering object categories in image collections. Proc. ICCV, 2005. M. Weber, M. Welling, and P. Perona. Toward automatic discovery of object categories. CVPR, 2000.