cvpr cvpr2013 cvpr2013-45 cvpr2013-45-reference knowledge-graph by maker-knowledge-mining

45 cvpr-2013-Articulated Pose Estimation Using Discriminative Armlet Classifiers

Source: pdf

Author: Georgia Gkioxari, Pablo Arbeláez, Lubomir Bourdev, Jitendra Malik

Abstract: We propose a novel approach for human pose estimation in real-world cluttered scenes, and focus on the challenging problem of predicting the pose of both arms for each person in the image. For this purpose, we build on the notion of poselets [4] and train highly discriminative classifiers to differentiate among arm configurations, which we call armlets. We propose a rich representation which, in addition to standardHOGfeatures, integrates the information of strong contours, skin color and contextual cues in a principled manner. Unlike existing methods, we evaluate our approach on a large subset of images from the PASCAL VOC detection dataset, where critical visual phenomena, such as occlusion, truncation, multiple instances and clutter are the norm. Our approach outperforms Yang and Ramanan [26], the state-of-the-art technique, with an improvement from 29.0% to 37.5% PCP accuracy on the arm keypoint prediction task, on this new pose estimation dataset.

reference text

[1] M. Andriluka, S. Roth, and S. Bernt. Pictorial structures revisited: People detection and articulated pose estimation. CVPR, 2009.

[2] P. Arbelaez, M. Maire, C. Fowlkes, and J. Malik. Contour detection and hierarchical image segmentation. PAMI, 2011.

[3] L. Bourdev, S. Maji, T. Brox, and J. Malik. Detecting people using mutually consistent poselet activations. ECCV, 2010.

[4] L. Bourdev and J. Malik. Poselets: Body part detectors trained using 3d human pose annotations. ICCV, 2009.

[5] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. CVPR, 2005.

[6] D. Desai and D. Ramanan. Detecting actions, poses, and objects with relational phraselets. ECCV, 2012.

[7] M. Eichner and V. Ferrari. Better appearance models for pictorial structures. BMVC, 2009.

[8] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2011 (VOC201 1) Results. http://www.pascalnetwork.org/challenges/VOC/voc201 1/workshop/index.html, 2011.

[9] P. Felzenszwalb and D. Huttenlocher. Efficient matching of pictorial structures. CVPR, 2000.

[10] P. F. Felzenszwalb, R. B. Girshick, D. A. McAllester, and D. Ramanan. Object detection with discriminatively trained part-based models. PAMI, 2010.

[11] V. Ferrari, M. Marin-Jimenez, and A. Zisserman. Progressive search space reduction for human pose estimation. CVPR, 2008.

[12] M. A. Fischler and R. A. Elschlager. The representation and matching of pictorial structures. IEEE Trans. Comput., 1973.

[13] S. Johnson and M. Everingham. Clustered pose and nonlinear appearance models for human pose estimation. BMVC, 2010.

[14] S. Johnson and M. Everingham. Learning effective human pose estimation from inaccurate annotation. CVPR, 2011.

[15] T. Malisiewicz, A. Gupta, and A. Efros. Ensemble of exemplar-svms for object detection and beyond. ICCV, 2011.

[16] G. Mori and J. Malik. Estimating human body configurations using shape context matching. ECCV, 2002.

[17] G. Mori and J. Malik. Recovering 3d human body configurations using shape contexts. PAMI, 2006.

[18] R. Nevatia and T. Binford. Description and recognition of curved objects. Artif. Intell., 1977.

[19] D. Ramanan. Learning to parse images of articulated bodies. NIPS, 2006.

[20] D. Ramanan and C. Sminchisescu. Training deformable models for localization. CVPR, 2006.

[21] B. Sapp, A. Toshev, and B. Taskar. Cascaded models for articulated pose estimation. ECCV, 2010.

[22] G. Shakhnarovich, P. Viola, and T. Darrell. Fast pose estimation with parameter-sensitive hashing. ICCV, 2003.

[23] Y. Tian, L. C. Zitnick, and S. G. Narasimhan. Exploring the spatial hierarchy of mixture models for human pose estimation. ECCV, 2012.

[24] A. Torralba and A. A. Efros. Unbiased look at dataset bias. CVPR, 2011.

[25] Y. Wang, D. Tran, and Z. Liao. Learning hierarchical poselets for human parsing. CVPR, 2011.

[26] Y. Yang and D. Ramanan. Articulated pose estimation with flexible mixtures-of-parts. CVPR, 2011. 333333444977