nips nips2001 nips2001-108 nips2001-108-reference knowledge-graph by maker-knowledge-mining

108 nips-2001-Learning Body Pose via Specialized Maps

Source: pdf

Author: Rómer Rosales, Stan Sclaroff

Abstract: A nonlinear supervised learning model, the Specialized Mappings Architecture (SMA), is described and applied to the estimation of human body pose from monocular images. The SMA consists of several specialized forward mapping functions and an inverse mapping function. Each specialized function maps certain domains of the input space (image features) onto the output space (body pose parameters). The key algorithmic problems faced are those of learning the specialized domains and mapping functions in an optimal way, as well as performing inference given inputs and knowledge of the inverse function. Solutions to these problems employ the EM algorithm and alternating choices of conditional independence assumptions. Performance of the approach is evaluated with synthetic and real video sequences of human motion. 1

reference text

[1] M. Brand. Shadow puppetry. In ICCV, 1999.

[2] C. Bregler. Tracking people with twists and exponential maps. In CVPR, 1998.

[3] 1. Csiszar and G. Thsnady. Information geometry and alternating minimization procedures. Statistics and Decisions, 1:205- 237, 1984.

[4] A. Dempster, N. Laird, and D. Rubin. Maximum likelihood estimation from incomplete data. Journal of the Royal Statistical Society (B), 39(1), 1977.

[5] J. Deutscher, A. Blake, and 1. Reid. Articulated body motion capture by annealed particle filtering. In CVPR, 2000.

[6] J.H. Friedman. Multivatiate adaptive regression splines. The Annals of Statistics, 19,1-141 , 1991.

[7] G. Hinton, B. Sallans, and Z. Ghahramani. A hierarchical community of experts. Learning in Graphical Models, M. Jordan (editor) , 1998.

[8] N. Howe, M. Leventon, and B. Freeman. Bayesian reconstruction of 3d human motion from single-camera video. In NIPS-1 2, 2000.

[9] M. Isard and A. Blake. Contour tracking by stochastic propagation of conditional density. In ECCV, 1996.

[10] G. Johansson. Visual perception of biological motion and a model for its analysis. P erception and Psychophysics, 14(2): 210-211, 1973.

[11] M. 1. Jordan and R. A. Jacobs. Hierarchical mixtures of experts and the EM algorithm. N eural Computation, 6, 181-214, 1994.

[12] R. Neal and G. Hinton. A view of the em algorithm that justifies incremental , sparse, and other variants. Learning in Graphical Models, M. Jordan (editor) , 1998.

[13] Dirk Ormoneit , Hedvig Sidenbladh, Michael J . Black, and Trevor Hastie. Learning and tracking cyclic human motion. In NIPS-1 3, 200l.

[14] Vladimir Pavlovic, James M. Rehg, and John MacCormick. Learning switching linear models of human motion. In NIPS-13, 200l.

[15] J. M. Regh and T. Kanade. Model-based tracking of self-occluding articulated objects. In ICC V, 1995.

[16] R. Rosales and S. Sclaroff. Specialized mappings and the estimation of body pose from a single image. In IEEE Human Motion Workshop , 2000.

[17] Y. Song, Xiaoling Feng, and P. Perona. Towards detection of human motion. In CVPR, 2000.