cvpr cvpr2013 cvpr2013-426 knowledge-graph by maker-knowledge-mining

426 cvpr-2013-Tensor-Based Human Body Modeling

Source: pdf

Author: Yinpeng Chen, Zicheng Liu, Zhengyou Zhang

Abstract: In this paper, we present a novel approach to model 3D human body with variations on both human shape and pose, by exploring a tensor decomposition technique. 3D human body modeling is important for 3D reconstruction and animation of realistic human body, which can be widely used in Tele-presence and video game applications. It is challenging due to a wide range of shape variations over different people and poses. The existing SCAPE model [4] is popular in computer vision for modeling 3D human body. However, it considers shape and pose deformations separately, which is not accurate since pose deformation is persondependent. Our tensor-based model addresses this issue by jointly modeling shape and pose deformations. Experimental results demonstrate that our tensor-based model outperforms the SCAPE model quite significantly. We also apply our model to capture human body using Microsoft Kinect sensors with excellent results.

Reference: text

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 com iu Abstract In this paper, we present a novel approach to model 3D human body with variations on both human shape and pose, by exploring a tensor decomposition technique. [sent-2, score-0.553]

2 3D human body modeling is important for 3D reconstruction and animation of realistic human body, which can be widely used in Tele-presence and video game applications. [sent-3, score-0.335]

3 However, it considers shape and pose deformations separately, which is not accurate since pose deformation is persondependent. [sent-6, score-0.574]

4 We also apply our model to capture human body using Microsoft Kinect sensors with excellent results. [sent-9, score-0.272]

5 Introduction 3D human body modeling has numerous applications in Computer Vision, Graphics, and Multimedia. [sent-11, score-0.251]

6 The problem is challenging because the variation in 3D human body geometry (over different people and poses) is a complicated function over multiple shape and pose variables. [sent-14, score-0.467]

7 Among the early work in human body modeling [2, 13, 4, 3, 10], the SCAPE model [4] has been widely used in estimating human shape and pose as well as in reshaping human body in images and videos. [sent-15, score-0.819]

8 It learns a pose deformation model from a subject with multiple poses and learns a shape model from many subjects with a neutral pose. [sent-16, score-0.808]

9 However, the decoupling of shape and pose deformations in the SCAPE model has a major limitation - 3D meshes of different individuals change in the similar manner for the same pose change. [sent-17, score-0.421]

10 We fit the SCAPE model on a female subject at the neutral pose Figure 1. [sent-19, score-0.307]

11 The shape parameters for both SCAPE and TenBo models are estimated from the original neutral pose data. [sent-22, score-0.327]

12 Using the tensor decomposition technique, we model the deformation as a joint function over both shape and pose parameters to preserve the dependency between them. [sent-29, score-0.646]

13 We also apply our TenBo model to capture human body using Microsoft Kinect sensors with excellent results. [sent-34, score-0.272]

14 Then, we introduce mesh deformation definition and describe our TenBo model in details. [sent-37, score-0.371]

15 Related Work 3D Human Body Models: The early human body models, including [2] and [13], focused on modeling the shape variations in similar poses. [sent-41, score-0.36]

16 Allen [3] used maximum posteriori estimation to learn a correlated model of identity and pose-dependent body shape variation. [sent-43, score-0.328]

17 The SCAPE model [4] is a widely used model which decouples shape and pose deformations. [sent-46, score-0.258]

18 However, due to the decoupling, the pose deformation model is shared by all individuals, i. [sent-47, score-0.349]

19 Human Shape and Pose Estimation: The SCAPE model [4] has been used widely in human shape and pose estimation [8, 5, 6, 17, 20]. [sent-51, score-0.29]

20 Guan [8] estimated human shape and pose from a single image using shading information. [sent-52, score-0.269]

21 Weiss [17] scaned 3D human body from noisy image and range data by using silhouette objective. [sent-54, score-0.251]

22 In [9], Hasler presented a bilinear model of shape and pose to estimate 3D meshes of dressed subjects from images. [sent-56, score-0.4]

23 Tensor Faces: Tensor based approaches have been successfully applied to face modeling [15, 16], which motivated us to extend them to human body modeling. [sent-59, score-0.251]

24 TenBo allows each subject to perform only a small subset of the poses rather than the full set and allows large variations among different subjects to perform the same pose, while TensorFaces requires the same capture configuration for all subjects (e. [sent-61, score-0.338]

25 Overview Each 3D human body mesh can be considered as a deformation from a reference mesh. [sent-65, score-0.63]

26 Our TenBo model considers the deformation D as a joint function D(v, θ) over shape parameters v and pose parameters θ to integrate shape deformation (due to different persons) and pose deformation (due to different poses) using tensor technique. [sent-66, score-1.331]

27 Compared with the SCAPE model, which separates the shape deformation S(v) and the pose deformation Q(θ) as D = S(v)Q(θ), our TenBo model is able to preserve the dependency between the shape and pose deformations. [sent-67, score-0.895]

28 The SCAPE model only uses one subject (with multiple poses) to train the pose model and only uses one pose (from multiple persons) to train the shape model. [sent-69, score-0.434]

29 In comparison, our TenBo model uses multiple poses from multiple subjects to combine the shape and pose deformations together. [sent-70, score-0.443]

30 Mesh Deformation Definition We use the same mesh deformation definition as in the SCAPE model [4]. [sent-76, score-0.371]

31 The deformation for an arbitrary 3D body mesh Y = {VY , P} indicates the difference between the mesh Y and {thVe r,ePfe}ren incdei mateessh t hXe. [sent-91, score-0.677]

32 three vertices xn,1, xn,2, xn,3 of the triangle pn on the reference mesh X is deformed to yn,1, yn,2, 111000666 SCAPE ModelTenBo Model SPTMrhosadiepnDelinD gfeiorfmerantiocen TpCUor saeni smTfidraouebnlmtreiopsn1hml. [sent-94, score-0.267]

33 The deformation of triangle pn from X to Y is represented as the linear transformation of two edges ( Δxn,1 = xn,2 −xn,1 , Δxn,2 = xn,3 −xn,1) as follows: yn,3 Δyn,q = yn,q+1 − yn,1 = Rl[n]DnΔxn,q, q = 1, 2, (1) where Rl[n] is the rotation matrix (from X to Y ) for the body segment l[n] (e. [sent-100, score-0.551]

34 torso, upper arm) that includes the triangle pn, Dn is the non-rigid deformation matrix for the triangle pn. [sent-102, score-0.379]

35 a tNricoetes {thDat Rl[n] i asl lsh traiarend- × by all triangles belonging to the body segment l[n] . [sent-106, score-0.291]

36 The calculation of Rl[n] and Dn for a given mesh Y and the 3D reconstruction of mesh Y using Rl[n] and Dn has been solved in the SCAPE model [4]. [sent-107, score-0.279]

37 Tensor-based Human Body Model (TenBo) Our TenBo model includes two parts - (a) model for an individual body segment (e. [sent-110, score-0.333]

38 We first introduce a tensor-based method to model the deformation of an individual body segment. [sent-113, score-0.44]

39 Then we will discuss how to integrate local shape vector (refer to shape parameters for a body segment) into global shape vector (refer to shape parameters for the whole body). [sent-114, score-0.712]

40 We use sl to denote the local shape vector on the lth body segment and use v to denote the global shape vector. [sent-115, score-0.639]

41 Model for an Individual Body Segment We model the deformation for each body segment as a joint function over both shape and pose parameters using tensor technique. [sent-121, score-0.903]

42 We rearrange the deformation matrix Dn column by column as a 9 1vector for every triangle on the lctohl segment (including nl triangles) faonrd e group aial n vgelect oonrs t as × ×× Figure 2. [sent-122, score-0.373]

43 Tensor decomposition for the deformation of a body segment . [sent-123, score-0.478]

44 Th×e Kloc)a,l a shape vector sl encodes the shape of the lth segment using It parameters. [sent-126, score-0.417]

45 The joint angle vector θl includes joint angles from the two nearest joints of the lth segment (e. [sent-127, score-0.28]

46 The deformation basis matrix Bl includes Kt deformation bases, which represent the deformation of the lth segment × in a low dimensional space. [sent-132, score-0.82]

47 GAfter training is completed, we can compute the deformation dl based on the local shape parameters sl and the joint angles θl , which are estimated for a specific person with a specific pose. [sent-150, score-0.59]

48 Model for the Whole Body The local shape vectors sl from different body segments are highly correlated to the global shape vector v that encodes the shape of the whole body (e. [sent-153, score-0.863]

49 We model this correlation using linear transform as: sl = Alv, (4) where sl includes It local shape parameters for the lth segment, v includes Iv global shape parameters, and Al is a × transform matrix (It Iv) for the lth segment. [sent-156, score-0.66]

50 By replacing the local shape vecto×r sl with the global shape vector v, eq. [sent-157, score-0.34]

51 (5) Combining all body segments, the entire TenBo model has L(ItJtKt + ItIv) + 9KtN parameters (ItJtKt parameters in Gl, ItIv parameters in Al, 9Ktnl parameters in Bl). [sent-159, score-0.327]

52 Once we finish training the TenBo model, we can apply it to estimate shape parameters v and pose parameters θl using a 3D point cloud of a human body surface as input. [sent-161, score-0.552]

53 Furthermore, we can generate animations for any subject (assuming shape vector v is available) with different pose sequences. [sent-162, score-0.308]

54 Learning the TenBo Model The TenBo model is learnt based on a training dataset that includes 3D human body meshes from multiple subjects (each subject has one or multiple poses). [sent-165, score-0.515]

55 Let us denote the number of subjects as I, the total number of poses as J and denote the number of poses for the ith subject as Ji (? [sent-167, score-0.328]

56 Preprocessing × In preprocessing, for every mesh in the training dataset, we compute the rotation matrix Rl and the joint angle vector θl for every body segment, and compute the deformation matrix Dn for every triangle (see calculation details in the SCAPE model [4]). [sent-173, score-0.727]

57 Then, we rearrange Dn to generate segment deformation tensor dl (a 1 1 9nl tensor). [sent-174, score-0.472]

58 We denote the deformation tensor a(nad 1th ×e joint angle vector for the jth pose for the ith subject as and respectively. [sent-175, score-0.579]

59 Optimization θli,j The goal of training is to search for the optimum tensor core Gl, shape transform matrix Al and deformation bassoisr m coarterix G Bl for every body segment as well as the global shape vector vi for every training subject to minimize L2 ×× × × Figure 3. [sent-178, score-0.93]

60 The deformation tensor Dl and joint angle matrix Θl . [sent-179, score-0.425]

61 dli,j distance between the actual deformation and the deformation generated by the tensor model (eq. [sent-180, score-0.582]

62 Firstly, we group the deformation tensors and joint angle vectors of different poses for every subject as Dli = [dil,1 , . [sent-206, score-0.436]

63 Note that training a TenBo model needs three inputs: dimension of the local shape vector It, dimension of the global shape vector Iv and the number of deformation bases Kt. [sent-254, score-0.504]

64 Therefore, we can change the dimension of global shape parameters Iv by selecting the first Iv rows of as the global shape parameter matrix V . [sent-273, score-0.315]

65 , zMp captured from a human body surface, determine the global shape parameters v and the pose parameters (or joint angles) θ, such that the difference between the reconstructed 3D human body based on v and θ and the original human body is minimum”. [sent-286, score-1.089]

66 The rotation Rl[n] is a function of joint angles θ and the deformation Dn is a joint function of shape parameters v and joint angles θ. [sent-311, score-0.563]

67 When using Microsoft Kinect, we can estimate the corresponding body segment l[zm] for each point zm using skeleton information. [sent-313, score-0.346]

68 This is useful in searching for the closest vertex yclosest(zm) since it can significantly reduce the searching scope to the vertices on the body segment l[zm] . [sent-314, score-0.348]

69 Other subjects perform either 10 predefined poses (randomly selected) or just the neutral pose. [sent-324, score-0.26]

70 50 subjects only have the neutral pose and the other 39 subjects have more poses. [sent-328, score-0.377]

71 A subject with more than two poses (except the reference subject) is selected as the validation data, and the remaining subjects are used for training both SCAPE and TenBo models. [sent-332, score-0.311]

72 Accurate prediction requires (a) a good model to capture the relationship between the deformation and shape/pose parameters, and (b) accurate estimation of global shape parameters v. [sent-338, score-0.45]

73 We use the average deformation error in the prediction over all validation subjects as the evaluation measure. [sent-339, score-0.422]

74 The deformation error between a predicted mesh Yr (including vertices {yr1 . [sent-340, score-0.431]

75 error is approximately proportional to the difference in deformation matrix Dn because: Δyrn,q − Δyno,q ≈ Rl[n] (Drn − Dno)Δxn,q, (10) where Drn is the deformation matrix to generate the predicted mesh Yr, Dno is the ground truth deformation matrix obtained from the original mesh Yo. [sent-355, score-1.01]

76 In our experiment, the deformation error is highly correlated to the average deformation matrix difference (En | |Drn − Dno | |) with Pearson correlation 0. [sent-356, score-0.487]

77 Since the deformation error and the Hausdorff distance have the same trend in results, we only show the deformation error for the sake of brevity. [sent-361, score-0.488]

78 When training the TenBo model, we heuristically choose the local shape dimension It = 4 due to the low dimensional shape of body segment. [sent-367, score-0.416]

79 In the prediction step, we predict the 3D geometry for non-neutral poses for the validation subject using the global shape vector v (estimated under the neutral pose) and correspondingjoint angles θ for the non-neutral poses (computed in preprocessing). [sent-368, score-0.577]

80 We estimate the global shape vector v using the neutral pose in two different ways: (a) using entire body scan as input, and (b) using sampled vertices as input. [sent-369, score-0.58]

81 1 Using Entire Body Mesh for Shape Estimation Using the entire body scan as input, the deformation tensor dl and joint angles θ are available (calculated in preprocessing). [sent-372, score-0.673]

82 Figure (5) shows the prediction error for the SCAPE model and four TenBo models over different number of global shape parameters Iv. [sent-375, score-0.252]

83 This is likely because the mesh resolution is low and the mesh alignment is not perfect. [sent-386, score-0.258]

84 In the rest of this section, we use 4 global shape parameters (Iv = 4) for both SCAPE and TenBo models, and use 10 deformation bases (Kt = 10) for TenBo. [sent-388, score-0.401]

85 For each validation subject, we randomly sample vertices on the 3D mesh of the neutral pose. [sent-399, score-0.308]

86 Different from using the entire mesh, we can not compute the deformation and joint angles directly from the sampled vertices. [sent-401, score-0.303]

87 Therefore, we use the fitting algorithm in Section 7 to estimate both shape and pose parameters. [sent-402, score-0.255]

88 We have three observations - (a) TenBo model outperforms the SCAPE model, (b) prediction error converges when the sampling rate is above 10%, so we do not need a lot of data to estimate body shape, and (c) sampling the whole body helps when the sampling rate is lower than 10%. [sent-405, score-0.572]

89 The fitting algorithm is run on a single desktop without GPU acceleration to fit 100 point clouds, which are generate by random vertex sampling on 100 3D body scans (at sampling rate 20%). [sent-416, score-0.361]

90 We use the average shape at the neutral pose as the initial guess. [sent-417, score-0.3]

91 Note that both models have the same complexity in animation where shape parameters are known and only pose parameters change over time. [sent-429, score-0.301]

92 3D Reconstruction Using Microsoft Kinect We also apply the TenBo model to capture 3D human body using Microsoft Kinect. [sent-432, score-0.272]

93 With the help of skeleton, the body segment correspondence for pixels in the depth map can be easily estimated. [sent-433, score-0.257]

94 Conclusion This paper presents a novel tensor-based 3D human body model (TenBo model). [sent-440, score-0.272]

95 Compared with the popular SCAPE model which separates the shape and pose deformations, the proposed approach jointly models shape and pose deformations in a systematic manner. [sent-441, score-0.483]

96 Our TenBo model is capable of capturing the human body shape using the depth map and skeleton provided by Microsoft Kinect sensors and generating animations. [sent-443, score-0.406]

97 Learning a correlated model of identity and pose-dependent body shape variation for real-time synthesis. [sent-468, score-0.328]

98 Estimating human shape and pose from a single image. [sent-513, score-0.269]

99 Multilinear pose and body shape estimation of dressed subjects from image sets. [sent-524, score-0.53]

100 A statistical model of human pose and body shape. [sent-533, score-0.379]

similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('tenbo', 0.665), ('scape', 0.476), ('deformation', 0.221), ('body', 0.198), ('mesh', 0.129), ('tensor', 0.119), ('shape', 0.109), ('pose', 0.107), ('gl', 0.107), ('sl', 0.098), ('subjects', 0.093), ('bl', 0.088), ('rl', 0.084), ('neutral', 0.084), ('poses', 0.083), ('dn', 0.072), ('subject', 0.069), ('zm', 0.064), ('hasler', 0.063), ('segment', 0.059), ('iv', 0.059), ('vertices', 0.058), ('human', 0.053), ('dl', 0.053), ('triangle', 0.051), ('prediction', 0.048), ('meshes', 0.047), ('kt', 0.045), ('kinect', 0.044), ('microsoft', 0.042), ('lth', 0.042), ('joint', 0.042), ('angles', 0.04), ('fitting', 0.039), ('alv', 0.038), ('itjtkt', 0.038), ('yclosest', 0.038), ('multilinear', 0.038), ('validation', 0.037), ('guan', 0.036), ('scans', 0.035), ('drn', 0.034), ('includes', 0.034), ('triangles', 0.034), ('vertex', 0.033), ('male', 0.032), ('balan', 0.031), ('freifeld', 0.031), ('animation', 0.031), ('cloud', 0.031), ('deformations', 0.03), ('allen', 0.03), ('reference', 0.029), ('jt', 0.029), ('dno', 0.028), ('dli', 0.028), ('sampling', 0.028), ('reshaping', 0.027), ('parameters', 0.027), ('female', 0.026), ('principal', 0.026), ('bltbl', 0.026), ('dlv', 0.026), ('drape', 0.026), ('itiv', 0.026), ('msec', 0.026), ('naked', 0.026), ('tensorfaces', 0.026), ('vtalt', 0.026), ('weiss', 0.025), ('skeleton', 0.025), ('preprocessing', 0.024), ('global', 0.024), ('ji', 0.023), ('error', 0.023), ('dressed', 0.023), ('animations', 0.023), ('vasilescu', 0.023), ('vtm', 0.023), ('hirshberg', 0.023), ('matrix', 0.022), ('angle', 0.021), ('model', 0.021), ('ahlen', 0.021), ('thorm', 0.021), ('zuffi', 0.021), ('bases', 0.02), ('um', 0.02), ('rearrange', 0.02), ('slt', 0.02), ('forearm', 0.02), ('al', 0.019), ('hausdorff', 0.019), ('pearson', 0.019), ('rosenhahn', 0.019), ('cross', 0.019), ('segments', 0.018), ('mna', 0.018), ('mpi', 0.018), ('popovi', 0.018)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99999982 426 cvpr-2013-Tensor-Based Human Body Modeling

Author: Yinpeng Chen, Zicheng Liu, Zhengyou Zhang

2 0.17210639 424 cvpr-2013-Templateless Quasi-rigid Shape Modeling with Implicit Loop-Closure

Author: Ming Zeng, Jiaxiang Zheng, Xuan Cheng, Xinguo Liu

Abstract: This paper presents a method for quasi-rigid objects modeling from a sequence of depth scans captured at different time instances. As quasi-rigid objects, such as human bodies, usually have shape motions during the capture procedure, it is difficult to reconstruct their geometries. We represent the shape motion by a deformation graph, and propose a model-to-partmethod to gradually integrate sampled points of depth scans into the deformation graph. Under an as-rigid-as-possible assumption, the model-to-part method can adjust the deformation graph non-rigidly, so as to avoid error accumulation in alignment, which also implicitly achieves loop-closure. To handle the drift and topological error for the deformation graph, two algorithms are introduced. First, we use a two-stage registration to largely keep the rigid motion part. Second, in the step of graph integration, we topology-adaptively integrate new parts and dynamically control the regularization effect of the deformation graph. We demonstrate the effectiveness and robustness of our method by several depth sequences of quasi-rigid objects, and an application in human shape modeling.

3 0.12598328 316 cvpr-2013-Optical Flow Estimation Using Laplacian Mesh Energy

Author: Wenbin Li, Darren Cosker, Matthew Brown, Rui Tang

Abstract: In this paper we present a novel non-rigid optical flow algorithm for dense image correspondence and non-rigid registration. The algorithm uses a unique Laplacian Mesh Energy term to encourage local smoothness whilst simultaneously preserving non-rigid deformation. Laplacian deformation approaches have become popular in graphics research as they enable mesh deformations to preserve local surface shape. In this work we propose a novel Laplacian Mesh Energy formula to ensure such sensible local deformations between image pairs. We express this wholly within the optical flow optimization, and show its application in a novel coarse-to-fine pyramidal approach. Our algorithm achieves the state-of-the-art performance in all trials on the Garg et al. dataset, and top tier performance on the Middlebury evaluation.

4 0.12518233 334 cvpr-2013-Pose from Flow and Flow from Pose

Author: Katerina Fragkiadaki, Han Hu, Jianbo Shi

Abstract: Human pose detectors, although successful in localising faces and torsos of people, often fail with lower arms. Motion estimation is often inaccurate under fast movements of body parts. We build a segmentation-detection algorithm that mediates the information between body parts recognition, and multi-frame motion grouping to improve both pose detection and tracking. Motion of body parts, though not accurate, is often sufficient to segment them from their backgrounds. Such segmentations are crucialfor extracting hard to detect body parts out of their interior body clutter. By matching these segments to exemplars we obtain pose labeled body segments. The pose labeled segments and corresponding articulated joints are used to improve the motion flow fields by proposing kinematically constrained affine displacements on body parts. The pose-based articulated motion model is shown to handle large limb rotations and displacements. Our algorithm can detect people under rare poses, frequently missed by pose detectors, showing the benefits of jointly reasoning about pose, segmentation and motion in videos.

5 0.12140173 40 cvpr-2013-An Approach to Pose-Based Action Recognition

Author: Chunyu Wang, Yizhou Wang, Alan L. Yuille

Abstract: We address action recognition in videos by modeling the spatial-temporal structures of human poses. We start by improving a state of the art method for estimating human joint locations from videos. More precisely, we obtain the K-best estimations output by the existing method and incorporate additional segmentation cues and temporal constraints to select the “best” one. Then we group the estimated joints into five body parts (e.g. the left arm) and apply data mining techniques to obtain a representation for the spatial-temporal structures of human actions. This representation captures the spatial configurations ofbodyparts in one frame (by spatial-part-sets) as well as the body part movements(by temporal-part-sets) which are characteristic of human actions. It is interpretable, compact, and also robust to errors on joint estimations. Experimental results first show that our approach is able to localize body joints more accurately than existing methods. Next we show that it outperforms state of the art action recognizers on the UCF sport, the Keck Gesture and the MSR-Action3D datasets.

6 0.12129438 14 cvpr-2013-A Joint Model for 2D and 3D Pose Estimation from a Single Image

7 0.11650816 206 cvpr-2013-Human Pose Estimation Using Body Parts Dependent Joint Regressors

8 0.10592746 335 cvpr-2013-Poselet Conditioned Pictorial Structures

9 0.098611869 284 cvpr-2013-Mesh Based Semantic Modelling for Indoor and Outdoor Scenes

10 0.095215827 444 cvpr-2013-Unconstrained Monocular 3D Human Pose Estimation by Action Detection and Cross-Modality Regression Forest

11 0.092114866 301 cvpr-2013-Multi-target Tracking by Rank-1 Tensor Approximation

12 0.088611826 207 cvpr-2013-Human Pose Estimation Using a Joint Pixel-wise and Part-wise Formulation

13 0.087692387 297 cvpr-2013-Multi-resolution Shape Analysis via Non-Euclidean Wavelets: Applications to Mesh Segmentation and Surface Alignment Problems

14 0.072355106 97 cvpr-2013-Correspondence-Less Non-rigid Registration of Triangular Surface Meshes

15 0.071453989 459 cvpr-2013-Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots

16 0.068948537 82 cvpr-2013-Class Generative Models Based on Feature Regression for Pose Estimation of Object Categories

17 0.067552283 89 cvpr-2013-Computationally Efficient Regression on a Dependency Graph for Human Pose Estimation

18 0.066074073 306 cvpr-2013-Non-rigid Structure from Motion with Diffusion Maps Prior

19 0.062460948 60 cvpr-2013-Beyond Physical Connections: Tree Models in Human Pose Estimation

20 0.060864605 438 cvpr-2013-Towards Pose Robust Face Recognition

similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.122), (1, 0.039), (2, -0.007), (3, -0.041), (4, -0.009), (5, -0.033), (6, 0.028), (7, 0.03), (8, 0.044), (9, -0.091), (10, -0.013), (11, 0.132), (12, -0.116), (13, 0.002), (14, 0.052), (15, -0.032), (16, 0.019), (17, 0.02), (18, -0.043), (19, 0.003), (20, -0.098), (21, -0.001), (22, -0.005), (23, 0.011), (24, -0.104), (25, -0.018), (26, -0.002), (27, -0.031), (28, 0.001), (29, -0.046), (30, 0.058), (31, -0.044), (32, -0.046), (33, -0.036), (34, 0.02), (35, 0.021), (36, -0.067), (37, -0.006), (38, -0.009), (39, 0.037), (40, -0.058), (41, -0.054), (42, 0.046), (43, 0.038), (44, 0.011), (45, -0.02), (46, -0.044), (47, 0.069), (48, -0.039), (49, 0.002)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.93373412 426 cvpr-2013-Tensor-Based Human Body Modeling

Author: Yinpeng Chen, Zicheng Liu, Zhengyou Zhang

2 0.644898 335 cvpr-2013-Poselet Conditioned Pictorial Structures

Author: Leonid Pishchulin, Mykhaylo Andriluka, Peter Gehler, Bernt Schiele

Abstract: In this paper we consider the challenging problem of articulated human pose estimation in still images. We observe that despite high variability of the body articulations, human motions and activities often simultaneously constrain the positions of multiple body parts. Modelling such higher order part dependencies seemingly comes at a cost of more expensive inference, which resulted in their limited use in state-of-the-art methods. In this paper we propose a model that incorporates higher order part dependencies while remaining efficient. We achieve this by defining a conditional model in which all body parts are connected a-priori, but which becomes a tractable tree-structured pictorial structures model once the image observations are available. In order to derive a set of conditioning variables we rely on the poselet-based features that have been shown to be effective for people detection but have so far found limited application for articulated human pose estimation. We demon- strate the effectiveness of our approach on three publicly available pose estimation benchmarks improving or being on-par with state of the art in each case.

3 0.62214202 14 cvpr-2013-A Joint Model for 2D and 3D Pose Estimation from a Single Image

Author: Edgar Simo-Serra, Ariadna Quattoni, Carme Torras, Francesc Moreno-Noguer

Abstract: We introduce a novel approach to automatically recover 3D human pose from a single image. Most previous work follows a pipelined approach: initially, a set of 2D features such as edges, joints or silhouettes are detected in the image, and then these observations are used to infer the 3D pose. Solving these two problems separately may lead to erroneous 3D poses when the feature detector has performed poorly. In this paper, we address this issue by jointly solving both the 2D detection and the 3D inference problems. For this purpose, we propose a Bayesian framework that integrates a generative model based on latent variables and discriminative 2D part detectors based on HOGs, and perform inference using evolutionary algorithms. Real experimentation demonstrates competitive results, and the ability of our methodology to provide accurate 2D and 3D pose estimations even when the 2D detectors are inaccurate.

4 0.61509389 206 cvpr-2013-Human Pose Estimation Using Body Parts Dependent Joint Regressors

Author: Matthias Dantone, Juergen Gall, Christian Leistner, Luc Van_Gool

Abstract: In this work, we address the problem of estimating 2d human pose from still images. Recent methods that rely on discriminatively trained deformable parts organized in a tree model have shown to be very successful in solving this task. Within such a pictorial structure framework, we address the problem of obtaining good part templates by proposing novel, non-linear joint regressors. In particular, we employ two-layered random forests as joint regressors. The first layer acts as a discriminative, independent body part classifier. The second layer takes the estimated class distributions of the first one into account and is thereby able to predict joint locations by modeling the interdependence and co-occurrence of the parts. This results in a pose estimation framework that takes dependencies between body parts already for joint localization into account and is thus able to circumvent typical ambiguities of tree structures, such as for legs and arms. In the experiments, we demonstrate that our body parts dependent joint regressors achieve a higher joint localization accuracy than tree-based state-of-the-art methods.

5 0.6048165 297 cvpr-2013-Multi-resolution Shape Analysis via Non-Euclidean Wavelets: Applications to Mesh Segmentation and Surface Alignment Problems

Author: Won Hwa Kim, Moo K. Chung, Vikas Singh

Abstract: The analysis of 3-D shape meshes is a fundamental problem in computer vision, graphics, and medical imaging. Frequently, the needs of the application require that our analysis take a multi-resolution view of the shape ’s local and global topology, and that the solution is consistent across multiple scales. Unfortunately, the preferred mathematical construct which offers this behavior in classical image/signal processing, Wavelets, is no longer applicable in this general setting (data with non-uniform topology). In particular, the traditional definition does not allow writing out an expansion for graphs that do not correspond to the uniformly sampled lattice (e.g., images). In this paper, we adapt recent results in harmonic analysis, to derive NonEuclidean Wavelets based algorithms for a range of shape analysis problems in vision and medical imaging. We show how descriptors derived from the dual domain representation offer native multi-resolution behavior for characterizing local/global topology around vertices. With only minor modifications, the framework yields a method for extracting interest/key points from shapes, a surprisingly simple algorithm for 3-D shape segmentation (competitive with state of the art), and a method for surface alignment (without landmarks). We give an extensive set of comparison results on a large shape segmentation benchmark and derive a uniqueness theorem for the surface alignment problem.

6 0.59712911 2 cvpr-2013-3D Pictorial Structures for Multiple View Articulated Pose Estimation

7 0.58337468 207 cvpr-2013-Human Pose Estimation Using a Joint Pixel-wise and Part-wise Formulation

8 0.58333784 97 cvpr-2013-Correspondence-Less Non-rigid Registration of Triangular Surface Meshes

9 0.57972658 89 cvpr-2013-Computationally Efficient Regression on a Dependency Graph for Human Pose Estimation

10 0.54117388 424 cvpr-2013-Templateless Quasi-rigid Shape Modeling with Implicit Loop-Closure

11 0.52966839 277 cvpr-2013-MODEC: Multimodal Decomposable Models for Human Pose Estimation

12 0.52844322 45 cvpr-2013-Articulated Pose Estimation Using Discriminative Armlet Classifiers

13 0.52734786 444 cvpr-2013-Unconstrained Monocular 3D Human Pose Estimation by Action Detection and Cross-Modality Regression Forest

14 0.52448392 40 cvpr-2013-An Approach to Pose-Based Action Recognition

15 0.52381665 82 cvpr-2013-Class Generative Models Based on Feature Regression for Pose Estimation of Object Categories

16 0.52248806 321 cvpr-2013-PDM-ENLOR: Learning Ensemble of Local PDM-Based Regressions

17 0.5155893 60 cvpr-2013-Beyond Physical Connections: Tree Models in Human Pose Estimation

18 0.51002151 141 cvpr-2013-Efficient Computation of Shortest Path-Concavity for 3D Meshes

19 0.48836711 334 cvpr-2013-Pose from Flow and Flow from Pose

20 0.48090333 159 cvpr-2013-Expressive Visual Text-to-Speech Using Active Appearance Models

similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(10, 0.122), (16, 0.025), (26, 0.044), (33, 0.243), (59, 0.017), (67, 0.055), (69, 0.051), (76, 0.237), (80, 0.012), (87, 0.064), (93, 0.013)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.84212512 317 cvpr-2013-Optimal Geometric Fitting under the Truncated L2-Norm

Author: Erik Ask, Olof Enqvist, Fredrik Kahl

Abstract: This paper is concerned with model fitting in the presence of noise and outliers. Previously it has been shown that the number of outliers can be minimized with polynomial complexity in the number of measurements. This paper improves on these results in two ways. First, it is shown that for a large class of problems, the statistically more desirable truncated L2-norm can be optimized with the same complexity. Then, with the same methodology, it is shown how to transform multi-model fitting into a purely combinatorial problem—with worst-case complexity that is polynomial in the number of measurements, though exponential in the number of models. We apply our framework to a series of hard registration and stitching problems demonstrating that the approach is not only of theoretical interest. It gives a practical method for simultaneously dealing with measurement noise and large amounts of outliers for fitting problems with lowdimensional models.

same-paper 2 0.83118075 426 cvpr-2013-Tensor-Based Human Body Modeling

Author: Yinpeng Chen, Zicheng Liu, Zhengyou Zhang

3 0.82602549 341 cvpr-2013-Procrustean Normal Distribution for Non-rigid Structure from Motion

Author: Minsik Lee, Jungchan Cho, Chong-Ho Choi, Songhwai Oh

Abstract: Non-rigid structure from motion is a fundamental problem in computer vision, which is yet to be solved satisfactorily. The main difficulty of the problem lies in choosing the right constraints for the solution. In this paper, we propose new constraints that are more effective for non-rigid shape recovery. Unlike the other proposals which have mainly focused on restricting the deformation space using rank constraints, our proposal constrains the motion parameters so that the 3D shapes are most closely aligned to each other, which makes the rank constraints unnecessary. Based on these constraints, we define a new class ofprobability distribution called the Procrustean normal distribution and propose a new NRSfM algorithm, EM-PND. The experimental results show that the proposed method outperforms the existing methods, and it works well even if there is no temporal dependence between the observed samples.

4 0.82243866 434 cvpr-2013-Topical Video Object Discovery from Key Frames by Modeling Word Co-occurrence Prior

Author: Gangqiang Zhao, Junsong Yuan, Gang Hua

Abstract: A topical video object refers to an object that is frequently highlighted in a video. It could be, e.g., the product logo and the leading actor/actress in a TV commercial. We propose a topic model that incorporates a word co-occurrence prior for efficient discovery of topical video objects from a set of key frames. Previous work using topic models, such as Latent Dirichelet Allocation (LDA), for video object discovery often takes a bag-of-visual-words representation, which ignored important co-occurrence information among the local features. We show that such data driven co-occurrence information from bottom-up can conveniently be incorporated in LDA with a Gaussian Markov prior, which combines top down probabilistic topic modeling with bottom up priors in a unified model. Our experiments on challenging videos demonstrate that the proposed approach can discover different types of topical objects despite variations in scale, view-point, color and lighting changes, or even partial occlusions. The efficacy of the co-occurrence prior is clearly demonstrated when comparing with topic models without such priors.

5 0.8152287 201 cvpr-2013-Heterogeneous Visual Features Fusion via Sparse Multimodal Machine

Author: Hua Wang, Feiping Nie, Heng Huang, Chris Ding

Abstract: To better understand, search, and classify image and video information, many visual feature descriptors have been proposed to describe elementary visual characteristics, such as the shape, the color, the texture, etc. How to integrate these heterogeneous visual features and identify the important ones from them for specific vision tasks has become an increasingly critical problem. In this paper, We propose a novel Sparse Multimodal Learning (SMML) approach to integrate such heterogeneous features by using the joint structured sparsity regularizations to learn the feature importance of for the vision tasks from both group-wise and individual point of views. A new optimization algorithm is also introduced to solve the non-smooth objective with rigorously proved global convergence. We applied our SMML method to five broadly used object categorization and scene understanding image data sets for both singlelabel and multi-label image classification tasks. For each data set we integrate six different types of popularly used image features. Compared to existing scene and object cat- egorization methods using either single modality or multimodalities of features, our approach always achieves better performances measured.

6 0.80442697 332 cvpr-2013-Pixel-Level Hand Detection in Ego-centric Videos

7 0.77142143 248 cvpr-2013-Learning Collections of Part Models for Object Recognition

8 0.77110726 413 cvpr-2013-Story-Driven Summarization for Egocentric Video

9 0.76772082 414 cvpr-2013-Structure Preserving Object Tracking

10 0.76680696 225 cvpr-2013-Integrating Grammar and Segmentation for Human Pose Estimation

11 0.76643753 325 cvpr-2013-Part Discovery from Partial Correspondence

12 0.76607352 446 cvpr-2013-Understanding Indoor Scenes Using 3D Geometric Phrases

13 0.76593524 365 cvpr-2013-Robust Real-Time Tracking of Multiple Objects by Volumetric Mass Densities

14 0.76575702 408 cvpr-2013-Spatiotemporal Deformable Part Models for Action Detection

15 0.76562858 285 cvpr-2013-Minimum Uncertainty Gap for Robust Visual Tracking

16 0.76439911 104 cvpr-2013-Deep Convolutional Network Cascade for Facial Point Detection

17 0.76394874 445 cvpr-2013-Understanding Bayesian Rooms Using Composite 3D Object Models

18 0.76338822 98 cvpr-2013-Cross-View Action Recognition via a Continuous Virtual Path

19 0.7632153 14 cvpr-2013-A Joint Model for 2D and 3D Pose Estimation from a Single Image

20 0.76286787 242 cvpr-2013-Label Propagation from ImageNet to 3D Point Clouds