cvpr cvpr2013 cvpr2013-415 knowledge-graph by maker-knowledge-mining

415 cvpr-2013-Structured Face Hallucination


Source: pdf

Author: Chih-Yuan Yang, Sifei Liu, Ming-Hsuan Yang

Abstract: The goal of face hallucination is to generate highresolution images with fidelity from low-resolution ones. In contrast to existing methods based on patch similarity or holistic constraints in the image space, we propose to exploit local image structures for face hallucination. Each face image is represented in terms of facial components, contours and smooth regions. The image structure is maintained via matching gradients in the reconstructed highresolution output. For facial components, we align input images to generate accurate exemplars and transfer the high-frequency details for preserving structural consistency. For contours, we learn statistical priors to generate salient structures in the high-resolution images. A patch matching method is utilized on the smooth regions where the image gradients are preserved. Experimental results demonstrate that the proposed algorithm generates hallucinated face images with favorable quality and adaptability.

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Structured Face Hallucination Chih-Yuan Yang Sifei Liu Ming-Hsuan Yang Electrical Engineering and Computer Science University of California at Merced {cyang3 5 , sl 2 iu3 , Abstract The goal of face hallucination is to generate highresolution images with fidelity from low-resolution ones. [sent-1, score-0.473]

2 In contrast to existing methods based on patch similarity or holistic constraints in the image space, we propose to exploit local image structures for face hallucination. [sent-2, score-0.196]

3 Each face image is represented in terms of facial components, contours and smooth regions. [sent-3, score-0.425]

4 For facial components, we align input images to generate accurate exemplars and transfer the high-frequency details for preserving structural consistency. [sent-5, score-0.338]

5 A patch matching method is utilized on the smooth regions where the image gradients are preserved. [sent-7, score-0.201]

6 Experimental results demonstrate that the proposed algorithm generates hallucinated face images with favorable quality and adaptability. [sent-8, score-0.252]

7 Introduction Face hallucination is a domain-specific super-resolution problem with the goal to generate high-resolution (HR) images from low-resolution (LR) inputs, which finds numerous vision applications. [sent-10, score-0.329]

8 Since a LR image can be modeled from a HR image by a linear convolution process with downsampling, the hallucination problem can be viewed as an inverse task to reconstruct the high-frequency details. [sent-11, score-0.263]

9 In this paper, we propose a face hallucination algorithm that exploits domain-specific image structures to generate HR results with high fidelity. [sent-13, score-0.424]

10 A landmark detection algorithm is utilized to locate facial components and contours, and process facial alignment in both frontal faces and those at different poses. [sent-15, score-0.675]

11 In this work, the exemplar face dataset consists both LR face images and the corresponding HR ones. [sent-16, score-0.51]

12 The landmark points of each HR exemplar face mhyang} @ucmerced . [sent-17, score-0.463]

13 From the set of HR exemplar images, the corresponding LR images with landmarks and labels are generated. [sent-19, score-0.333]

14 Given a test LR image, the pose and landmark points are extracted from an intermediate HR image via bicubic interpolation. [sent-20, score-0.214]

15 Based on the pose and landmark points, the aligned facial components of the input images are compared with those of the training LR images. [sent-21, score-0.459]

16 The LR exemplar images with most similar components are selected, and their gradients are preserved in reconstructing the output HR image. [sent-22, score-0.463]

17 To preserve the structure of edges, we generate HR edges through an anisotropic interpolation and restore the sharpness of edges via statistical priors. [sent-23, score-0.431]

18 For other remaining smooth region, we generate the HR details through a patch match method. [sent-24, score-0.22]

19 Extensive experiments with comparisons to the state-of-the-art methods show that high-quality images with richer details can be generated by the proposed algorithm without assuming faces are well aligned, at fixed pose and without facial expression change. [sent-26, score-0.409]

20 Related Work In contrast to generic super-resolution algorithms, recent work in face hallucination aims to learn the mapping between HR and LR patches from a set of exemplar images to recover the missing details of an input frame. [sent-28, score-0.71]

21 In [1], the relationship between LR and HR image patches are modeled in a probabilistic framework such that high-frequency details can be transferred from exemplar images for face hallucination. [sent-29, score-0.525]

22 For every query patch cropped from an input image, the most similar LR patch is retrieved from an exemplar set and the corresponding HR patch is transferred in terms of the first and second order derivatives. [sent-30, score-0.454]

23 The gen- erated face images contain significantly richer details than those by bicubic interpolation, but some artifacts also can be introduced as the transferred HR patches are not structurally consistent although their LR patches are similar to the LR test patches. [sent-31, score-0.441]

24 (e) Based on the component masks, the corresponding HR components are found and the set of gradient maps Uc are generated. [sent-38, score-0.289]

25 (f) Priors are used to restore edge sharpness and generate the set of gradient maps Ue. [sent-39, score-0.432]

26 (g) Three sets of gradient maps based on components, edges and smooth regions are generated. [sent-40, score-0.283]

27 In [9], a face hallucination method is proposed which enforces linear constraints for HR face images using a subspace learned from a set of training images via Principal Component Analysis (PCA). [sent-45, score-0.552]

28 The global linear constraints of subspace representations are replaced by multiple local constraints learned from exemplar patches [10]. [sent-52, score-0.337]

29 When the exemplar and test images are precisely aligned with similar appearance, the adopted local linear constraints are effective as the mapping between HR and LR local patches can be modeled via manifold learning [3]. [sent-53, score-0.371]

30 However, the resulting HR images may contain significant noisy artifacts along contours since the number of training patches collected along edges are relatively less than that of smooth regions and thus the sparse representation dictionary is not effective in reconstructing these regions. [sent-56, score-0.31]

31 This method performs well when training faces are highly similar to the test face in terms ofthe identity, pose, and expression. [sent-58, score-0.209]

32 Proposed Algorithm Given a LR test image Il, we generate a set of HR gradient maps U from exemplar images such that we can generate a HR image Ih based on matching HR gradients and LR intensity values by Ih= argImin? [sent-61, score-0.689]

33 GWaue group image asntrduc ↓t ruerepsr eosfe a tfsa ace d ionwton tsharmeep lcinatgegories including facial components, edges, and smooth regions, whose gradients are generated by specific methods to produce the best visual quality. [sent-66, score-0.356]

34 The gradients of facial components are transferred from the corresponding components of exemplar images to preserve the consistency of highfrequency details. [sent-67, score-0.755]

35 In addition, we ex- ploit the similarity between the test image and the training images to drive an efficient patch matching algorithm to reduce the computational load of retrieving exemplar patches. [sent-73, score-0.374]

36 Gradient Maps for Facial Components In order to generate effective gradients of facial components, we prepare a dataset in which every face image is associated with a set of landmark points and two label sets indicating the pose of the face and the existence of glasses on the face. [sent-77, score-0.867]

37 The landmark points are used to generate an aligned image while the pose and glasses labels restrict the search domains. [sent-78, score-0.406]

38 Given a LR test image Il, we generate an intermediate HR image Ib by bicubic interpolation, localize its landmark points and estimate the pose of the test image using [17]. [sent-80, score-0.29]

39 We use the estimated results to select a set of exemplar images in the dataset which have the same pose as Ib. [sent-81, score-0.32]

40 Each face is annotated by several landmark points such that all the facial components and contours are known (Figure 1(b)). [sent-82, score-0.538]

41 Suppose a facial component is annotated by n landmark points denoted as {xbi, ybi}in=1 of Ib and t{axteei,d y beiy}in n= l1a nodfm an exemplar image. [sent-83, score-0.569]

42 We use the estimated parameters to generate an aligned exemplar image, denoted by H. [sent-93, score-0.371]

43 Note that the alignment is carried out for each facial component individually, which is different from existing methods [9, 16, 10] in which faces are aligned based on eyes locations. [sent-94, score-0.398]

44 The proposed alignment approach is more flexible for dealing with face images containing various expressions and shapes because they cannot be effectively aligned by eye positions only. [sent-95, score-0.216]

45 Suppose {Hj } is a set of aligned iHniRn exemplar images fso. [sent-97, score-0.319]

46 We generate the corresponding LR exemplar image Lj = (Hj ⊗ G) ↓, (3) and compare Lj and the Il to determine the best exemplar image for the component c. [sent-101, score-0.622]

47 Based on the landmark points belonging to component c estimated from Ib, we create a HR mask map Figure 2. [sent-103, score-0.188]

48 3 and determine the best exemplar by j∗= argj∈mSinp? [sent-112, score-0.248]

49 The index set S is determined by the component c and the labels of glasses associated with the exemplar images {Hj }. [sent-116, score-0.461]

50 As we determine the best exemplar by comparing features in LR, to prevent artifacts caused by selecting an incorrect HR exemplar image, we utilize the labels of glasses to exclude gradient maps Vj from index set S if the component c may be covered by glasses. [sent-120, score-0.897]

51 On the contrary, if the component c is irrelevant to glasses such as a mouth, all Vj are included in S. [sent-121, score-0.189]

52 Figure 2 shows an example of four matched images based on different facial components (the dark boundary of each LR exemplar is the result of alignment). [sent-122, score-0.55]

53 Note that the best matched facial components are matched from images of different subjects. [sent-123, score-0.336]

54 Once the best LR exemplar image Lj∗ is determined for a component, we transfer the gradients of the corresponding source HR image Hj∗ for the pixels whose values in the mask Mh are 1 as the gradients in the set of gradient maps Uc. [sent-125, score-0.65]

55 The same process is carried out for each component to generate the most effective image gradients, and together they form the gradient map Uc. [sent-126, score-0.21]

56 (c) The set of LR similarity maps are upsampled to a set of HR maps through bilinear interpolation to preserve the directions of edges. [sent-131, score-0.433]

57 Although the generated edges are visually pleasing, the HR image may contain significant artifacts (especially along sharp edges) as they are generated by enhancing the contrast of edges from a bicubic interpolated image where edges are jaggy. [sent-138, score-0.396]

58 In this work, we propose to preserve the structure of edges and restore their sharpness through learned statistical priors. [sent-139, score-0.245]

59 Rather than generating sharp edges based on interpolated images, we develop a direction-preserving upsampling function that eliminates the artifacts for prior learning. [sent-140, score-0.21]

60 We use t,h teh upsampled rdviirengcti tohneal d isriemc-ilarity maps {Tk} to regularize an under-constrained optiimlaizriattyio mn problem Id= argImin? [sent-157, score-0.244]

61 Edges in the upsampled image Id are clear and smooth but the not sharp enough because the sharpness is not modeled in the regularization term of Eq. [sent-167, score-0.335]

62 Since the structure of edges are highly symmetric with greatest magnitude of gradients along the center, we label the pixels at edge centers (Figure 4(d)) in Id by C(p) =? [sent-173, score-0.244]

63 We collect millions of samples from images of the exemplar dataset, and separate the domain of (mp, mc, t) into thousands of bins. [sent-179, score-0.272]

64 Suppose Ud are the gradient maps of Id, we generate the Suppose Ud is the set of the gradient maps of Id, we generate the set of gradient maps for facial contours Ue by Ue(p) =m m¯? [sent-190, score-0.916]

65 (8) According to the definition of magnitude of gradients mp = ? [sent-192, score-0.213]

66 We generate LR exemplar images from the matched dataset using Eq. [sent-200, score-0.382]

67 3 and utilize the PatchMatch algorithm [2] to reduce the computational load of retrieving the most similar LR exemplar patches. [sent-201, score-0.273]

68 (h) An image generated by the restored gradients to show the effectiveness of the restored edge sharpness. [sent-214, score-0.238]

69 We extract the gradients of the back-projected HR image as the gradients of smooth regions, denoted by Ub (Figure 1(g)). [sent-218, score-0.242]

70 Integrating Gradient Maps In order to generate the required gradient map set U for producing the output HR image, we generate two weight maps wc and we. [sent-221, score-0.357]

71 We set map wc as the summation of all HR mask maps Mh (Figure 1(b)), and set we (p) = min{1, α m¯e}, where m¯ e is the gradient magnitude in Eq. [sent-222, score-0.302]

72 It is clear that the use of each gradient map facilitates generating better results for different facial structures. [sent-232, score-0.283]

73 (a)(b) The generated gradient maps Uc ensure consistency of high-frequency details at components. [sent-236, score-0.242]

74 (c)(d) The generated gradient maps Ue ensure clear and sharp edges. [sent-237, score-0.237]

75 The pose labels and landmarks of each image are given in the dataset, and we manually generate the glasses labels for training images. [sent-240, score-0.324]

76 One set with 2,184 320 240 images at upright frontal pose of 289 winidthiv 2id,u1a84ls 3is2 0us ×ed 2 as t ihme training dparitagshett f froorn tfaalce p ohsaellu ofc i2n8a9tion experiments (Figure 6, Figure 7 and Figure 8). [sent-241, score-0.197]

77 We generate the input LR images by downsampling the original HR test images through Eq. [sent-244, score-0.192]

78 The ground truth HR images in the test set are used for compar- × isons with the generated hallucination results. [sent-247, score-0.287]

79 For color images, we apply the proposed algorithm on grayscaling channel and the color channels are upsampled by bicubic interpolation to make fair comparisons with existing methods [9, 16, 10]. [sent-251, score-0.277]

80 To label facial landmarks, we use the algorithm of [17] which produces the landmarks as the active appearance model [4] with 68 points as shown in Figure 1(b). [sent-254, score-0.236]

81 As the landmarks of eyebrows and nose do not form a close polygons, we mask eyebrows as the rectangles where the landmarks are the center vertical segments. [sent-256, score-0.305]

82 We implement several state-of-the-art face hallucination 1 1 1 1 1 10 0 03 1 1 algorithms [9, 16, 10] for comparisons. [sent-258, score-0.348]

83 Figures 6, 7 and 8 show hallucinated faces of frontal pose where the input images are enlarged by nearest neighbor interpolation for illustration purpose. [sent-260, score-0.386]

84 , one eye of Figure 6(e) contains glasses and the other one does not) with significant ghosty effects. [sent-268, score-0.218]

85 While the algorithms based on sparse coding [16] and position-patch [10] generate high-frequency textures, the results do not contain fine facial details such as contours and hair (See Figure 6). [sent-269, score-0.432]

86 The proposed algorithm reconstruct fine details of facial components such as the spots and moles in Figure 7(f) and individual tooth in Figure 6(f). [sent-271, score-0.306]

87 In addition, the proposed algorithm also generates bet- ter details due to the glasses label and the component-level alignment. [sent-279, score-0.207]

88 We also compare the hallucination results on faces at different pose. [sent-281, score-0.319]

89 As shown in Figure 9 and Figure 10, the subspace-based methods [10] and [9] do not perform well as both the subspace learning and the patch reconstruction at fix positions require precise face alignment. [sent-283, score-0.21]

90 However, it is more difficult to align face images at different pose such that PCA subspace can be well constructed for hallucination. [sent-284, score-0.228]

91 Conclusion A novel approach that exploits image structures for face hallucination is proposed in this paper. [sent-290, score-0.348]

92 The image struc- tures of a face are grouped into three categories including facial components, edges, and smooth regions. [sent-291, score-0.346]

93 Their gradient maps are generated and integrated to produce HR results with the best visual quality. [sent-292, score-0.204]

94 Experimental results show that the proposed algorithm generates hallucinated face images with fine and consistent details over state-of-the-art algorithms. [sent-293, score-0.314]

95 Qualitative comparison for 4 times upsampled upright frontal faces (results best viewed on a high-resolution display). [sent-425, score-0.407]

96 Qualitative comparison for 4 times upsampled upright frontal faces (results best viewed on a high-resolution display). [sent-445, score-0.407]

97 Qualitative comparison for 4 times upsampled upright frontal faces (results best viewed on a high-resolution display). [sent-465, score-0.407]

98 Qualitative comparison for 4 times upsampled non-frontal faces (results best viewed on a high-resolution display). [sent-485, score-0.282]

99 Qualitative comparison for 4 times upsampled non-frontal faces (results best viewed on a high-resolution display). [sent-505, score-0.282]

100 Qualitative comparison for 4 times upsampled upright frontal faces (results best viewed on a high-resolution display). [sent-525, score-0.407]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('hr', 0.568), ('lr', 0.393), ('exemplar', 0.248), ('hallucination', 0.229), ('facial', 0.175), ('upsampled', 0.158), ('diivine', 0.141), ('glasses', 0.139), ('ssim', 0.122), ('face', 0.119), ('landmark', 0.096), ('gradients', 0.095), ('psnr', 0.095), ('sharpness', 0.092), ('faces', 0.09), ('maps', 0.086), ('gradient', 0.084), ('ghosty', 0.079), ('hallucinated', 0.079), ('contours', 0.079), ('generate', 0.076), ('bicubic', 0.07), ('frontal', 0.07), ('components', 0.069), ('downsampling', 0.068), ('display', 0.066), ('ud', 0.066), ('il', 0.066), ('mp', 0.063), ('landmarks', 0.061), ('edges', 0.061), ('restore', 0.061), ('hj', 0.056), ('magnitude', 0.055), ('upright', 0.055), ('mouth', 0.055), ('id', 0.054), ('patch', 0.054), ('patches', 0.052), ('smooth', 0.052), ('component', 0.05), ('infinite', 0.05), ('upsampling', 0.05), ('eyebrows', 0.049), ('hallucinating', 0.049), ('interpolation', 0.049), ('ue', 0.049), ('pose', 0.048), ('aligned', 0.047), ('mh', 0.046), ('transferred', 0.044), ('nose', 0.043), ('artifacts', 0.042), ('lj', 0.042), ('mask', 0.042), ('mc', 0.04), ('hair', 0.04), ('blocky', 0.04), ('fk', 0.04), ('details', 0.038), ('uc', 0.038), ('restored', 0.038), ('ub', 0.038), ('ib', 0.038), ('ih', 0.037), ('subspace', 0.037), ('qualitative', 0.037), ('superresolution', 0.036), ('eyes', 0.036), ('argimin', 0.035), ('wc', 0.035), ('vj', 0.035), ('generated', 0.034), ('matched', 0.034), ('viewed', 0.034), ('sharp', 0.033), ('edge', 0.033), ('suppose', 0.031), ('preserve', 0.031), ('generates', 0.03), ('directional', 0.028), ('polygons', 0.028), ('upsample', 0.028), ('preserved', 0.027), ('expressions', 0.026), ('enlarged', 0.026), ('patchmatch', 0.026), ('structural', 0.025), ('tip', 0.025), ('load', 0.025), ('qk', 0.025), ('highresolution', 0.025), ('tk', 0.024), ('fine', 0.024), ('masks', 0.024), ('generating', 0.024), ('sk', 0.024), ('images', 0.024), ('similarity', 0.023), ('pleasing', 0.023)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 1.0000001 415 cvpr-2013-Structured Face Hallucination

Author: Chih-Yuan Yang, Sifei Liu, Ming-Hsuan Yang

Abstract: The goal of face hallucination is to generate highresolution images with fidelity from low-resolution ones. In contrast to existing methods based on patch similarity or holistic constraints in the image space, we propose to exploit local image structures for face hallucination. Each face image is represented in terms of facial components, contours and smooth regions. The image structure is maintained via matching gradients in the reconstructed highresolution output. For facial components, we align input images to generate accurate exemplars and transfer the high-frequency details for preserving structural consistency. For contours, we learn statistical priors to generate salient structures in the high-resolution images. A patch matching method is utilized on the smooth regions where the image gradients are preserved. Experimental results demonstrate that the proposed algorithm generates hallucinated face images with favorable quality and adaptability.

2 0.25112838 119 cvpr-2013-Detecting and Aligning Faces by Image Retrieval

Author: Xiaohui Shen, Zhe Lin, Jonathan Brandt, Ying Wu

Abstract: Detecting faces in uncontrolled environments continues to be a challenge to traditional face detection methods[24] due to the large variation in facial appearances, as well as occlusion and clutter. In order to overcome these challenges, we present a novel and robust exemplarbased face detector that integrates image retrieval and discriminative learning. A large database of faces with bounding rectangles and facial landmark locations is collected, and simple discriminative classifiers are learned from each of them. A voting-based method is then proposed to let these classifiers cast votes on the test image through an efficient image retrieval technique. As a result, faces can be very efficiently detected by selecting the modes from the voting maps, without resorting to exhaustive sliding window-style scanning. Moreover, due to the exemplar-based framework, our approach can detect faces under challenging conditions without explicitly modeling their variations. Evaluation on two public benchmark datasets shows that our new face detection approach is accurate and efficient, and achieves the state-of-the-art performance. We further propose to use image retrieval for face validation (in order to remove false positives) and for face alignment/landmark localization. The same methodology can also be easily generalized to other facerelated tasks, such as attribute recognition, as well as general object detection.

3 0.24165717 152 cvpr-2013-Exemplar-Based Face Parsing

Author: Brandon M. Smith, Li Zhang, Jonathan Brandt, Zhe Lin, Jianchao Yang

Abstract: In this work, we propose an exemplar-based face image segmentation algorithm. We take inspiration from previous works on image parsing for general scenes. Our approach assumes a database of exemplar face images, each of which is associated with a hand-labeled segmentation map. Given a test image, our algorithm first selects a subset of exemplar images from the database, Our algorithm then computes a nonrigid warp for each exemplar image to align it with the test image. Finally, we propagate labels from the exemplar images to the test image in a pixel-wise manner, using trained weights to modulate and combine label maps from different exemplars. We evaluate our method on two challenging datasets and compare with two face parsing algorithms and a general scene parsing algorithm. We also compare our segmentation results with contour-based face alignment results; that is, we first run the alignment algorithms to extract contour points and then derive segments from the contours. Our algorithm compares favorably with all previous works on all datasets evaluated.

4 0.18626465 161 cvpr-2013-Facial Feature Tracking Under Varying Facial Expressions and Face Poses Based on Restricted Boltzmann Machines

Author: Yue Wu, Zuoguan Wang, Qiang Ji

Abstract: Facial feature tracking is an active area in computer vision due to its relevance to many applications. It is a nontrivial task, sincefaces may have varyingfacial expressions, poses or occlusions. In this paper, we address this problem by proposing a face shape prior model that is constructed based on the Restricted Boltzmann Machines (RBM) and their variants. Specifically, we first construct a model based on Deep Belief Networks to capture the face shape variations due to varying facial expressions for near-frontal view. To handle pose variations, the frontal face shape prior model is incorporated into a 3-way RBM model that could capture the relationship between frontal face shapes and non-frontal face shapes. Finally, we introduce methods to systematically combine the face shape prior models with image measurements of facial feature points. Experiments on benchmark databases show that with the proposed method, facial feature points can be tracked robustly and accurately even if faces have significant facial expressions and poses.

5 0.14078362 257 cvpr-2013-Learning Structured Low-Rank Representations for Image Classification

Author: Yangmuzi Zhang, Zhuolin Jiang, Larry S. Davis

Abstract: An approach to learn a structured low-rank representation for image classification is presented. We use a supervised learning method to construct a discriminative and reconstructive dictionary. By introducing an ideal regularization term, we perform low-rank matrix recovery for contaminated training data from all categories simultaneously without losing structural information. A discriminative low-rank representation for images with respect to the constructed dictionary is obtained. With semantic structure information and strong identification capability, this representation is good for classification tasks even using a simple linear multi-classifier. Experimental results demonstrate the effectiveness of our approach.

6 0.12905176 77 cvpr-2013-Capturing Complex Spatio-temporal Relations among Facial Muscles for Facial Expression Recognition

7 0.11721221 438 cvpr-2013-Towards Pose Robust Face Recognition

8 0.11225116 92 cvpr-2013-Constrained Clustering and Its Application to Face Clustering in Videos

9 0.10788668 166 cvpr-2013-Fast Image Super-Resolution Based on In-Place Example Regression

10 0.10671437 64 cvpr-2013-Blessing of Dimensionality: High-Dimensional Feature and Its Efficient Compression for Face Verification

11 0.10113302 172 cvpr-2013-Finding Group Interactions in Social Clutter

12 0.099975795 104 cvpr-2013-Deep Convolutional Network Cascade for Facial Point Detection

13 0.093764715 399 cvpr-2013-Single-Sample Face Recognition with Image Corruption and Misalignment via Sparse Illumination Transfer

14 0.091795884 338 cvpr-2013-Probabilistic Elastic Matching for Pose Variant Face Verification

15 0.090423696 427 cvpr-2013-Texture Enhanced Image Denoising via Gradient Histogram Preservation

16 0.086992532 182 cvpr-2013-Fusing Robust Face Region Descriptors via Multiple Metric Learning for Face Recognition in the Wild

17 0.086542301 36 cvpr-2013-Adding Unlabeled Samples to Categories by Learned Attributes

18 0.084843241 325 cvpr-2013-Part Discovery from Partial Correspondence

19 0.081238829 107 cvpr-2013-Deformable Spatial Pyramid Matching for Fast Dense Correspondences

20 0.080540836 189 cvpr-2013-Graph-Based Discriminative Learning for Location Recognition


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.162), (1, -0.028), (2, -0.042), (3, 0.051), (4, 0.049), (5, 0.004), (6, -0.027), (7, -0.05), (8, 0.198), (9, -0.169), (10, 0.098), (11, -0.007), (12, 0.077), (13, 0.092), (14, 0.051), (15, 0.021), (16, 0.027), (17, -0.012), (18, 0.049), (19, 0.065), (20, -0.001), (21, 0.041), (22, 0.034), (23, -0.036), (24, 0.051), (25, 0.044), (26, 0.04), (27, -0.084), (28, 0.035), (29, 0.074), (30, -0.043), (31, -0.068), (32, 0.0), (33, 0.018), (34, -0.109), (35, -0.046), (36, 0.049), (37, -0.106), (38, -0.07), (39, 0.005), (40, 0.001), (41, -0.034), (42, -0.026), (43, 0.043), (44, 0.055), (45, -0.018), (46, -0.038), (47, -0.075), (48, -0.048), (49, -0.053)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.94516462 415 cvpr-2013-Structured Face Hallucination

Author: Chih-Yuan Yang, Sifei Liu, Ming-Hsuan Yang

Abstract: The goal of face hallucination is to generate highresolution images with fidelity from low-resolution ones. In contrast to existing methods based on patch similarity or holistic constraints in the image space, we propose to exploit local image structures for face hallucination. Each face image is represented in terms of facial components, contours and smooth regions. The image structure is maintained via matching gradients in the reconstructed highresolution output. For facial components, we align input images to generate accurate exemplars and transfer the high-frequency details for preserving structural consistency. For contours, we learn statistical priors to generate salient structures in the high-resolution images. A patch matching method is utilized on the smooth regions where the image gradients are preserved. Experimental results demonstrate that the proposed algorithm generates hallucinated face images with favorable quality and adaptability.

2 0.78372908 152 cvpr-2013-Exemplar-Based Face Parsing

Author: Brandon M. Smith, Li Zhang, Jonathan Brandt, Zhe Lin, Jianchao Yang

Abstract: In this work, we propose an exemplar-based face image segmentation algorithm. We take inspiration from previous works on image parsing for general scenes. Our approach assumes a database of exemplar face images, each of which is associated with a hand-labeled segmentation map. Given a test image, our algorithm first selects a subset of exemplar images from the database, Our algorithm then computes a nonrigid warp for each exemplar image to align it with the test image. Finally, we propagate labels from the exemplar images to the test image in a pixel-wise manner, using trained weights to modulate and combine label maps from different exemplars. We evaluate our method on two challenging datasets and compare with two face parsing algorithms and a general scene parsing algorithm. We also compare our segmentation results with contour-based face alignment results; that is, we first run the alignment algorithms to extract contour points and then derive segments from the contours. Our algorithm compares favorably with all previous works on all datasets evaluated.

3 0.74500114 161 cvpr-2013-Facial Feature Tracking Under Varying Facial Expressions and Face Poses Based on Restricted Boltzmann Machines

Author: Yue Wu, Zuoguan Wang, Qiang Ji

Abstract: Facial feature tracking is an active area in computer vision due to its relevance to many applications. It is a nontrivial task, sincefaces may have varyingfacial expressions, poses or occlusions. In this paper, we address this problem by proposing a face shape prior model that is constructed based on the Restricted Boltzmann Machines (RBM) and their variants. Specifically, we first construct a model based on Deep Belief Networks to capture the face shape variations due to varying facial expressions for near-frontal view. To handle pose variations, the frontal face shape prior model is incorporated into a 3-way RBM model that could capture the relationship between frontal face shapes and non-frontal face shapes. Finally, we introduce methods to systematically combine the face shape prior models with image measurements of facial feature points. Experiments on benchmark databases show that with the proposed method, facial feature points can be tracked robustly and accurately even if faces have significant facial expressions and poses.

4 0.70087838 119 cvpr-2013-Detecting and Aligning Faces by Image Retrieval

Author: Xiaohui Shen, Zhe Lin, Jonathan Brandt, Ying Wu

Abstract: Detecting faces in uncontrolled environments continues to be a challenge to traditional face detection methods[24] due to the large variation in facial appearances, as well as occlusion and clutter. In order to overcome these challenges, we present a novel and robust exemplarbased face detector that integrates image retrieval and discriminative learning. A large database of faces with bounding rectangles and facial landmark locations is collected, and simple discriminative classifiers are learned from each of them. A voting-based method is then proposed to let these classifiers cast votes on the test image through an efficient image retrieval technique. As a result, faces can be very efficiently detected by selecting the modes from the voting maps, without resorting to exhaustive sliding window-style scanning. Moreover, due to the exemplar-based framework, our approach can detect faces under challenging conditions without explicitly modeling their variations. Evaluation on two public benchmark datasets shows that our new face detection approach is accurate and efficient, and achieves the state-of-the-art performance. We further propose to use image retrieval for face validation (in order to remove false positives) and for face alignment/landmark localization. The same methodology can also be easily generalized to other facerelated tasks, such as attribute recognition, as well as general object detection.

5 0.68389589 420 cvpr-2013-Supervised Descent Method and Its Applications to Face Alignment

Author: Xuehan Xiong, Fernando De_la_Torre

Abstract: Many computer vision problems (e.g., camera calibration, image alignment, structure from motion) are solved through a nonlinear optimization method. It is generally accepted that 2nd order descent methods are the most robust, fast and reliable approaches for nonlinear optimization ofa general smoothfunction. However, in the context of computer vision, 2nd order descent methods have two main drawbacks: (1) The function might not be analytically differentiable and numerical approximations are impractical. (2) The Hessian might be large and not positive definite. To address these issues, thispaperproposes a Supervised Descent Method (SDM) for minimizing a Non-linear Least Squares (NLS) function. During training, the SDM learns a sequence of descent directions that minimizes the mean of NLS functions sampled at different points. In testing, SDM minimizes the NLS objective using the learned descent directions without computing the Jacobian nor the Hessian. We illustrate the benefits of our approach in synthetic and real examples, and show how SDM achieves state-ofthe-art performance in the problem of facial feature detec- tion. The code is available at www. .human sen sin g. . cs . cmu . edu/in t ra fa ce.

6 0.676557 359 cvpr-2013-Robust Discriminative Response Map Fitting with Constrained Local Models

7 0.60589522 77 cvpr-2013-Capturing Complex Spatio-temporal Relations among Facial Muscles for Facial Expression Recognition

8 0.6038515 399 cvpr-2013-Single-Sample Face Recognition with Image Corruption and Misalignment via Sparse Illumination Transfer

9 0.59364647 159 cvpr-2013-Expressive Visual Text-to-Speech Using Active Appearance Models

10 0.58167523 438 cvpr-2013-Towards Pose Robust Face Recognition

11 0.57006437 463 cvpr-2013-What's in a Name? First Names as Facial Attributes

12 0.56367499 104 cvpr-2013-Deep Convolutional Network Cascade for Facial Point Detection

13 0.53674835 182 cvpr-2013-Fusing Robust Face Region Descriptors via Multiple Metric Learning for Face Recognition in the Wild

14 0.53093064 385 cvpr-2013-Selective Transfer Machine for Personalized Facial Action Unit Detection

15 0.53064746 96 cvpr-2013-Correlation Filters for Object Alignment

16 0.52367032 64 cvpr-2013-Blessing of Dimensionality: High-Dimensional Feature and Its Efficient Compression for Face Verification

17 0.50454974 338 cvpr-2013-Probabilistic Elastic Matching for Pose Variant Face Verification

18 0.46846747 358 cvpr-2013-Robust Canonical Time Warping for the Alignment of Grossly Corrupted Sequences

19 0.46712023 160 cvpr-2013-Face Recognition in Movie Trailers via Mean Sequence Sparse Representation-Based Classification

20 0.45843184 220 cvpr-2013-In Defense of Sparsity Based Face Recognition


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(10, 0.108), (16, 0.03), (26, 0.106), (28, 0.019), (33, 0.24), (39, 0.021), (67, 0.073), (69, 0.033), (80, 0.012), (87, 0.082), (91, 0.2)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.84381145 415 cvpr-2013-Structured Face Hallucination

Author: Chih-Yuan Yang, Sifei Liu, Ming-Hsuan Yang

Abstract: The goal of face hallucination is to generate highresolution images with fidelity from low-resolution ones. In contrast to existing methods based on patch similarity or holistic constraints in the image space, we propose to exploit local image structures for face hallucination. Each face image is represented in terms of facial components, contours and smooth regions. The image structure is maintained via matching gradients in the reconstructed highresolution output. For facial components, we align input images to generate accurate exemplars and transfer the high-frequency details for preserving structural consistency. For contours, we learn statistical priors to generate salient structures in the high-resolution images. A patch matching method is utilized on the smooth regions where the image gradients are preserved. Experimental results demonstrate that the proposed algorithm generates hallucinated face images with favorable quality and adaptability.

2 0.83930111 394 cvpr-2013-Shading-Based Shape Refinement of RGB-D Images

Author: Lap-Fai Yu, Sai-Kit Yeung, Yu-Wing Tai, Stephen Lin

Abstract: We present a shading-based shape refinement algorithm which uses a noisy, incomplete depth map from Kinect to help resolve ambiguities in shape-from-shading. In our framework, the partial depth information is used to overcome bas-relief ambiguity in normals estimation, as well as to assist in recovering relative albedos, which are needed to reliably estimate the lighting environment and to separate shading from albedo. This refinement of surface normals using a noisy depth map leads to high-quality 3D surfaces. The effectiveness of our algorithm is demonstrated through several challenging real-world examples.

3 0.83836162 259 cvpr-2013-Learning a Manifold as an Atlas

Author: Nikolaos Pitelis, Chris Russell, Lourdes Agapito

Abstract: In this work, we return to the underlying mathematical definition of a manifold and directly characterise learning a manifold as finding an atlas, or a set of overlapping charts, that accurately describe local structure. We formulate the problem of learning the manifold as an optimisation that simultaneously refines the continuous parameters defining the charts, and the discrete assignment of points to charts. In contrast to existing methods, this direct formulation of a manifold does not require “unwrapping ” the manifold into a lower dimensional space and allows us to learn closed manifolds of interest to vision, such as those corresponding to gait cycles or camera pose. We report state-ofthe-art results for manifold based nearest neighbour classification on vision datasets, and show how the same techniques can be applied to the 3D reconstruction of human motion from a single image.

4 0.82149565 311 cvpr-2013-Occlusion Patterns for Object Class Detection

Author: Bojan Pepikj, Michael Stark, Peter Gehler, Bernt Schiele

Abstract: Despite the success of recent object class recognition systems, the long-standing problem of partial occlusion remains a major challenge, and a principled solution is yet to be found. In this paper we leave the beaten path of methods that treat occlusion as just another source of noise instead, we include the occluder itself into the modelling, by mining distinctive, reoccurring occlusion patterns from annotated training data. These patterns are then used as training data for dedicated detectors of varying sophistication. In particular, we evaluate and compare models that range from standard object class detectors to hierarchical, part-based representations of occluder/occludee pairs. In an extensive evaluation we derive insights that can aid further developments in tackling the occlusion challenge. –

5 0.81741643 152 cvpr-2013-Exemplar-Based Face Parsing

Author: Brandon M. Smith, Li Zhang, Jonathan Brandt, Zhe Lin, Jianchao Yang

Abstract: In this work, we propose an exemplar-based face image segmentation algorithm. We take inspiration from previous works on image parsing for general scenes. Our approach assumes a database of exemplar face images, each of which is associated with a hand-labeled segmentation map. Given a test image, our algorithm first selects a subset of exemplar images from the database, Our algorithm then computes a nonrigid warp for each exemplar image to align it with the test image. Finally, we propagate labels from the exemplar images to the test image in a pixel-wise manner, using trained weights to modulate and combine label maps from different exemplars. We evaluate our method on two challenging datasets and compare with two face parsing algorithms and a general scene parsing algorithm. We also compare our segmentation results with contour-based face alignment results; that is, we first run the alignment algorithms to extract contour points and then derive segments from the contours. Our algorithm compares favorably with all previous works on all datasets evaluated.

6 0.817092 440 cvpr-2013-Tracking People and Their Objects

7 0.81177902 280 cvpr-2013-Maximum Cohesive Grid of Superpixels for Fast Object Localization

8 0.80705011 185 cvpr-2013-Generalized Domain-Adaptive Dictionaries

9 0.80552471 104 cvpr-2013-Deep Convolutional Network Cascade for Facial Point Detection

10 0.80393153 248 cvpr-2013-Learning Collections of Part Models for Object Recognition

11 0.80318362 380 cvpr-2013-Scene Coordinate Regression Forests for Camera Relocalization in RGB-D Images

12 0.8020705 423 cvpr-2013-Template-Based Isometric Deformable 3D Reconstruction with Sampling-Based Focal Length Self-Calibration

13 0.80109853 4 cvpr-2013-3D Visual Proxemics: Recognizing Human Interactions in 3D from a Single Image

14 0.8005836 225 cvpr-2013-Integrating Grammar and Segmentation for Human Pose Estimation

15 0.79970205 365 cvpr-2013-Robust Real-Time Tracking of Multiple Objects by Volumetric Mass Densities

16 0.7987383 331 cvpr-2013-Physically Plausible 3D Scene Tracking: The Single Actor Hypothesis

17 0.79839253 414 cvpr-2013-Structure Preserving Object Tracking

18 0.79743457 30 cvpr-2013-Accurate Localization of 3D Objects from RGB-D Data Using Segmentation Hypotheses

19 0.79721975 254 cvpr-2013-Learning SURF Cascade for Fast and Accurate Object Detection

20 0.79706752 325 cvpr-2013-Part Discovery from Partial Correspondence