cvpr cvpr2013 cvpr2013-195 knowledge-graph by maker-knowledge-mining

195 cvpr-2013-HDR Deghosting: How to Deal with Saturation?

Source: pdf

Author: Jun Hu, Orazio Gallo, Kari Pulli, Xiaobai Sun

Abstract: We present a novel method for aligning images in an HDR (high-dynamic-range) image stack to produce a new exposure stack where all the images are aligned and appear as if they were taken simultaneously, even in the case of highly dynamic scenes. Our method produces plausible results even where the image used as a reference is either too dark or bright to allow for an accurate registration.

Reference: text

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Our method produces plausible results even where the image used as a reference is either too dark or bright to allow for an accurate registration. [sent-3, score-0.401]

2 The limited dynamic range of most imaging sensors often fails to capture the irradiance range visible to the human eye in common real-world scenes. [sent-6, score-0.229]

3 A relatively cheap way to address this limitation is to capture a stack of differently exposed pictures of the same scene and merge them, effectively extending the captured range [18, 6]. [sent-7, score-0.419]

4 However, because the merging process assumes that the pixels of the different images are aligned, any motion—either due to the motion of the camera or to anything moving in the scene— will cause ghosting artifacts (if the motion is large) or blurring artifacts (if the motion is small). [sent-8, score-0.406]

5 A common approach to address the artifacts due to the camera motion is to first register the low-dynamic-range (LDR) images, a task complicated by the dramatic changes in brightness across the stack, since most registration algorithms rely on the brightness constancy assumption [3 1]. [sent-9, score-0.396]

6 [12] address the brightness changes by binarizing each exposure and determining the optimal translation and rotation, respectively. [sent-11, score-0.555]

7 [26] compute the gradient map for each exposure and find a similarity transformation in the Fourier domain. [sent-13, score-0.515]

8 Our approach allows gathering data from the images in the stack even for regions that are severely under- or over-exposed in the reference, a main limitation of many state-of-the-art approaches. [sent-19, score-0.353]

9 camera is static, or that a global registration of the background can be performed. [sent-20, score-0.124]

10 [7] model the exposure change and determine patches that might contain moving objects by counting the pixels that deviate from the predicted behavior. [sent-22, score-0.654]

11 Raman and Chaudhuri [23] follow a similar idea, but they model the intensity change and detect the motion in irregular patches obtained by grouping pixels into super-pixels. [sent-23, score-0.204]

12 These algorithms pay for the reduction of motion artifacts with a potentially reduced dynamic range, as they drop data that does not follow the registration of the background. [sent-24, score-0.273]

13 [12] detect pixels that would cause ghosting based on the variance and entropy across the exposure stack. [sent-27, score-0.598]

14 [10] use a weight that emphasizes well-exposed pixels and a second weight that enforces consistency across spatial and exposure domains. [sent-31, score-0.563]

15 Zhang and Cham [29] propose to weight the pixel using local gradients across the exposure stack as a measure of consistency. [sent-32, score-0.851]

16 While computationally efficient, these approaches have the drawback that they downweight or completely ignore pixels of moving objects except, possibly, in one of the images. [sent-33, score-0.125]

17 More sophisticated methods attempt to establish dense correspondences between the reference image and the other images in the stack. [sent-35, score-0.268]

18 However, standard optical flow algorithms [2] rely on the brightness constancy assumption, which is always violated, by construction, in the case of exposure stacks. [sent-36, score-0.555]

19 [13] boost the image intensity to compensate for this and use a standard optical flow to refine the correspondence mapping initialized by a global registration. [sent-38, score-0.173]

20 [24] 1 converts each image into a linear space inverting the camera response function, and selects an image as the reference for the final HDR image. [sent-44, score-0.276]

21 Using a variant of PatchMatch, they reconstruct an HDR image which maximizes the similarity with the reference image at the pixel level while minimizing the bidirectional similarity metric with the remaining images. [sent-45, score-0.282]

22 In general, methods dealing with non-rigid scene motion fall in one of two categories, each with its limitations: • Algorithms that do not define a reference image and incorporate de-ghosting dienf itnhee ade rfeinfeirtieonnc eo ifm mthaeg pixel weights. [sent-46, score-0.319]

23 • Approaches requiring the definition of a reference image. [sent-48, score-0.24]

24 However, while it capitalizes on the benefits of selecting a reference image (producing a consistent image [7]) it also enables us to recover from the other images the regions that contain clipped pixels (either too dark or too bright) in the reference image. [sent-53, score-0.726]

25 In a nutshell, from each source image S in the stack we attempt to build a new image that looks as if it was taken at the same time as the reference R, but with the exposure settings of S. [sent-54, score-1.177]

26 For the areas where R provides sufficient detail, the process is driven by the reference image to ensure consistency. [sent-55, score-0.301]

27 For the remaining areas, we use other constraints, as reliable direct registration becomes impossible, and we rely mostly on the information in the source image. [sent-56, score-0.139]

28 To get consistent results even when parts of the scene are moving, we ensure that the boundaries of the saturated regions are consistent with both R and S. [sent-57, score-0.382]

29 Our contribution is a novel method for generating a registered stack from a set of mis-aligned images of dynamic scenes, similar to Hu et al. [sent-58, score-0.423]

30 However, as opposed to their work, our algorithm can be applied to generic non-linearized exposure stacks, and is also capable of dealing with large saturated regions in the reference image, even under large camera motion or scene object displacements. [sent-61, score-1.21]

31 Besides, our method propagates both intensity and gradient information in the reconstruction process, so we can preserve more detail from the exposure stacks. [sent-62, score-0.585]

32 Method Our algorithm works by first selecting the image with the highest number of well-exposed pixels to be the reference image R [13, 7]. [sent-64, score-0.288]

33 Then, for each source image S in the stack, it synthesizes a new image L (the latent image) that looks like the reference image R, only exposed like S. [sent-65, score-0.502]

34 First, where the reference R is properly exposed, L has image content that is geometrically compatible with R. [sent-67, score-0.274]

35 In Figure 1, where the reference R is the middle exposure, this means for instance that the arms of the woman in the latent images L must appear in the same location as they appear in R. [sent-68, score-0.374]

36 If the reference had been the darkest image (top row in Figure 1), the areas posing these difficulties would have been the dark areas, where details are lost due to clipping. [sent-70, score-0.365]

37 For each source image S in the stack, we want to synthesize the latent image L we would have if we had captured it at the same time as the reference image R, but with the same exposure settings as used to capture S. [sent-72, score-0.965]

38 τ is an intensity mapping function accounting for how the pixel values change under the exposure change. [sent-73, score-0.701]

39 , 5 (ordered by exposure time); if the reference image is 3, we first register 2 and 4 to the reference 3, then 2 acts as the reference for 1, and 4 acts as the reference for 5. [sent-80, score-1.526]

40 The reference R is on the left (red), the source S is on the right (blue), and we want to create a latent image L in the center (green) so the shapes of objects in L look like they do in R, except that they have the luminance range of S. [sent-82, score-0.448]

41 We first initialize L by applying a color mapping function τ to R, where τ is initialized using the intensity histograms of the images [8], and is later refined as L is updated. [sent-83, score-0.173]

42 If the reference patch PiR is not clipped, that is, it is mostly mid-tones and does not contain too dark or bright pixels, PatchMatch looks for a match from S. [sent-85, score-0.524]

43 However, if PiR is clipped, neither the color mapping τ, nor direct registration is reliable. [sent-86, score-0.162]

44 In this case we modify the PatchMatch to find a patch PiS that could plausibly match PiR: pixels in PiS should match the pixels in PiR that are not clipped, and the rest ofthe pixels in PiS would clip under the current τ. [sent-87, score-0.349]

45 As we progress, the intensity mapping function τ is updated and refined based on the dense correspondence. [sent-89, score-0.144]

46 To avoid a bad local minimum and to better synthesize clipped areas, these processes are executed iteratively using a coarse-tofine schedule. [sent-90, score-0.212]

47 Two-picture Synthesis Algorithm We wish to synthesize the latent image L that looks just as if R was taken using the exposure setting of S: in other words, L should be consistent with R everywhere in geometry. [sent-94, score-0.716]

48 [5], but we account for a generic intensity mapping function τ: Cr(L,R,τ) = ? [sent-99, score-0.144]

49 In addition to boosting the details of the texture [1, 21], using gradients helps to compensate for exposure changes [30]. [sent-109, score-0.515]

50 The intensity mapping function τ describes how the RGB values change from the reference to the source image. [sent-110, score-0.435]

51 , where PiS is a p p patch centered at iin image S (same for PiL and L)i san ad p u×(ip) maps patches i ant tL i itno itmhea corresponding patches in S, see Figure 2. [sent-117, score-0.168]

52 We operate in the RGB color space and only search over translations, which makes the updates of L faster but does not lower the quality of our results, given the expected changes in an exposure stack. [sent-121, score-0.515]

53 The optimal solution can therefore be reduced to finding the nearest-neighbor patches in S for each patch PiL. [sent-132, score-0.119]

54 1 and 2, and summing over the pixels × in the patches rather than over than over the patches themselves, Eq. [sent-136, score-0.146]

55 si Bmailsaicra pixels i ins S and the patch in τ(R), while ∇T denotes the weighted average othfe thpea gradients. [sent-140, score-0.118]

56 n(i)wu(j)S(i + u(j))], (6) where wτ (i) and wu (i) reflect the confidence of the intensity mapping function τ(·) and the geometric mapping u(·) fsoitry pixel i ,n ? [sent-145, score-0.295]

57 The intensity mapping function τ, which describes how the RGB values change from the reference to the source im- age, cannot be accurate across the whole range, due to saturation and under-exposure. [sent-153, score-0.435]

58 For example, if S was captured with a shorter exposure time (darker) than R, and if the top of the range in the domain of R is saturated, τ will be flat in that area, thus not providing any relevant information; all the useful information for registration and HDR image creation is in S. [sent-154, score-0.701]

59 The opposite may be true when S was captured with a longer exposure time, see inset, where red bands show the range in which the mapping τ is not reliable. [sent-155, score-0.633]

60 However, consider an area that is saturated in R and assume that we are working with an S that is darker, and therefore better exposed. [sent-166, score-0.353]

61 In such regions, τ(PiR) is not reliable and we want to relax the requirement that patches from S have to match, or we would reject all the patches in that area. [sent-167, score-0.135]

62 On the other hand, if a patch in S is so dark that it wouldn’t possibly become saturated in R we also don’t want to allow its use. [sent-168, score-0.524]

63 In this way, the clipped areas of R in L can be reasonably synthesized using the information from S. [sent-171, score-0.166]

64 In the third and last step, given the existing L, we need to re-estimate the intensity mapping function (IMF) τ (Eq. [sent-174, score-0.144]

65 Second, 1 1 1 1 1 16 6 6 64 4 in addition to the hard monotonicity constraint, we also require the function to be within [0, 1] , and be convex (or concave) if the exposure time of R is longer (or shorter) than that of S. [sent-181, score-0.515]

66 When moving from a level to a finer one, three variables need to be propagated; we transfer τ as is, and linearly interpolate the mapping u. [sent-188, score-0.116]

67 Otherwise, it should be initialized using the source image S (using the mapping u derived from the previous level). [sent-191, score-0.154]

68 When the reference image is reasonably well-exposed everywhere, our method produces very similar results as Hu et al. [sent-201, score-0.288]

69 However, when part of the reference is saturated, as in Figure 5, Hu et al. [sent-202, score-0.288]

70 discard valuable information from the shorter exposure (first row, middle image); our method, on the other hand, successfully captures all the available information in the synthesized latent image (second row, middle image). [sent-203, score-0.79]

71 Figure 6 shows another case with a large saturated region. [sent-207, score-0.353]

72 are not caused by the tonemapping algorithm, rather they are artifacts of their registration algorithm. [sent-219, score-0.155]

73 In our result (bottom, rightmost image in Figure 6) the sky is more faithful to the original images and no artifacts are introduced. [sent-220, score-0.16]

74 As we mentioned in the previous section, we attempt to preserve as much information as possible from the exposure stack by using both the intensity and the gradients in our reconstruction. [sent-221, score-0.907]

75 Figure 7 shows an extreme case of a stack comprising only two images, with a region that is saturated in both images, demonstrating one of the limitations of our method. [sent-225, score-0.647]

76 Our method can register the images correctly despite selecting a reference image that has a completely saturated sky. [sent-227, score-0.679]

77 However, since the sun is saturated in both images, our algorithm fills in the saturated sun using non-saturated pixels from S. [sent-228, score-0.834]

78 Notice that the sky is almost completely saturated, causing their algorithm to disregard useful information in the short exposure (top row, middle image), and leading to poor quality in the fusion result (top right). [sent-246, score-0.724]

79 Sen’s algorithm is designed to work on linear exposure stacks. [sent-247, score-0.515]

80 For this non-linear stack, a reliable estimation of the camera response function would require acquiring a stack of registered images. [sent-248, score-0.33]

81 With the same reference frame our algorithm can synthesize a novel image which is completely consistent with the reference, and also captures all the details of the sky (bottom row, middle image). [sent-250, score-0.492]

82 This directly reflects in the high quality of our exposure fusion result (bottom row, rightmost image). [sent-251, score-0.585]

83 The first column shows the original images in the stack, the middle exposure is selected as the reference. [sent-255, score-0.602]

84 , we first linearize the original images and use the linearized exposure stacks as the input. [sent-257, score-0.612]

85 For example, the blurred sky in the saturated region and the halo around the dome are unexpected. [sent-260, score-0.45]

86 Note that the halo in the reconstructed shorter exposure is not caused by tone mapping but the errors in HDR reconstruction. [sent-261, score-0.74]

87 For the tone mapped HDR image (top right), the reconstructed sky is not natural for the saturated region in the reference. [sent-262, score-0.463]

88 Our algorithm can synthesize an image (bottom middle) that is completely consistent with the reference and also preserves as much information as possible from the whole exposure stack. [sent-263, score-0.865]

89 The original images (left) are dramatically separated in terms of exposure time: the areas that are correctly exposed in one are barely visible in the other. [sent-267, score-0.657]

90 An interesting feature of this stack is that the region around the sun is saturated in both images. [sent-268, score-0.687]

91 Note that the longer exposure, which we selected as the reference (left bottom), is completely saturated in the sky; our algorithm attempts to synthesize the saturated region in the source image from other pixels in the same image, thus effectively removing the sun (middle top). [sent-269, score-1.195]

92 The last column shows the exposure fusion result for the standard patch size (top) and for the larger patches (bottom). [sent-271, score-0.666]

93 Conclusions We have presented a novel method to generate a perfectly aligned stack from a set of images of a dynamic scene, captured with a hand-held camera. [sent-273, score-0.375]

94 Four previous methods can deal with both the camera and scene object motion at the same time: Kang et al. [sent-274, score-0.121]

95 It successfully deals with large saturated regions in the reference image, which is the most common limitation for algorithms that select a reference frame. [sent-280, score-0.862]

96 Ghost removal in high [15] [16] [17] [18] [19] [20] [21] [22] [23] [24] [25] [26] dynamic range images. [sent-389, score-0.125]

97 Being ‘undigital’ with digital cameras: Extending dynamic range by combining differently exposed pictures. [sent-420, score-0.206]

98 A perceptual framework for contrast processing of high dynamic range images. [sent-427, score-0.125]

99 Image registration for multi-exposure high dynamic range image acquisition. [sent-472, score-0.213]

100 Fast, robust image registration for compositing high-dynamic range photographs from handheld exposures. [sent-484, score-0.132]

similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('exposure', 0.515), ('saturated', 0.353), ('stack', 0.294), ('reference', 0.24), ('hdr', 0.234), ('sen', 0.188), ('patchmatch', 0.107), ('clipped', 0.105), ('zimmer', 0.105), ('pir', 0.097), ('gallo', 0.095), ('pis', 0.091), ('registration', 0.088), ('middle', 0.087), ('hacohen', 0.084), ('dynamic', 0.081), ('exposed', 0.081), ('synthesize', 0.075), ('kang', 0.074), ('mapping', 0.074), ('wexler', 0.073), ('hu', 0.072), ('mantiuk', 0.071), ('plausibly', 0.071), ('intensity', 0.07), ('patch', 0.07), ('bright', 0.069), ('stacks', 0.069), ('artifacts', 0.067), ('dark', 0.064), ('darabi', 0.063), ('areas', 0.061), ('siggraph', 0.056), ('tone', 0.055), ('sky', 0.055), ('shorter', 0.054), ('goldman', 0.053), ('source', 0.051), ('register', 0.051), ('looks', 0.049), ('patches', 0.049), ('et', 0.048), ('pixels', 0.048), ('mertens', 0.047), ('raman', 0.047), ('tico', 0.047), ('tomaszewska', 0.047), ('latent', 0.047), ('range', 0.044), ('halo', 0.042), ('tzimiropoulos', 0.042), ('moving', 0.042), ('pixel', 0.042), ('sun', 0.04), ('brightness', 0.04), ('shechtman', 0.04), ('drucker', 0.039), ('duke', 0.039), ('pulli', 0.039), ('rightmost', 0.038), ('want', 0.037), ('motion', 0.037), ('dramatic', 0.037), ('courtesy', 0.037), ('pil', 0.037), ('cr', 0.036), ('camera', 0.036), ('completely', 0.035), ('isp', 0.035), ('ghosting', 0.035), ('wu', 0.035), ('jacobs', 0.034), ('content', 0.034), ('synthesizes', 0.034), ('radiance', 0.032), ('heo', 0.032), ('match', 0.032), ('bad', 0.032), ('fusion', 0.032), ('nvidia', 0.031), ('irradiance', 0.031), ('coarsest', 0.031), ('chances', 0.03), ('everywhere', 0.03), ('kopf', 0.03), ('ct', 0.03), ('fourier', 0.03), ('severely', 0.03), ('jun', 0.03), ('barnes', 0.03), ('regions', 0.029), ('initialized', 0.029), ('impossible', 0.029), ('luminance', 0.029), ('bottom', 0.029), ('imaging', 0.029), ('rgb', 0.028), ('linearized', 0.028), ('plausible', 0.028), ('attempt', 0.028)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.9999997 195 cvpr-2013-HDR Deghosting: How to Deal with Saturation?

Author: Jun Hu, Orazio Gallo, Kari Pulli, Xiaobai Sun

2 0.11073313 108 cvpr-2013-Dense 3D Reconstruction from Severely Blurred Images Using a Single Moving Camera

Author: Hee Seok Lee, Kuoung Mu Lee

Abstract: Motion blur frequently occurs in dense 3D reconstruction using a single moving camera, and it degrades the quality of the 3D reconstruction. To handle motion blur caused by rapid camera shakes, we propose a blur-aware depth reconstruction method, which utilizes a pixel correspondence that is obtained by considering the effect of motion blur. Motion blur is dependent on 3D geometry, thus parameterizing blurred appearance of images with scene depth given camera motion is possible and a depth map can be accurately estimated from the blur-considered pixel correspondence. The estimated depth is then converted intopixel-wise blur kernels, and non-uniform motion blur is easily removed with low computational cost. The obtained blur kernel is depth-dependent, thus it effectively addresses scene-depth variation, which is a challenging problem in conventional non-uniform deblurring methods.

3 0.10780361 177 cvpr-2013-FrameBreak: Dramatic Image Extrapolation by Guided Shift-Maps

Author: Yinda Zhang, Jianxiong Xiao, James Hays, Ping Tan

Abstract: We significantly extrapolate the field of view of a photograph by learning from a roughly aligned, wide-angle guide image of the same scene category. Our method can extrapolate typical photos into complete panoramas. The extrapolation problem is formulated in the shift-map image synthesis framework. We analyze the self-similarity of the guide image to generate a set of allowable local transformations and apply them to the input image. Our guided shift-map method preserves to the scene layout of the guide image when extrapolating a photograph. While conventional shiftmap methods only support translations, this is not expressive enough to characterize the self-similarity of complex scenes. Therefore we additionally allow image transformations of rotation, scaling and reflection. To handle this in- crease in complexity, we introduce a hierarchical graph optimization method to choose the optimal transformation at each output pixel. We demonstrate our approach on a variety of indoor, outdoor, natural, and man-made scenes.

4 0.088014543 107 cvpr-2013-Deformable Spatial Pyramid Matching for Fast Dense Correspondences

Author: Jaechul Kim, Ce Liu, Fei Sha, Kristen Grauman

Abstract: We introduce a fast deformable spatial pyramid (DSP) matching algorithm for computing dense pixel correspondences. Dense matching methods typically enforce both appearance agreement between matched pixels as well as geometric smoothness between neighboring pixels. Whereas the prevailing approaches operate at the pixel level, we propose a pyramid graph model that simultaneously regularizes match consistency at multiple spatial extents—ranging from an entire image, to coarse grid cells, to every single pixel. This novel regularization substantially improves pixel-level matching in the face of challenging image variations, while the “deformable ” aspect of our model overcomes the strict rigidity of traditional spatial pyramids. Results on LabelMe and Caltech show our approach outperforms state-of-the-art methods (SIFT Flow [15] and PatchMatch [2]), both in terms of accuracy and run time.

5 0.086840674 321 cvpr-2013-PDM-ENLOR: Learning Ensemble of Local PDM-Based Regressions

Author: Yen H. Le, Uday Kurkure, Ioannis A. Kakadiaris

Abstract: Statistical shape models, such as Active Shape Models (ASMs), sufferfrom their inability to represent a large range of variations of a complex shape and to account for the large errors in detection of model points. We propose a novel method (dubbed PDM-ENLOR) that overcomes these limitations by locating each shape model point individually using an ensemble of local regression models and appearance cues from selected model points. Our method first detects a set of reference points which were selected based on their saliency during training. For each model point, an ensemble of regressors is built. From the locations of the detected reference points, each regressor infers a candidate location for that model point using local geometric constraints, encoded by a point distribution model (PDM). The final location of that point is determined as a weighted linear combination, whose coefficients are learnt from the training data, of candidates proposed from its ensemble ’s component regressors. We use different subsets of reference points as explanatory variables for the component regressors to provide varying degrees of locality for the models in each ensemble. This helps our ensemble model to capture a larger range of shape variations as compared to a single PDM. We demonstrate the advantages of our method on the challenging problem of segmenting gene expression images of mouse brain.

6 0.082817085 412 cvpr-2013-Stochastic Deconvolution

7 0.080203474 326 cvpr-2013-Patch Match Filter: Efficient Edge-Aware Filtering Meets Randomized Search for Fast Correspondence Field Estimation

8 0.079530649 352 cvpr-2013-Recovering Stereo Pairs from Anaglyphs

9 0.07690604 244 cvpr-2013-Large Displacement Optical Flow from Nearest Neighbor Fields

10 0.074700877 424 cvpr-2013-Templateless Quasi-rigid Shape Modeling with Implicit Loop-Closure

11 0.072924055 360 cvpr-2013-Robust Estimation of Nonrigid Transformation for Point Set Registration

12 0.070683695 166 cvpr-2013-Fast Image Super-Resolution Based on In-Place Example Regression

13 0.069971383 330 cvpr-2013-Photometric Ambient Occlusion

14 0.069491021 115 cvpr-2013-Depth Super Resolution by Rigid Body Self-Similarity in 3D

15 0.0665906 307 cvpr-2013-Non-uniform Motion Deblurring for Bilayer Scenes

16 0.063508779 397 cvpr-2013-Simultaneous Super-Resolution of Depth and Images Using a Single Camera

17 0.061650224 54 cvpr-2013-BRDF Slices: Accurate Adaptive Anisotropic Appearance Acquisition

18 0.06148601 111 cvpr-2013-Dense Reconstruction Using 3D Object Shape Priors

19 0.059138093 117 cvpr-2013-Detecting Changes in 3D Structure of a Scene from Multi-view Images Captured by a Vehicle-Mounted Camera

20 0.057951976 158 cvpr-2013-Exploring Weak Stabilization for Motion Feature Extraction

similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.15), (1, 0.091), (2, 0.007), (3, 0.034), (4, -0.014), (5, 0.025), (6, -0.009), (7, -0.01), (8, 0.004), (9, 0.005), (10, 0.012), (11, 0.032), (12, 0.026), (13, -0.02), (14, 0.041), (15, -0.067), (16, 0.013), (17, -0.041), (18, 0.058), (19, 0.01), (20, -0.001), (21, 0.015), (22, 0.006), (23, -0.09), (24, -0.006), (25, -0.063), (26, -0.02), (27, -0.036), (28, 0.034), (29, -0.005), (30, -0.056), (31, -0.066), (32, 0.034), (33, 0.003), (34, -0.004), (35, -0.053), (36, -0.049), (37, 0.005), (38, -0.051), (39, 0.004), (40, -0.001), (41, 0.004), (42, -0.005), (43, -0.046), (44, 0.051), (45, 0.015), (46, -0.048), (47, -0.048), (48, -0.054), (49, 0.012)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.94127947 195 cvpr-2013-HDR Deghosting: How to Deal with Saturation?

Author: Jun Hu, Orazio Gallo, Kari Pulli, Xiaobai Sun

2 0.87086469 177 cvpr-2013-FrameBreak: Dramatic Image Extrapolation by Guided Shift-Maps

Author: Yinda Zhang, Jianxiong Xiao, James Hays, Ping Tan

3 0.70955515 166 cvpr-2013-Fast Image Super-Resolution Based on In-Place Example Regression

Author: Jianchao Yang, Zhe Lin, Scott Cohen

Abstract: We propose a fast regression model for practical single image super-resolution based on in-place examples, by leveraging two fundamental super-resolution approaches— learning from an external database and learning from selfexamples. Our in-place self-similarity refines the recently proposed local self-similarity by proving that a patch in the upper scale image have good matches around its origin location in the lower scale image. Based on the in-place examples, a first-order approximation of the nonlinear mapping function from low- to high-resolution image patches is learned. Extensive experiments on benchmark and realworld images demonstrate that our algorithm can produce natural-looking results with sharp edges and preserved fine details, while the current state-of-the-art algorithms are prone to visual artifacts. Furthermore, our model can easily extend to deal with noise by combining the regression results on multiple in-place examples for robust estimation. The algorithm runs fast and is particularly useful for practical applications, where the input images typically contain diverse textures and they are potentially contaminated by noise or compression artifacts.

4 0.68051773 81 cvpr-2013-City-Scale Change Detection in Cadastral 3D Models Using Images

Author: Aparna Taneja, Luca Ballan, Marc Pollefeys

Abstract: In this paper, we propose a method to detect changes in the geometry of a city using panoramic images captured by a car driving around the city. We designed our approach to account for all the challenges involved in a large scale application of change detection, such as, inaccuracies in the input geometry, errors in the geo-location data of the images, as well as, the limited amount of information due to sparse imagery. We evaluated our approach on an area of 6 square kilometers inside a city, using 3420 images downloaded from Google StreetView. These images besides being publicly available, are also a good example of panoramic images captured with a driving vehicle, and hence demonstrating all the possible challenges resulting from such an acquisition. We also quantitatively compared the performance of our approach with respect to a ground truth, as well as to prior work. This evaluation shows that our approach outperforms the current state of the art.

5 0.66997886 47 cvpr-2013-As-Projective-As-Possible Image Stitching with Moving DLT

Author: Julio Zaragoza, Tat-Jun Chin, Michael S. Brown, David Suter

Abstract: We investigate projective estimation under model inadequacies, i.e., when the underpinning assumptions oftheprojective model are not fully satisfied by the data. We focus on the task of image stitching which is customarily solved by estimating a projective warp — a model that is justified when the scene is planar or when the views differ purely by rotation. Such conditions are easily violated in practice, and this yields stitching results with ghosting artefacts that necessitate the usage of deghosting algorithms. To this end we propose as-projective-as-possible warps, i.e., warps that aim to be globally projective, yet allow local non-projective deviations to account for violations to the assumed imaging conditions. Based on a novel estimation technique called Moving Direct Linear Transformation (Moving DLT), our method seamlessly bridges image regions that are inconsistent with the projective model. The result is highly accurate image stitching, with significantly reduced ghosting effects, thus lowering the dependency on post hoc deghosting.

6 0.6558314 352 cvpr-2013-Recovering Stereo Pairs from Anaglyphs

7 0.65185028 283 cvpr-2013-Megastereo: Constructing High-Resolution Stereo Panoramas

8 0.6368773 169 cvpr-2013-Fast Patch-Based Denoising Using Approximated Patch Geodesic Paths

9 0.63176203 162 cvpr-2013-FasT-Match: Fast Affine Template Matching

10 0.62150377 393 cvpr-2013-Separating Signal from Noise Using Patch Recurrence across Scales

11 0.61257887 464 cvpr-2013-What Makes a Patch Distinct?

12 0.60855711 107 cvpr-2013-Deformable Spatial Pyramid Matching for Fast Dense Correspondences

13 0.59301275 427 cvpr-2013-Texture Enhanced Image Denoising via Gradient Histogram Preservation

14 0.58920002 453 cvpr-2013-Video Editing with Temporal, Spatial and Appearance Consistency

15 0.58556259 90 cvpr-2013-Computing Diffeomorphic Paths for Large Motion Interpolation

16 0.57965863 424 cvpr-2013-Templateless Quasi-rigid Shape Modeling with Implicit Loop-Closure

17 0.57346147 37 cvpr-2013-Adherent Raindrop Detection and Removal in Video

18 0.5711019 451 cvpr-2013-Unsupervised Salience Learning for Person Re-identification

19 0.56968737 333 cvpr-2013-Plane-Based Content Preserving Warps for Video Stabilization

20 0.56776386 360 cvpr-2013-Robust Estimation of Nonrigid Transformation for Point Set Registration

similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(10, 0.144), (13, 0.036), (16, 0.052), (21, 0.174), (26, 0.044), (33, 0.23), (65, 0.042), (67, 0.041), (69, 0.034), (80, 0.011), (87, 0.071), (96, 0.017)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.91159445 454 cvpr-2013-Video Enhancement of People Wearing Polarized Glasses: Darkening Reversal and Reflection Reduction

Author: Mao Ye, Cha Zhang, Ruigang Yang

Abstract: With the wide-spread of consumer 3D-TV technology, stereoscopic videoconferencing systems are emerging. However, the special glasses participants wear to see 3D can create distracting images. This paper presents a computational framework to reduce undesirable artifacts in the eye regions caused by these 3D glasses. More specifically, we add polarized filters to the stereo camera so that partial images of reflection can be captured. A novel Bayesian model is then developed to describe the imaging process of the eye regions including darkening and reflection, and infer the eye regions based on Classification ExpectationMaximization (EM). The recovered eye regions under the glasses are brighter and with little reflections, leading to a more nature videoconferencing experience. Qualitative evaluations and user studies are conducted to demonstrate the substantial improvement our approach can achieve.

2 0.87531984 214 cvpr-2013-Image Understanding from Experts' Eyes by Modeling Perceptual Skill of Diagnostic Reasoning Processes

Author: Rui Li, Pengcheng Shi, Anne R. Haake

Abstract: Eliciting and representing experts ’ remarkable perceptual capability of locating, identifying and categorizing objects in images specific to their domains of expertise will benefit image understanding in terms of transferring human domain knowledge and perceptual expertise into image-based computational procedures. In this paper, we present a hierarchical probabilistic framework to summarize the stereotypical and idiosyncratic eye movement patterns shared within 11 board-certified dermatologists while they are examining and diagnosing medical images. Each inferred eye movement pattern characterizes the similar temporal and spatial properties of its corresponding seg. edu anne .haake @ rit . edu , ments of the experts ’ eye movement sequences. We further discover a subset of distinctive eye movement patterns which are commonly exhibited across multiple images. Based on the combinations of the exhibitions of these eye movement patterns, we are able to categorize the images from the perspective of experts’ viewing strategies. In each category, images share similar lesion distributions and configurations. The performance of our approach shows that modeling physicians ’ diagnostic viewing behaviors informs about medical images’ understanding to correct diagnosis.

same-paper 3 0.86207366 195 cvpr-2013-HDR Deghosting: How to Deal with Saturation?

Author: Jun Hu, Orazio Gallo, Kari Pulli, Xiaobai Sun

4 0.85648966 96 cvpr-2013-Correlation Filters for Object Alignment

Author: Vishnu Naresh Boddeti, Takeo Kanade, B.V.K. Vijaya Kumar

Abstract: Alignment of 3D objects from 2D images is one of the most important and well studied problems in computer vision. A typical object alignment system consists of a landmark appearance model which is used to obtain an initial shape and a shape model which refines this initial shape by correcting the initialization errors. Since errors in landmark initialization from the appearance model propagate through the shape model, it is critical to have a robust landmark appearance model. While there has been much progress in designing sophisticated and robust shape models, there has been relatively less progress in designing robust landmark detection models. In thispaper wepresent an efficient and robust landmark detection model which is designed specifically to minimize localization errors thereby leading to state-of-the-art object alignment performance. We demonstrate the efficacy and speed of the proposed approach on the challenging task of multi-view car alignment.

5 0.84013325 461 cvpr-2013-Weakly Supervised Learning for Attribute Localization in Outdoor Scenes

Author: Shuo Wang, Jungseock Joo, Yizhou Wang, Song-Chun Zhu

Abstract: In this paper, we propose a weakly supervised method for simultaneously learning scene parts and attributes from a collection ofimages associated with attributes in text, where the precise localization of the each attribute left unknown. Our method includes three aspects. (i) Compositional scene configuration. We learn the spatial layouts of the scene by Hierarchical Space Tiling (HST) representation, which can generate an excessive number of scene configurations through the hierarchical composition of a relatively small number of parts. (ii) Attribute association. The scene attributes contain nouns and adjectives corresponding to the objects and their appearance descriptions respectively. We assign the nouns to the nodes (parts) in HST using nonmaximum suppression of their correlation, then train an appearance model for each noun+adjective attribute pair. (iii) Joint inference and learning. For an image, we compute the most probable parse tree with the attributes as an instantiation of the HST by dynamic programming. Then update the HST and attribute association based on the in- ferred parse trees. We evaluate the proposed method by (i) showing the improvement of attribute recognition accuracy; and (ii) comparing the average precision of localizing attributes to the scene parts.

6 0.82672018 277 cvpr-2013-MODEC: Multimodal Decomposable Models for Human Pose Estimation

7 0.82619584 360 cvpr-2013-Robust Estimation of Nonrigid Transformation for Point Set Registration

8 0.82596171 302 cvpr-2013-Multi-task Sparse Learning with Beta Process Prior for Action Recognition

9 0.82482833 447 cvpr-2013-Underwater Camera Calibration Using Wavelength Triangulation

10 0.82452703 225 cvpr-2013-Integrating Grammar and Segmentation for Human Pose Estimation

11 0.82384408 248 cvpr-2013-Learning Collections of Part Models for Object Recognition

12 0.82374328 400 cvpr-2013-Single Image Calibration of Multi-axial Imaging Systems

13 0.82315397 414 cvpr-2013-Structure Preserving Object Tracking

14 0.82289803 80 cvpr-2013-Category Modeling from Just a Single Labeling: Use Depth Information to Guide the Learning of 2D Models

15 0.82260871 42 cvpr-2013-Analytic Bilinear Appearance Subspace Construction for Modeling Image Irradiance under Natural Illumination and Non-Lambertian Reflectance

16 0.8223443 285 cvpr-2013-Minimum Uncertainty Gap for Robust Visual Tracking

17 0.82150644 365 cvpr-2013-Robust Real-Time Tracking of Multiple Objects by Volumetric Mass Densities

18 0.82146335 245 cvpr-2013-Layer Depth Denoising and Completion for Structured-Light RGB-D Cameras

19 0.82136399 408 cvpr-2013-Spatiotemporal Deformable Part Models for Action Detection

20 0.82134843 324 cvpr-2013-Part-Based Visual Tracking with Online Latent Structural Learning