iccv iccv2013 iccv2013-346 knowledge-graph by maker-knowledge-mining

346 iccv-2013-Rectangling Stereographic Projection for Wide-Angle Image Visualization

Source: pdf

Author: Che-Han Chang, Min-Chun Hu, Wen-Huang Cheng, Yung-Yu Chuang

Abstract: This paper proposes a new projection model for mapping a hemisphere to a plane. Such a model can be useful for viewing wide-angle images. Our model consists of two steps. In the first step, the hemisphere is projected onto a swung surface constructed by a circular profile and a rounded rectangular trajectory. The second step maps the projected image on the swung surface onto the image plane through the perspective projection. We also propose a method for automatically determining proper parameters for the projection model based on image content. The proposed model has several advantages. It is simple, efficient and easy to control. Most importantly, it makes a better compromise between distortion minimization and line preserving than popular projection models, such as stereographic and Pannini projections. Experiments and analysis demonstrate the effectiveness of our model.

Reference: text

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 In the first step, the hemisphere is projected onto a swung surface constructed by a circular profile and a rounded rectangular trajectory. [sent-4, score-1.018]

2 The second step maps the projected image on the swung surface onto the image plane through the perspective projection. [sent-5, score-0.585]

3 Most importantly, it makes a better compromise between distortion minimization and line preserving than popular projection models, such as stereographic and Pannini projections. [sent-9, score-0.699]

4 For viewing wide-angle images defined on a viewing sphere, it is often required to map from the viewing sphere to an image plane. [sent-15, score-0.55]

5 Thus, most projection models trade off different types of distortions and none can avoid all distortions. [sent-17, score-0.357]

6 (2) Line preserving: scene lines should remain straight in the projection because human is often more sensitive to distortions of straight lines. [sent-23, score-0.675]

7 Vertical/horizontal lines and lines passing through vanishing points are especially important. [sent-24, score-0.385]

8 Rectilinear projection (or gnomonic projection) preserves most scene lines, but exhibits extensive stretches for a large FOV. [sent-29, score-0.359]

9 However, they reduce the perspective effects because scene lines towards the vanishing point become curved. [sent-32, score-0.334]

10 Additionally, equirectangular projection severely distorts the area near the zenith and nadir. [sent-33, score-0.531]

11 To preserve conformality, Mercator projection requires infinite space to show the whole view along the vertical direction. [sent-34, score-0.465]

12 Stereographic projection performs well on minimizing distortions, but several prominent lines become curved in the projection and the resultant image is not rectangular. [sent-36, score-0.741]

13 Pannini projection, on the other hand, produces a rectangular image, and preserves vertical lines and scene lines towards the vanishing point. [sent-37, score-0.689]

14 But it does not preserve horizontal lines and generates severe distortions around the zenith and nadir. [sent-38, score-0.411]

15 This paper proposes a projection model which strikes a good compromise among these desired properties. [sent-40, score-0.315]

16 As shown in Figure 1, similar to stereographic projection, our model has a good conformality property and less content distortions. [sent-41, score-0.346]

17 In addition, it preserves vertical scene lines and scene lines passing through the vanishing point in the same way Pannini projection does. [sent-43, score-0.923]

18 Finally, it also better preserves horizontal lines that Pannini projection doesn’t. [sent-44, score-0.541]

19 2824 RectilnearEquirectangularMercatorStereographicPan iniOurs Figure 1: Comparisons of the proposed projection model with several popular projection models. [sent-46, score-0.568]

20 The first step projects the viewing hemisphere onto a swung surface. [sent-48, score-0.554]

21 The second step maps the projection on the swung surface onto the image plane through the perspective projection. [sent-49, score-0.835]

22 We construct our swung surface by rotating and scaling a circular profile along a rounded rectangular trajectory. [sent-51, score-0.864]

23 We show that the proposed model unifies several projection models. [sent-52, score-0.331]

24 [10] proposed a multi-plane projection as an alternative to cylindrical projection. [sent-56, score-0.345]

25 Thus, the resulting panorama is composed by several perspective projections with the same viewpoint but with different projection planes. [sent-58, score-0.42]

26 They employed an adaptive projection which allows for smooth transition between the rectilinear and cylindrical projections. [sent-61, score-0.465]

27 For wide-angle images, Zorin and Barr [11] proposed a one-parameter family of projections that interpolate between the rectilinear and stereographic projections. [sent-62, score-0.423]

28 [6] proposed Pannini projection for viewing wide-angle perspective images. [sent-65, score-0.491]

29 The proposed projection model This section describes our projection model for mapping a point ¯ p on the viewing sphere to a point ˆ p on the image plane. [sent-81, score-0.834]

30 For projection, the viewing direction is towards the positive z axis and the projection plane is located at z = 1. [sent-86, score-0.574]

31 A point p¯ on the viewing sphere can be described by a 3D spherical coordinate (r, θp, φp), where r = 1for the unit viewing sphere, φp is the angle measured from positive z axis and θp is the angle measured on the xy plane from the positive x axis. [sent-87, score-0.733]

32 The two-step projection model Our projection model is a two-step model that maps from a hemisphere to the projection plane as illustrated in Figure 2, which only shows an octant. [sent-91, score-1.059]

33 Given a point ¯ p on the hemisphere (the orange surface), the first step projects p¯ onto a point p on a surface S (the blue surface) through a line emanating from the center of the hemisphere. [sent-92, score-0.379]

34 By construction, the 3D spherical coordinate of p is (rp, θp, φp) where its polar and azimuth angles are the same as ¯ p while the radial distance changes to rp depending on the definition of the surface S. [sent-94, score-0.432]

35 In the second step, p on the surface S is projected onto a point ˆ p on the image plane by the perspective projection. [sent-96, score-0.327]

36 As shown in Figure 2(b), we let the center of projection lie on the negative z axis with coordinate (0, 0, −d). [sent-97, score-0.401]

37 rp denotes the radial distance in the 3D spherical coordinate system while rˆp is for the 2D polar coordinate system. [sent-103, score-0.375]

38 With Equation 1 and 2, one can relate the 3D spherical coordinate (1, θp, φp) of a point ¯ p on the hemisphere with its projection ˆ p on the image plane with 2D polar coordinate ( rˆp, θˆp). [sent-104, score-0.745]

39 Constructing a surface of revolution requires a profile curve γ and a revolution axis. [sent-109, score-0.586]

40 By rotating the profile curve about the revolution axis, one can generate a surface of revolution. [sent-113, score-0.465]

41 In addition to the profile curve and the revolution axis, a trajectory curve R is required for swung surfaces. [sent-115, score-0.778]

42 Similar to surfaces of revolution, the profile curve rotates about the revolution axis. [sent-121, score-0.359]

43 More specifically, the profile curve γ is scaled along the line of intersection between the rotated Aγ and the AR, such that the rotated γ intersects with the trajectory curve R. [sent-124, score-0.543]

44 We set the revolution axis to be the z axis which is also the optical axis of the perspective projection in the second step of our projection model. [sent-126, score-0.977]

45 The trajectory curve is defined on the xy plane and is in a rounded rectangular shape (Figure 4(b)), which can be parameterized by θ as R(θ). [sent-128, score-0.582]

46 Note that the discussion in this subsection applies to all trajectory curves parameterized as R(θ) but not limited to the rounded rectangular curve we chose. [sent-131, score-0.497]

47 Figure 4(c) demonstrates the construction of the swung surface S. [sent-132, score-0.369]

48 The circular arc is rotated (the green arrow) and then scaled (the red arrow) to touch the trajectory curve. [sent-133, score-0.376]

49 A scaled circular arc becomes an elliptical arc by the construction process. [sent-135, score-0.417]

50 Note that no matter how the circular arc is rotated and scaled, the resulting elliptical arc always passes through the point (0, 0, 1) 1. [sent-137, score-0.431]

51 Remember that, for computing Equation 2, given a point p¯ = (1, θp, φp) on the hemisphere, we need to find the radial distance rp of its projection p on S. [sent-138, score-0.405]

52 For our swung surface, p locates on the elliptical arc generated by rotating a unit circular arc (γ) by θp and then scaled by R(θp). [sent-139, score-0.709]

53 (5) Thus, once defining the trajectory by specifying R(θ), given a point ¯ p on a hemisphere with the polar and azimuthal angles θp and, φp, one can find its projection’s polar coordinate ˆr p) on the image plane using Equation 1 and 5. [sent-146, score-0.533]

54 on 2826 (a)Profilecurve(b)Trajectorycurve(c)Surfaceconstruction(d)Eliptcal rc(e)Rounde rectangle Figure 4: The construction of the swung surface S. [sent-150, score-0.442]

55 For example, when using a circle (R(θ) = 1) as the trajectory, the constructed surface is a sphere and the projection model becomes stereographic projection. [sent-153, score-0.84]

56 If two vertical lines are employed as the trajectory, we obtain a cylinder as the surface and the model becomes Pannini projection. [sent-154, score-0.444]

57 In this paper, we adopt a specific family of closed curves as the trajectory curve: rounded rectangles. [sent-155, score-0.342]

58 Figure 5(d) shows the swung surface constructed using a rounded rectangle as the trajectory curve. [sent-156, score-0.784]

59 As shown in Figure 4(e), an rounded rectangle is parameterized by three parameters: width (w), height (h), and the radius of the circle (l) for rounding corners. [sent-158, score-0.35]

60 Therefore, there are three parameters in our projection model: the focal length d, the aspect ratio h and the roundness l. [sent-161, score-0.531]

61 The last but the most important reason is that four line segments of the rounded rectangle better preserve lines. [sent-184, score-0.41]

62 We used Tissot’s indicatrix [5] to compare these projection models on distortions. [sent-195, score-0.353]

63 The first column of Figure 7 shows the Tissot’s indicatrix of rectilinear projection, stereographic projection, Pannini projection and the proposed model with a rounded square. [sent-197, score-0.949]

64 It shows how a circle on the viewing sphere is mapped in the projection. [sent-198, score-0.327]

65 Since stereographic projection is conformal, circles on sphere remain circular on the projection plane. [sent-199, score-1.025]

66 Both vertical and horizontal lines becomes curved in stereographic projection. [sent-203, score-0.647]

67 Pannini projection performs well on vertical lines but poorly on horizontal lines. [sent-204, score-0.645]

68 Our model is only second to rectilinear projection in terms of line preserving. [sent-206, score-0.492]

69 Figure 8(a) gives an example on why a vertical scene line keeps straight under the projection with a rounded rectangle. [sent-207, score-0.837]

70 A vertical scene line is represented by a circular arc which passes through both the zenith and nadir (the orange arc). [sent-208, score-0.649]

71 In the first step, when projecting onto the surface, it becomes a straight line (the blue line segment) because its nearby geometry is a elliptical cylinder (the blue- shaded surface). [sent-209, score-0.396]

72 In the second step, the blue line remains straight since perspective projection preserves lines. [sent-210, score-0.551]

73 Thus, the straight line segments in the rounded rectangular trajectory produce a surface that is a portion of an elliptical cylinder which better preserves lines. [sent-211, score-0.854]

74 If a scene line is parallel to the straight line segment of the trajectory, and its corresponding arc on the sphere (orange arc) lies in the orangeshaded surface, then the straightness of the scene line is preserved after the projection. [sent-212, score-0.664]

75 From the above discussion, we found that (1) stereographic projection performs better on distortion minimization because it is conformal. [sent-213, score-0.585]

76 Pannini projection can preserve vertical scene lines because its trajectory curve con- (a) (b) (c) (d)Tis ot’s indicatrixGrid pat ernProjection Figure 7: Tissot’s indicatrix and grid patterns. [sent-214, score-0.912]

77 The second column shows the projection of three sets of orthogonal scene lines. [sent-224, score-0.315]

78 Our projection model softly blends the advantages of stereographic projection and Pannini projection by a rounded rectangle composing both straight line segments and circular arcs. [sent-226, score-1.632]

79 In addition, compared to Pannini projection, it contains horizontal line segments in the trajectory and can preserve horizontal scene lines in addi- tion to vertical lines. [sent-227, score-0.705]

80 For the trajectory curve, we first find a good aspect ratio of the rectangle with zero roundness and then look for the best roundness of the rounded rectangles with that aspect ratio. [sent-236, score-0.863]

81 For finding line structures, we used a cube map to project the viewing sphere onto six perspective views, then used the LSD line segment detector [7] to find line structures. [sent-240, score-0.627]

82 Each line segment corresponds to an arc si of a great circle on the viewing sphere. [sent-241, score-0.43]

83 From the previous analysis, we know that vertical (horizontal) scene lines in subregion Uv (θ) (Uh (θ)) remain straight. [sent-245, score-0.351]

84 On the other hand, vertical lines located in Uh (θ) will be severely distorted and the same for horizontal lines and Uv (θ). [sent-246, score-0.549]

85 ∈Sh λ |θ − (π/4) |, (6) where 1 is the indicator function; and Sv and Sh are the [] sets of vertical and horizontal scene lines respectively. [sent-255, score-0.392]

86 first term counts the number of vertical scene lines lying in the subregion Uh, and the second term counts horizontal lines for Uv. [sent-257, score-0.564]

87 For example, we want that vertical scene lines lie in either U? [sent-288, score-0.326]

88 For an output image with the 800 2829 Figure 10: When the horizontal FOV is very wide, the line distortion of stereographic projection aggravates quickly. [sent-306, score-0.739]

89 (a)(b)(c) Figure 11: Comparisons with general Pannini projection (Pannini projection followed by vertical compression). [sent-308, score-0.716]

90 Figure 11compares our projection with the general Pannini projection [6], which is a two-stage process: the Pannini projection followed by vertical compression (VC). [sent-321, score-1.035]

91 To compare our model with general Pannini projection more intuitively, one could interpret our model as a two-stage process: the Pannini projection followed by a warping from Panninis result (Figure 7(c)) to our result (Figure 11(c)). [sent-323, score-0.568]

92 By doing so, some horizontal lines can be straightened while keeping vertical lines straight. [sent-328, score-0.508]

93 However, it introduces distortions at both sides of the image and bends vanishing lines (Figure 11(a)(b)). [sent-329, score-0.353]

94 Although some vertical lines become curved, all radial lines are kept straight (as shown in the grid pattern of Figure 7(d)). [sent-332, score-0.542]

95 Our method can also be used for viewing full spherical panoramas or creating thumbnails for them. [sent-340, score-0.336]

96 With our model, the vanishing lines passing through vanishing points will be kept straight. [sent-343, score-0.329]

97 Equirectangular projection severely distorts the content near the zenith and nadir. [sent-347, score-0.47]

98 Pannini projection is less effective in the vertical FOV. [sent-349, score-0.432]

99 2830 Figure 12: Comparisons with stereographic projection, Pannini projection and content-preserving projection. [sent-359, score-0.544]

100 Pannini: a new projection for rendering wide angle perspective im- [7] [8] [9] [10] [11] ages. [sent-405, score-0.374]

similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('pannini', 0.499), ('projection', 0.284), ('stereographic', 0.26), ('swung', 0.258), ('rounded', 0.216), ('revolution', 0.155), ('vertical', 0.148), ('lines', 0.147), ('viewing', 0.142), ('trajectory', 0.126), ('sphere', 0.124), ('hemisphere', 0.122), ('roundness', 0.121), ('rectilinear', 0.12), ('arc', 0.113), ('surface', 0.111), ('fisheye', 0.107), ('tissot', 0.103), ('zenith', 0.092), ('profile', 0.091), ('rp', 0.091), ('vanishing', 0.091), ('line', 0.088), ('equirectangular', 0.086), ('uh', 0.085), ('plane', 0.085), ('rectangular', 0.081), ('aspect', 0.08), ('elliptical', 0.08), ('carroll', 0.076), ('curve', 0.074), ('spherical', 0.073), ('circular', 0.073), ('polar', 0.073), ('rectangle', 0.073), ('distortions', 0.073), ('fov', 0.071), ('straight', 0.07), ('indicatrix', 0.069), ('horizontal', 0.066), ('perspective', 0.065), ('axis', 0.063), ('circle', 0.061), ('conformality', 0.061), ('cylindrical', 0.061), ('panoramas', 0.061), ('coordinate', 0.054), ('nadir', 0.052), ('zorin', 0.052), ('quadrant', 0.049), ('unifies', 0.047), ('ratio', 0.046), ('vc', 0.044), ('kopf', 0.044), ('preserves', 0.044), ('projections', 0.043), ('bends', 0.042), ('severely', 0.041), ('distortion', 0.041), ('surfaces', 0.039), ('cylinder', 0.038), ('scaled', 0.038), ('catadioptric', 0.037), ('uv', 0.036), ('compression', 0.035), ('gigapixel', 0.034), ('mercator', 0.034), ('sharpless', 0.034), ('thumbnails', 0.034), ('projected', 0.034), ('rotating', 0.034), ('preserve', 0.033), ('onto', 0.032), ('ying', 0.031), ('scene', 0.031), ('straightness', 0.031), ('barr', 0.031), ('strikes', 0.031), ('radial', 0.03), ('hemispheres', 0.028), ('panorama', 0.028), ('distorts', 0.028), ('sin', 0.028), ('inclination', 0.027), ('axial', 0.027), ('curved', 0.026), ('passes', 0.026), ('si', 0.026), ('creating', 0.026), ('rotated', 0.026), ('street', 0.026), ('orange', 0.026), ('preserving', 0.026), ('angle', 0.025), ('subregion', 0.025), ('lsd', 0.025), ('cos', 0.025), ('prefer', 0.025), ('content', 0.025), ('square', 0.025)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99999994 346 iccv-2013-Rectangling Stereographic Projection for Wide-Angle Image Visualization

Author: Che-Han Chang, Min-Chun Hu, Wen-Huang Cheng, Yung-Yu Chuang

2 0.1215737 250 iccv-2013-Lifting 3D Manhattan Lines from a Single Image

Author: Srikumar Ramalingam, Matthew Brand

Abstract: We propose a novel and an efficient method for reconstructing the 3D arrangement of lines extracted from a single image, using vanishing points, orthogonal structure, and an optimization procedure that considers all plausible connectivity constraints between lines. Line detection identifies a large number of salient lines that intersect or nearly intersect in an image, but relatively a few of these apparent junctions correspond to real intersections in the 3D scene. We use linear programming (LP) to identify a minimal set of least-violated connectivity constraints that are sufficient to unambiguously reconstruct the 3D lines. In contrast to prior solutions that primarily focused on well-behaved synthetic line drawings with severely restricting assumptions, we develop an algorithm that can work on real images. The algorithm produces line reconstruction by identifying 95% correct connectivity constraints in York Urban database, with a total computation time of 1 second per image.

3 0.11513561 90 iccv-2013-Content-Aware Rotation

Author: Kaiming He, Huiwen Chang, Jian Sun

Abstract: We present an image editing tool called Content-Aware Rotation. Casually shot photos can appear tilted, and are often corrected by rotation and cropping. This trivial solution may remove desired content and hurt image integrity. Instead of doing rigid rotation, we propose a warping method that creates the perception of rotation and avoids cropping. Human vision studies suggest that the perception of rotation is mainly due to horizontal/vertical lines. We design an optimization-based method that preserves the rotation of horizontal/vertical lines, maintains the completeness of the image content, and reduces the warping distortion. An efficient algorithm is developed to address the challenging optimization. We demonstrate our content-aware rotation method on a variety of practical cases.

4 0.11456856 343 iccv-2013-Real-World Normal Map Capture for Nearly Flat Reflective Surfaces

Author: Bastien Jacquet, Christian Häne, Kevin Köser, Marc Pollefeys

Abstract: Although specular objects have gained interest in recent years, virtually no approaches exist for markerless reconstruction of reflective scenes in the wild. In this work, we present a practical approach to capturing normal maps in real-world scenes using video only. We focus on nearly planar surfaces such as windows, facades from glass or metal, or frames, screens and other indoor objects and show how normal maps of these can be obtained without the use of an artificial calibration object. Rather, we track the reflections of real-world straight lines, while moving with a hand-held or vehicle-mounted camera in front of the object. In contrast to error-prone local edge tracking, we obtain the reflections by a robust, global segmentation technique of an ortho-rectified 3D video cube that also naturally allows efficient user interaction. Then, at each point of the reflective surface, the resulting 2D-curve to 3D-line correspondence provides a novel quadratic constraint on the local surface normal. This allows to globally solve for the shape by integrability and smoothness constraints and easily supports the usage of multiple lines. We demonstrate the technique on several objects and facades.

5 0.10313336 436 iccv-2013-Unsupervised Intrinsic Calibration from a Single Frame Using a "Plumb-Line" Approach

Author: R. Melo, M. Antunes, J.P. Barreto, G. Falcão, N. Gonçalves

Abstract: Estimating the amount and center ofdistortionfrom lines in the scene has been addressed in the literature by the socalled “plumb-line ” approach. In this paper we propose a new geometric method to estimate not only the distortion parameters but the entire camera calibration (up to an “angular” scale factor) using a minimum of 3 lines. We propose a new framework for the unsupervised simultaneous detection of natural image of lines and camera parameters estimation, enabling a robust calibration from a single image. Comparative experiments with existing automatic approaches for the distortion estimation and with ground truth data are presented.

6 0.10298225 280 iccv-2013-Multi-view 3D Reconstruction from Uncalibrated Radially-Symmetric Cameras

7 0.094966047 79 iccv-2013-Coherent Object Detection with 3D Geometric Context from a Single Image

8 0.094602369 140 iccv-2013-Elastic Net Constraints for Shape Matching

9 0.091017894 152 iccv-2013-Extrinsic Camera Calibration without a Direct View Using Spherical Mirror

10 0.083575115 292 iccv-2013-Non-convex P-Norm Projection for Robust Sparsity

11 0.082645208 410 iccv-2013-Support Surface Prediction in Indoor Scenes

12 0.07433518 397 iccv-2013-Space-Time Tradeoffs in Photo Sequencing

13 0.07236854 281 iccv-2013-Multi-view Normal Field Integration for 3D Reconstruction of Mirroring Objects

14 0.067334004 342 iccv-2013-Real-Time Solution to the Absolute Pose Problem with Unknown Radial Distortion and Focal Length

15 0.066014752 319 iccv-2013-Point-Based 3D Reconstruction of Thin Objects

16 0.063655533 49 iccv-2013-An Enhanced Structure-from-Motion Paradigm Based on the Absolute Dual Quadric and Images of Circular Points

17 0.062542081 439 iccv-2013-Video Co-segmentation for Meaningful Action Extraction

18 0.05976874 183 iccv-2013-Geometric Registration Based on Distortion Estimation

19 0.056874327 1 iccv-2013-3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding

20 0.05670974 323 iccv-2013-Pose Estimation with Unknown Focal Length Using Points, Directions and Lines

similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.113), (1, -0.1), (2, -0.029), (3, 0.017), (4, -0.024), (5, 0.045), (6, 0.019), (7, -0.073), (8, 0.031), (9, -0.015), (10, 0.043), (11, 0.006), (12, -0.056), (13, 0.028), (14, 0.048), (15, 0.007), (16, 0.033), (17, 0.134), (18, -0.026), (19, -0.021), (20, -0.007), (21, -0.05), (22, 0.047), (23, -0.017), (24, -0.039), (25, 0.043), (26, -0.003), (27, 0.022), (28, -0.027), (29, -0.043), (30, -0.059), (31, 0.053), (32, -0.04), (33, -0.054), (34, -0.014), (35, -0.023), (36, -0.037), (37, -0.058), (38, 0.045), (39, -0.083), (40, 0.139), (41, -0.094), (42, -0.049), (43, -0.012), (44, 0.038), (45, 0.031), (46, -0.013), (47, -0.032), (48, 0.031), (49, 0.016)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.97128254 346 iccv-2013-Rectangling Stereographic Projection for Wide-Angle Image Visualization

Author: Che-Han Chang, Min-Chun Hu, Wen-Huang Cheng, Yung-Yu Chuang

2 0.81552029 436 iccv-2013-Unsupervised Intrinsic Calibration from a Single Frame Using a "Plumb-Line" Approach

Author: R. Melo, M. Antunes, J.P. Barreto, G. Falcão, N. Gonçalves

3 0.7632342 343 iccv-2013-Real-World Normal Map Capture for Nearly Flat Reflective Surfaces

Author: Bastien Jacquet, Christian Häne, Kevin Köser, Marc Pollefeys

4 0.75290823 250 iccv-2013-Lifting 3D Manhattan Lines from a Single Image

Author: Srikumar Ramalingam, Matthew Brand

5 0.66667926 152 iccv-2013-Extrinsic Camera Calibration without a Direct View Using Spherical Mirror

Author: Amit Agrawal

Abstract: We consider the problem of estimating the extrinsic parameters (pose) of a camera with respect to a reference 3D object without a direct view. Since the camera does not view the object directly, previous approaches have utilized reflections in a planar mirror to solve this problem. However, a planar mirror based approach requires a minimum of three reflections and has degenerate configurations where estimation fails. In this paper, we show that the pose can be obtained using a single reflection in a spherical mirror of known radius. This makes our approach simpler and easier in practice. In addition, unlike planar mirrors, the spherical mirror based approach does not have any degenerate configurations, leading to a robust algorithm. While a planar mirror reflection results in a virtual perspective camera, a spherical mirror reflection results in a non-perspective axial camera. The axial nature of rays allows us to compute the axis (direction of sphere center) and few pose parameters in a linear fashion. We then derive an analytical solution to obtain the distance to the sphere cen- ter and remaining pose parameters and show that it corresponds to solving a 16th degree equation. We present comparisons with a recent method that use planar mirrors and show that our approach recovers more accurate pose in the presence of noise. Extensive simulations and results on real data validate our algorithm.

6 0.6665684 49 iccv-2013-An Enhanced Structure-from-Motion Paradigm Based on the Absolute Dual Quadric and Images of Circular Points

7 0.65967894 348 iccv-2013-Refractive Structure-from-Motion on Underwater Images

8 0.65165573 280 iccv-2013-Multi-view 3D Reconstruction from Uncalibrated Radially-Symmetric Cameras

9 0.62033147 84 iccv-2013-Complex 3D General Object Reconstruction from Line Drawings

10 0.56939733 281 iccv-2013-Multi-view Normal Field Integration for 3D Reconstruction of Mirroring Objects

11 0.55982268 323 iccv-2013-Pose Estimation with Unknown Focal Length Using Points, Directions and Lines

12 0.55715656 27 iccv-2013-A Robust Analytical Solution to Isometric Shape-from-Template with Focal Length Calibration

13 0.5483638 342 iccv-2013-Real-Time Solution to the Absolute Pose Problem with Unknown Radial Distortion and Focal Length

14 0.52976763 90 iccv-2013-Content-Aware Rotation

15 0.52709258 397 iccv-2013-Space-Time Tradeoffs in Photo Sequencing

16 0.522645 139 iccv-2013-Elastic Fragments for Dense Scene Reconstruction

17 0.50821167 319 iccv-2013-Point-Based 3D Reconstruction of Thin Objects

18 0.49503866 140 iccv-2013-Elastic Net Constraints for Shape Matching

19 0.46884891 148 iccv-2013-Example-Based Facade Texture Synthesis

20 0.46532974 183 iccv-2013-Geometric Registration Based on Distortion Estimation

similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(2, 0.047), (7, 0.038), (26, 0.072), (31, 0.078), (35, 0.026), (40, 0.01), (42, 0.079), (56, 0.294), (64, 0.032), (73, 0.029), (89, 0.16), (98, 0.016)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.84718448 262 iccv-2013-Matching Dry to Wet Materials

Author: Yaser Yacoob

Abstract: When a translucent liquid is spilled over a rough surface it causes a significant change in the visual appearance of the surface. This wetting phenomenon is easily detected by humans, and an early model was devised by the physicist Andres Jonas Angstrom nearly a century ago. In this pa. umd . edu per we investigate the problem of determining if a wet/dry relationship between two image patches explains the differences in their visual appearance. Water tends to be the typical liquid involved and therefore it is the main objective. At the same time, we consider the general problem where the liquid has some of the characteristics of water (i.e., a similar refractive index), but has an unknown spectral absorption profile (e.g., coffee, tea, wine, etc.). We report on several experiments using our own images, a publicly available dataset, and images downloaded from the web. 1. Background When a material absorbs a liquid it changes visual appearance due to richer light reflection and refraction processes. Humans easily detect wet versus dry surfaces, and are capable of integrating this ability in object detection and segmentation. As a result, a wet part of a surface is associated with the dry part of the same surface despite significant differences in their appearance. For example, when driving over a partially wet road surface it is easily recognized as a drivable surface. Similarly, a wine spill on a couch is recognized as a stain and not a separate object. The same capability is harder to implement in computer vision since the basic attributes of edges, color distributions and texture are disrupted in the wetting process. Engineering algorithms around these changes has not received attention in published research. Nevertheless, such capability is needed to cope with partial wetting of surfaces. The emphasis ofthis paper is on surfaces combining both This work was partially supported by the Office of Naval Research under Grant N00014-10-1-0934. Figure1.Apartialywetconcret pavement,waterspiledon wood, water stain on a cap, and coffee spilled on a carpet. dry and wet parts. Distinguishing between completely wet and dry surfaces in independent images requires accounting for the illumination variations in the scenes, and may be subject to increased ambiguity in the absence of context. For example, comparing an image of a dry T-shirt to an image of the same T-shirt taken out of a washing machine is a more challenging problem since the straightforward solution is to consider them as different colored T-shirts. However, the algorithms we develop in this paper apply to this scenario assuming illumination is the same in both images. Figure 1 shows examples we analyze: (a) partially wet concrete pavement, (b) water spilled on a piece of wood, (c) water stain on a cap, and (d) coffee spilled on a carpet. We assume that the wet and dry patches have been pre-segmented and focus on whether the dry patch can be synthesized to appear wet under unknown parameters employing a well-known optical model. There are several factors that determine the visual appearance of wet versus dry surfaces. Specifically: • The physical properties of the liquid involved. The translucence (or light absorption) of the liquid determines ifinterreflection occurs and is visually observed. Water is translucent, while paint is near opaque. The light absorption of the liquid as a function of wave2952 lengths affects the overall spectral appearance of the wet area. Water absorbs slightly more of the green and red wavelengths and less of the blue wavelength, while olive oil absorbs more of the blue wavelength and much less of the red and green wavelengths. • • • The size and shape of the liquid affect the optical properties of the scene. For example, liquid droplets create a complex optical phenomenon as the curvature of each droplet acts as a lens (e.g., a drop of water can operate as a magnifying lens as well as cause light dispersion). The illuminant contributes to the appearance of both the dry and wet patches since it determines the wavelengths that are reaching the scene and the absorptions of the surface and liquid. The liquid absorption rate of the material determines whether a thin film of liquid remains floating apart on top of the material surface. For example, some plastics or highly polished metals absorb very little liquid and therefore a wetting phenomenon without absorption occurs. Nevertheless, non-absorbed liquids do change the appearance of the surface as they form droplets. • Specular reflections may occur at parts of the wet surface and therefore mask the light refraction from air-toliquid and interreflections that occur within the liquidmaterial complex. In this paper we study the problem of determining if two patches within the same image (or two images taken under similar illumination conditions) can be explained as wet and dry instances of the same material given that the material, liquid and illumination are unknown. The paper’s contribution is proposing an algorithm for searching a high-dimensional space of possible liquids, material and imaging parameters to determine a plausible wetting process that explains the appearance differences between two patches. Beyond the basic aspects of the problem, the results are relevant to fundamental capabilities such as detection, segmentation and recognition. 2. Related Research Wet surfaces were considered first as an optics albedo measurement of various surfaces by Angstrom in 1925 [1]. The proposed model assumed that light reaching the observer is solely stemming from rays at or exceeding the critical angle and thus the model suggested less light than experimental data. Lekner and Dorf [3] expanded this model by accounting for the probability of internal reflections in the water film and the effect of the decrease of the relative refractive index at the liquid to material surface. Ther model was shown to agree more closely with experimental data. In computer graphics, Jensen et al. [5] rendered wet surfaces by combining a reflection model for surface water with subsurface scattering. Gu et al [6] observed empirically the process of surface drying of several materials but no physical model for drying was offered. There has been little interest in wet surfaces in computer vision. Mall and da Vitoria Lobo [4] adopted the Lekner and Dorf model [3] to convert a dry material into a wet appearance and vice versa. The algorithm was described for greyscale images and fixed physical parameters. This work forms the basis of our paper. Teshima and Saito [2] developed a temporal approach for detection of wet road surfaces based on the occurrence of specular reflections across multiple images. 3. Approach Given two patches, Pd presumed dry, and Pw possibly wet, the objective is to determine if a liquid of unknown properties can synthesize the dry patch so that it appears visually similar to the wet patch. We employ the term material to describe the surface that absorbs the thin film of liquid to create the wet patch. We leverage the optical model developed by [3] and used by [4], by formulating a search over the parameter space of possible materials and liquids. In this paper we focus on a partial set of liquid on ma- terial appearances. Specifically, we exclude specular reflections, non-absorbing materials, and liquid droplets. 3.1. Optics Model Figure 2 shows the basic model developed in [3]. A light ray entering the liquid film over the rough material surface with a probability of 1−Rl where Rl is the reflectance at the air-liquid interface. A fraction, a, ofthis light is absorbed by the material surface, and thus (1 Rl) ∗ (1 a) is reflected back to the liquid surface. Let p be the fraction of light reflected back into the liquid at the liquid-air surface. The total probability of absorption by the rough surface as this process repeats is described by − − A=(1−Rl)[a+a(1−a)p+a(1−a)2p2+...]=1(−1p−(R1−l)aa) .(1) Lekner and Dorf [3] show that p can be written in terms of the liquid ’s refractive index nl and the average isotropically illuminated surface R: p = 1 −n1l2[1 − R(nl)] where (2) R(n) (n > 1): R(n) = 3n32(n++2n1)+21 −(2nn23+(n12)+2(n2n2−−11)) + n(2n(2n−2+1)21)log(n) −n2(n(2n2+−1)13)2log(nn(n−+11)) (3) 2953 Figure2.Thligta1−rR-ltoiqu(d1−Ral()1n−adliqu1(−-Rlt1()o−-asp)urfcemodl. Lekner and Dorff [3] proposed that the light absorption rates of the dry and wet materials are different, and that the wet material will always have a higher absorption rate. Let ad and aw be the light absorption rates of the dry and wet materials respectively, so that aw > ad. Thus the albedo values for the dry and wet surfaces are 1−ad and A = 1 aw, respectively, assuming isotropic illumination. Let nr be the refractive index of the material. For small absorptions, ad ≈ 1 and aw ≈ 1 and therefore − R(nr), aw ≈ − R(nr/nl) ad[1 − R(nr/nl)]/[1 − R(nr)] while for large absorptions aw ≈ the two values can be expressed as ad. An interpolation of aw= ad(1 − ad)11 − − R R(n(rn/rn)l)+ ad 3.2. Imaging Model (4) (5) Lekner and Dorff [3] and Mall and da Vitoria Lobo [4] focused on the albedo change between dry and wet surfaces. The model is suitable for estimating reflectance of a single wavelength but requires extension to aggregated wavelengths captured by greyscale or color images. In [4], the model was applied to greyscale images where the true albedo was approximated by using the maximum observed brightness in the patch. This assumes that micro-facet orientations of the material are widely distributed. Color images present two additional issues: cameras (1) integrate light across spectral zones, and (2) apply image processing, enhancement and compression to the raw images. As a result, the input image is a function of the actual physical process but may not be quantitatively accurate. Our objective is to estimate the albedo of the homogeneous dry patch, Pd, for each of the RGB channels (overlooking the real spectral wavelengths), despite unknown imaging parameters. It is critical to note that the camera acquires an image that is a function of the albedo, surface normal and illuminant attributes (direction, intensity and emitted wavelengths) at each pixel, so that estimating the true physical albedo is challenging in the absence of information about the scene. In the following we first describe a representation of the relative albedo in RGB and then describe how it is re-formulated to derive possible absolute albedo values. Let the albedo of the homogeneous dry material be AR, AG , AB with respect to the RGB channels. Then, AR = 1 − aR, AG = 1 − aG, AB = 1 − aB (6) where aR, aG , aB are the absorption rates of light in the red, green and blue channels, respectively. Since the value of each absorption parameter is between 0 and 1, it is possible to search this three dimensional space in small increments of aR, aG , aB values. However, these absorption rates are confounded with the variable surface normals across the patch as we consider RGB values. Instead, we observe that the colors of pixels reflect, approximately, the relative absorption rates of red, green and blue. For example, a grey pixel indicates equal absorption in red, green and blue regardless of the level of the greyness. The surface normal contributes to a scalar that modifies the amount of light captured by the camera, but does not alter the relative albedos. Therefore, we can parametrize the albedo values as AR ∗ (1, rGR, rBR), where rGR and rBR are the relative albedo values green-to-red and blue-to-red, respectively. This parametrization does not, theoretically, change due to variation in surface normals. Specifically, consider a homogeneous patch of constant albedo but variable surface normals, and assuming a Lambertian model, the image reflectance can be expressed as IR(x, y) = AR IG (x, y) = AG IB (x, y) = AB ∗ ∗ ∗ (N(x, y) · S(x, y)) (N(x, y) · S(x, y)) (N(x, y) · S(x, y)) (7) where N(x, y) and S(x, y) are the surface normal and the illuminant direction at (x, y), respectively (S(x, y) = S for a distant point light source). The two ratios rGR = IG/IR and rBR = IB/IR are constant for all pixels (x, y) independent of the dot product of the normal and illumination vectors (N(x, y) · S(x, y)) (since they cancel out). In practice, however, due to imaging artifacts, the ratios are more defuse and therefore multiple ratios may be detectable over a patch. Given a dry patch, Pd, we compute a set of (rGR, rBR) pairs. If the patch were perfectly uniform (in terms of surface normals), a single pair will be found, but for complex surfaces there may be several such pairs. We histogram the normalized G/R and B/R values to compute these pairs. Let Sd denote the set of these ratios computed over Pd. As a result of the above parametrization, the red albedo, AR, is unknown and it will be searched for optimal fit and AG and AB are computed from the Sd ratios. Mall and da Vitoria Lobo [4] proposed that assuming a rough surface, the maximum reflected brightness, Imax, can be used as a denominator to normalize all values and generate relative albedo values. In reality, even under these assumptions, Imax is the lower-bound value that should be 2954 used as denominator to infer the albedo of the patch. Moreover, the values acquired by the camera are subject to automatic gain, white balance and other processing that tend to change numerical values. For example, a surface with albedo equal to 1, may have a value of 180 (out of 256 levels), and therefore mislead the recovery of the true surface albedo (i.e., suggesting a lower albedo than 1). The optics framework requires absolute albedo values to predict the wet albedo of the surface. Therefore, the reflectance values should be normalized with respect to an unknown Rwhite ≥ Imax (typically) which represents the absolute value that corresponds to the intensity of a fully reflective surface under the same imaging conditions (including unknown camera imaging parameters, and a normal and illuminant dot product equal to 1.0). Note that for an ideal image acquisition an albedo of 1 corresponds to Rwhite = 256, but in practice Rwhite can be lower (e.g., for white balance) or higher than 256 (e.g., camera gain). Determining Rwhite involves a search for the best value in the range Imax to IUpperBound. While IUpperBound can be chosen as a large number, the computational cost is prohibitive. Instead, we observe that if we assume that the patch includes all possible surface normal orientations, then the maximum intensity, Imax corresponds to (N(x, y) · S(x, y)) being 1.0 while minimum intensity Imin corresponds to (N(x, y) · S(x, y)) near zero, for the unknown albedo A (see Equation 7). Let denote a vector of the values of all the normals multiplied by the illuminant direction (these values span the range 0..1). Therefore, the brightness of an object with an albedo of 1in these unknown imaging conditions (and including the camera’s image processing) can be computed as n IUpperBound = 256 ∗ max(A ∗ n) + 256 ∗ max ((1 − A) ∗ n) (8) where 256 is the camera’s intensity output range (assuming no saturation occurred). This is equal to IUpperBound = Imax + (256 − Imin) (9) Imax and Imin may be subject to noise and imaging factors that may create outliers, so we approximate the intensity values as a gaussian distribution with a standard deviation σ and assign Imax Imin = 4 ∗ σ cropping the tail values and capturing near 97% of the distribution, so that IUpperBound = 256 + 4 ∗ σ. This gaussian assumption is reasonable for a rough surface but for a flat surface, σ is near zero, and therefore we use IUpperBound = 256 + 100 as an arbitrary value. Note that IUpperBound reduces the range of the search for the best Rwhite and not the quality of the results. We use the largest value of IUpperBound computed for each of the RGB channels for all searches. Imax may be subject to automatic gain amplification during acquisition. Therefore, the range of values for Rwhite is expanded to be from 0.75 ∗ Imax to IUpperBound. The choice of 0.75 is arbitrary since it assumes that the gain is limited to 33% of the true values, and one could choose a different values. Given a pixel from a dry patch, Pd, we can convert its value to a wet pixel − Pw (x, y) = Pd(x, y) + ((1 − ad) − (1− aw)) ∗ Rwhite (10) where aw is calculated using Equation 5 given a specific ad. Equation 10 is applied to each of the RGB channels using the respective parameters. 3.3. Liquid Spectral Absorption The model described so far assumed that the spectral absorption of the liquid film itself is near zero across all wavelengths. This is a reasonable assumption for water since it can be treated as translucent given the negligible thickness of the liquid present at the surface. We next consider water-based liquids that have different absorption rates across wavelengths such as coffee and wine (even at negligible thickness). We assume a refractive index that is equal to water, however we assume that qr , qg , qb represent corrective absorption rates in RGB, respectively. These corrective rates modify the darkening due to water-based wetness. The real liquid absorption rates are computed as Lr = qr Lg = awg Lb = awb − awr − awr + qg + (11) qb where awr, awg, awb are the respective wet surface absorptions for red, green and blue, respectively (for water). Equation 10 is modified to account for the liquid absorption rates: Pw (x ,y) = Pd (x ,y) + (( 1 − ad ) − (1 − aw ) − ( 1 q) ) ∗ − Rw hite (12) where the respective parameters for each of the RGB channels are used. Note that Equation 11 computes relative ab- sorption rates with respect to qr, so that we recover only the differences in absorptions between the RGB channels. Nevertheless, these relative absorptions are informative and sufficient since the absolute values are intertwined with the intensity of the illuminant. For example, adding a constant absorption of 0.1 to each of Lr, Lg , Lb is equal to decrease in reflected light equal to a 10% loss of illuminant intensity. Absent prior information, we search the full range of possible values between 0 1.0 for each variable. In practice, we can, in most cases, limit the search to values between 0.0 0.5 since higher values are likely, when combined with the increased absorption due to wetting, to drive total light absorption to 1.0 which represents a black object. In cases where the Pw shows complete absorption of a wavelength (e.g., a thick layer of wine or coffee), the 0..1 range is searched. Moreover, values that represent equal absorptions, qr ≈ qg ≈ qb are unnecessary to consider since − − 2955 they are functionally equivalent to water (but they do contribute uniform darkening in all channels that is automatically captured in the computation of the absorption values of the material). The search is conducted in small increments of 0.02. 3.4. Similarity Metric The synthesized wet patch Ps is scored against Pw. A useful similarity metric is the well-known Earth Mover’s Distance [7] (EMD). The distance is computed between the size-normalized histograms of the two patches. The smaller the distance, the closer the appearance between the synthesized and true wet patches. Given that these patches are typically taken from different parts of the same image, we assume that the dry and wet patches are of the same material as well as have similar surface normal distributions. If the distributions of surface normals between the two patches violate this assumption, we have a suboptimal similarity metric. Devising a metric that accounts for different and unknown distributions of surface normal remains an open problem. Note that EMD is not suitable for comparing different materials (e.g., if the wet and dry material are of two different wood species). 4. Search Space We summarize the search parameters to determine the best synthesis, Ps, of Pd given Pw. The refractive index of the material, nr is unknown. Refractive indices of materials vary widely, with air being near 1.0 and the highest measured material (a synthetic material) is 38.6. Common materials, however, tend to fall between 1−5.0. As a result, we perform a search on all values of nr between 1.1 − 5.0 in increments of 0.1 (note that if we assume the material to have higher refractive index than water, the search can be made between 1.5 −5.0). Note that nr is dependent on light wavelengths (i.e., light wavelengths have slightly different speeds in the same medium), but accounting for this variation in the search process is computationally expensive. Therefore, we use the same nr for the three channels. We assume the liquid to be water-like, so that nl is known. Specifically, we assume that nl = 1.331 for the red channel, nl = 1.336 for the green channel, and nl = 1.343 for the blue channel. This assumption is suitable for most water-based liquids such as coffee, wine, etc. (in practice, the ethanol in wine increases the refractive index slightly, and coffee particles increase it upto 1.5). Other liquids, such as oil, have different refractive indices, but since we assume no prior information, we employ the water refractive indices even when oil may be involved. The absorption rate of the dry material, ad, is unknown and falls in the range 0 − 1.0. The discussion in subsection 3.2 uses the albedo AR as a variable and derives the green and blue albedo values, and thus their absorptions accordingly. Therefore, we perform a search over all values between 0.05 − 0.95 in 0.05 increments for adR . The values Imin, Imax and IUpperBound are pre-computed and then a search for optimal Rwhite is computed in increments of 20 units for the range 0.75 ∗ Imax and IUpperBound. Depending on the expected liquid, we can limit the search to water, or search in a reduced 3D space of liquid correction absorption rates, qr, qg , qb, as discussed in section 3.3. Algorithm 1, below, is for the case of water, but can be adjusted for an unknown liquid. Algorithm 1Dry-to-Wet algorithm 1:procedure DRY2WET (Pd,Pw)? 2: for nr 1.1 : 5.0 do 3: for adR 0.05 : 0.95 do 4: for Rwhite 0.75 ∗ Imax : IUpperBound do 5: for all pairs in Sd do 6: Compute adG adB 7: Compute awR awG awB 8: Compute Ps using Eq. (10) 9: d=EMD(Pw, Ps) 10: dmin = min(dmin , d) 11: end for 12: end for 13: end for 14: end for 15: return dmin and Ps corresponding to dmin 16: end procedure ? 5. Experiments We conducted experiments on three data sets: collected by us, collected from the web, and a controlled set of drying objects collected and described in Gu et al. [6]. The experiments answer the question: given a dry patch, Pd and a patch likely to be wet Pw, what are the best parameters that make Pd look most similar to Pw? The answer allows uncovering physical information about the liquid and the material which is valuable for computer vision. The answer may also indicate that no wetting process can make Pd look like Pw, which is also valuable since it suggests that the two patches differ in more significant ways. Note that we focus on applying a physically-motivated model to the problem and not an image-based appearance transformation. One could pose the problem differently by computing a transformation (that has nothing to do with wetting) that maximizes the similarity between a transformed Pd and Pw. But such transformation does not uncover information about the physical process that is involved and is ultimately less insightful. The patches Pd and Pw are manually delineated. The border area between the patches is neither fully dry or wet. Therefore, the border area is rarely synthesized properly. We exclude these boundary pixels from EMD computation between Ps and Pw . 2956 Empirically, we observed that EMD distances below 20 indicate close resemblance and below 10 are near identical images. Note that EMD does not capture the spatial color variations (i.e., texture differences). In all figures below, the numeric values show the EMD distance, followed by (nr, Rwhite), the next row shows the respective albedo values AR, AG, AB. In the images of the colored liquids, the third row shows the albedo of the liquid ALR, ALG, ALB . Figure 3 shows the results of the closest synthetic wetting of a dry material (images taken from [6]). These images were taken under controlled illumination but at different times, as the initially wet material dried. The top row shows the dry materials, the middle row shows the real wet material, both are provided by [6]. The bottom row of images shows the computed wet materials using our algorithm. Below each image we provide the physical parameters that our algorithm uncovered, assuming the liquid is water. Note that most of the true wet images have some specular reflections that are not generated by our model. The materials are (left to right), rock, wood, cloth, wood, felt, paper, cardboard, brick, wood, cloth, cloth and granite. The results indicate that wood is the least successfully analyzed material. The wet wood has increased spectral divergence in colors beyond what the dry material exhibits and therefore does not appear to be correctly captured by the model. Specifically, the wet wood appears to absorb more of the blue and green light relative to red, and therefore the wood is tinted brown-red. We discuss this issue further in Section 6. Figure 4 shows images we acquired of different wet materials. From left to right all images have a darker wet patch: yellow paper (wet on the right side), paper towel, large area of a cap, a smaller part of the same cap, blue paper, orange fleece material, grey/blue paper, green paper, orange fabric, and grey/blue fabric. The distances are largest for the complete green cap and blue paper. The reason is that the surface normal distributions vary between the wet and dry patches, and therefore the EMD is not a suitable metric (see discussion in subsection 3.4). The smaller part of the cap shows very good synthesis of the dry patch. Figure 5 shows a collection of images of water-based wetting of different materials downloaded from the web. From left to right, raster scan, partially wet: two cardboard images, concrete, yellow brick, three types of wood, blue fabric, two images of different types of sand, red tile, red brick, blue/green brick, striped shirt and grey pants. Two of the wood images show the largest distances and a discussion of likely reasons is provided in Section 6. The rest of images are close to the real wet areas in each image ignoring the borders between patches. Figure 5 shows a collection of images downloaded from the web ofnon-water wetting. From left to right, raster scan, partially wet: coffee on carpet, coffee on wood, wine on carpet, olive oil on humus, olive oil on wood, tea on fabric, coffee on fabric, two images of coffee on carpet, wine on tile, wine on carpet, wine on granite, same image but applying a water model, wine on carpet, coffee on plastic table cloth, coffee on carpet, coffee on shirt, same image but applying a water model, wine on yellow napkin, and soy sauce on yellow napkin (the last two images are acquired by us). The liquid color is rendered with intensity that is close to the wet area. The wine on granite and coffee on shirt are used to also demonstrate the results of the water model as opposed to accounting for different spectral absorptions. Overall the distances are low with exception to the olive oil on wood and wine on white carpet (middle of the bottom group). The olive oil on wood maybe related to explanations in Section 6 while the wine on carpet shows marked difference in surface normals between the dry and wet patches (the wet patches are in focus while the dry patch is blurred). 6. Open Challenges The experiments indicated that in some images of wet wood, the model is not accurate. Figure 7 shows an image of an outdoor deck, a part of a wetted area used for an experiment, and the synthesized dry patch using our model. The dry wood appears nearly perfectly grey, while the wet wood is brown. The wet pixels show high absorption of green relative to red, and even higher absorption of blue relative to green and red. The model does not predict this result given that the liquid is water. A similar phenomenon was observed in some experiments in Figures 3 and 5. We suggest two conjectures as to why this occurs. The first has to do with image acquisition, and suggests that perhaps the camera is overstating the amount of blue and green light reflected at the dry patch. The second is that these woods and their resultant images have a more complex wetting process. Specifically, it is possible that this wood is composed of 2 layers, the first is very thin and tends to have only a hint of the spectral properties of the wood, and the second layer reflects the full spectral attributes of the wood. The top layer may come to exist due to environmental degradation or dust, but may not exist in freshly cut wood. For the dry wood in Figure 7 the reflectance is mostly the result of reflection from the top layer, while upon wetting, the second layer is reached by the water and thus it be- the dominant source of reflectance. Unfortunately it remains an open challenge to explain these deviations from the model. Differences in the distributions of the surface normals between the dry and wet patches make it harder to determine similarity (even if a different metric than EMD is used). This is general computer vision problem that is not specific to wetting, but is made more challenging by the complexity of the wetting process. comes 2957 8.3 (2.8,195) (0.90,0.89,0.87) 8.8(5.0,182) (0.05,0.03,0.02) 20.2 (2.1,155) 25.0(1.8,160) 6.4 (0.30,0.20,0.15) (0.10,0.08,0.07) (5.0,233) (0.05,0.05,0.05) 16.4 (5.0,162) (0.60,0.61,0.62) 9.2(5.0,247) 3.0(5.0,154) 24. 1(5.0,146) (0.15,0.14,0.12) (0.10,0.09,0.09) (0.10,0.09,0.08) 1.5 (4.8,121) (0.25,0.27,0.21) 13.3(2.7,131) 7.0(3.8,157) (0.15,0. 15,0.15) (0.30,0.29,0.28) Figure 3. Top row, images of dry material, middle row, images of wet materials (water), and bottom row the synthesized wet images. 1.2(0.(903.1,0.,91 2,2)0.7 ) 13.5( 01..960,1,06.58)0,0.59) 31.(905.4(03,0. 07,61,730).73) (034. 40,0(.467.9,0,2.64 4) (209.2.0,40.(26.61,0,1.9318) 12(0.8.0 ,06(.2 0.,30,.1 91 ) (80.8.65,0.8(38.1,0,.28194) (0.9 .0,90.(280.8,0,1.5 9 ) (10.605,0.2.3(42.,09.1,139)1) (0.9109,0. 8781,(0. 981,1)58) Figure 4. Top row, input images with wet patches. Bottom row, dry patches synthesized into wet patches assuming water. From left to right, yellow paper, brown paper towel, large area over a cap, small area of the cap, blue paper, orange fleece, grey/blue paper, green paper, orange fabric and grey/blue fabric. Figure7.Left oright,fo tprintsondrydeck,inputfor uralgo- rithm, and synthesized output. 7. Summary In this paper we investigated the problem of visual appearance change as liquids and rough surfaces interact. The problem assumes that two patches, the first is known to be dry and the second is possibly wet are given. Liquid attributes that are close to water, but also allow for varying absorption rates across spectral wavelengths allow accounting for unknown liquids suchs as coffee, wine and oil. Our experiments indicate an ability to explain wetting effects in different materials and under unknown imaging conditions. References [1] A. Angstrom. The Albedo of Various Surfaces of Ground, Geographic Annals, vol. 7, 1925, 323-342. [2] T. Teshima, H. Saito, M. Shimizu, and A. Taguchi. Classification of Wet/Dry Area Based on the Mahalanobis Distance of Feature from Time Space Image Analysis. IAPR Conference on Machine Vision Applications, 2009, 467-470. [3] J Lekner and M. C. Dorf. Why some things are darker when wet, Applied Optics, (27)7, 1988, 1278-1280. [4] H. Mall and N. da Vitoria Lobo. Determining Wet Surfaces from Dry. ICCV, Boston, 1995 , 963 - 968. [5] H. Jensen, J. Legakis, J. Dorsey. Rendering of Wet Materials. Rendering Techniques 99. Eds. D. Lischinski and G. Larson. Springer-Verlag, 1999, 273-282. [6] J. Gu, C. Tu, R. Ramamoorthi, P. Belhumeur, W. Matusik and S. K. Nayar. Time-varying Surface Appearance: Acquisition, Modeling, and Rendering. ACM Trans. on Graphics (also Proc. of ACM SIGGRAPH), Jul, 2006, (25)3 ,762 - 771. [7] Y. Rubner, C. Tomasi, L. J. Guibas. A Metric for Distributions with Applications to Image Databases. Proceedings ICCV, 1998: 59-66. 2958 7.6( 30..485,,3501.8)0,0.72) 25(.0.755(5,0. 409,2,09.04)1) 15(0. 355(,40..372,,103.214)) 8(.09.8(54,.01.7,42,500.6)3) 13(.0.080(2,0. 654,2,07.05)5) 48(0. 59,0.(37.12,0,1.5753)) (0.93,90.7.80,0.(72.2)1,189) 7.(027(0,.67,219,0.)781(.08(1,0.87,5170.)6 29(.07 5(,0.63,209.1)47 .(809(3,.1027613,0.)62 1.(05( ,.0 ,4279,10).4728(0.2,0(.52 0, 3.216)(0.53.,045(6,0. 6,21)0(1.230,.521(,30.92,0)45 Figure 5. Web images, top row is input, and second row is synthetic wetting. (104.65 ,0.(56,.0 4258) (09.708,.(531.4,01.26) (06.5 ,90.(418.,0138)4(02.960,. 68(1,0.37,14)6 (0.490,.80,(1.740,)132(0.91,0.38 6,0.8(32).8,07)(.90,3.8 5,0.(812.),68)(0.9,07.8 6,0.(815.),184)(0.85,20.73,0(.1583),209) LIQ(0.8 ,0.73,0.62)(0.82,0.56,0.45)(0.61,0.41,0.39)(0.61,0.53,0. 3)(0.67,0.57,0.35)(0.75,0.59,0.38)(0.82,0.65,0.45)(0.93,0.7 ,0.59)(0.80, .62,0.43) (0.9LI,Q7(5.0,839(1.)5,6084(.5)90,1(.3894,20.8(5471,)0.28F16)ig(0u.82r,40e3.7 26,0.9(71W6.24),37e1bim(0a.657g,1eW.3s9,A0T(5toE.90)R,p351tob(0.65to9,04m.265 ,0.r6o(53)w.1,26s4(:09.)5in,903.p86u1,308t.6(21s)9.y,31nt)(h0.8e57,2t0i.c1654,9w0. 5e4)(t.1in,0(78g.29),0a5 .9n36,0d5(13.l6)2i8,q17u(d0W.37AaT,90lE.b5R(3e,0.d512o9)6(80.2,8(.5401,5.392170) 8(.1940,82. 9,01.5834,)0.5

same-paper 2 0.74688989 346 iccv-2013-Rectangling Stereographic Projection for Wide-Angle Image Visualization

Author: Che-Han Chang, Min-Chun Hu, Wen-Huang Cheng, Yung-Yu Chuang

3 0.66150087 419 iccv-2013-To Aggregate or Not to aggregate: Selective Match Kernels for Image Search

Author: Giorgos Tolias, Yannis Avrithis, Hervé Jégou

Abstract: This paper considers a family of metrics to compare images based on their local descriptors. It encompasses the VLAD descriptor and matching techniques such as Hamming Embedding. Making the bridge between these approaches leads us to propose a match kernel that takes the best of existing techniques by combining an aggregation procedure with a selective match kernel. Finally, the representation underpinning this kernel is approximated, providing a large scale image search both precise and scalable, as shown by our experiments on several benchmarks.

4 0.64443702 245 iccv-2013-Learning a Dictionary of Shape Epitomes with Applications to Image Labeling

Author: Liang-Chieh Chen, George Papandreou, Alan L. Yuille

Abstract: The first main contribution of this paper is a novel method for representing images based on a dictionary of shape epitomes. These shape epitomes represent the local edge structure of the image and include hidden variables to encode shift and rotations. They are learnt in an unsupervised manner from groundtruth edges. This dictionary is compact but is also able to capture the typical shapes of edges in natural images. In this paper, we illustrate the shape epitomes by applying them to the image labeling task. In other work, described in the supplementary material, we apply them to edge detection and image modeling. We apply shape epitomes to image labeling by using Conditional Random Field (CRF) Models. They are alternatives to the superpixel or pixel representations used in most CRFs. In our approach, the shape of an image patch is encoded by a shape epitome from the dictionary. Unlike the superpixel representation, our method avoids making early decisions which cannot be reversed. Our resulting hierarchical CRFs efficiently capture both local and global class co-occurrence properties. We demonstrate its quanti- tative and qualitativeproperties ofour approach with image labeling experiments on two standard datasets: MSRC-21 and Stanford Background.

5 0.60607743 66 iccv-2013-Building Part-Based Object Detectors via 3D Geometry

Author: Abhinav Shrivastava, Abhinav Gupta

Abstract: This paper proposes a novel part-based representation for modeling object categories. Our representation combines the effectiveness of deformable part-based models with the richness of geometric representation by defining parts based on consistent underlying 3D geometry. Our key hypothesis is that while the appearance and the arrangement of parts might vary across the instances of object categories, the constituent parts will still have consistent underlying 3D geometry. We propose to learn this geometrydriven deformable part-based model (gDPM) from a set of labeled RGBD images. We also demonstrate how the geometric representation of gDPM can help us leverage depth data during training and constrain the latent model learning problem. But most importantly, a joint geometric and appearance based representation not only allows us to achieve state-of-the-art results on object detection but also allows us to tackle the grand challenge of understanding 3D objects from 2D images.

6 0.59659177 444 iccv-2013-Viewing Real-World Faces in 3D

7 0.59312558 376 iccv-2013-Scene Text Localization and Recognition with Oriented Stroke Detection

8 0.5836066 137 iccv-2013-Efficient Salient Region Detection with Soft Image Abstraction

9 0.58238149 315 iccv-2013-PhotoOCR: Reading Text in Uncontrolled Conditions

10 0.57864296 275 iccv-2013-Motion-Aware KNN Laplacian for Video Matting

11 0.57751858 415 iccv-2013-Text Localization in Natural Images Using Stroke Feature Transform and Text Covariance Descriptors

12 0.5765788 269 iccv-2013-Modeling Occlusion by Discriminative AND-OR Structures

13 0.57642162 349 iccv-2013-Regionlets for Generic Object Detection

14 0.5759505 156 iccv-2013-Fast Direct Super-Resolution by Simple Functions

15 0.57556099 196 iccv-2013-Hierarchical Data-Driven Descent for Efficient Optimal Deformation Estimation

16 0.57454515 73 iccv-2013-Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification

17 0.57397699 420 iccv-2013-Topology-Constrained Layered Tracking with Latent Flow

18 0.5739767 60 iccv-2013-Bayesian Robust Matrix Factorization for Image and Video Processing

19 0.57326734 389 iccv-2013-Shortest Paths with Curvature and Torsion

20 0.57283354 180 iccv-2013-From Where and How to What We See