iccv iccv2013 iccv2013-250 knowledge-graph by maker-knowledge-mining

250 iccv-2013-Lifting 3D Manhattan Lines from a Single Image

Source: pdf

Author: Srikumar Ramalingam, Matthew Brand

Abstract: We propose a novel and an efficient method for reconstructing the 3D arrangement of lines extracted from a single image, using vanishing points, orthogonal structure, and an optimization procedure that considers all plausible connectivity constraints between lines. Line detection identifies a large number of salient lines that intersect or nearly intersect in an image, but relatively a few of these apparent junctions correspond to real intersections in the 3D scene. We use linear programming (LP) to identify a minimal set of least-violated connectivity constraints that are sufficient to unambiguously reconstruct the 3D lines. In contrast to prior solutions that primarily focused on well-behaved synthetic line drawings with severely restricting assumptions, we develop an algorithm that can work on real images. The algorithm produces line reconstruction by identifying 95% correct connectivity constraints in York Urban database, with a total computation time of 1 second per image.

Reference: text

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 @me rl com Abstract We propose a novel and an efficient method for reconstructing the 3D arrangement of lines extracted from a single image, using vanishing points, orthogonal structure, and an optimization procedure that considers all plausible connectivity constraints between lines. [sent-2, score-0.79]

2 Line detection identifies a large number of salient lines that intersect or nearly intersect in an image, but relatively a few of these apparent junctions correspond to real intersections in the 3D scene. [sent-3, score-1.264]

3 In contrast to prior solutions that primarily focused on well-behaved synthetic line drawings with severely restricting assumptions, we develop an algorithm that can work on real images. [sent-5, score-0.429]

4 The algorithm produces line reconstruction by identifying 95% correct connectivity constraints in York Urban database, with a total computation time of 1 second per image. [sent-6, score-0.532]

5 Such man-made structures predominantly consist of lines in three orthogonal directions. [sent-10, score-0.422]

6 It is easy to observe that several pairs of lines intersect. [sent-11, score-0.383]

7 However, this does not necessarily mean that the corresponding 3D lines in the world also intersect. [sent-12, score-0.464]

8 Lines that share a common vanishing point are a trivial counterexample; all appear to intersect at the vanishing point but none intersect in world space, where they are parallel. [sent-13, score-0.549]

9 Knowing which apparent intersections correspond to real 3D intersections would be highly informative about the scene’s 3D geometry. [sent-14, score-0.444]

10 In fact, by identifying a minimal set of such intersections we can completely reconstruct the 3D lines up to an unknown global scale. [sent-15, score-0.652]

11 Line detection algorithms in real images often miss important lines and produce spurious ones. [sent-18, score-0.475]

12 The detected Manhattan lines and their intersections are shown on the image. [sent-21, score-0.598]

13 In all the figures shown in this paper, red, green and blue lines denote the lines oriented along x, y and z directions respectively. [sent-22, score-0.766]

14 The pair of 3D lines corresponding to each one of these also intersect in 3D space. [sent-24, score-0.494]

15 We automatically detect such intersections and use them as connectivity constraints to lift the 2D lines to 3D space as shown in two different perspective views (c) and (d). [sent-25, score-0.94]

16 1(b) and use them as connectivity constraints to lift the 2D lines to 3D space. [sent-28, score-0.762]

17 We show two perspective views of the reconstructed 3D lines in Fig. [sent-29, score-0.383]

18 The most popular approach is based on labeling lines into convex, concave or occluding lines. [sent-37, score-0.576]

19 The line labeling problem is in general NP-hard and several challenging line drawings were studied and novel constraint satisfaction algorithms were developed to solve the single view reconstruction problem. [sent-38, score-0.823]

20 The algorithms were primarily tested on synthetic line drawings and ultimately proved too brittle for inputs derived from real images. [sent-39, score-0.429]

21 Example-based approaches have been used to reconstruct 3D scenes from line drawings [2]. [sent-59, score-0.444]

22 [15] used connectivity constraints for reconstructing lines from multiple images. [sent-61, score-0.628]

23 We use junctions for designing penalty terms in a linear programming (LP) formulation. [sent-63, score-0.44]

24 Contributions The main contributions of this paper are as follows: • • • • • We build a graph that models connectivity constraints between nearby lines in real images. [sent-67, score-0.686]

25 We use LP to lift the line segments in 3D space using Manhattan world assumption [3]. [sent-68, score-0.515]

26 We show automatic single view line reconstruction for challenging real images. [sent-71, score-0.359]

27 Lifting 2D lines to 3D Early approaches to the interpretation of line drawing centered on labeling lines and detecting junctions [27, 29, 30]. [sent-73, score-1.542]

28 Based on the orientations of the projected line segments in the image, junctions can be classified into an “alphabet” {L, T, Y, W, X}. [sent-74, score-0.665]

29 The points A, B and C correspond to W junctions and the points D, E and F correspond to Y junctions. [sent-77, score-0.477]

30 It is straightforward to detect and classify junctions in synthetic line drawings. [sent-78, score-0.616]

31 Once thejunctions are detected, the incident lines are labeled using four labels (+,−,←,→). [sent-79, score-0.383]

32 Although every line can take four possible labels, the adjacent junctions restrict certain labeling. [sent-83, score-0.656]

33 Based on this restriction, Huffman [14] and Clowes [1] produced catalogs that provide all possible labeling for lines adjoining a given junction. [sent-84, score-0.421]

34 Existing constraint satisfaction algorithms for the line labeling problem are generally successful on most synthetic line drawings, except a few pathological cases. [sent-85, score-0.569]

35 Given the labeling, Sugihara proposed a lifting procedure to decide whether a line drawing represents a physically realizable polyhedral object or not. [sent-86, score-0.488]

36 Such constraints can be written in the form of linear inequalities and if the linear program has a + 498 feasible solution, then the line drawing is physically realizable. [sent-91, score-0.47]

37 Whiteley extended Suhihara’s approach to general line drawings using combinatorial structures like matroids [31]. [sent-96, score-0.385]

38 Due to missing and spurious lines, we can not detect junctions with very high confidence. [sent-100, score-0.471]

39 Instead, evidence about junctions is used to inform the penalty terms in an LP to obtain a consistent 3D structure that “breaks” the smallest number of infeasible apparent junctions. [sent-104, score-0.535]

40 Right: This line drawing is physically not realizable because the edges joining the two triangles do not concur at a point. [sent-108, score-0.418]

41 It is convenient to work in the world coordinate system where every 3D line is aligned along one of the three orthogonal axes. [sent-113, score-0.383]

42 This rotation is used to orient the cam- × era rays such that the 3D line segments we reconstruct are aligned with the world. [sent-117, score-0.4]

43 Constraint Graph We consider connectivity constraints between two line segments, if the shortest distance between them is less than a threshold. [sent-125, score-0.44]

44 We consider two types of connectivity: • • Two orthogonal lines can (be extended to) intersect at a point, which we refer to as the intersection. [sent-126, score-0.533]

45 Two collinear lines can (be extended to) meet at a point, which we refer to as incidence. [sent-127, score-0.507]

46 3, we can choose any point collinear to the lines as the incidence. [sent-129, score-0.507]

47 The intersections and incidences basically provide the necessary coupling constraints to lift the 2D lines to 3D space. [sent-130, score-0.877]

48 In addition to merging broken line segments, the incidence relationship can also connect two lines coming from two different objects that are collinear in the world. [sent-131, score-0.82]

49 This is not a problem because our LP formulation is designed to handle constraints that may not be true, by using penalty terms based on the junction features. [sent-134, score-0.406]

50 Linear Program We encode the scene interpretation problem in an LP as follows: Let n be the number of lines in an image and li, i≤ n denote the ith line. [sent-137, score-0.442]

51 , n} represent the lines and the edges (i, j) ∈ E represent possible intersections or incidences between lines li and lj . [sent-141, score-1.076]

52 These lines lead to three intersections I12, I13 and I23. [sent-144, score-0.561]

53 Note that the intersection can happen between any two lines that belong to two different orientations. [sent-145, score-0.438]

54 Accordingly, the lines l4 and l2 can intersect if the shortest distance between them is less than a threshold. [sent-146, score-0.494]

55 (a) We consider four lines in the image space denoted by l1,l2,l3 and l4 that produce three intersection points I12, I13 and I23. [sent-151, score-0.48]

56 For a pair of nearby collinear lines, we use a point that is collinear to both the lines as a point of incidence I14. [sent-152, score-0.721]

57 (b) We build a constraint graph where all the lines in the image are the vertices and all the intersections and incidences are edges. [sent-153, score-0.749]

58 The projection rays for two image pixels pi and pj, lying on lines li and lj respectively, are also shown. [sent-156, score-0.557]

59 The distance between the two 3D lines is given by sij. [sent-161, score-0.383]

60 Accordingly, the 3D point λidi encodes the position of the 3D line corresponding to line li. [sent-167, score-0.446]

61 One can see that by varying the λi parameter we can slide the 3D line along the projection ray using one of its end points while maintaining the Manhattan direction Di. [sent-171, score-0.378]

62 Every intersection or incidence relationship indicates that the two lines should intersect or be collinear. [sent-173, score-0.639]

63 The constraints λi 1ensures that the line segments are at least unit distance from the camera and also in front of the camera. [sent-179, score-0.392]

64 4, minimizing the L0 norm of the slack variable sij is equivalent to depth-shifting and depth-bridging the smallest subset of 3D lines to make the scene graph connected and consistent in 3D. [sent-184, score-0.631]

65 (2) The weight parameters wij in the objective function present an opportunity to incorporate more evidence from the image, which is obtained from junction features and will be explained in Sec. [sent-188, score-0.362]

66 For every edge (i, j) the slack variables sij give us the minimum depth separation between line iand line j needed to obtain the lowest-cost globally consistent 3D interpretation of the scene. [sent-192, score-0.788]

67 (a) Given an input image, we extract lines, compute vanishing points and cluster the lines according to three principal orientations. [sent-202, score-0.548]

68 In (b), we show the intersections of the Manhattan lines in the constraint graph . [sent-203, score-0.65]

69 These depth parameters are sufficient to lift the 2D lines to 3D space in the world coordinate system. [sent-207, score-0.676]

70 Since junctions play an important role in our work, we show Fig. [sent-214, score-0.393]

71 6 to illustrate the idea behindjunctions and how to comTwo different perspective views of the reconstructed lines are pute them. [sent-215, score-0.383]

72 These orientations are denoted by the set S = { −→x , →y , →z , Every subset A ⊆ S, |A| 2, denotes a junction and one can compute a function F(A, p) for every possible subset A and pixel p. [sent-218, score-0.353]

73 The distribution of junctions depends on the scene geometry. [sent-226, score-0.393]

74 It is easy to observe the following behavior from junctions on Manhattan scenes. [sent-227, score-0.393]

75 • • • L and X junctions occur on planar surfaces. [sent-228, score-0.393]

76 T junctions occur on both planar surfaces and occluding boundaries. [sent-229, score-0.497]

77 Y and W junctions are common on convex and concave edges. [sent-230, score-0.444]

78 Depending on the presence of lines in each of these 6 regions, we can detect different junctions. [sent-236, score-0.383]

79 We show a Y junction at pixel p and a T junction at pixel q. [sent-237, score-0.626]

80 We did not give any preference to L junctions because T can sometimes be detected as L due to missing lines. [sent-240, score-0.46]

81 We considered intersection points between two line segments if the minimum distance between the line segments is less than 40 pixels. [sent-244, score-0.667]

82 The collinearity constraint is considered for lines that are separated by as much as 1/4 of the image. [sent-245, score-0.492]

83 After the line detection, we find candidate connections between lines by finding intersections and incidences. [sent-246, score-0.784]

84 We then solve the LP and compute its MST to find the largest connected component in the constraint graph and reconstruct only the lines in this component. [sent-247, score-0.57]

85 Based on the detected junctions we used Ch = 10 and Cm = 5. [sent-255, score-0.43]

86 We identify a minimum number of edges (constraints) to unambiguously lift the lines to 3D space using a minimum spanning tree (MST) algo- rithm. [sent-259, score-0.757]

87 Recall that our goal is to identify a maximal set of pairwise constraints that spans the detected lines and supports a 3D reconstruction without error (ignoring small VP, line, and calibration errors). [sent-265, score-0.596]

88 Quantitative Evaluation: Using the minimum spanning tree (MST) algorithm, we identify a minimum number of constraints to unambiguously lift the lines to 3D space. [sent-283, score-0.813]

89 Otherwise, our problem formulation is also general and applies to any connected set of lines and these could come from non-planar regions like lampposts or even cables running between buildings. [sent-292, score-0.422]

90 The first column displays the original images with the Manhattan lines marked. [sent-299, score-0.436]

91 Inthefirstrow, missing lines along the z direction led to the collapse of the two parallel planes to a single plane. [sent-305, score-0.44]

92 In the second row, spurious lines from the reflections on the shiny floor are mistaken for vertical structure in the scene. [sent-306, score-0.431]

93 In the third row, missing horizontal lines on the background building deprives the LP of evidence that it is a separate structure. [sent-307, score-0.464]

94 by considering all plausible connectivity constraints between lines and try to find a consistent interpretation of the scene. [sent-309, score-0.659]

95 Note that the line reconstruction recovers the second door behind the right wall. [sent-334, score-0.343]

96 Automatic single-image 3d reconstructions of indoor manhattan world scenes. [sent-367, score-0.443]

97 Efficient edge-based methods for estimating manhattan frames in urban imagery. [sent-373, score-0.339]

98 Exploiting global connectivity constraints for reconstruction of 3d line segment from images. [sent-435, score-0.532]

99 Manhattan junction catalogue for spatial reasoning of indoor scenes. [sent-474, score-0.365]

100 Generating semantic descriptions from line drawings of scenes with shadows. [sent-517, score-0.385]

similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('junctions', 0.393), ('lines', 0.383), ('manhattan', 0.298), ('junction', 0.275), ('line', 0.223), ('intersections', 0.178), ('lp', 0.17), ('drawings', 0.162), ('lift', 0.162), ('connectivity', 0.133), ('mst', 0.127), ('collinear', 0.124), ('vanishing', 0.123), ('intersect', 0.111), ('sij', 0.108), ('occluding', 0.104), ('reconstruction', 0.092), ('incidence', 0.09), ('constraints', 0.084), ('world', 0.081), ('incidences', 0.07), ('sugihara', 0.07), ('rays', 0.069), ('indoor', 0.064), ('lifting', 0.064), ('drawing', 0.063), ('collinearity', 0.062), ('interpretation', 0.059), ('reconstruct', 0.059), ('slack', 0.059), ('realizable', 0.057), ('intersection', 0.055), ('unambiguously', 0.054), ('dj', 0.053), ('displays', 0.053), ('inequalities', 0.053), ('concave', 0.051), ('evidence', 0.051), ('layout', 0.051), ('depth', 0.05), ('spanning', 0.05), ('segments', 0.049), ('di', 0.049), ('spurious', 0.048), ('penalty', 0.047), ('physically', 0.047), ('delage', 0.047), ('idia', 0.047), ('jdja', 0.047), ('vpx', 0.047), ('vpy', 0.047), ('constraint', 0.047), ('geometrical', 0.045), ('apparent', 0.044), ('projection', 0.044), ('real', 0.044), ('graph', 0.042), ('points', 0.042), ('ray', 0.041), ('urban', 0.041), ('pillai', 0.041), ('vrml', 0.041), ('every', 0.04), ('connected', 0.039), ('orthogonal', 0.039), ('satisfaction', 0.038), ('jain', 0.038), ('labeling', 0.038), ('pixel', 0.038), ('detected', 0.037), ('camera', 0.036), ('wij', 0.036), ('brand', 0.036), ('efros', 0.036), ('hoiem', 0.036), ('polyhedral', 0.034), ('idi', 0.034), ('lj', 0.034), ('planarity', 0.032), ('ramalingam', 0.032), ('unknown', 0.032), ('schwing', 0.031), ('incorrect', 0.03), ('missing', 0.03), ('york', 0.029), ('saxena', 0.029), ('svr', 0.029), ('vertices', 0.029), ('edges', 0.028), ('door', 0.028), ('slide', 0.028), ('mit', 0.028), ('reconstructing', 0.028), ('hedau', 0.028), ('tree', 0.028), ('geometry', 0.027), ('led', 0.027), ('pi', 0.027), ('minimum', 0.026), ('reasoning', 0.026)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 1.0000006 250 iccv-2013-Lifting 3D Manhattan Lines from a Single Image

Author: Srikumar Ramalingam, Matthew Brand

2 0.19280016 84 iccv-2013-Complex 3D General Object Reconstruction from Line Drawings

Author: Linjie Yang, Jianzhuang Liu, Xiaoou Tang

Abstract: An important topic in computer vision is 3D object reconstruction from line drawings. Previous algorithms either deal with simple general objects or are limited to only manifolds (a subset of solids). In this paper, we propose a novel approach to 3D reconstruction of complex general objects, including manifolds, non-manifold solids, and non-solids. Through developing some 3D object properties, we use the degree of freedom of objects to decompose a complex line drawing into multiple simpler line drawings that represent meaningful building blocks of a complex object. After 3D objects are reconstructed from the decomposed line drawings, they are merged to form a complex object from their touching faces, edges, and vertices. Our experiments show a number of reconstruction examples from both complex line drawings and images with line drawings superimposed. Comparisons are also given to indicate that our algorithm can deal with much more complex line drawings of general objects than previous algorithms. 1. Introduction and Related Work A 2D line drawing is the most straightforward way of illustrating a 3D object. Given a line drawing representing a 3D object, our visual system can understand the 3D structure easily. For example, we can interpret without difficulty the line drawing shown in Fig. 1(a) as a castle with four walls and one door. Imitating this ability has been a longstanding and challenging topic in computer vision when a line drawing is as complex as this example. The applications of this work include 3D object design in CAD and for 3D printers, 3D query generation for 3D object retrieval, and 3D modeling from images. In this paper, same as the majority of related work, a line drawing is defined as the orthogonal projection of the Fig.1 (a)Alinedrawing(rae)pres nti gac stle.(b)The3Dm(obd)el of the line drawing. edges and vertices of a 3D object in a generic view, and objects with planar surfaces are considered. A line drawing is represented by an edge-vertex graph. It can be obtained by the user/designer who draws on the screen with a tablet pen, a mouse, or a finger (on a touch sensitive screen), with all, with some, or without hidden edges and vertices. Line labeling is the earliest work to interpret line drawings [1], [17]. It searches for a set of consistent labels such as convex, concave, and occluding from a line drawing to test its correctness and/or realizability. Line labeling itself cannot recover 3D shape from a line drawing. Later, 3D reconstruction from the contours (line drawings) of objects in images is studied [19], [14], [13], which handles simple objects only. Model-based 3D reconstruction [2], [3], [20] can deal with more complex objects, but these methods require to pre-define a set of parametric models. Recently, popular methods of 3D reconstruction from line drawings are optimization based, which are most related to our work and are reviewed next. These methods can be classified into two categories: one dealing with manifolds and the other dealing with general objects. A general object can be a manifold, non-manifold solid, or non-solid. Manifolds are a subset of solids, defined as follows: A manifold, or more rigorously 2D manifold, is a solid where every point on its surface has a neighborhood topologically equivalent to an open diskin the 2D Euclidean space. 1433 In this paper, a solid is a portion of 3D space bounded by planar faces, and a manifold is also bounded by planar faces. Each edge of such manifolds is shared exactly by two faces [4]. Most 3D reconstruction methods from a line drawing assume that the face topology of the line drawing is known in advance. This information can reduce the reconstruction complexity greatly. Algorithms have been developed to find faces from a line drawing in [16], [10], and [9], where [16] and [10] are for general objects and [9] for manifolds. Optimization-based 3D reconstruction depends on some critera (also called image regularities) that simulate our visual perception. Marill proposes a very simple but effective criterion to reconstruct a simple object: minimizing the standard deviation of the angles (MSDA) in the object [11]. Later, other regularities are proposed to deal with more complex objects such as face planarity, line parallelism, isometry, and corner orthogonality [5], [6], [15], [18]. In these methods, an objective function ?C Φ(z1,z2, ...,zNv) = ?ωiφi(z1,z2, ...,zNv) (1) i?= ?1 is minimized to derive the depths z1, z2 , ..., zNv , where Nv is the number of vertices in the line drawing, φi , i = 1, 2, ..., C, are the regularities, and ωi , i = 1, 2, ..., C, are the weights. The main problem in this approach is that these algorithms are easy to get trapped into local minima (obtaining failed results) when a line drawing is complex with many vertices, due to the search in a highdimensional space (Nv dimensions) with the non-convex objective function. For example, the search space is of 56 dimensions for the object in Fig. 1(a). To alleviate this problem, Liu et al. formulate 3D reconstruction in a lower dimensional space so that the optimization procedure has a better chance to find desired results [7]. For the complex object in Fig. 1(a), however, the search in a space with 18 dimensions is still too difficult for it to obtain a satisfactory 3D object (see Section 3). The methods in [5], [6], [15], [18], and [7] reconstruct general objects, and the one in [7] can deal with more complex objects than the other four. But these algorithms cannot avoid the local minimum problem in a high dimensional search space when a line drawing is complex. In [8], a divide-and-conquer (D&C;) strategy is used to tackle this problem. It first separates a complex line drawing into multiple simpler ones, then independently recovers the 3D objects from these line drawings, and finally merges them to form a complete object. Since the separated line drawings are much simpler than the original one, the 3D reconstruction from each of them is an easy task. This D&C; approach handles manifolds only. Based on known faces found by the face identification algorithm in [9], it uses manifold properties to find internal faces Fig.2(a)Asimaeplhdmiafnbo(ldagc)withnae'fahdc'eisfba'nd(obgcn)'eitrnalfce (a, b, c, d). (b) Decomposition result from the internal face. (a) (b) (c) (d) Fig. 3. (a) A non-manifold solid. (b) Expected decomposition of (a). (c) A sheet object. (d) Expected decomposition of (c). from a line drawing and then separates the line drawing from the internal faces. An internal face is defined as an imaginary face lying inside a manifold with only its edges visible on the surface [8]. It is not a real face but can be considered as two coincident real faces of identical shape belonging to two manifolds. For example, Fig. 2(a) shows a manifold with nine faces. The D&C; first finds the internal face (a, b, c, d) and then decomposes the line drawing from this internal face (Fig. 2(b)). However, handling manifolds only limits the applica- tions of [8]. In many applications in computer vision and graphics such as 3D object matching, retrieval, and rendering, it is unnecessary to represent objects as manifolds, in order to facilitate data processing and reduce data storage. For example, a flat ground can be represented by a sheet (one face), but if it is represented by a manifold, a thin box with six faces has to be used. Fig. 1(a), Fig. 3(a), and Fig. 3(c) are line drawings of three non-manifolds. In this paper, we propose a novel approach to 3D reconstruction of complex general objects based on visual perception, object properties, and new line drawing decomposition. Compared with previous methods, ours can deal with much more complex line drawings of general objects. It can handle not only manifolds but also non-manifold solids and non-solids, and is insensitive to sketching errors. 2. General Object Reconstruction We also use the D&C; strategy to deal with 3D reconstruction from a line drawing representing a complex general object. The key is how to decompose a complex line drawing of any objects into multiple simpler line drawings. These decomposed line drawings should represent objects that are in accordance with our visual perception, which makes the 3D reconstruction from these line drawings easier and better because the regularities used to build an objective function for reconstruction follow human perception of 1434 common objects [11], [5], [6], [15], [18]. Before the decomposition of a line drawing, we assume that all the real and internal faces of the object have been obtained from the line drawing using a face identification algorithm. For example, the algorithm in [10] finds 10 faces from the line drawing in Fig. 2(a) (including the internal face), and obtains 12 and 7 faces from the line drawings in Figs. 3(a) and (c), respectively. 2.1. Decomposing line drawings of solids In this subsection, we consider the line drawings of solids first. The decomposition method will be extended to the line drawings of general objects in the next subsection. It is not difficult to see that in general, a complex object, especially a manmade complex object, can be considered as the combinations of multiple smaller objects. The most common combination is the touch of two faces from two different objects such as the one in Fig. 2. Other combinations are the touches among lines, faces, and vertices. Our target is to decompose a complex solid into multiple primitive solids. Before the definition of a primitive solid, we introduce a term called the degree of freedom of a solid. Definition 1. The degree of freedom (DoF) of a 3D solid represented by a line drawing is the minimal number of zcoordinates that can uniquely determine this 3D solid. This is the first time that the concept of DoF is used to decompose line drawings. Now let us consider a simple object in Fig. 4(a). The cube has six faces: (v1, v2 , v3 , v4), (v1, v2, v6, v5), (v1, v4, v8, v5), (v2 , v3, v7, v6), (v4, v3, v7, v8), and (v6, v7, v8, v5). We can show that the cube is determined if the z-coordinates of its four non-coplanar vertices are known. Without loss of generality, suppose z1, z2, z4, and z5 are known. Since the 3D coordinates of v1, v2, and v4 are fixed (remind that the x- and y-coordinates of all the vertices are known under the orthogonal projection), the 3D plane passing through the face (v1, v2 , v3, v4) is determined, and thus z3 can be calculated. Similarly, z6 and z8 can be obtained. Finally, z7 can be computed with the 3D coordinates of v3, v4, and v8 known, which determine the plane passing through the face (v4, v3, v7, v8). So the 3D cube can be determined by the known four z-coordinates, z1, z2, z4, and z5. Further, it can be verified that three 3D vertices cannot determine this object uniquely because they can only define one face in 3D space. Therefore, the DoF of the cube is 4. Similar analysis allows us to know that the solids in Fig. 2(b), Fig. 3(b), and Fig. 4(b) all have DoF 4, while the two solids in Fig. 2(a) and Fig. 3(a) have DoFs 5 and 6, respectively. From these analysis, we can have the intuition that solids with DoF 4 serve as the building blocks of more complex solids whose DoFs are more than 4. Besides, we have the following property: Property 1. There is no solid with DoF less than 4. Fv5(1iga).4v (26a)Av48cubev3w7hos(be)DoFis4.(b)Anedo(tch)fergBasbolifdAwhosfCeDocFji is also 4. (c) Part of a line drawing with each vertex of degree 3. This property is easy to verify. A solid with fewest faces is a tetrahedron. Every two of its four faces are not co-planar. Three 3D vertices of a tetrahedron can only determine one 3D face. Next, we define primitive solids. Definition 2. A 3D solid represented by a line drawing is called a primitive solid if its DoF is 4. Property 2. If every vertex of a 3D solid represented by a line drawing has degree 31, then it is a primitive solid. Proof. Let part of such a line drawing be the one as shown in Fig. 4(c). At each vertex, every two of the three edges form a face, because a solid is bounded by faces without dangling faces and edges. Let the three paths fA, fB, and fC in Fig. 4(c) denote the three faces at vertex a. Without loss of generality, suppose that the four zcoordinates (and thus the four 3D coordinates) of vertices a, b, c, and d are known. Then the three planes passing through fA, fB, and fC are determined in 3D space. With the two known 3D planes passing through fA and fB at vertex b, the 3D coordinates of vertices g and h connected to b can be computed. Similarly, the 3D coordinates of vertices e and f connected to d and the 3D coordinates of vertices i and j connected to c can be obtained. Furthermore, all the 3D coordinates of the other vertices connected to e, f, g, h, i, and j can be derived in the same way. This derivation can propagate to all the vertices of this solid. Therefore, the DoF of this solid is 4 and it is a primitive solid. Property 3. The DoF of a solid is 5 which is obtained by gluing two faces of two primitive solids. Proof. Let the two primitive solids be PS1 and PS2 and their corresponding gluing faces be f1and f2, respectively. The DoFs of PS1 and PS2 are both 4. Suppose that PS1 is determined in 3D space, which requires four z-coordinates. Then f1and f2 are also determined in 3D space. When the z-coordinates of three vertices on PS2 are known based on f2, one more z-coordinate of a vertex not coplanar with f2 on PS2 can determine PS2 in 3D space. Therefore, the DoF of the combined solid is 5. Fig. 2 is a typical example of two primitive solids gluing together along faces. Fig. 3(a) is an example of two primitive solids gluing together along edges. Two primitive solids may also connect at one vertex. The following property is easy to verify. 1The degree of a vertex is the number of edges connected to this vertex. 1435 Property 4. The DoF of a solid is 6 which is obtained by gluing two edges of two primitive solids. The DoF of a solid is 7 which is obtained by gluing two vertices oftwoprimitive solids. From the above properties, we can see that primitive solids are indeed the “smallest” solids in terms of DoF and they can serve as the building blocks to construct more complex solids. Therefore, our next target is to decompose a line drawing representing a complex solid into multiple line drawings representing primitive solids. Before giving Definition 3, we define some terms first. Vertex set of a face. The vertex set V er(f) of a face f is the set of all the vertices of f. Fixed vertex. A fixed vertex is one with its z-coordinate (thus its 3D coordinate) known. Unfixed vertex. An unfixed vertex is one with its zcoordinate unknown. Fixed face. A fixed face is one with its 3D position determined by its three fixed vertices. Unfixed face. An unfixed face is one with its 3D position undetermined. Definition 3. Let the vertex set and the face set of a line drawing be V = {v1, v2 , ..., vn} and F = {f1, f2 , ..., fm}, respectively, w =he {rve n and m are Fthe = n{fumbers of th}e, vertices and the faces, respectively. Also let Vfixed, Ffixed, Vunfixed, and Funfixed be the sets of fixed vertices, fixed faces, unfixed vertices, and unfixed faces, respectively. Suppose that an initial set of two fixed neighboring faces sharing an edge is Finitial with all their fixed vertices in Vinitial. The final Ffixed in Algorithm 1 is called the maximum extended face set (MEFS) from Finitial. In Algorithm 1, a face f that satisfies the condition in step 3 is a face that has been determined in 3D space by the current fixed vertices in Ffixed. When this face is found, it becomes a fixed face and all its vertices become fixed vertices. The DoF of the initial two fixed faces combined is 4. It is not difficult to see that the algorithm does not increase the initial DoF, and thus the final object represented by the MEFS also has DoF 4. Next, let us consider a simple example shown in Fig. 2(a) with the following three cases: Case 1. Suppose that Finitial = {(e, f,g, h) , (e, f,b, a)}, Vinitial = {e, f,g, h, b, a}, and th{e( algorithm a(de,dfs tbh,ea f}a,c Ves into Ffixed ,ing thh,isb aor}d,e ar:n (f th,e g, c, obr)i →m (a, b, c, d) → (e, h, d, a) →i ( tgh, hs, odr, dce)r. T (fhe,gn tch,eb )fin →al object ,fod)und → by t,hhe, algorithm (isg thh,ed c,uc)b.e. Note that the algorithm does not add any triangular faces into Ffixed because they do not satisfy the condition in step 3. Case 2. If Finitial = {(b, i,a) , (b, i,c)}, then the final object found is the pyramid, abn,id, tah)e, algorithm hdoenes t nhoet f iandadl any rectangular faces except (a, b, c, d) into Ffixed. Case 3. If Finitial = {(b, a, i) , (e, f,b, a)}, the algorithm cannot find any othe=r f a{(cebs,a at,oi a)d,(de ,tof Ffixed. tThheus al, gito rfaitihlsto find the cube or pyramid. Algorithm 1 Face extending procedure Initialization: F, F, Initialization: Funfixed = F \ Finitial , Ffixed = Finitial , Vfixed = Vinitial, Vunfixed = FV \ \ FVinitial. 1. do the following steps until no face satisfies the condition in step 3; 2. Find a face f ∈ Funfixed that satisfies 3. the number ofnon-collinear vertices in V er(f) ∩Vfixed is more than 2; 4. Add face f into Ffixed and delete it from Funfixed; 5. For each vertex v ∈ V er(f), if v ∈ Vunfixed, add v into Vfixed and delete it from Vunfixed; Return The final Ffixed. Fig.5(a)Ac(oam)plexinedrawingofn (-bm)anifolds id.(b)The decomposition result by our algorithm. In case 3, the object represented by the MEFS has only two initial faces and this object is discarded. In order not to miss a primitive solid, we run Algorithm 1 multiple times each with a different pair of neighboring faces in Finitial. Then, we can always have Finitial with its two faces from one primitive solid. For the object in Fig. 2(a), we can always find the cube and the pyramid. Note that the same primitive solid may be found multiple times from different Finitial, and finally we keep only one copy of each different object (cube and pyramid in this example). When a complex solid is formed by more than two primitive solids, Algorithm 1 can still be used to find the primitive solids, which is the decomposition result of the complex line drawing. More complex examples are given in Section 3. Besides, Algorithm 1 can also deal with complex solids formed by gluing primitive solids between edges and vertices. Fig. 5(a) is a solid constructed by gluing eight primitive solids between faces, edges, and vertices. Running Algorithm 1multiple times with different pairs of neighboring faces in Finitial generates the primitive solids as shown in Fig. 5(b). 2.2. Decomposing line drawings of general objects A general object can be a manifold, non-manifold solid, or non-solid. Given a line drawing representing a general object, it is unknown whether this object consists of only primitive solids. However, we can always apply Algorithm 1to the line drawing multiple times, each with a 1436 Obj6(4)O b j 15( 94)(ca)O b j 24(9 7)Obj3(7)(bd) Fig. 6. Illustration of our decomposition method. (a) A line drawing. (b) The set of MEFSs from (a). (c) The weighted objectcoexistence graph where the maximum weight clique is shown in bold. (d) The decomposition of (a). different pair of neighboring faces in Finitial, generating a set SMEFS of MEFSs (recall that an MEFS with only two initial neighboring faces is discarded). In what follows, we also call an MEFS an object, which is represented by the MEFS. Note that an MEFS generated from a general line drawing may not be a primitive solid, but its DoF must be 4. Objects of DoF 4 have relatively simple structures and are easy to be reconstructed. A number of decomposition examples of complex general line drawings can be seen from the experimental section. One issue existing in this decomposition method is that two different MEFSs may share many faces. For example, from the line drawing in Fig. 6(a), all different MEFSs found by running Algorithm 1multiple times are shown in Fig. 6(b), where Obj 1and Obj 5 share four faces, and so do Obj 2 and Obj 6. Obviously, Obj 5 and Obj 6 are not necessary. Next we define object coexistence and a rule to choose objects. Definition 4. Two objects are called coexistent if they share no face or share only coplanar faces. Rule 1. Choose a subset of SMEFS such that in the subset, all the objects are coexistent and the number of total faces is maximized. From Definition 4, Obj 1 and Obj 5 are not coexistent in Fig. 6, and Obj 2 and Obj 6 are not either. If Obj 5 and Obj 6 are kept with Obj 1and Obj 2 discarded, many faces in the original object will be missing. Rule 1guarantees that Obj 1and Obj 2 are kept but not Obj 5 and Obj 6. Algorithm 2 Decomposition of a general line drawing Algorithm 2 Decomposition of a general line drawing Input: A Line Drawing: G = (V,E,F). Initialization: SMEFS = ∅, SMWC = ∅. 1. for each pair of neighboring faces {fa , fb} in F do 2. Call Algorithm 1with Finitial = {fa , fb} and Vinitial = V er(fa) ∪ V er(fb); 3. if the returned Ffixed from Algorithm 1contains more than two faces do 4. SMEFS ← Ffixed; 5. Construct the object-coexistence graph Gobj with SMEFS ; 6. SMWC ← the maximum weight clique found from Gobj ; 7. for each face f not contained in SMWC do 8. Attach f to the object in SMWC that contains the maximum number of the vertices of f; Return SMWC. Fig.7 (a)Ashe tobjec(ta)with23faces.(b)Decompositon(br)esult by Algorithm 2 with the modification in Algorithm 1. We formulate Rule 1 as a maximum weight clique problem (MWCP), which is to find a clique2 of the maximum weight from a weighted graph. First, we construct a weighted graph, called the object-coexistence graph, in which a vertex denotes an object in SMEFS and there is an edge connecting two vertices if the two objects represented by the two vertices are coexistent. Besides, each vertex is assigned a weight equal to the number of the faces of the corresponding object. The MWCP is a well-known NP-hard problem. In our application, however, solving this problem is fast enough since an object-coexistence graph usually has less than 20 objects (vertices). We use the algorithm in [12] to deal with this problem. Fig. 6(c) is the object-coexistence graph constructed from the six objects in Fig. 6(b), where the weights of the vertices are denoted by the numbers in the parentheses. The maximum weight clique is shown in bold. From Fig. 6, we see that the face (14, 13, 26, 25) is not contained in SWMC, which is used to store the objects in the maximum weight clique. This face is finally attached to Obj 3. In general, each of the faces not in SWMC is attached to an object that contains the maximum number of the vertices of this face. If there are two or more objects that contain the same number of the vertices of this face, this face is assigned to any of them. 2A clique is a subgraph of a graph such that subgraph are connected by an edge. every two vertices in the 1437 Algorithm 2 shows the complete algorithm to decompose a general line drawing. Steps 7 and 8 attach the faces not in SMWC to some objects in SMWC. A common complex object usually consists of primitive solids and sheets, and Algorithm 2 works well for the decomposition of most complex line drawings. However, there are still some line drawings the algorithm cannot deal with. Such an example is shown in Fig. 7(a) which is a sheet object with 23 faces. In Algorithm 1, with any pair of initial neighboring faces, there is no any other face satisfying the condition in step 3, thus no object of DoF 4 will be found. The following scheme can solve this problem. Given a line drawing, steps 1–6 in Algorithm 2 are used to decompose it into multiple objects of DoF 4. If there are separate groups of faces not in SMWC, where the faces in each group are connected, then attach the groups each with less than four faces to some objects in SMWC3 (the attachment method is similar to steps 7 and 8 in Algorithm 2). For a group with four or more connected faces, Algorithm 2 is applied to it with a minor modification in Algorithm 1. The modification is to set Finitial to contain three connected faces whose combined DoF is 5. This modification allows the search of objects of DoF 5. Suppose the object in Fig. 7(a) is such a group. Applying Algorithm 2 to it with the minor modification generates the decomposition result as shown in Fig. 7(b). 2.3. 3D Reconstruction A complex line drawing can be decomposed into several simpler ones using the method proposed in Sections 2. 1 and 2.2. The next step is to reconstruct a 3D object from each ofthem, which is an easy task because the decomposed line drawings are simple. The method in [6] or [7] can carry out this task very well. We use the one in [6] for our work with the objective function Φ(z1 , z2 , ..., zNv ) constructed by these five image regularities: MSDA, face planarity, line parallelism, isometry, and corner orthogonality. The details of the regularities can be found from [6]. After obtaining the 3D objects from all the decomposed line drawings, the next step is to merge them to form one complex object. When merging two 3D objects, since they are reconstructed separately, the gluing parts (face or edge) of them are usually not of the same size. Then one object is automatically rescaled according to the sizes of the two gluing parts, and the vertices of the gluing part of this object are also adjusted so that the two parts are the same. After merging all the 3D objects, the whole object is fine-tuned by minimizing the objective function Φ on the object. We can also apply our method to reconstruct 3D shapes from objects in images. First, the user draws a line drawing along the visible edges of an object and he/she can also 3The reason to attach a group with less than four faces to an object in SMWC is that this group is small and is not necessary to be an independent object to reconstruct. guess (draw) the hidden edges. Then from this line drawing, our approach described above reconstructs the 3D geometry of the object in the image. 3. Experimental Results In this section, we show a number of complex 3D reconstruction examples from both line drawings and images to demonstrate the performance of our approach. The first set of experiments in Fig. 8 has nine complex line drawings. Fig. 8(a) is a manifold, and the others are nonmanifold solids or non-solids. The decompositions of the line drawings are also given in the figure, from which we can see that the results are in accordance with our visual perception very well. All the primitive solids are found by our algorithm. It is the successful decompositions that make the 3D reconstructions from these complex line drawings possible. The expected satisfactory reconstruction results are shown also in Fig. 8 each in two views. Fig. 9 shows another set of 3D reconstructions from objects in images with line drawings drawn on the objects. The decomposition results are omitted due to the space limitation. Each reconstruction result obtained by our algorithm is shown in two views with the texture from the image mapped onto the surface. We can see that the results are very good. The details of the objects and the line drawings can be shown by enlarging the figures on the screen. Among all the previous algorithms for general object reconstruction, the one in [7] can deal with most complex objects. Due to the local minimum problem in a high dimensional search space, however, this algorithm cannot handle line drawings as complex as those in Figs. 8 and 9. For example, Fig. 10(a) shows its reconstruction result from the line drawing in Fig. 8(c), which is a failure. The reader may wonder what happens if the 3D reconstruction is based on an arbitrary decomposition of a complex line drawing, instead of the proposed one. Fig. 10(b) shows such a decomposition from Fig. 8(c). Based on this decomposition, the 3D reconstruction result obtained by the scheme described in Section 2.3 is given in Fig. 10(c), which is a failure. The failure is caused by two reasons: (i) An arbitrary decomposition usually does not generate common objects, which makes the image regularities less meaningful for the 3D reconstruction. (ii) The gluing of 3D objects from the decomposition in Fig. 10(b) is difficult because of the irregular touches between the objects. The fine-tuning processing (see Section 2.3) cannot reduce the large distortion to an acceptable result. Note that since our algorithm is not limited to manifolds, it can deal with line drawings with some or without hidden lines. The third line drawing in Fig. 9 is an example where some hidden lines are not drawn. Most of the line drawings in this paper look tidy. This 1438 (g)(h)(i) Fig. 8. Nine complex line drawings, their decompositons, and 3D reconstruction results in two views wher dif er nt col rs are used to denote the faces (better viewed on the screen). Fig.9 Fourimages,thecorespondi glinedrawings,andther constructed3Dobjectswith exturemap ed,eachs owni twoviews. The details can be seen by enlarging the figures on the screen. 1439 Fig.10(.a( ) Afailedreconstru(cb)tionbythealgorithm(ci)n[7].(b)An Fig.1 .(a ) Alinedrawingwith(bs)trongsketchingero(sc.)(b)(c) arbitrary decomposition of the line drawing in Fig. 8(c) without using our decomposition method. (c) Failed 3D reconstruction based on the decomposition in (b). Two views ofthe successful reconstruction result by our algorithm. is for easy observation of the objects. In fact, our algorithm is not sensitive to sketching errors. Take Fig. 8(a) as an example and assume it is an accurate projection of the 3D object. Then, random variations are generated with the Gaussian distribution N(0, σ2) on the 2D locations of the vertices. Fig. 11(a) is a resulting noisy line drawing with σ = W/200 where W is the width of the line drawing in Fig. 8(a). From Fig. 11, we see that even for this line drawing with strong sketching errors, our algorithm can still obtain the good reconstruction result. Our algorithm is implemented in C++. The computational time includes two parts: line drawing decomposition and 3D reconstruction. The main computation is consumed by the second part. On average, a common PC takes about one minute to obtain the reconstruction from each of the line drawings in Figs. 8 and 9. 4. Conclusion Previous algorithms of 3D object reconstruction from line drawings either deal with simple general objects or are limited to only manifolds (a subset of solids). In this paper, we have proposed a novel approach that can handle complex general objects, including manifolds, nonmanifold solids, and non-solids. It decomposes a complex line drawing into simpler ones according to the degree of freedom of objects, which is based on the developed 3D object properties. After 3D objects are reconstructed from the decomposed line drawings, they are merged to form a complex object. We have shown a number of reconstruction examples with comparison to the best previous algorithm. The results indicate that our algorithm can tackle much more complex line drawings of general objects and is insensitive to sketching errors. The future work includes (i) the correction of the distortions of 3D objects reconstructed from images caused by the perspective projection, and (ii) the extension of this work to objects with curved faces. Acknowledgements This work was supported by grants from Natural Science Foundation of China (No. try, Trade, and Information Shenzhen Municipality, and Guangdong Science, Technology Commission China (No. Innovative 201001D0104648280). 61070148), Indusof JC201005270378A), Research Team Program (No. Jianzhuang Liu is the correspond- ing author. References [1] M. Clowes. On seeing things. Artificial Intelligence, 2:79–1 16, 1971. [2] P. Debevec, C. Taylor, and J. Malik. Modeling and rendering architecture from photographs: A hybrid geometry- and image-based approach. Proc. ACM SIGGRAPH, pages 11–20, 1996. [3] D. Jelinek and C. Taylor. Reconstruction of linearly parameterized models from single images with a camera of unknown focal length. IEEE T-PAMI, 23(7):767–773, 2001. [4] D. E. LaCourse. Handbook of Solid Modeling. McGraw-Hill, 1995. [5] Y. Leclerc and M. Fischler. An optimization-based approach to the interpretation of single line drawings as 3D wire frames. IJCV, 9(2): 113–136, 1992. [6] H. Lipson and M. Shpitalni. Optimization-based reconstruction of a 3d object from a single freehand line drawing. Computer-Aided Design, 28(7):651–663, 1996. [7] J. Liu, L. Cao, Z. Li, and X. Tang. Plane-based optimization for 3D object reconstruction from single line drawings. IEEE T-PAMI, 30(2):315–327, 2008. [8] J. Liu, Y. Chen, and X. Tang. Decomposition of complex line drawings with hidden lines for 3d planar-faced manifold object [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] reconstruction. IEEE T-PAMI, 33(1):3–15, 2011. J. Liu, Y. Lee, and W. Cham. Identifying faces in a 2D line drawing representing a manifold object. IEEE T-PAMI, 24(12): 1579–1593, 2002. J. Liu and X. Tang. Evolutionary search for faces from line drawings. IEEE T-PAMI, 27(6):861–872, 2005. T. Marill. Emulating the human interpretation of line-drawings as three-dimensional objects. IJCV, 6(2): 147–161, 1991 . P. R. J. O¨sterg a˚rd. A new algorithm for the maximum-weight clique problem. Nordic J. of Computing, 8(4):424–436, Dec. 2001 . H. Shimodaira. A shape-from-shading method of polyhedral objects using prior information. IEEE T-PAMI, 28(4):612–624, 2006. I. Shimshoni and J. Ponce. Recovering the shape of polyhedra using line-drawing analysis and complex reflectance models. Computer Vision and Image Understanding, 65(2):296–3 10, 1997. K. Shoji, K. Kato, and F. Toyama. 3-d interpretation of single line drawings based on entropy minimization principle. CVPR, 2001. M. Shpitalni and H. Lipson. Identification of faces in a 2d line drawing projection of a wireframe object. IEEE T-PAMI, 18(10), 1996. K. Sugihara. Machine interpretation of line drawings. MIT Press, 1986. A. Turner, D. Chapman, and A. Penn. Sketching space. Computer and Graphics, 24:869–879, 2000. F. Ulupinar and R. Nevatia. Shape from contour: straight homogeneous generalized cylinders and constant cross-section generalized cylinders. IEEE T-PAMI, 17(2): 120–135, 1995. T. Xue, J. Liu, and X. Tang. Example-based 3d object reconstruction from line drawings. CVPR, 2012. 1440

3 0.16413675 343 iccv-2013-Real-World Normal Map Capture for Nearly Flat Reflective Surfaces

Author: Bastien Jacquet, Christian Häne, Kevin Köser, Marc Pollefeys

Abstract: Although specular objects have gained interest in recent years, virtually no approaches exist for markerless reconstruction of reflective scenes in the wild. In this work, we present a practical approach to capturing normal maps in real-world scenes using video only. We focus on nearly planar surfaces such as windows, facades from glass or metal, or frames, screens and other indoor objects and show how normal maps of these can be obtained without the use of an artificial calibration object. Rather, we track the reflections of real-world straight lines, while moving with a hand-held or vehicle-mounted camera in front of the object. In contrast to error-prone local edge tracking, we obtain the reflections by a robust, global segmentation technique of an ortho-rectified 3D video cube that also naturally allows efficient user interaction. Then, at each point of the reflective surface, the resulting 2D-curve to 3D-line correspondence provides a novel quadratic constraint on the local surface normal. This allows to globally solve for the shape by integrability and smoothness constraints and easily supports the usage of multiple lines. We demonstrate the technique on several objects and facades.

4 0.15725809 436 iccv-2013-Unsupervised Intrinsic Calibration from a Single Frame Using a "Plumb-Line" Approach

Author: R. Melo, M. Antunes, J.P. Barreto, G. Falcão, N. Gonçalves

Abstract: Estimating the amount and center ofdistortionfrom lines in the scene has been addressed in the literature by the socalled “plumb-line ” approach. In this paper we propose a new geometric method to estimate not only the distortion parameters but the entire camera calibration (up to an “angular” scale factor) using a minimum of 3 lines. We propose a new framework for the unsupervised simultaneous detection of natural image of lines and camera parameters estimation, enabling a robust calibration from a single image. Comparative experiments with existing automatic approaches for the distortion estimation and with ground truth data are presented.

5 0.13124637 144 iccv-2013-Estimating the 3D Layout of Indoor Scenes and Its Clutter from Depth Sensors

Author: Jian Zhang, Chen Kan, Alexander G. Schwing, Raquel Urtasun

Abstract: In this paper we propose an approach to jointly estimate the layout ofrooms as well as the clutterpresent in the scene using RGB-D data. Towards this goal, we propose an effective model that is able to exploit both depth and appearance features, which are complementary. Furthermore, our approach is efficient as we exploit the inherent decomposition of additive potentials. We demonstrate the effectiveness of our approach on the challenging NYU v2 dataset and show that employing depth reduces the layout error by 6% and the clutter estimation by 13%.

6 0.1283669 90 iccv-2013-Content-Aware Rotation

7 0.12327666 323 iccv-2013-Pose Estimation with Unknown Focal Length Using Points, Directions and Lines

8 0.1215737 346 iccv-2013-Rectangling Stereographic Projection for Wide-Angle Image Visualization

9 0.12146043 64 iccv-2013-Box in the Box: Joint 3D Layout and Object Reasoning from Single Images

10 0.11469999 397 iccv-2013-Space-Time Tradeoffs in Photo Sequencing

11 0.10416757 429 iccv-2013-Tree Shape Priors with Connectivity Constraints Using Convex Relaxation on General Graphs

12 0.10055158 292 iccv-2013-Non-convex P-Norm Projection for Robust Sparsity

13 0.099394888 68 iccv-2013-Camera Alignment Using Trajectory Intersections in Unsynchronized Videos

14 0.099010386 104 iccv-2013-Decomposing Bag of Words Histograms

15 0.098507978 17 iccv-2013-A Global Linear Method for Camera Pose Registration

16 0.089502163 319 iccv-2013-Point-Based 3D Reconstruction of Thin Objects

17 0.089354813 387 iccv-2013-Shape Anchors for Data-Driven Multi-view Reconstruction

18 0.088020161 1 iccv-2013-3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding

19 0.082193285 252 iccv-2013-Line Assisted Light Field Triangulation and Stereo Matching

20 0.079745211 57 iccv-2013-BOLD Features to Detect Texture-less Objects

similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.172), (1, -0.126), (2, -0.025), (3, 0.012), (4, 0.013), (5, 0.025), (6, -0.01), (7, -0.101), (8, 0.019), (9, -0.058), (10, 0.032), (11, 0.034), (12, -0.105), (13, 0.067), (14, 0.037), (15, 0.007), (16, 0.012), (17, 0.069), (18, -0.068), (19, -0.007), (20, -0.059), (21, -0.109), (22, 0.066), (23, -0.032), (24, 0.028), (25, 0.024), (26, 0.003), (27, -0.032), (28, -0.06), (29, 0.002), (30, -0.081), (31, 0.13), (32, -0.047), (33, -0.023), (34, -0.079), (35, -0.008), (36, -0.07), (37, -0.048), (38, 0.008), (39, -0.073), (40, 0.046), (41, -0.054), (42, 0.01), (43, 0.033), (44, 0.027), (45, 0.035), (46, -0.011), (47, 0.024), (48, -0.01), (49, 0.104)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.96437746 250 iccv-2013-Lifting 3D Manhattan Lines from a Single Image

Author: Srikumar Ramalingam, Matthew Brand

2 0.7971288 346 iccv-2013-Rectangling Stereographic Projection for Wide-Angle Image Visualization

Author: Che-Han Chang, Min-Chun Hu, Wen-Huang Cheng, Yung-Yu Chuang

Abstract: This paper proposes a new projection model for mapping a hemisphere to a plane. Such a model can be useful for viewing wide-angle images. Our model consists of two steps. In the first step, the hemisphere is projected onto a swung surface constructed by a circular profile and a rounded rectangular trajectory. The second step maps the projected image on the swung surface onto the image plane through the perspective projection. We also propose a method for automatically determining proper parameters for the projection model based on image content. The proposed model has several advantages. It is simple, efficient and easy to control. Most importantly, it makes a better compromise between distortion minimization and line preserving than popular projection models, such as stereographic and Pannini projections. Experiments and analysis demonstrate the effectiveness of our model.

3 0.77210712 436 iccv-2013-Unsupervised Intrinsic Calibration from a Single Frame Using a "Plumb-Line" Approach

Author: R. Melo, M. Antunes, J.P. Barreto, G. Falcão, N. Gonçalves

4 0.74224412 84 iccv-2013-Complex 3D General Object Reconstruction from Line Drawings

Author: Linjie Yang, Jianzhuang Liu, Xiaoou Tang

5 0.68835795 348 iccv-2013-Refractive Structure-from-Motion on Underwater Images

Author: Anne Jordt-Sedlazeck, Reinhard Koch

Abstract: In underwater environments, cameras need to be confined in an underwater housing, viewing the scene through a piece of glass. In case of flat port underwater housings, light rays entering the camera housing are refracted twice, due to different medium densities of water, glass, and air. This causes the usually linear rays of light to bend and the commonly used pinhole camera model to be invalid. When using the pinhole camera model without explicitly modeling refraction in Structure-from-Motion (SfM) methods, a systematic model error occurs. Therefore, in this paper, we propose a system for computing camera path and 3D points with explicit incorporation of refraction using new methods for pose estimation. Additionally, a new error function is introduced for non-linear optimization, especially bundle adjustment. The proposed method allows to increase reconstruction accuracy and is evaluated in a set of experiments, where the proposed method’s performance is compared to SfM with the perspective camera model.

6 0.67399377 49 iccv-2013-An Enhanced Structure-from-Motion Paradigm Based on the Absolute Dual Quadric and Images of Circular Points

7 0.66713351 343 iccv-2013-Real-World Normal Map Capture for Nearly Flat Reflective Surfaces

8 0.66314131 152 iccv-2013-Extrinsic Camera Calibration without a Direct View Using Spherical Mirror

9 0.64128864 280 iccv-2013-Multi-view 3D Reconstruction from Uncalibrated Radially-Symmetric Cameras

10 0.60838842 90 iccv-2013-Content-Aware Rotation

11 0.59507102 410 iccv-2013-Support Surface Prediction in Indoor Scenes

12 0.58491504 323 iccv-2013-Pose Estimation with Unknown Focal Length Using Points, Directions and Lines

13 0.576846 397 iccv-2013-Space-Time Tradeoffs in Photo Sequencing

14 0.56290728 102 iccv-2013-Data-Driven 3D Primitives for Single Image Understanding

15 0.55890614 1 iccv-2013-3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding

16 0.55748814 57 iccv-2013-BOLD Features to Detect Texture-less Objects

17 0.556539 132 iccv-2013-Efficient 3D Scene Labeling Using Fields of Trees

18 0.54610974 79 iccv-2013-Coherent Object Detection with 3D Geometric Context from a Single Image

19 0.54075277 17 iccv-2013-A Global Linear Method for Camera Pose Registration

20 0.53526884 319 iccv-2013-Point-Based 3D Reconstruction of Thin Objects

similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(2, 0.052), (3, 0.245), (7, 0.039), (12, 0.019), (13, 0.015), (26, 0.085), (27, 0.021), (31, 0.04), (42, 0.086), (48, 0.017), (55, 0.011), (64, 0.047), (73, 0.028), (78, 0.02), (89, 0.19)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.7797277 365 iccv-2013-SIFTpack: A Compact Representation for Efficient SIFT Matching

Author: Alexandra Gilinsky, Lihi Zelnik Manor

Abstract: Computing distances between large sets of SIFT descriptors is a basic step in numerous algorithms in computer vision. When the number of descriptors is large, as is often the case, computing these distances can be extremely time consuming. In this paper we propose the SIFTpack: a compact way of storing SIFT descriptors, which enables significantly faster calculations between sets of SIFTs than the current solutions. SIFTpack can be used to represent SIFTs densely extracted from a single image or sparsely from multiple different images. We show that the SIFTpack representation saves both storage space and run time, for both finding nearest neighbors and for computing all distances between all descriptors. The usefulness of SIFTpack is also demonstrated as an alternative implementation for K-means dictionaries of visual words.

same-paper 2 0.77871621 250 iccv-2013-Lifting 3D Manhattan Lines from a Single Image

Author: Srikumar Ramalingam, Matthew Brand

3 0.75577891 286 iccv-2013-NYC3DCars: A Dataset of 3D Vehicles in Geographic Context

Author: Kevin Matzen, Noah Snavely

Abstract: Geometry and geography can play an important role in recognition tasks in computer vision. To aid in studying connections between geometry and recognition, we introduce NYC3DCars, a rich dataset for vehicle detection in urban scenes built from Internet photos drawn from the wild, focused on densely trafficked areas of New York City. Our dataset is augmented with detailed geometric and geographic information, including full camera poses derived from structure from motion, 3D vehicle annotations, and geographic information from open resources, including road segmentations and directions of travel. NYC3DCars can be used to study new questions about using geometric information in detection tasks, and to explore applications of Internet photos in understanding cities. To demonstrate the utility of our data, we evaluate the use of the geographic information in our dataset to enhance a parts-based detection method, and suggest other avenues for future exploration.

4 0.74721622 141 iccv-2013-Enhanced Continuous Tabu Search for Parameter Estimation in Multiview Geometry

Author: Guoqing Zhou, Qing Wang

Abstract: Optimization using the L∞ norm has been becoming an effective way to solve parameter estimation problems in multiview geometry. But the computational cost increases rapidly with the size of measurement data. Although some strategies have been presented to improve the efficiency of L∞ optimization, it is still an open issue. In the paper, we propose a novel approach under theframework ofenhanced continuous tabu search (ECTS) for generic parameter estimation in multiview geometry. ECTS is an optimization method in the domain of artificial intelligence, which has an interesting ability of covering a wide solution space by promoting the search far away from current solution and consecutively decreasing the possibility of trapping in the local minima. Taking the triangulation as an example, we propose the corresponding ways in the key steps of ECTS, diversification and intensification. We also present theoretical proof to guarantee the global convergence of search with probability one. Experimental results have validated that the ECTS based approach can obtain global optimum efficiently, especially for large scale dimension of parameter. Potentially, the novel ECTS based algorithm can be applied in many applications of multiview geometry.

5 0.71665585 26 iccv-2013-A Practical Transfer Learning Algorithm for Face Verification

Author: Xudong Cao, David Wipf, Fang Wen, Genquan Duan, Jian Sun

Abstract: Face verification involves determining whether a pair of facial images belongs to the same or different subjects. This problem can prove to be quite challenging in many important applications where labeled training data is scarce, e.g., family album photo organization software. Herein we propose a principled transfer learning approach for merging plentiful source-domain data with limited samples from some target domain of interest to create a classifier that ideally performs nearly as well as if rich target-domain data were present. Based upon a surprisingly simple generative Bayesian model, our approach combines a KL-divergencebased regularizer/prior with a robust likelihood function leading to a scalable implementation via the EM algorithm. As justification for our design choices, we later use principles from convex analysis to recast our algorithm as an equivalent structured rank minimization problem leading to a number of interesting insights related to solution structure and feature-transform invariance. These insights help to both explain the effectiveness of our algorithm as well as elucidate a wide variety of related Bayesian approaches. Experimental testing with challenging datasets validate the utility of the proposed algorithm.

6 0.69101143 150 iccv-2013-Exemplar Cut

7 0.69097167 107 iccv-2013-Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction

8 0.69045627 349 iccv-2013-Regionlets for Generic Object Detection

9 0.69008923 196 iccv-2013-Hierarchical Data-Driven Descent for Efficient Optimal Deformation Estimation

10 0.6896345 376 iccv-2013-Scene Text Localization and Recognition with Oriented Stroke Detection

11 0.68852663 379 iccv-2013-Semantic Segmentation without Annotating Segments

12 0.68823922 420 iccv-2013-Topology-Constrained Layered Tracking with Latent Flow

13 0.68797958 411 iccv-2013-Symbiotic Segmentation and Part Localization for Fine-Grained Categorization

14 0.68792903 78 iccv-2013-Coherent Motion Segmentation in Moving Camera Videos Using Optical Flow Orientations

15 0.68764234 128 iccv-2013-Dynamic Probabilistic Volumetric Models

16 0.68732995 127 iccv-2013-Dynamic Pooling for Complex Event Recognition

17 0.68700504 204 iccv-2013-Human Attribute Recognition by Rich Appearance Dictionary

18 0.68695384 436 iccv-2013-Unsupervised Intrinsic Calibration from a Single Frame Using a "Plumb-Line" Approach

19 0.68679464 146 iccv-2013-Event Detection in Complex Scenes Using Interval Temporal Constraints

20 0.68669301 65 iccv-2013-Breaking the Chain: Liberation from the Temporal Markov Assumption for Tracking Human Poses