iccv iccv2013 iccv2013-17 knowledge-graph by maker-knowledge-mining

17 iccv-2013-A Global Linear Method for Camera Pose Registration


Source: pdf

Author: Nianjuan Jiang, Zhaopeng Cui, Ping Tan

Abstract: We present a linear method for global camera pose registration from pairwise relative poses encoded in essential matrices. Our method minimizes an approximate geometric error to enforce the triangular relationship in camera triplets. This formulation does not suffer from the typical ‘unbalanced scale ’ problem in linear methods relying on pairwise translation direction constraints, i.e. an algebraic error; nor the system degeneracy from collinear motion. In the case of three cameras, our method provides a good linear approximation of the trifocal tensor. It can be directly scaled up to register multiple cameras. The results obtained are accurate for point triangulation and can serve as a good initialization for final bundle adjustment. We evaluate the algorithm performance with different types of data and demonstrate its effectiveness. Our system produces good accuracy, robustness, and outperforms some well-known systems on efficiency.

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Our method minimizes an approximate geometric error to enforce the triangular relationship in camera triplets. [sent-2, score-0.274]

2 an algebraic error; nor the system degeneracy from collinear motion. [sent-5, score-0.402]

3 Introduction Structure-from-motion (SfM) methods simultaneously estimate scene structure and camera motion from multiple images. [sent-12, score-0.319]

4 First, relative poses between camera pairs or triplets are computed from matched image feature points, e. [sent-14, score-0.376]

5 Second, all camera poses (including orientations and positions) and scene point coordinates are recovered in a global coordinate system according to these relative poses. [sent-17, score-0.475]

6 Some well-known systems, such as [36, 2], compute camera poses in an incremental fashion, ∗These authors contributed equally to this work. [sent-25, score-0.298]

7 Thus, it is highly desirable that all camera poses are solved simultaneously for efficiency and accuracy. [sent-32, score-0.298]

8 [3] derived a novel linear algorithm that is robust to different camera baseline lengths. [sent-39, score-0.274]

9 Yet it still suffers from the same degeneracy as [13] for collinear cameras (e. [sent-40, score-0.448]

10 Unlike earlier algebraic methods, we compute the camera positions (translations) by minimizing a geometric error the Euclidean distance between the camera centers and the lines collinear with their corresponding baselines. [sent-47, score-0.882]

11 This novel approach generates more precise results, and does not degenerate with collinear camera motion. [sent-48, score-0.506]

12 We want to stress that the robustness with collinear motion is an important advantage, since collinear motion is common (e. [sent-49, score-0.662]

13 Furthermore, our estimation of camera poses does not involve reconstructing any 3D point. [sent-52, score-0.298]

14 Effectively, we first solve the ‘motion’ camera poses, and then solve the ‘structure’ scene points. [sent-53, score-0.329]

15 Once the camera poses are recovered, the scene points can be reconstructed from – – – nearby cameras. [sent-56, score-0.369]

16 In the special case of three cameras, our algorithm effectively computes the trifocal tensor from three essential matrices. [sent-57, score-0.362]

17 In our experiment, we find that our method is more robust than the four-point algorithm [26] which solves trifocal tensor from three calibrated images. [sent-58, score-0.365]

18 Many well-known SfM systems take a sequential [3 1, 36, 2] or hierarchical [11, 22, 17] approach to register cameras incrementally to a global coordinate system from their pairwise relative poses. [sent-72, score-0.282]

19 Factorization based 3D reconstruction was proposed by Tomasi and Kanade [39] to recover all camera poses and 3D points simultaneously under weak perspective projection. [sent-75, score-0.373]

20 Some global methods solve all camera poses together in two steps. [sent-79, score-0.368]

21 Typically, they first compute camera rotations and solve translations in the next step. [sent-80, score-0.407]

22 While global rotations can be computed robustly and accurately by rotation averaging [15], translations are difficult because the input pairwise relative translations are only known up to a scale. [sent-82, score-0.441]

23 The pioneer works [13, 4] solved translations according to linear equations derived from pairwise relative translation directions. [sent-83, score-0.365]

24 These earlier methods suffer from degeneracy of collinear camera motion and unbalanced constraint weighting caused by different camera baseline length. [sent-84, score-0.91]

25 [7] computed homographies to glue individual triplet reconstructions by loop analysis and nonlinear optimization. [sent-87, score-0.321]

26 [35] registered individual pairwise reconstructions by solving their individual global scaling and translation in a robust linear system. [sent-89, score-0.291]

27 However, it still suffers from the degeneracy of collinear camera motion like [13, 4]. [sent-94, score-0.636]

28 Other global methods solve all camera poses and 3D scene points at once. [sent-96, score-0.404]

29 In the special case of three cameras, the camera geometry is fully captured by a trifocal tensor. [sent-104, score-0.485]

30 Effectively, our method provides a linear solution for trifocal tensor from three essential matrices (i. [sent-107, score-0.362]

31 An essential matrix Eij between two images i,j provides the relative rotation Rij and the translation direction tij . [sent-114, score-0.366]

32 i sO au 3r goal rist toon recover aaltlr tihxe a nabds toliust ea camera poses itno a global ocaolo isrd itnoa rteec system. [sent-116, score-0.336]

33 tWhee use a rotation matrix Ri and a translation vector ci to denote the orientation and position of the i-th camera (1 ≤ i ≤ N). [sent-117, score-0.697]

34 In real data, these equations mweail n sno etq uhoalldit precisely sacnadl we nne reeda lto d afitand, a esseet of Ri, ci that best satisfy these equations. [sent-122, score-0.303]

35 Secondly, the camera poses should be solved separately from the scene points. [sent-126, score-0.334]

36 There are often much more scene points than cameras so that solving camera poses without scene points will significantly reduce the number of unknowns. [sent-127, score-0.465]

37 We first apply the linear method described in [24] to compute the global camera rotations Ri. [sent-128, score-0.337]

38 Once all rotations are fixed, we then solve all camera centers (ci, 1 ≤ i ≤ N) without reconstructing any 3D point. [sent-131, score-0.384]

39 Translation Registration Given the global camera rotations computed in the previous section, we first transform each tij to the global rotation reference frame as cij = −R? [sent-133, score-0.746]

40 The constraint on camera centers in Equation =(1) − can be written as in [13], cij (cj − ci) = 0. [sent-135, score-0.485]

41 Hhoisw iesv ae lri, equations oobnt aaibnoeudt this way degenerate for collinear camera motion. [sent-139, score-0.57]

42 In fact, Equation (2) minimizes the cross product between cij and the baseline direction cj −ci. [sent-142, score-0.584]

43 The × relative translation cij , cik , and cjk between camera pairs are known. [sent-148, score-1.041]

44 We need to estimate camera centers ci, cj , and ck. [sent-149, score-0.618]

45 Ideally, the three unit vectors cij , cik , and cjk should be coplanar. [sent-150, score-0.661]

46 Here, | |ci − cj | | is the distance between ci and cj . [sent-168, score-0.911]

47 ) = | |cj ck −| | /c| ||c|i/ cj −| −| are effectively the baseline length ) ra =tio ||sc. [sent-173, score-0.56]

48 To linearize it, we observe that sjikj −c | |ci − cj | |cik = | |ci − cj −||c | |Ri (θ? [sent-179, score-0.761]

49 (4) Here, Ri (φ) is the rotation matrix around the axis cij × for an angle φ (counter-clockwise). [sent-182, score-0.345]

50 Thus we obtain th×e following linear equation, cik 2ck − ci − cj = Ri(θi? [sent-183, score-0.816]

51 (5) Note Rj (·) is a rotation matrix around the direction cij cjk. [sent-186, score-0.312]

52 Similarly, we can oonb mtaiant rtihxe following et dwior elictnieoanr equations of camera centers by assuming cik and cjk are free from error respectively, 2cj − ci − ck 2ci −cj −ck Ri(−θi? [sent-187, score-1.231]

53 = − − (7) Solving these three linear equations can determine the camera centers. [sent-192, score-0.293]

54 Note that Equation (5) does not require the orientation cj − ci to be the same as cij . [sent-193, score-0.778]

55 This introduces a rotation ambiguity in the plane defined by the camera centers. [sent-194, score-0.383]

56 We can solve it by computing the average rotation to align cj ci, ck ci and ck cj with the projection of cij , cik and cjk in the− camera plane, respectively, after the initial registration. [sent-195, score-2.228]

57 Calculating baseline length ratios by the sine angles as described earlier is only valid when cij, cik and cjk are not collinear. [sent-197, score-0.539]

58 In order to be robust regardless of the type of camera motion, we compute − − − 483 all baseline length ratios from locally reconstructed scene points. [sent-198, score-0.381]

59 The translation registration does not involve reconstructing any scene point in the global coordinate system. [sent-206, score-0.314]

60 Given a triplet graph (see definition in Section 5), we collect all equations (i. [sent-210, score-0.332]

61 the rotation ambiguity in all triplets share the same rotation axis), there is a global in-plane rotation ambiguity similar to the three-camera case. [sent-220, score-0.502]

62 Therefore, the unknown camera centers are implicitly given different weights depending on the number of constraints containing that particular camera when we solve for Ac = 0. [sent-223, score-0.543]

63 Thus, for every camera i, we count the number of triplet constraints containing its center, denoted by Ki. [sent-224, score-0.453]

64 Each triplet constraint involving camera i, j, k is re-weighted by min(Ki1,Kj,Kk). [sent-225, score-0.453]

65 1) We verify every triplet in the match graph, and remove EGs which participate in no triplet that passes the verification. [sent-243, score-0.504]

66 Specifically, we apply our translation registration to each triplet and calculate the average difference between the relative translation directions before and after the registration. [sent-244, score-0.572]

67 We further require that at least one good point (with reprojection error smaller than 4 pixels) can be triangulated by the registered triplet cameras. [sent-246, score-0.374]

68 2) Among the edges of the match graph, we extract a subset of ‘reliable edges’ to compute the global camera orientations as described in Section 3. [sent-247, score-0.352]

69 3) We further use these camera orientations to verify the match graph edges, and discard an edge if the geodesic distance [15] between the loop rotation matrix [45] and the identity matrix is greater than δ2. [sent-252, score-0.504]

70 We further extract connected triplet graphs from the match graph, where each triplet is represented by a vertex. [sent-259, score-0.569]

71 A single connected component of the match graph could generate multiple connected triplet graphs, as illustrated in Figure 2. [sent-261, score-0.454]

72 We then apply our method in Section 4 to compute the positions of cameras in each triplet graph respectively. [sent-262, score-0.363]

73 Our method is much more stable in translation estimation for near collinear camera motions. [sent-267, score-0.626]

74 When there are multiple triplet graphs, their reconstructions are merged to obtain the final result. [sent-269, score-0.284]

75 Camera 0 is placed at the world origin and camera 2 is placed at a random location away from camera 0 by 0. [sent-280, score-0.458]

76 The location of camera 1is sampled randomly in the sphere centered at the middle point between camera 0 and 2, and passing through their camera centers. [sent-282, score-0.687]

77 The scene points are generated randomly within the viewing volume of the first camera and the distance between the nearest scene point and the furthest scene point is about 0. [sent-285, score-0.337]

78 The error of camera orientations Rerr is the mean geodesic distance (in degrees) between the estimated and the true camera rotation matrix. [sent-293, score-0.641]

79 Absolute camera location error cerr is the mean Euclidean distance between the estimated and the true camera centers. [sent-295, score-0.548]

80 We compare with the four-point algorithm [26], which is the only practical algorithm to compute trifocal tensor from three calibrated images as far as we know. [sent-298, score-0.365]

81 We also compare with the recent method [3] to demonstrate the robustness of our method on near collinear camera motions. [sent-303, score-0.506]

82 It is clear that our method produces more stable results for near collinear motion. [sent-311, score-0.309]

83 Multi-view Reconstruction We test the performance of our method with some standard benchmark datasets with known ground-truth camera motion to quantitatively evaluate the reconstruction accuracy. [sent-314, score-0.358]

84 On average our method produces error in ci about 0. [sent-329, score-0.316]

85 The absolute camera rotation error Rerr and camera location error cerr are measured in degrees and meters, respectively. [sent-360, score-0.738]

86 Each internet image collection is reconstructed as one single connected triplet graph by our algorithm. [sent-366, score-0.4]

87 We further manually identify the set of common cameras registered by our method and VisualSFM, respectively, for the Notre Dame example, and compute the difference between the estimated camera motion. [sent-374, score-0.356]

88 007 (when the distance between the two farthest camera is 1). [sent-377, score-0.266]

89 Conclusion We present a novel linear solution for the global camera pose registration problem. [sent-389, score-0.344]

90 It is free from the common degeneration of linear methods on collinear motion, and is robust to different baseline lengths between cameras. [sent-391, score-0.322]

91 For the case of three cameras, it produces more accurate results than prior trifocal tensor estimation method on calibrated images. [sent-392, score-0.397]

92 Spectral solution of large-scale extrinsic camera calibration as a graph embedding problem. [sent-428, score-0.273]

93 Exploiting loops in the graph of trifocal tensors for calibrating a network of cameras. [sent-448, score-0.342]

94 Randomized structure from motion based on atomic 3d models from camera triplets. [sent-513, score-0.283]

95 Self-calibration and [31] [32] [33] [34] [35] [36] [37] [38] [39] metric reconstruction inspite of varying and unknown intrinsic camera parameters. [sent-591, score-0.304]

96 Derivation of Equation (3) We first show that the length of the line segments ciA, cjB are approx- siikj sjikj imately | |ci − cj | | and | |ci − cj | | respectively. [sent-706, score-0.886]

97 The three vector|s| cij , cik a|n adn cjk sh|o|culd− b ce |c|lo rsees ptoe coplanar, so the angle ∠Acick is close to zero, and the length of ciA is close to that of cick. [sent-707, score-0.73]

98 ≈ θk because the three vectors cij , cik and cjk are ≈clo θse to coplanar. [sent-713, score-0.661]

99 The 3D coordinate of A is then approximated by ci + siikj | |ci − cj | |cik. [sent-714, score-0.707]

100 Similarly, we sjikj can obtain the coordinate of B as cj + | |ci −cj | |cjk. [sent-715, score-0.468]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('cj', 0.336), ('collinear', 0.277), ('trifocal', 0.256), ('cik', 0.241), ('ci', 0.239), ('camera', 0.229), ('triplet', 0.224), ('cjk', 0.217), ('cij', 0.203), ('egs', 0.183), ('ck', 0.143), ('visualsfm', 0.138), ('translation', 0.12), ('ba', 0.116), ('rotation', 0.109), ('cameras', 0.095), ('siikj', 0.089), ('sjikj', 0.089), ('exif', 0.082), ('registration', 0.077), ('translations', 0.076), ('degeneracy', 0.076), ('reconstruction', 0.075), ('reprojection', 0.073), ('rotations', 0.07), ('poses', 0.069), ('sfm', 0.067), ('siijk', 0.067), ('connected', 0.065), ('equations', 0.064), ('rj', 0.063), ('ri', 0.06), ('reconstructions', 0.06), ('tij', 0.059), ('unordered', 0.059), ('tensor', 0.059), ('match', 0.056), ('motion', 0.054), ('sinha', 0.053), ('centers', 0.053), ('factorization', 0.053), ('pages', 0.052), ('uncalibrated', 0.05), ('calibrated', 0.05), ('algebraic', 0.049), ('notre', 0.047), ('triplets', 0.047), ('essential', 0.047), ('eg', 0.046), ('ambiguity', 0.045), ('snavely', 0.045), ('error', 0.045), ('baseline', 0.045), ('cerr', 0.045), ('rerr', 0.045), ('sijjk', 0.045), ('ssiinn', 0.045), ('trevi', 0.045), ('geometries', 0.044), ('graph', 0.044), ('incorrect', 0.043), ('coordinate', 0.043), ('bundle', 0.043), ('tensors', 0.042), ('pairwise', 0.041), ('cmvs', 0.04), ('courchay', 0.04), ('global', 0.038), ('agarwal', 0.037), ('loop', 0.037), ('dalalyan', 0.037), ('farthest', 0.037), ('suspicious', 0.037), ('scene', 0.036), ('equation', 0.036), ('length', 0.036), ('sin', 0.036), ('degrees', 0.036), ('seitz', 0.035), ('reconstructed', 0.035), ('cia', 0.034), ('coplanar', 0.034), ('register', 0.034), ('zach', 0.034), ('arising', 0.034), ('angle', 0.033), ('pioneer', 0.033), ('internet', 0.032), ('registered', 0.032), ('produces', 0.032), ('solve', 0.032), ('bundler', 0.032), ('dame', 0.032), ('eriksson', 0.032), ('midpoint', 0.032), ('relative', 0.031), ('pisa', 0.03), ('singapore', 0.03), ('olsson', 0.029), ('orientations', 0.029)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99999964 17 iccv-2013-A Global Linear Method for Camera Pose Registration

Author: Nianjuan Jiang, Zhaopeng Cui, Ping Tan

Abstract: We present a linear method for global camera pose registration from pairwise relative poses encoded in essential matrices. Our method minimizes an approximate geometric error to enforce the triangular relationship in camera triplets. This formulation does not suffer from the typical ‘unbalanced scale ’ problem in linear methods relying on pairwise translation direction constraints, i.e. an algebraic error; nor the system degeneracy from collinear motion. In the case of three cameras, our method provides a good linear approximation of the trifocal tensor. It can be directly scaled up to register multiple cameras. The results obtained are accurate for point triangulation and can serve as a good initialization for final bundle adjustment. We evaluate the algorithm performance with different types of data and demonstrate its effectiveness. Our system produces good accuracy, robustness, and outperforms some well-known systems on efficiency.

2 0.37041491 184 iccv-2013-Global Fusion of Relative Motions for Robust, Accurate and Scalable Structure from Motion

Author: Pierre Moulon, Pascal Monasse, Renaud Marlet

Abstract: Multi-view structure from motion (SfM) estimates the position and orientation of pictures in a common 3D coordinate frame. When views are treated incrementally, this external calibration can be subject to drift, contrary to global methods that distribute residual errors evenly. We propose a new global calibration approach based on the fusion of relative motions between image pairs. We improve an existing method for robustly computing global rotations. We present an efficient a contrario trifocal tensor estimation method, from which stable and precise translation directions can be extracted. We also define an efficient translation registration method that recovers accurate camera positions. These components are combined into an original SfM pipeline. Our experiments show that, on most datasets, it outperforms in accuracy other existing incremental and global pipelines. It also achieves strikingly good running times: it is about 20 times faster than the other global method we could compare to, and as fast as the best incremental method. More importantly, it features better scalability properties.

3 0.14931607 115 iccv-2013-Direct Optimization of Frame-to-Frame Rotation

Author: Laurent Kneip, Simon Lynen

Abstract: This work makes use of a novel, recently proposed epipolar constraint for computing the relative pose between two calibrated images. By enforcing the coplanarity of epipolar plane normal vectors, it constrains the three degrees of freedom of the relative rotation between two camera views directly—independently of the translation. The present paper shows how the approach can be extended to n points, and translated into an efficient eigenvalue minimization over the three rotational degrees of freedom. Each iteration in the non-linear optimization has constant execution time, independently of the number of features. Two global optimization approaches are proposed. The first one consists of an efficient Levenberg-Marquardt scheme with randomized initial value, which already leads to stable and accurate results. The second scheme consists of a globally optimal branch-and-bound algorithm based on a bound on the eigenvalue variation derived from symmetric eigenvalue-perturbation theory. Analysis of the cost function reveals insights into the nature of a specific relative pose problem, and outlines the complexity under different conditions. The algorithm shows state-of-the-art performance w.r.t. essential-matrix based solutions, and a frameto-frame application to a video sequence immediately leads to an alternative, real-time visual odometry solution. Note: All algorithms in this paper are made available in the OpenGV library. Please visit http : / / l aurent kne ip .github . i / opengv o

4 0.13793468 280 iccv-2013-Multi-view 3D Reconstruction from Uncalibrated Radially-Symmetric Cameras

Author: Jae-Hak Kim, Yuchao Dai, Hongdong Li, Xin Du, Jonghyuk Kim

Abstract: We present a new multi-view 3D Euclidean reconstruction method for arbitrary uncalibrated radially-symmetric cameras, which needs no calibration or any camera model parameters other than radial symmetry. It is built on the radial 1D camera model [25], a unified mathematical abstraction to different types of radially-symmetric cameras. We formulate the problem of multi-view reconstruction for radial 1D cameras as a matrix rank minimization problem. Efficient implementation based on alternating direction continuation is proposed to handle scalability issue for real-world applications. Our method applies to a wide range of omnidirectional cameras including both dioptric and catadioptric (central and non-central) cameras. Additionally, our method deals with complete and incomplete measurements under a unified framework elegantly. Experiments on both synthetic and real images from various types of cameras validate the superior performance of our new method, in terms of numerical accuracy and robustness.

5 0.13430548 432 iccv-2013-Uncertainty-Driven Efficiently-Sampled Sparse Graphical Models for Concurrent Tumor Segmentation and Atlas Registration

Author: Sarah Parisot, William Wells_III, Stéphane Chemouny, Hugues Duffau, Nikos Paragios

Abstract: Graph-based methods have become popular in recent years and have successfully addressed tasks like segmentation and deformable registration. Their main strength is optimality of the obtained solution while their main limitation is the lack of precision due to the grid-like representations and the discrete nature of the quantized search space. In this paper we introduce a novel approach for combined segmentation/registration of brain tumors that adapts graph and sampling resolution according to the image content. To this end we estimate the segmentation and registration marginals towards adaptive graph resolution and intelligent definition of the search space. This information is considered in a hierarchical framework where uncertainties are propagated in a natural manner. State of the art results in the joint segmentation/registration of brain images with low-grade gliomas demonstrate the potential of our approach.

6 0.13369334 367 iccv-2013-SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels

7 0.12913498 323 iccv-2013-Pose Estimation with Unknown Focal Length Using Points, Directions and Lines

8 0.12394218 402 iccv-2013-Street View Motion-from-Structure-from-Motion

9 0.11854788 138 iccv-2013-Efficient and Robust Large-Scale Rotation Averaging

10 0.1129659 111 iccv-2013-Detecting Dynamic Objects with Multi-view Background Subtraction

11 0.10923178 49 iccv-2013-An Enhanced Structure-from-Motion Paradigm Based on the Absolute Dual Quadric and Images of Circular Points

12 0.098507978 250 iccv-2013-Lifting 3D Manhattan Lines from a Single Image

13 0.097592972 314 iccv-2013-Perspective Motion Segmentation via Collaborative Clustering

14 0.094422601 387 iccv-2013-Shape Anchors for Data-Driven Multi-view Reconstruction

15 0.093828216 273 iccv-2013-Monocular Image 3D Human Pose Estimation under Self-Occlusion

16 0.092640668 397 iccv-2013-Space-Time Tradeoffs in Photo Sequencing

17 0.090111159 27 iccv-2013-A Robust Analytical Solution to Isometric Shape-from-Template with Focal Length Calibration

18 0.088210665 353 iccv-2013-Revisiting the PnP Problem: A Fast, General and Optimal Solution

19 0.088147148 228 iccv-2013-Large-Scale Multi-resolution Surface Reconstruction from RGB-D Sequences

20 0.087713048 289 iccv-2013-Network Principles for SfM: Disambiguating Repeated Structures with Local Context


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.192), (1, -0.155), (2, -0.059), (3, 0.028), (4, -0.033), (5, 0.05), (6, 0.044), (7, -0.113), (8, 0.048), (9, 0.026), (10, 0.02), (11, -0.027), (12, -0.135), (13, 0.013), (14, 0.024), (15, 0.123), (16, 0.103), (17, 0.104), (18, -0.031), (19, 0.025), (20, 0.008), (21, -0.154), (22, -0.068), (23, 0.082), (24, 0.021), (25, -0.005), (26, 0.046), (27, 0.002), (28, -0.121), (29, 0.002), (30, 0.035), (31, 0.018), (32, -0.014), (33, -0.073), (34, -0.093), (35, -0.002), (36, 0.032), (37, 0.08), (38, -0.11), (39, 0.14), (40, -0.111), (41, 0.071), (42, 0.029), (43, 0.003), (44, 0.036), (45, -0.098), (46, -0.049), (47, 0.162), (48, 0.008), (49, 0.056)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.94934958 17 iccv-2013-A Global Linear Method for Camera Pose Registration

Author: Nianjuan Jiang, Zhaopeng Cui, Ping Tan

Abstract: We present a linear method for global camera pose registration from pairwise relative poses encoded in essential matrices. Our method minimizes an approximate geometric error to enforce the triangular relationship in camera triplets. This formulation does not suffer from the typical ‘unbalanced scale ’ problem in linear methods relying on pairwise translation direction constraints, i.e. an algebraic error; nor the system degeneracy from collinear motion. In the case of three cameras, our method provides a good linear approximation of the trifocal tensor. It can be directly scaled up to register multiple cameras. The results obtained are accurate for point triangulation and can serve as a good initialization for final bundle adjustment. We evaluate the algorithm performance with different types of data and demonstrate its effectiveness. Our system produces good accuracy, robustness, and outperforms some well-known systems on efficiency.

2 0.92713505 184 iccv-2013-Global Fusion of Relative Motions for Robust, Accurate and Scalable Structure from Motion

Author: Pierre Moulon, Pascal Monasse, Renaud Marlet

Abstract: Multi-view structure from motion (SfM) estimates the position and orientation of pictures in a common 3D coordinate frame. When views are treated incrementally, this external calibration can be subject to drift, contrary to global methods that distribute residual errors evenly. We propose a new global calibration approach based on the fusion of relative motions between image pairs. We improve an existing method for robustly computing global rotations. We present an efficient a contrario trifocal tensor estimation method, from which stable and precise translation directions can be extracted. We also define an efficient translation registration method that recovers accurate camera positions. These components are combined into an original SfM pipeline. Our experiments show that, on most datasets, it outperforms in accuracy other existing incremental and global pipelines. It also achieves strikingly good running times: it is about 20 times faster than the other global method we could compare to, and as fast as the best incremental method. More importantly, it features better scalability properties.

3 0.82817984 138 iccv-2013-Efficient and Robust Large-Scale Rotation Averaging

Author: Avishek Chatterjee, Venu Madhav Govindu

Abstract: In this paper we address the problem of robust and efficient averaging of relative 3D rotations. Apart from having an interesting geometric structure, robust rotation averaging addresses the need for a good initialization for largescale optimization used in structure-from-motion pipelines. Such pipelines often use unstructured image datasets harvested from the internet thereby requiring an initialization method that is robust to outliers. Our approach works on the Lie group structure of 3D rotations and solves the problem of large-scale robust rotation averaging in two ways. Firstly, we use modern ?1 optimizers to carry out robust averaging of relative rotations that is efficient, scalable and robust to outliers. In addition, we also develop a twostep method that uses the ?1 solution as an initialisation for an iteratively reweighted least squares (IRLS) approach. These methods achieve excellent results on large-scale, real world datasets and significantly outperform existing methods, i.e. the state-of-the-art discrete-continuous optimization method of [3] as well as the Weiszfeld method of [8]. We demonstrate the efficacy of our method on two large- scale real world datasets and also provide the results of the two aforementioned methods for comparison.

4 0.73363721 115 iccv-2013-Direct Optimization of Frame-to-Frame Rotation

Author: Laurent Kneip, Simon Lynen

Abstract: This work makes use of a novel, recently proposed epipolar constraint for computing the relative pose between two calibrated images. By enforcing the coplanarity of epipolar plane normal vectors, it constrains the three degrees of freedom of the relative rotation between two camera views directly—independently of the translation. The present paper shows how the approach can be extended to n points, and translated into an efficient eigenvalue minimization over the three rotational degrees of freedom. Each iteration in the non-linear optimization has constant execution time, independently of the number of features. Two global optimization approaches are proposed. The first one consists of an efficient Levenberg-Marquardt scheme with randomized initial value, which already leads to stable and accurate results. The second scheme consists of a globally optimal branch-and-bound algorithm based on a bound on the eigenvalue variation derived from symmetric eigenvalue-perturbation theory. Analysis of the cost function reveals insights into the nature of a specific relative pose problem, and outlines the complexity under different conditions. The algorithm shows state-of-the-art performance w.r.t. essential-matrix based solutions, and a frameto-frame application to a video sequence immediately leads to an alternative, real-time visual odometry solution. Note: All algorithms in this paper are made available in the OpenGV library. Please visit http : / / l aurent kne ip .github . i / opengv o

5 0.68650275 348 iccv-2013-Refractive Structure-from-Motion on Underwater Images

Author: Anne Jordt-Sedlazeck, Reinhard Koch

Abstract: In underwater environments, cameras need to be confined in an underwater housing, viewing the scene through a piece of glass. In case of flat port underwater housings, light rays entering the camera housing are refracted twice, due to different medium densities of water, glass, and air. This causes the usually linear rays of light to bend and the commonly used pinhole camera model to be invalid. When using the pinhole camera model without explicitly modeling refraction in Structure-from-Motion (SfM) methods, a systematic model error occurs. Therefore, in this paper, we propose a system for computing camera path and 3D points with explicit incorporation of refraction using new methods for pose estimation. Additionally, a new error function is introduced for non-linear optimization, especially bundle adjustment. The proposed method allows to increase reconstruction accuracy and is evaluated in a set of experiments, where the proposed method’s performance is compared to SfM with the perspective camera model.

6 0.67952532 353 iccv-2013-Revisiting the PnP Problem: A Fast, General and Optimal Solution

7 0.6651991 90 iccv-2013-Content-Aware Rotation

8 0.65523303 152 iccv-2013-Extrinsic Camera Calibration without a Direct View Using Spherical Mirror

9 0.64488906 49 iccv-2013-An Enhanced Structure-from-Motion Paradigm Based on the Absolute Dual Quadric and Images of Circular Points

10 0.63668782 323 iccv-2013-Pose Estimation with Unknown Focal Length Using Points, Directions and Lines

11 0.63552701 185 iccv-2013-Go-ICP: Solving 3D Registration Efficiently and Globally Optimally

12 0.62636656 280 iccv-2013-Multi-view 3D Reconstruction from Uncalibrated Radially-Symmetric Cameras

13 0.60852015 141 iccv-2013-Enhanced Continuous Tabu Search for Parameter Estimation in Multiview Geometry

14 0.58592433 367 iccv-2013-SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels

15 0.56966239 397 iccv-2013-Space-Time Tradeoffs in Photo Sequencing

16 0.55725729 402 iccv-2013-Street View Motion-from-Structure-from-Motion

17 0.53243387 436 iccv-2013-Unsupervised Intrinsic Calibration from a Single Frame Using a "Plumb-Line" Approach

18 0.52483219 342 iccv-2013-Real-Time Solution to the Absolute Pose Problem with Unknown Radial Distortion and Focal Length

19 0.52085739 27 iccv-2013-A Robust Analytical Solution to Isometric Shape-from-Template with Focal Length Calibration

20 0.5142771 407 iccv-2013-Subpixel Scanning Invariant to Indirect Lighting Using Quadratic Code Length


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(2, 0.058), (7, 0.037), (16, 0.206), (26, 0.044), (31, 0.04), (34, 0.017), (40, 0.011), (42, 0.215), (64, 0.025), (73, 0.033), (89, 0.185), (95, 0.019), (98, 0.028)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.9180764 177 iccv-2013-From Point to Set: Extend the Learning of Distance Metrics

Author: Pengfei Zhu, Lei Zhang, Wangmeng Zuo, David Zhang

Abstract: Most of the current metric learning methods are proposed for point-to-point distance (PPD) based classification. In many computer vision tasks, however, we need to measure the point-to-set distance (PSD) and even set-to-set distance (SSD) for classification. In this paper, we extend the PPD based Mahalanobis distance metric learning to PSD and SSD based ones, namely point-to-set distance metric learning (PSDML) and set-to-set distance metric learning (SSDML), and solve them under a unified optimization framework. First, we generate positive and negative sample pairs by computing the PSD and SSD between training samples. Then, we characterize each sample pair by its covariance matrix, and propose a covariance kernel based discriminative function. Finally, we tackle the PSDML and SSDMLproblems by using standard support vector machine solvers, making the metric learning very efficient for multiclass visual classification tasks. Experiments on gender classification, digit recognition, object categorization and face recognition show that the proposed metric learning methods can effectively enhance the performance of PSD and SSD based classification.

same-paper 2 0.85718 17 iccv-2013-A Global Linear Method for Camera Pose Registration

Author: Nianjuan Jiang, Zhaopeng Cui, Ping Tan

Abstract: We present a linear method for global camera pose registration from pairwise relative poses encoded in essential matrices. Our method minimizes an approximate geometric error to enforce the triangular relationship in camera triplets. This formulation does not suffer from the typical ‘unbalanced scale ’ problem in linear methods relying on pairwise translation direction constraints, i.e. an algebraic error; nor the system degeneracy from collinear motion. In the case of three cameras, our method provides a good linear approximation of the trifocal tensor. It can be directly scaled up to register multiple cameras. The results obtained are accurate for point triangulation and can serve as a good initialization for final bundle adjustment. We evaluate the algorithm performance with different types of data and demonstrate its effectiveness. Our system produces good accuracy, robustness, and outperforms some well-known systems on efficiency.

3 0.83512068 304 iccv-2013-PM-Huber: PatchMatch with Huber Regularization for Stereo Matching

Author: Philipp Heise, Sebastian Klose, Brian Jensen, Alois Knoll

Abstract: Most stereo correspondence algorithms match support windows at integer-valued disparities and assume a constant disparity value within the support window. The recently proposed PatchMatch stereo algorithm [7] overcomes this limitation of previous algorithms by directly estimating planes. This work presents a method that integrates the PatchMatch stereo algorithm into a variational smoothing formulation using quadratic relaxation. The resulting algorithm allows the explicit regularization of the disparity and normal gradients using the estimated plane parameters. Evaluation of our method in the Middlebury benchmark shows that our method outperforms the traditional integer-valued disparity strategy as well as the original algorithm and its variants in sub-pixel accurate disparity estimation.

4 0.82095885 188 iccv-2013-Group Sparsity and Geometry Constrained Dictionary Learning for Action Recognition from Depth Maps

Author: Jiajia Luo, Wei Wang, Hairong Qi

Abstract: Human action recognition based on the depth information provided by commodity depth sensors is an important yet challenging task. The noisy depth maps, different lengths of action sequences, and free styles in performing actions, may cause large intra-class variations. In this paper, a new framework based on sparse coding and temporal pyramid matching (TPM) is proposed for depthbased human action recognition. Especially, a discriminative class-specific dictionary learning algorithm isproposed for sparse coding. By adding the group sparsity and geometry constraints, features can be well reconstructed by the sub-dictionary belonging to the same class, and the geometry relationships among features are also kept in the calculated coefficients. The proposed approach is evaluated on two benchmark datasets captured by depth cameras. Experimental results show that the proposed algorithm repeatedly hqi } @ ut k . edu GB ImagesR epth ImagesD setkonlSy0 896.5170d4ept.3h021 .x02y 19.876504.dep3th02.1 x02. achieves superior performance to the state of the art algorithms. Moreover, the proposed dictionary learning method also outperforms classic dictionary learning approaches.

5 0.81416845 101 iccv-2013-DCSH - Matching Patches in RGBD Images

Author: Yaron Eshet, Simon Korman, Eyal Ofek, Shai Avidan

Abstract: We extend patch based methods to work on patches in 3D space. We start with Coherency Sensitive Hashing [12] (CSH), which is an algorithm for matching patches between two RGB images, and extend it to work with RGBD images. This is done by warping all 3D patches to a common virtual plane in which CSH is performed. To avoid noise due to warping of patches of various normals and depths, we estimate a group of dominant planes and compute CSH on each plane separately, before merging the matching patches. The result is DCSH - an algorithm that matches world (3D) patches in order to guide the search for image plane matches. An independent contribution is an extension of CSH, which we term Social-CSH. It allows a major speedup of the k nearest neighbor (kNN) version of CSH - its runtime growing linearly, rather than quadratically, in k. Social-CSH is used as a subcomponent of DCSH when many NNs are required, as in the case of image denoising. We show the benefits ofusing depth information to image reconstruction and image denoising, demonstrated on several RGBD images.

6 0.81236875 184 iccv-2013-Global Fusion of Relative Motions for Robust, Accurate and Scalable Structure from Motion

7 0.8109985 231 iccv-2013-Latent Multitask Learning for View-Invariant Action Recognition

8 0.80800998 46 iccv-2013-Allocentric Pose Estimation

9 0.80646235 54 iccv-2013-Attribute Pivots for Guiding Relevance Feedback in Image Search

10 0.80567425 259 iccv-2013-Manifold Based Face Synthesis from Sparse Samples

11 0.80521178 14 iccv-2013-A Generalized Iterated Shrinkage Algorithm for Non-convex Sparse Coding

12 0.80380708 52 iccv-2013-Attribute Adaptation for Personalized Image Search

13 0.80271471 392 iccv-2013-Similarity Metric Learning for Face Recognition

14 0.8021946 213 iccv-2013-Implied Feedback: Learning Nuances of User Behavior in Image Search

15 0.80173898 26 iccv-2013-A Practical Transfer Learning Algorithm for Face Verification

16 0.80131888 45 iccv-2013-Affine-Constrained Group Sparse Coding and Its Application to Image-Based Classifications

17 0.80129266 93 iccv-2013-Correlation Adaptive Subspace Segmentation by Trace Lasso

18 0.79984093 438 iccv-2013-Unsupervised Visual Domain Adaptation Using Subspace Alignment

19 0.79918587 106 iccv-2013-Deep Learning Identity-Preserving Face Space

20 0.79844344 79 iccv-2013-Coherent Object Detection with 3D Geometric Context from a Single Image