cvpr cvpr2013 cvpr2013-155 knowledge-graph by maker-knowledge-mining

155 cvpr-2013-Exploiting the Power of Stereo Confidences


Source: pdf

Author: David Pfeiffer, Stefan Gehrig, Nicolai Schneider

Abstract: Applications based on stereo vision are becoming increasingly common, ranging from gaming over robotics to driver assistance. While stereo algorithms have been investigated heavily both on the pixel and the application level, far less attention has been dedicated to the use of stereo confidence cues. Mostly, a threshold is applied to the confidence values for further processing, which is essentially a sparsified disparity map. This is straightforward but it does not take full advantage of the available information. In this paper, we make full use of the stereo confidence cues by propagating all confidence values along with the measured disparities in a Bayesian manner. Before using this information, a mapping from confidence values to disparity outlier probability rate is performed based on gathered disparity statistics from labeled video data. We present an extension of the so called Stixel World, a generic 3D intermediate representation that can serve as input for many of the applications mentioned above. This scheme is modified to directly exploit stereo confidence cues in the underlying sensor model during a maximum a poste- riori estimation process. The effectiveness of this step is verified in an in-depth evaluation on a large real-world traffic data base of which parts are made publicly available. We show that using stereo confidence cues allows both reducing the number of false object detections by a factor of six while keeping the detection rate at a near constant level.

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 com Abstract Applications based on stereo vision are becoming increasingly common, ranging from gaming over robotics to driver assistance. [sent-7, score-0.442]

2 While stereo algorithms have been investigated heavily both on the pixel and the application level, far less attention has been dedicated to the use of stereo confidence cues. [sent-8, score-1.058]

3 Mostly, a threshold is applied to the confidence values for further processing, which is essentially a sparsified disparity map. [sent-9, score-0.644]

4 In this paper, we make full use of the stereo confidence cues by propagating all confidence values along with the measured disparities in a Bayesian manner. [sent-11, score-1.15]

5 Before using this information, a mapping from confidence values to disparity outlier probability rate is performed based on gathered disparity statistics from labeled video data. [sent-12, score-1.16]

6 This scheme is modified to directly exploit stereo confidence cues in the underlying sensor model during a maximum a poste- riori estimation process. [sent-14, score-0.842]

7 We show that using stereo confidence cues allows both reducing the number of false object detections by a factor of six while keeping the detection rate at a near constant level. [sent-16, score-0.882]

8 In recent years, stereo algorithms and applications have matured significantly spawning products in fields ranging from industrial automation over gaming up to driver assistance systems. [sent-19, score-0.476]

9 The underlying stereo algorithms and their properties are well understood, at least for the current real-time algorithms, typically approaches based on correlation [20] or semi-global matching (SGM) [10]. [sent-20, score-0.352]

10 that compare stereo algorithms on a 100 % density level are available [19], also for the automotive domain [8]. [sent-23, score-0.394]

11 The computation of stereo confidences has only recently been researched in more detail. [sent-24, score-0.551]

12 Hu and Mordohai [12] performed an excellent review ofknown stereo confidence metrics comparing them to ground truth scenes on a pixel level. [sent-25, score-0.872]

13 In related work on confidence estimation for stereo or optical flow computation, the so called sparsification plots are established as the main method to show the effectiveness of the considered confidence metric. [sent-26, score-1.187]

14 This procedure gives a good impression with respect to how well the confidence helps reducing the average error of the disparity map when the least confident values are removed. [sent-27, score-0.682]

15 However, no explicit use of both the disparity map and the confidence map in further processing has been reported so far. [sent-28, score-0.644]

16 The main objective is to robustly extract free space and obstacle information from dense disparity maps and to represent the results in a compact and simple fashion. [sent-30, score-0.33]

17 This paper extends our Bayesian approach [17] to use stereo confidence cues. [sent-36, score-0.706]

18 The idea is that each disparity mea222999777 surement is given an individual probability to be an outlier. [sent-37, score-0.328]

19 The effectiveness of this procedure is evaluated on a large sequence data base containing different adverse scenarios for the stereo sensor setup. [sent-40, score-0.564]

20 To round things off, the performance is also compared against the straightforward way of using sparsification on the disparity map. [sent-41, score-0.443]

21 The main contribution of this paper is the first-time fully probabilistic usage of stereo confidences along with the disparity map. [sent-42, score-0.813]

22 Moreover, we introduce modified stereo confidence metrics suited for global stereo algorithms, and link confidence values to disparity outlier probabilities. [sent-43, score-1.952]

23 The paper is organized as follows: Section 2 describes related work to the field of stereo confidence estimation. [sent-44, score-0.706]

24 We limit ourselves to references that inspired our confidence metrics. [sent-45, score-0.354]

25 In addition, work that makes use of stereo confidences in subsequent processing is analyzed. [sent-46, score-0.523]

26 Besides stereo confidence we also review work on 3D intermediate representations. [sent-47, score-0.706]

27 Section 3 encourages our selection of stereo confidence metrics and their modifications for our applica- tion. [sent-48, score-0.816]

28 The resulting confidence values are mapped to outlier probabilities which is described in Section 4. [sent-49, score-0.494]

29 In Section 5, the Stixel World is introduced, followed by the extension to use stereo confidences, also for further applications, in the subsequent Sections 6 and 7. [sent-50, score-0.352]

30 Related Work Stereo confidence computation has recently attracted rising attention [4, 9, 12]. [sent-54, score-0.354]

31 So far, most work on stereo confidences focused on local stereo approaches. [sent-55, score-0.875]

32 [9] applied some of these confidence metrics to SGM. [sent-57, score-0.464]

33 In [12], Hu and Mordohai provide an extensive review of existing stereo confidence metrics, again using local correlation as the underlying stereo method. [sent-58, score-1.058]

34 To this end, the disparity values are sorted according to their confidence values. [sent-60, score-0.644]

35 Subsequently, those depth measurements with the lowest confidence are dropped and a new error metric is calculated for the remaining pixels. [sent-61, score-0.418]

36 Milella and Siegwart [15] explicitly compute stereo confidence and eliminate less confident matches for the use in an iterated closest point (ICP) algorithm for ego-motion estimation. [sent-66, score-0.706]

37 [26] also compute stereo confidence and eliminate less reliable matches by thresholding on the confidence. [sent-68, score-0.706]

38 In addition, the confidence value is used as a weight in plane fitting for 3D reconstruction. [sent-69, score-0.354]

39 the precision of a stereo measurement) has been incorporated several times into occupancy grid approaches where obstacles are mapped onto a grid structure (e. [sent-72, score-0.455]

40 and our approach make explicit use of a detailed sensor model by pre- cisely taking the measurement noise and outlier characteristic of the particular sensor setup into account, till now, no approach has taken advantage of stereo confidence cues. [sent-86, score-1.09]

41 Stereo Confidence Metrics The stereo confidence metrics introduced in [12] have been investigated in conjunction with a local stereo method. [sent-88, score-1.168]

42 In addition, we perform the well-known leftright consistency (LRC) check that was shown to be very effective for high stereo densities [9]. [sent-93, score-0.352]

43 The lower rows visualize the particular stereo confidence cues which is b) LC, c) PKRN, and d) MLM. [sent-109, score-0.749]

44 The brighter a pixel is, the higher is the confidence that the depth measurement is correct. [sent-110, score-0.437]

45 We exclude very similar, adjacent costs to not penalize disparity results around half integer values. [sent-116, score-0.315]

46 Although the PKRN modifications slightly violate the confidence ordering of the original metric, they have the following advantages over the original counterpart: The rare case of a singularity with a denominator of zero is avoided. [sent-120, score-0.354]

47 (3) The costs C+ and C− are adjacent to the optimal disparity Cmin. [sent-127, score-0.315]

48 The results of both the stereo matching and the confidence metrics are illustrated in Figure 2. [sent-129, score-0.816]

49 For this purpose, two different situations are shown that pose quite a challenge for the stereo matching. [sent-130, score-0.352]

50 The first one exhibits strong textural patterns caused by scattered sunlight in the windshield which clearly misleads the stereo estimation. [sent-131, score-0.409]

51 For practical reasons, all confidence metrics are scaled and bound to the interval [0 . [sent-133, score-0.464]

52 Using Confidences as Outlier Probabilities Before turning to the core topic ofthis section it is important to establish a common conception of confidence metrics as discussed in this work. [sent-139, score-0.464]

53 Instead of simply computing the disparity measurement d for a pixel, we assume the used stereo scheme to output pairs of values (d, c), with d ∈ D and c ∈ [0, 1] . [sent-140, score-0.7]

54 D = [0, 127] , and c is the corresponding confidence value of d. [sent-144, score-0.354]

55 d It is to be expected that this value c strongly depends on various aspects: Foremost the confidence metric itself (i. [sent-148, score-0.393]

56 PKRN, LC, or MLM), the used stereo scheme (in our case SGM), and the corresponding parameter choice for that particular stereo scheme. [sent-150, score-0.704]

57 Since we intend to use confidences in a probabilistic framework for Stixel computation, a mapping from the particular confidence metric to an actual outlier probability is required. [sent-151, score-0.741]

58 It works with the same type of sensor data and stereo algorithm that we later run our vision algorithms on. [sent-162, score-0.445]

59 Similar to using the ground truth data, the underlying idea is straightforward: a human inspector annotates regions in the stereo map using the binary labels “inlier” and ”outlier”. [sent-163, score-0.408]

60 ch eNmoete ( tohra stt tehreiso parameter c bheoi dcoen) e b outn liys independent of the used confidence metric. [sent-166, score-0.354]

61 The right side of Figure 3 shows the obtained confidence mapping p (o | c). [sent-179, score-0.391]

62 The dashed line is the expected ground profile and the disparity measurement vector is marked using purple. [sent-229, score-0.406]

63 8 Figure 3: The left figure shows the confidence distribution with respect to the manual labeling of the disparity values (training data) into “inlier” and ”outlier”. [sent-235, score-0.644]

64 The Stixel computation is formulated as a MAP estimation problem, this way ensuring to obtain the best segmentation result for the given stereo input. [sent-245, score-0.352]

65 Modeling all segments as piecewise planar surfaces simplifies the function set fn to linear functions: object segments are assumed to have a constant disparity while ground segments follow the disparity gradient of the ground surface. [sent-255, score-0.642]

66 Since this paper discusses how to efficiently take confidence cues into account, particular emphasis is put on the data term. [sent-264, score-0.397]

67 y for a single disparity mea|s surement dv at image row coordinate v to belong to a possible Stixel segment sn. [sent-291, score-0.411]

68 333000111 numerous false positives (mostly red) on the ground surface caused by stereo matching. [sent-298, score-0.527]

69 Using Confidences for Stixel Computation For obtaining a more measurement specific outlier model, stereo confidence cues are used. [sent-300, score-0.947]

70 As discussed in Section 4, these confidence cues are not used directly but are mapped to an outlier probability. [sent-301, score-0.537]

71 Equation 9) is straightforward: Instead of processing the plain disparity measurement dv in PD, the tuple (dv ,pv) is used which is the disparity measurement dv along with the corresponding outlier probability pv. [sent-304, score-1.002]

72 That decision is for two reasons: Firstly, when not providing stereo confidence cues, using pv = 0 yields the original sensor model of Equation 9. [sent-313, score-0.832]

73 Secondly and more crucial for our application, in awareness that the stereo confidence cue might not always be correct (i. [sent-314, score-0.706]

74 No false positives are visible for the confidence version whereas many false positives occurred in the original version. [sent-322, score-0.642]

75 Using Confidences for Other Applications Confidences are helpful for further applications driven by stereo vision. [sent-324, score-0.352]

76 The following popular stereo-based tasks are easily extended to use confidence cues: Occupancy map generation: The disparities are triangulated and registered in a map. [sent-325, score-0.401]

77 com/ground-truth-stixel-dataset 333000222 stereo – – – confidence cues in the sensor model allows to remove nearly all false positives while the detection rate is kept high. [sent-346, score-1.035]

78 Base line: SGM stereo and Stixels are computed according to [17]. [sent-347, score-0.352]

79 Sparsification: SGM and the proposed confidence metrics are computed. [sent-349, score-0.464]

80 A manually optimized confidence threshold is applied for discarding all depth measurements with a lower confidence from the disparity map. [sent-350, score-1.023]

81 Stixels with confidences: SGM and the proposed confidence metrics are computed. [sent-352, score-0.464]

82 Subsequently, we transfer both the disparity and the confidence map to the Stixel engine as described in Section 6. [sent-353, score-0.644]

83 Any differences purely result from using the confidence metrics and the way how they are taken into account. [sent-356, score-0.464]

84 When using sparsification on the disparity input, the number of false positives is reduced to 637 with LC, 758 with PKRN, and 648 with MLM. [sent-362, score-0.561]

85 The results when using confidence cues as suggested are as follows: On the large data set we obtain 360 frames with false positives when using LC, 719 with PKRN, and 301 in case of MLM. [sent-372, score-0.574]

86 Problems with repetitive structures, a known shortcoming of local stereo methods that is detected with PKRN and MLM, are rarely observed when using SGM. [sent-380, score-0.352]

87 In conclusion, exploiting stereo confidences throughout the whole processing chain clearly proves to have a positive effect. [sent-381, score-0.523]

88 Conclusions and Outlook In this contribution, we presented an improvement of the state-of-the-art 3D Stixel intermediate representation by exploiting stereo confidence information in a probabilistic fashion. [sent-385, score-0.706]

89 It is shown that the intuitive approach to sparsify the disparity maps based on confidence allows to reduce the false positive rate by a factor of three. [sent-386, score-0.777]

90 The same holds true for integrating confidence information into the subsequent Stixel processing step. [sent-391, score-0.354]

91 Also, when using Stixels with motion information, the identical concept can be applied for using optical flow confidence information. [sent-394, score-0.354]

92 78 890 127415 Table 1: For evaluating our extension of the Stixel computation scheme, we considered three different stereo confidence metrics and compared against both the base line approach of Pfeiffer et al. [sent-400, score-0.912]

93 and the straight-forward way of using sparsification of the disparity map. [sent-401, score-0.417]

94 A stereo confidence metric using single view imagery with comparison to five alternative approaches. [sent-424, score-0.745]

95 Sensor integration for robot navigation: Combining sonar and stereo range data in a gridbased representation. [sent-430, score-0.352]

96 A real-time low-power stereo vision engine using semi-global matching. [sent-443, score-0.352]

97 Analysis of KITTI data for stereo analysis with stereo confidence measures. [sent-456, score-1.058]

98 Accurate and efficient stereo processing by semi-global matching and mutual information. [sent-460, score-0.352]

99 Calculating dense disparity maps from color stereo images, an efficient implementation. [sent-512, score-0.642]

100 A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. [sent-534, score-0.352]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('stixel', 0.457), ('confidence', 0.354), ('stereo', 0.352), ('disparity', 0.29), ('pkrn', 0.248), ('mlm', 0.191), ('confidences', 0.171), ('outlier', 0.14), ('sparsification', 0.127), ('stixels', 0.114), ('metrics', 0.11), ('lc', 0.106), ('sensor', 0.093), ('false', 0.084), ('dv', 0.083), ('gehrig', 0.078), ('vnt', 0.076), ('occupancy', 0.075), ('sgm', 0.074), ('base', 0.069), ('badino', 0.068), ('positives', 0.06), ('driver', 0.06), ('inlier', 0.059), ('measurement', 0.058), ('vnb', 0.057), ('windshield', 0.057), ('pd', 0.053), ('pfeiffer', 0.051), ('daimler', 0.051), ('franke', 0.051), ('pout', 0.051), ('world', 0.05), ('rate', 0.049), ('germany', 0.048), ('disparities', 0.047), ('cues', 0.043), ('gallup', 0.042), ('automotive', 0.042), ('icvs', 0.042), ('free', 0.04), ('sn', 0.039), ('metric', 0.039), ('clmcin', 0.038), ('cmmilnm', 0.038), ('cpmkinrn', 0.038), ('daimle', 0.038), ('ege', 0.038), ('fpframes', 0.038), ('milella', 0.038), ('minneapolis', 0.038), ('planetary', 0.038), ('pomuint', 0.038), ('rover', 0.038), ('sindelfingen', 0.038), ('surement', 0.038), ('utterly', 0.038), ('impression', 0.038), ('mapping', 0.037), ('june', 0.037), ('rain', 0.035), ('assistance', 0.034), ('lrc', 0.034), ('haeusler', 0.034), ('usa', 0.034), ('frames', 0.033), ('pv', 0.033), ('belgium', 0.031), ('aodha', 0.031), ('hirschm', 0.031), ('ground', 0.031), ('icp', 0.03), ('mac', 0.03), ('gaming', 0.03), ('traffic', 0.029), ('october', 0.029), ('researched', 0.028), ('providence', 0.028), ('indoors', 0.028), ('benenson', 0.028), ('obstacles', 0.028), ('weather', 0.027), ('mordohai', 0.027), ('corridor', 0.027), ('line', 0.027), ('straightforward', 0.026), ('san', 0.026), ('dl', 0.026), ('subsequently', 0.026), ('italy', 0.026), ('findings', 0.025), ('adverse', 0.025), ('unsolved', 0.025), ('francisco', 0.025), ('safe', 0.025), ('truth', 0.025), ('po', 0.025), ('scenarios', 0.025), ('costs', 0.025), ('depth', 0.025)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99999887 155 cvpr-2013-Exploiting the Power of Stereo Confidences

Author: David Pfeiffer, Stefan Gehrig, Nicolai Schneider

Abstract: Applications based on stereo vision are becoming increasingly common, ranging from gaming over robotics to driver assistance. While stereo algorithms have been investigated heavily both on the pixel and the application level, far less attention has been dedicated to the use of stereo confidence cues. Mostly, a threshold is applied to the confidence values for further processing, which is essentially a sparsified disparity map. This is straightforward but it does not take full advantage of the available information. In this paper, we make full use of the stereo confidence cues by propagating all confidence values along with the measured disparities in a Bayesian manner. Before using this information, a mapping from confidence values to disparity outlier probability rate is performed based on gathered disparity statistics from labeled video data. We present an extension of the so called Stixel World, a generic 3D intermediate representation that can serve as input for many of the applications mentioned above. This scheme is modified to directly exploit stereo confidence cues in the underlying sensor model during a maximum a poste- riori estimation process. The effectiveness of this step is verified in an in-depth evaluation on a large real-world traffic data base of which parts are made publicly available. We show that using stereo confidence cues allows both reducing the number of false object detections by a factor of six while keeping the detection rate at a near constant level.

2 0.50374079 147 cvpr-2013-Ensemble Learning for Confidence Measures in Stereo Vision

Author: Ralf Haeusler, Rahul Nair, Daniel Kondermann

Abstract: With the aim to improve accuracy of stereo confidence measures, we apply the random decision forest framework to a large set of diverse stereo confidence measures. Learning and testing sets were drawnfrom the recently introduced KITTI dataset, which currently poses higher challenges to stereo solvers than other benchmarks with ground truth for stereo evaluation. We experiment with semi global matching stereo (SGM) and a census dataterm, which is the best performing realtime capable stereo method known to date. On KITTI images, SGM still produces a significant amount of error. We obtain consistently improved area under curve values of sparsification measures in comparison to best performing single stereo confidence measures where numbers of stereo errors are large. More specifically, our method performs best in all but one out of 194 frames of the KITTI dataset.

3 0.20858274 181 cvpr-2013-Fusing Depth from Defocus and Stereo with Coded Apertures

Author: Yuichi Takeda, Shinsaku Hiura, Kosuke Sato

Abstract: In this paper we propose a novel depth measurement method by fusing depth from defocus (DFD) and stereo. One of the problems of passive stereo method is the difficulty of finding correct correspondence between images when an object has a repetitive pattern or edges parallel to the epipolar line. On the other hand, the accuracy of DFD method is inherently limited by the effective diameter of the lens. Therefore, we propose the fusion of stereo method and DFD by giving different focus distances for left and right cameras of a stereo camera with coded apertures. Two types of depth cues, defocus and disparity, are naturally integrated by the magnification and phase shift of a single point spread function (PSF) per camera. In this paper we give the proof of the proportional relationship between the diameter of defocus and disparity which makes the calibration easy. We also show the outstanding performance of our method which has both advantages of two depth cues through simulation and actual experiments.

4 0.19235854 431 cvpr-2013-The Variational Structure of Disparity and Regularization of 4D Light Fields

Author: Bastian Goldluecke, Sven Wanner

Abstract: Unlike traditional images which do not offer information for different directions of incident light, a light field is defined on ray space, and implicitly encodes scene geometry data in a rich structure which becomes visible on its epipolar plane images. In this work, we analyze regularization of light fields in variational frameworks and show that their variational structure is induced by disparity, which is in this context best understood as a vector field on epipolar plane image space. We derive differential constraints on this vector field to enable consistent disparity map regularization. Furthermore, we show how the disparity field is related to the regularization of more general vector-valued functions on the 4D ray space of the light field. This way, we derive an efficient variational framework with convex priors, which can serve as a fundament for a large class of inverse problems on ray space.

5 0.18122736 384 cvpr-2013-Segment-Tree Based Cost Aggregation for Stereo Matching

Author: Xing Mei, Xun Sun, Weiming Dong, Haitao Wang, Xiaopeng Zhang

Abstract: This paper presents a novel tree-based cost aggregation method for dense stereo matching. Instead of employing the minimum spanning tree (MST) and its variants, a new tree structure, ”Segment-Tree ”, is proposed for non-local matching cost aggregation. Conceptually, the segment-tree is constructed in a three-step process: first, the pixels are grouped into a set of segments with the reference color or intensity image; second, a tree graph is created for each segment; and in the final step, these independent segment graphs are linked to form the segment-tree structure. In practice, this tree can be efficiently built in time nearly linear to the number of the image pixels. Compared to MST where the graph connectivity is determined with local edge weights, our method introduces some ’non-local’ decision rules: the pixels in one perceptually consistent segment are more likely to share similar disparities, and therefore their connectivity within the segment should be first enforced in the tree construction process. The matching costs are then aggregated over the tree within two passes. Performance evaluation on 19 Middlebury data sets shows that the proposed method is comparable to previous state-of-the-art aggregation methods in disparity accuracy and processing speed. Furthermore, the tree structure can be refined with the estimated disparities, which leads to consistent scene segmentation and significantly better aggregation results.

6 0.14350784 345 cvpr-2013-Real-Time Model-Based Rigid Object Pose Estimation and Tracking Combining Dense and Sparse Visual Cues

7 0.12116648 318 cvpr-2013-Optimized Pedestrian Detection for Multiple and Occluded People

8 0.11973462 326 cvpr-2013-Patch Match Filter: Efficient Edge-Aware Filtering Meets Randomized Search for Fast Correspondence Field Estimation

9 0.11803633 219 cvpr-2013-In Defense of 3D-Label Stereo

10 0.11556447 362 cvpr-2013-Robust Monocular Epipolar Flow Estimation

11 0.11075393 373 cvpr-2013-SWIGS: A Swift Guided Sampling Method

12 0.098829016 352 cvpr-2013-Recovering Stereo Pairs from Anaglyphs

13 0.092864916 284 cvpr-2013-Mesh Based Semantic Modelling for Indoor and Outdoor Scenes

14 0.089287832 21 cvpr-2013-A New Perspective on Uncalibrated Photometric Stereo

15 0.085124105 188 cvpr-2013-Globally Consistent Multi-label Assignment on the Ray Space of 4D Light Fields

16 0.080172986 117 cvpr-2013-Detecting Changes in 3D Structure of a Scene from Multi-view Images Captured by a Vehicle-Mounted Camera

17 0.072653852 217 cvpr-2013-Improving an Object Detector and Extracting Regions Using Superpixels

18 0.072397709 430 cvpr-2013-The SVM-Minus Similarity Score for Video Face Recognition

19 0.07093899 365 cvpr-2013-Robust Real-Time Tracking of Multiple Objects by Volumetric Mass Densities

20 0.068614602 245 cvpr-2013-Layer Depth Denoising and Completion for Structured-Light RGB-D Cameras


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.159), (1, 0.123), (2, 0.024), (3, 0.013), (4, 0.03), (5, -0.061), (6, -0.004), (7, 0.014), (8, 0.012), (9, 0.076), (10, -0.0), (11, 0.036), (12, 0.193), (13, 0.014), (14, -0.099), (15, 0.054), (16, -0.19), (17, -0.169), (18, 0.108), (19, -0.051), (20, -0.123), (21, 0.101), (22, 0.213), (23, 0.203), (24, -0.103), (25, -0.115), (26, -0.008), (27, -0.091), (28, 0.05), (29, 0.011), (30, 0.011), (31, 0.092), (32, -0.183), (33, 0.009), (34, 0.019), (35, 0.038), (36, -0.045), (37, 0.06), (38, 0.016), (39, -0.039), (40, 0.098), (41, 0.013), (42, 0.062), (43, 0.002), (44, -0.071), (45, 0.062), (46, 0.044), (47, 0.015), (48, -0.061), (49, -0.032)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.97182739 155 cvpr-2013-Exploiting the Power of Stereo Confidences

Author: David Pfeiffer, Stefan Gehrig, Nicolai Schneider

Abstract: Applications based on stereo vision are becoming increasingly common, ranging from gaming over robotics to driver assistance. While stereo algorithms have been investigated heavily both on the pixel and the application level, far less attention has been dedicated to the use of stereo confidence cues. Mostly, a threshold is applied to the confidence values for further processing, which is essentially a sparsified disparity map. This is straightforward but it does not take full advantage of the available information. In this paper, we make full use of the stereo confidence cues by propagating all confidence values along with the measured disparities in a Bayesian manner. Before using this information, a mapping from confidence values to disparity outlier probability rate is performed based on gathered disparity statistics from labeled video data. We present an extension of the so called Stixel World, a generic 3D intermediate representation that can serve as input for many of the applications mentioned above. This scheme is modified to directly exploit stereo confidence cues in the underlying sensor model during a maximum a poste- riori estimation process. The effectiveness of this step is verified in an in-depth evaluation on a large real-world traffic data base of which parts are made publicly available. We show that using stereo confidence cues allows both reducing the number of false object detections by a factor of six while keeping the detection rate at a near constant level.

2 0.93768519 147 cvpr-2013-Ensemble Learning for Confidence Measures in Stereo Vision

Author: Ralf Haeusler, Rahul Nair, Daniel Kondermann

Abstract: With the aim to improve accuracy of stereo confidence measures, we apply the random decision forest framework to a large set of diverse stereo confidence measures. Learning and testing sets were drawnfrom the recently introduced KITTI dataset, which currently poses higher challenges to stereo solvers than other benchmarks with ground truth for stereo evaluation. We experiment with semi global matching stereo (SGM) and a census dataterm, which is the best performing realtime capable stereo method known to date. On KITTI images, SGM still produces a significant amount of error. We obtain consistently improved area under curve values of sparsification measures in comparison to best performing single stereo confidence measures where numbers of stereo errors are large. More specifically, our method performs best in all but one out of 194 frames of the KITTI dataset.

3 0.76344782 384 cvpr-2013-Segment-Tree Based Cost Aggregation for Stereo Matching

Author: Xing Mei, Xun Sun, Weiming Dong, Haitao Wang, Xiaopeng Zhang

Abstract: This paper presents a novel tree-based cost aggregation method for dense stereo matching. Instead of employing the minimum spanning tree (MST) and its variants, a new tree structure, ”Segment-Tree ”, is proposed for non-local matching cost aggregation. Conceptually, the segment-tree is constructed in a three-step process: first, the pixels are grouped into a set of segments with the reference color or intensity image; second, a tree graph is created for each segment; and in the final step, these independent segment graphs are linked to form the segment-tree structure. In practice, this tree can be efficiently built in time nearly linear to the number of the image pixels. Compared to MST where the graph connectivity is determined with local edge weights, our method introduces some ’non-local’ decision rules: the pixels in one perceptually consistent segment are more likely to share similar disparities, and therefore their connectivity within the segment should be first enforced in the tree construction process. The matching costs are then aggregated over the tree within two passes. Performance evaluation on 19 Middlebury data sets shows that the proposed method is comparable to previous state-of-the-art aggregation methods in disparity accuracy and processing speed. Furthermore, the tree structure can be refined with the estimated disparities, which leads to consistent scene segmentation and significantly better aggregation results.

4 0.72265339 181 cvpr-2013-Fusing Depth from Defocus and Stereo with Coded Apertures

Author: Yuichi Takeda, Shinsaku Hiura, Kosuke Sato

Abstract: In this paper we propose a novel depth measurement method by fusing depth from defocus (DFD) and stereo. One of the problems of passive stereo method is the difficulty of finding correct correspondence between images when an object has a repetitive pattern or edges parallel to the epipolar line. On the other hand, the accuracy of DFD method is inherently limited by the effective diameter of the lens. Therefore, we propose the fusion of stereo method and DFD by giving different focus distances for left and right cameras of a stereo camera with coded apertures. Two types of depth cues, defocus and disparity, are naturally integrated by the magnification and phase shift of a single point spread function (PSF) per camera. In this paper we give the proof of the proportional relationship between the diameter of defocus and disparity which makes the calibration easy. We also show the outstanding performance of our method which has both advantages of two depth cues through simulation and actual experiments.

5 0.6582424 219 cvpr-2013-In Defense of 3D-Label Stereo

Author: Carl Olsson, Johannes Ulén, Yuri Boykov

Abstract: It is commonly believed that higher order smoothness should be modeled using higher order interactions. For example, 2nd order derivatives for deformable (active) contours are represented by triple cliques. Similarly, the 2nd order regularization methods in stereo predominantly use MRF models with scalar (1D) disparity labels and triple clique interactions. In this paper we advocate a largely overlooked alternative approach to stereo where 2nd order surface smoothness is represented by pairwise interactions with 3D-labels, e.g. tangent planes. This general paradigm has been criticized due to perceived computational complexity of optimization in higher-dimensional label space. Contrary to popular beliefs, we demonstrate that representing 2nd order surface smoothness with 3D labels leads to simpler optimization problems with (nearly) submodular pairwise interactions. Our theoretical and experimental re- sults demonstrate advantages over state-of-the-art methods for 2nd order smoothness stereo. 1

6 0.56806332 431 cvpr-2013-The Variational Structure of Disparity and Regularization of 4D Light Fields

7 0.56667352 326 cvpr-2013-Patch Match Filter: Efficient Edge-Aware Filtering Meets Randomized Search for Fast Correspondence Field Estimation

8 0.46428162 362 cvpr-2013-Robust Monocular Epipolar Flow Estimation

9 0.43840563 352 cvpr-2013-Recovering Stereo Pairs from Anaglyphs

10 0.42513514 373 cvpr-2013-SWIGS: A Swift Guided Sampling Method

11 0.41743386 188 cvpr-2013-Globally Consistent Multi-label Assignment on the Ray Space of 4D Light Fields

12 0.41317242 274 cvpr-2013-Lost! Leveraging the Crowd for Probabilistic Visual Self-Localization

13 0.37845036 345 cvpr-2013-Real-Time Model-Based Rigid Object Pose Estimation and Tracking Combining Dense and Sparse Visual Cues

14 0.36090457 21 cvpr-2013-A New Perspective on Uncalibrated Photometric Stereo

15 0.33089972 15 cvpr-2013-A Lazy Man's Approach to Benchmarking: Semisupervised Classifier Evaluation and Recalibration

16 0.31789681 318 cvpr-2013-Optimized Pedestrian Detection for Multiple and Occluded People

17 0.30168894 128 cvpr-2013-Discrete MRF Inference of Marginal Densities for Non-uniformly Discretized Variable Space

18 0.29733664 467 cvpr-2013-Wide-Baseline Hair Capture Using Strand-Based Refinement

19 0.28754336 284 cvpr-2013-Mesh Based Semantic Modelling for Indoor and Outdoor Scenes

20 0.2838324 112 cvpr-2013-Dense Segmentation-Aware Descriptors


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(10, 0.104), (16, 0.061), (26, 0.034), (28, 0.025), (33, 0.191), (67, 0.041), (69, 0.058), (86, 0.224), (87, 0.145), (97, 0.025)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.7923587 155 cvpr-2013-Exploiting the Power of Stereo Confidences

Author: David Pfeiffer, Stefan Gehrig, Nicolai Schneider

Abstract: Applications based on stereo vision are becoming increasingly common, ranging from gaming over robotics to driver assistance. While stereo algorithms have been investigated heavily both on the pixel and the application level, far less attention has been dedicated to the use of stereo confidence cues. Mostly, a threshold is applied to the confidence values for further processing, which is essentially a sparsified disparity map. This is straightforward but it does not take full advantage of the available information. In this paper, we make full use of the stereo confidence cues by propagating all confidence values along with the measured disparities in a Bayesian manner. Before using this information, a mapping from confidence values to disparity outlier probability rate is performed based on gathered disparity statistics from labeled video data. We present an extension of the so called Stixel World, a generic 3D intermediate representation that can serve as input for many of the applications mentioned above. This scheme is modified to directly exploit stereo confidence cues in the underlying sensor model during a maximum a poste- riori estimation process. The effectiveness of this step is verified in an in-depth evaluation on a large real-world traffic data base of which parts are made publicly available. We show that using stereo confidence cues allows both reducing the number of false object detections by a factor of six while keeping the detection rate at a near constant level.

2 0.76866931 441 cvpr-2013-Tracking Sports Players with Context-Conditioned Motion Models

Author: Jingchen Liu, Peter Carr, Robert T. Collins, Yanxi Liu

Abstract: We employ hierarchical data association to track players in team sports. Player movements are often complex and highly correlated with both nearby and distant players. A single model would require many degrees of freedom to represent the full motion diversity and could be difficult to use in practice. Instead, we introduce a set of Game Context Features extracted from noisy detections to describe the current state of the match, such as how the players are spatially distributed. Our assumption is that players react to the current situation in only a finite number of ways. As a result, we are able to select an appropriate simplified affinity model for each player and time instant using a random decisionforest based on current track and game contextfeatures. Our context-conditioned motion models implicitly incorporate complex inter-object correlations while remaining tractable. We demonstrate significant performance improvements over existing multi-target tracking algorithms on basketball and field hockey sequences several minutes in duration and containing 10 and 20 players respectively.

3 0.74386621 209 cvpr-2013-Hypergraphs for Joint Multi-view Reconstruction and Multi-object Tracking

Author: Martin Hofmann, Daniel Wolf, Gerhard Rigoll

Abstract: We generalize the network flow formulation for multiobject tracking to multi-camera setups. In the past, reconstruction of multi-camera data was done as a separate extension. In this work, we present a combined maximum a posteriori (MAP) formulation, which jointly models multicamera reconstruction as well as global temporal data association. A flow graph is constructed, which tracks objects in 3D world space. The multi-camera reconstruction can be efficiently incorporated as additional constraints on the flow graph without making the graph unnecessarily large. The final graph is efficiently solved using binary linear programming. On the PETS 2009 dataset we achieve results that significantly exceed the current state of the art.

4 0.74181581 337 cvpr-2013-Principal Observation Ray Calibration for Tiled-Lens-Array Integral Imaging Display

Author: Weiming Li, Haitao Wang, Mingcai Zhou, Shandong Wang, Shaohui Jiao, Xing Mei, Tao Hong, Hoyoung Lee, Jiyeun Kim

Abstract: Integral imaging display (IID) is a promising technology to provide realistic 3D image without glasses. To achieve a large screen IID with a reasonable fabrication cost, a potential solution is a tiled-lens-array IID (TLA-IID). However, TLA-IIDs are subject to 3D image artifacts when there are even slight misalignments between the lens arrays. This work aims at compensating these artifacts by calibrating the lens array poses with a camera and including them in a ray model used for rendering the 3D image. Since the lens arrays are transparent, this task is challenging for traditional calibration methods. In this paper, we propose a novel calibration method based on defining a set of principle observation rays that pass lens centers of the TLA and the camera ’s optical center. The method is able to determine the lens array poses with only one camera at an arbitrary unknown position without using any additional markers. The principle observation rays are automatically extracted using a structured light based method from a dense correspondence map between the displayed and captured . pixels. . com, Experiments show that lens array misalignments xme i nlpr . ia . ac . cn @ can be estimated with a standard deviation smaller than 0.4 pixels. Based on this, 3D image artifacts are shown to be effectively removed in a test TLA-IID with challenging misalignments.

5 0.73785013 274 cvpr-2013-Lost! Leveraging the Crowd for Probabilistic Visual Self-Localization

Author: Marcus A. Brubaker, Andreas Geiger, Raquel Urtasun

Abstract: In this paper we propose an affordable solution to selflocalization, which utilizes visual odometry and road maps as the only inputs. To this end, we present a probabilistic model as well as an efficient approximate inference algorithm, which is able to utilize distributed computation to meet the real-time requirements of autonomous systems. Because of the probabilistic nature of the model we are able to cope with uncertainty due to noisy visual odometry and inherent ambiguities in the map (e.g., in a Manhattan world). By exploiting freely available, community developed maps and visual odometry measurements, we are able to localize a vehicle up to 3m after only a few seconds of driving on maps which contain more than 2,150km of drivable roads.

6 0.73773718 39 cvpr-2013-Alternating Decision Forests

7 0.7372449 222 cvpr-2013-Incorporating User Interaction and Topological Constraints within Contour Completion via Discrete Calculus

8 0.73617363 107 cvpr-2013-Deformable Spatial Pyramid Matching for Fast Dense Correspondences

9 0.73471707 50 cvpr-2013-Augmenting CRFs with Boltzmann Machine Shape Priors for Image Labeling

10 0.73190027 230 cvpr-2013-Joint 3D Scene Reconstruction and Class Segmentation

11 0.72960252 104 cvpr-2013-Deep Convolutional Network Cascade for Facial Point Detection

12 0.72922081 298 cvpr-2013-Multi-scale Curve Detection on Surfaces

13 0.7286827 125 cvpr-2013-Dictionary Learning from Ambiguously Labeled Data

14 0.72779918 365 cvpr-2013-Robust Real-Time Tracking of Multiple Objects by Volumetric Mass Densities

15 0.72771859 354 cvpr-2013-Relative Volume Constraints for Single View 3D Reconstruction

16 0.72131747 443 cvpr-2013-Uncalibrated Photometric Stereo for Unknown Isotropic Reflectances

17 0.72103727 71 cvpr-2013-Boundary Cues for 3D Object Shape Recovery

18 0.7193538 19 cvpr-2013-A Minimum Error Vanishing Point Detection Approach for Uncalibrated Monocular Images of Man-Made Environments

19 0.71917909 147 cvpr-2013-Ensemble Learning for Confidence Measures in Stereo Vision

20 0.71763074 61 cvpr-2013-Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics