iccv iccv2013 iccv2013-230 knowledge-graph by maker-knowledge-mining

230 iccv-2013-Latent Data Association: Bayesian Model Selection for Multi-target Tracking


Source: pdf

Author: Aleksandr V. Segal, Ian Reid

Abstract: We propose a novel parametrization of the data association problem for multi-target tracking. In our formulation, the number of targets is implicitly inferred together with the data association, effectively solving data association and model selection as a single inference problem. The novel formulation allows us to interpret data association and tracking as a single Switching Linear Dynamical System (SLDS). We compute an approximate posterior solution to this problem using a dynamic programming/message passing technique. This inference-based approach allows us to incorporate richer probabilistic models into the tracking system. In particular, we incorporate inference over inliers/outliers and track termination times into the system. We evaluate our approach on publicly available datasets and demonstrate results competitive with, and in some cases exceeding the state of the art.

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 uk Abstract We propose a novel parametrization of the data association problem for multi-target tracking. [sent-5, score-0.499]

2 In our formulation, the number of targets is implicitly inferred together with the data association, effectively solving data association and model selection as a single inference problem. [sent-6, score-0.521]

3 The novel formulation allows us to interpret data association and tracking as a single Switching Linear Dynamical System (SLDS). [sent-7, score-0.608]

4 This inference-based approach allows us to incorporate richer probabilistic models into the tracking system. [sent-9, score-0.336]

5 In particular, we incorporate inference over inliers/outliers and track termination times into the system. [sent-10, score-0.309]

6 Introduction Multi-target tracking is an important, but stubborn problem in Computer Vision as well as many related fields (notably robotics). [sent-13, score-0.238]

7 The first is the combinatorial space of possible associations between the observations and objects being tracked, and the second is model selection over the number of existing tracks. [sent-17, score-0.189]

8 In this paper we propose Latent Data Association as an alternative parametrization of the data association problem where the number of underlying target tracks is implicit in the data association. [sent-18, score-0.75]

9 We treat the new parametrization as a special case of a Switching Linear Dynamical System (SLDS) [19], and perform approximate inference using a Ian Reid Department of Computer Science, University of Adelaide i . [sent-19, score-0.191]

10 Nodes are numbered within each time slice and colored based on their global track membership. [sent-25, score-0.206]

11 Each node represents a single latent track state together with any observations (if they exist). [sent-26, score-0.634]

12 By treating multi-target tracking as an approximate hybrid inference problem, more complex reasoning about object classification can be incorporated into the same algorithm used for data association and tracking. [sent-28, score-0.67]

13 This is accomplished by adding discrete object category 2904 variables into the tracking model. [sent-30, score-0.385]

14 Using this model allows the classification and tracking problem to be naturally combined into a single system where statistical relationships between target motion (tracking) and target identity (detection and classification) can be exploited. [sent-32, score-0.455]

15 Previous Work Classical approaches to multi-target tracking were pioneered decades ago assuming point-like targets such as radar returns. [sent-34, score-0.372]

16 Most of these were progressive variations and generalizations of single target tracking in a cluttered environment. [sent-35, score-0.329]

17 The Probabilistic Data Association Filter (PDAF) [5] only deals with a single target at a time, but introduced the notion of soft data association based on a weighted mixture of measurements. [sent-36, score-0.461]

18 The Multiple Hypothesis Tracker (MHT) [22] keeps a list of all possible data association hypotheses and the resulting filter outputs for each target. [sent-38, score-0.408]

19 This technique re-frames multi-target tracking as the fusion of an object detector [10, 11, 21] with data association. [sent-40, score-0.297]

20 In contrast to classical methods focusing on radar data with point measurements, TBD literature has focused on tracking objects in video sequences. [sent-41, score-0.335]

21 The tracking question is formulated as linking compatible detections on the grid into consistent trajectories. [sent-44, score-0.349]

22 Berclaz et al [7] form a sparse graph over every hypothetical discrete object locations. [sent-45, score-0.205]

23 it is not easy to combine with a moving sensor platform) and forces a compromise between accuracy and the size of the tracking area. [sent-50, score-0.238]

24 All continuous variables are treated as such and smoothing of the output trajectories is done implicitly via the motion model without any postprocessing. [sent-52, score-0.189]

25 In this case, the set of discrete detections is partitioned into tracks without explicitly enumerating what happens to the target in between successive detections. [sent-54, score-0.439]

26 Jiang et al [14] formulates data association as a Linear Program (LP) over the sparse graph of detections. [sent-55, score-0.453]

27 Monte Carlo based approaches represent the distribution over the state space as a set of discrete samples. [sent-58, score-0.189]

28 In the case of Particle Filters (PF), these samples are manipulated so that their distribution tracks the posterior of the filter. [sent-60, score-0.245]

29 The JPDAF can be implemented as a PF [24, 25] in order to track people from a mobile platform using 2D laser range data. [sent-61, score-0.206]

30 Khan et al [15] use a Markov Chain Monte Carlo (MCMC) based particle filter to incorporate motion priors over target interactions. [sent-62, score-0.345]

31 Breitenstein et al [9] introduce the Detector Confidence Particle Filter (DCPF) to directly incorporate detector scores as a measure of confidence. [sent-63, score-0.183]

32 MCMC can also be used as an independent tracking algorithm by sampling over the joint posterior of the whole problem. [sent-65, score-0.323]

33 Recently, Benfold et al [6] proposed a real-time global MCMC strategy which simply ignores the continuous state variables of the targets and samples directly over groupings of observations. [sent-67, score-0.387]

34 This has the disadvantage of losing the latent/hidden state space of the targets and so requires postprocessing to recover smooth trajectories. [sent-68, score-0.194]

35 Andriyenko et al [3, 4] formulate tracking as a direct optimization problems over splines, and in the latter case discrete track labels. [sent-69, score-0.611]

36 We assume a fixed number of tracks and attempt to simultaneously find the target trajectories and the data association of observations to targets. [sent-78, score-0.775]

37 Depending on =the problem, each} }ob asnedrv tati doenn zti gco tuimld ei. [sent-86, score-0.205]

38 follows the trajectory Xm = The data association problem is classically formulated as finding a correspondence between the targets and observations at each point in time. [sent-96, score-0.569]

39 In this notation, di(t) = j ∈ {1, , M} indicates that the observation zi(t) is associated with the jth target, with the constraint that dDis(tc)r = · · · no two observations can be assigned to the same target. [sent-101, score-0.185]

40 While the classical approach attempts to assign observations to previously existing tracks, Latent Data Association starts by assuming that each detection is its own track (of length 1) with a permanently associated hidden state variable. [sent-119, score-0.473]

41 The problem of tracking then becomes a question of linking these singleton tracks into longer trajectories. [sent-120, score-0.443]

42 We do this by assigning each track at time t as the continuation of some track at t 1. [sent-121, score-0.412]

43 We refer to this form of data association as latteimnte eb tec −au 1s. [sent-124, score-0.37]

44 e W Wthee rdeifsecrr teote t hviasria fobrlmes now ctao natssroolc aiastsiooncia atsio lan-s between adjacent latent state variables. [sent-125, score-0.205]

45 Figure 1 illustrates this parametrization with the tracks being spliced between t = 3 and t = 4. [sent-126, score-0.289]

46 To define this model formally, we define a node as the set of hidden state variables associated with some track at a specific time instance, as well as any observations of this state. [sent-127, score-0.597]

47 For n = (t, i), we define xti as the unobserved state variables of the node and zti as the observations (if present). [sent-130, score-0.898]

48 − The binary indication matrix Li(tj) is used to control the Li(tj) latent data associations at time t; setting = 1 corresponds to linking node (t, i) with node (t 1, j). [sent-131, score-0.45]

49 In order to ensure track continuations are always onet? [sent-134, score-0.206]

50 This parametrization of the problem subsumes standard data association as well as model selection over the number of tracks; any number of tracks and any data association can be represented with a suitable value for L = {L(t)}. [sent-143, score-1.029]

51 By fixing the set of latent data association indicators, we partition the nodes into independent tracks. [sent-144, score-0.534]

52 Each observation zti is generated from the associated target state xti according to an observation model, P(zti |xti). [sent-146, score-0.853]

53 (5) If we assume linear motion and observation models, the model forms an SLDS [19] where the discrete L(t) variables control the relationships between continuous variables in the Markov Chain. [sent-149, score-0.367]

54 This SLDS can be used to implicitly 2906 (a) Classical data association Figure 2. [sent-150, score-0.37]

55 trolled by the data association variables D(t) or latent data association variables L(t) Dashed lines represent dependencies con- respectively. [sent-152, score-0.966]

56 solve the data association problem together with model selection over the number of targets. [sent-153, score-0.37]

57 For a node (t, i), we define prti as the index of the previous node (at t−1) in the same track and nxti as the index ofthe next node t(a−t 1t )+in 1th). [sent-160, score-0.545]

58 The forward and backward messages respectively can then be defined recursively as →μti(xti) =? [sent-162, score-0.189]

59 This quantity can be efficiently retrieved from any node along the track as mti=? [sent-167, score-0.319]

60 X −→μt−1,j· P(zti|xti) · P(xti|xt−1,j) ·←μ −t+1,nxti (12) Note that mtij is the hypothetical value of mti if we had torn the node (t, i) from its current assignment and attached it to node (t 1, j) instead. [sent-174, score-0.48]

61 11 does not affect any of the forward messages before time t or any of the backward messages after time t these only depend on values of for t? [sent-176, score-0.286]

62 We use the messages { −→μt−1 } and } to update L(t) , and subsequently use {the new avnadlue { of L(}t) t oto u compute the forward messages { −→μt}. [sent-181, score-0.244]

63 11 8: add virtual nodes at t 9: for all n = (t, i) do 10: update forward message −μ →ti using Eq. [sent-195, score-0.25]

64 Approximate message passing procedure used for inference in the forward direction. [sent-197, score-0.277]

65 Pedestrian Tracking by Detection with Latent Data Association Up to this point we have described the Latent Data Association parametrization and inference algorithm in general terms. [sent-199, score-0.191]

66 To this end we describe the observation and state space models for both 2D and 3D tracking, as well as extensions to handle false positive detections and track length priors. [sent-201, score-0.452]

67 Since every detection now corresponds to a track, outliers must correspond to outlier tracks, leading to an extra discrete state variable, cti ∈ {pedestrian, outlier}, representing the target class. [sent-204, score-0.461]

68 The pedestrian detectors we use are discriminative, so no generative model exists to explain the observations based on the target class. [sent-207, score-0.307]

69 In practice, a lot of information is contained in the missing detections a track with very few detections is more likely to be an outlier than one with many consistent detections. [sent-229, score-0.436]

70 To incorporate this negative information, we include detector failure into the observation model. [sent-230, score-0.175]

71 The indicator variable mti = 1is used to denote a missing observation at node n = (t, i). [sent-231, score-0.337]

72 In this case n is a virtual node and the zti and sti observation variables are ignored. [sent-232, score-0.563]

73 Because of the detector failure model, we cannot assume a track continues on indefinitely after its last observation doing so would imply a very large number of missing observations and make all tracks likely to be outliers. [sent-235, score-0.645]

74 Instead, we give each target track a fixed probability of terminating at every time instance after its last observation. [sent-236, score-0.336]

75 We introduce the indicator variable eti to mark that the track has ended. [sent-237, score-0.28]

76 If eti = 1, we require that mti = 1; once a track ends, it cannot have any additional observations. [sent-239, score-0.394]

77 9 do not depend on the Markov chain being continuous; analogous equations hold for a discrete chain if the marginalization integrals are replaced with sums. [sent-247, score-0.176]

78 We run discrete message passing over eti and cti and compute the track log-likelihood of the data by adding the log-likelihoods obtained from Eq. [sent-248, score-0.647]

79 11 with the cost of each assignment based on the combined track loglikelihood. [sent-251, score-0.253]

80 Evaluation Experimental validation was performed using four publicly available video sequences comprising over 1200 frames from two standard pedestrian tracking datasets (TUD [1] and PETS’09 [12]). [sent-253, score-0.344]

81 2D tracking was used for the TUD datasets and 3D tracking for the PETS sequence. [sent-254, score-0.476]

82 We ran 2D tracking on TUD-Stadtmitte despite the available camera calibration because the oblique viewing angle makes accurate estimation of ground plane positions difficult. [sent-255, score-0.238]

83 Raw detections, ground truth annotations, and tracking area specifications provided by Andriyenko et al [4] were used for all evaluations. [sent-256, score-0.321]

84 Results are presented in terms of the CLEAR MOT [8] metrics for tracking performance and precision-recall curves for classification accuracy. [sent-257, score-0.238]

85 In the 2D case, the continuous state space is composed of the bounding box center and the log of the dimensions. [sent-261, score-0.191]

86 Both the position, p, and logdimensions, d, have an associated velocity ( p˙ and resulting in an 8D state space: (px , py, dx , dy , p˙ x , p˙ y, d˙x , d˙y). [sent-263, score-0.232]

87 We again use a constant-velocity model for the ground plane position, but assume the dimensions follow a random walk with no velocity (unlike in the 2D tracking case, we expect the 3D dimensions to stay relatively constant). [sent-284, score-0.387]

88 Because our system keeps track of object sizes as well as location, the size of the bounding boxes output by the detector vs the size of the labeled ground truth plays an important role in the performance of the system. [sent-316, score-0.304]

89 In this case we assumed average 3D pedestrian dimensions and projected these into 2D bounding boxes. [sent-329, score-0.196]

90 These curves are possible because of the probabilistic nature of our approach where each output has an associated posterior pedestrian vs outlier probability. [sent-338, score-0.311]

91 While these figures convey the quantitative measures of performance, we encourage the reader to view the supplementary material to observe the qualitative tracking behavior and performance. [sent-339, score-0.238]

92 Conclusions and Future Work This paper has proposed a novel parametrization of the data association problem for multi-target tracking that has a number of very useful properties. [sent-341, score-0.737]

93 The key idea behind our formulation is the proposal to perform latent data association, in which we seek associations between latent state variables over time. [sent-342, score-0.447]

94 (6785VT609PiewI12D46–8)9SM1–87TF4512M–368 1 evaluated by PETS’09 workshop 2 cropped to tracking region of Andriyenko et al [3, 4] 3 our own 2D evaluations using authors’ provided output data 4 results as published by authors Table 1. [sent-362, score-0.404]

95 Sonar tracking of multiple targets using joint probabilistic data association. [sent-431, score-0.384]

96 An mcmc-based particle filter for tracking multiple interacting targets. [sent-443, score-0.333]

97 Coupled object detection and tracking from static cameras and moving vehicles. [sent-451, score-0.238]

98 Markov chain monte carlo [21] [22] [23] [24] [25] data association for general multiple-target tracking problems. [sent-474, score-0.767]

99 Gmcp-tracker: Global multi-object tracking using generalized minimum clique graphs. [sent-492, score-0.238]

100 Monte carlo filtering for multi target tracking and data association. [sent-507, score-0.385]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('association', 0.37), ('xti', 0.302), ('tracking', 0.238), ('track', 0.206), ('zti', 0.205), ('andriyenko', 0.171), ('tracks', 0.16), ('slds', 0.139), ('xptri', 0.139), ('parametrization', 0.129), ('pets', 0.121), ('cti', 0.118), ('mti', 0.114), ('node', 0.113), ('observations', 0.11), ('pedestrian', 0.106), ('state', 0.105), ('latent', 0.1), ('messages', 0.097), ('message', 0.095), ('target', 0.091), ('targets', 0.089), ('posterior', 0.085), ('discrete', 0.084), ('al', 0.083), ('xtpir', 0.083), ('associations', 0.079), ('conference', 0.078), ('tj', 0.077), ('observation', 0.075), ('eti', 0.074), ('passing', 0.07), ('mcmc', 0.067), ('detections', 0.066), ('sti', 0.066), ('xm', 0.065), ('nodes', 0.064), ('mot', 0.064), ('variables', 0.063), ('outlier', 0.063), ('inference', 0.062), ('detector', 0.059), ('tud', 0.059), ('monte', 0.057), ('probabilistic', 0.057), ('particle', 0.057), ('carlo', 0.056), ('jpdaf', 0.055), ('logdimensions', 0.055), ('mtij', 0.055), ('pdaf', 0.055), ('tbd', 0.055), ('xt', 0.054), ('berlin', 0.053), ('transition', 0.052), ('classical', 0.052), ('dimensions', 0.051), ('forward', 0.05), ('heidelberg', 0.049), ('markov', 0.049), ('bti', 0.049), ('lecture', 0.048), ('tracked', 0.048), ('assignment', 0.047), ('continuous', 0.047), ('velocity', 0.047), ('chain', 0.046), ('switching', 0.046), ('radar', 0.045), ('benfold', 0.045), ('firing', 0.045), ('mt', 0.045), ('pf', 0.045), ('linking', 0.045), ('trajectories', 0.044), ('published', 0.044), ('conditioned', 0.043), ('lap', 0.043), ('notes', 0.042), ('backward', 0.042), ('incorporate', 0.041), ('enumeration', 0.041), ('virtual', 0.041), ('volume', 0.04), ('dy', 0.04), ('dx', 0.04), ('pages', 0.04), ('berclaz', 0.039), ('terminating', 0.039), ('bounding', 0.039), ('evaluations', 0.039), ('enumerating', 0.038), ('hypothetical', 0.038), ('multitarget', 0.038), ('filter', 0.038), ('leibe', 0.037), ('width', 0.037), ('ieee', 0.037), ('motion', 0.035), ('missing', 0.035)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99999946 230 iccv-2013-Latent Data Association: Bayesian Model Selection for Multi-target Tracking

Author: Aleksandr V. Segal, Ian Reid

Abstract: We propose a novel parametrization of the data association problem for multi-target tracking. In our formulation, the number of targets is implicitly inferred together with the data association, effectively solving data association and model selection as a single inference problem. The novel formulation allows us to interpret data association and tracking as a single Switching Linear Dynamical System (SLDS). We compute an approximate posterior solution to this problem using a dynamic programming/message passing technique. This inference-based approach allows us to incorporate richer probabilistic models into the tracking system. In particular, we incorporate inference over inliers/outliers and track termination times into the system. We evaluate our approach on publicly available datasets and demonstrate results competitive with, and in some cases exceeding the state of the art.

2 0.44184953 58 iccv-2013-Bayesian 3D Tracking from Monocular Video

Author: Ernesto Brau, Jinyan Guan, Kyle Simek, Luca Del Pero, Colin Reimer Dawson, Kobus Barnard

Abstract: Jinyan Guan† j guan1 @ emai l ari z ona . edu . Kyle Simek† ks imek@ emai l ari z ona . edu . Colin Reimer Dawson‡ cdaws on@ emai l ari z ona . edu . ‡School of Information University of Arizona Kobus Barnard‡ kobus @ s i sta . ari z ona . edu ∗School of Informatics University of Edinburgh for tracking an unknown and changing number of people in a scene using video taken from a single, fixed viewpoint. We develop a Bayesian modeling approach for tracking people in 3D from monocular video with unknown cameras. Modeling in 3D provides natural explanations for occlusions and smoothness discontinuities that result from projection, and allows priors on velocity and smoothness to be grounded in physical quantities: meters and seconds vs. pixels and frames. We pose the problem in the context of data association, in which observations are assigned to tracks. A correct application of Bayesian inference to multitarget tracking must address the fact that the model’s dimension changes as tracks are added or removed, and thus, posterior densities of different hypotheses are not comparable. We address this by marginalizing out the trajectory parameters so the resulting posterior over data associations has constant dimension. This is made tractable by using (a) Gaussian process priors for smooth trajectories and (b) approximately Gaussian likelihood functions. Our approach provides a principled method for incorporating multiple sources of evidence; we present results using both optical flow and object detector outputs. Results are comparable to recent work on 3D tracking and, unlike others, our method requires no pre-calibrated cameras.

3 0.17762239 120 iccv-2013-Discriminative Label Propagation for Multi-object Tracking with Sporadic Appearance Features

Author: K.C. Amit Kumar, Christophe De_Vleeschouwer

Abstract: Given a set of plausible detections, detected at each time instant independently, we investigate how to associate them across time. This is done by propagating labels on a set of graphs that capture how the spatio-temporal and the appearance cues promote the assignment of identical or distinct labels to a pair of nodes. The graph construction is driven by the locally linear embedding (LLE) of either the spatio-temporal or the appearance features associated to the detections. Interestingly, the neighborhood of a node in each appearance graph is defined to include all nodes for which the appearance feature is available (except the ones that coexist at the same time). This allows to connect the nodes that share the same appearance even if they are temporally distant, which gives our framework the uncommon ability to exploit the appearance features that are available only sporadically along the sequence of detections. Once the graphs have been defined, the multi-object tracking is formulated as the problem of finding a label assignment that is consistent with the constraints captured by each of the graphs. This results into a difference of convex program that can be efficiently solved. Experiments are performed on a basketball and several well-known pedestrian datasets in order to validate the effectiveness of the proposed solution.

4 0.17230441 242 iccv-2013-Learning People Detectors for Tracking in Crowded Scenes

Author: Siyu Tang, Mykhaylo Andriluka, Anton Milan, Konrad Schindler, Stefan Roth, Bernt Schiele

Abstract: People tracking in crowded real-world scenes is challenging due to frequent and long-term occlusions. Recent tracking methods obtain the image evidence from object (people) detectors, but typically use off-the-shelf detectors and treat them as black box components. In this paper we argue that for best performance one should explicitly train people detectors on failure cases of the overall tracker instead. To that end, we first propose a novel joint people detector that combines a state-of-the-art single person detector with a detector for pairs of people, which explicitly exploits common patterns of person-person occlusions across multiple viewpoints that are a frequent failure case for tracking in crowded scenes. To explicitly address remaining failure modes of the tracker we explore two methods. First, we analyze typical failures of trackers and train a detector explicitly on these cases. And second, we train the detector with the people tracker in the loop, focusing on the most common tracker failures. We show that our joint multi-person detector significantly improves both de- tection accuracy as well as tracker performance, improving the state-of-the-art on standard benchmarks.

5 0.17030871 318 iccv-2013-PixelTrack: A Fast Adaptive Algorithm for Tracking Non-rigid Objects

Author: Stefan Duffner, Christophe Garcia

Abstract: In this paper, we present a novel algorithm for fast tracking of generic objects in videos. The algorithm uses two components: a detector that makes use of the generalised Hough transform with pixel-based descriptors, and a probabilistic segmentation method based on global models for foreground and background. These components are used for tracking in a combined way, and they adapt each other in a co-training manner. Through effective model adaptation and segmentation, the algorithm is able to track objects that undergo rigid and non-rigid deformations and considerable shape and appearance variations. The proposed tracking method has been thoroughly evaluated on challenging standard videos, and outperforms state-of-theart tracking methods designed for the same task. Finally, the proposed models allow for an extremely efficient implementation, and thus tracking is very fast.

6 0.15736738 442 iccv-2013-Video Segmentation by Tracking Many Figure-Ground Segments

7 0.15395571 65 iccv-2013-Breaking the Chain: Liberation from the Temporal Markov Assumption for Tracking Human Poses

8 0.14867362 395 iccv-2013-Slice Sampling Particle Belief Propagation

9 0.14851508 366 iccv-2013-STAR3D: Simultaneous Tracking and Reconstruction of 3D Objects Using RGB-D Data

10 0.14762104 87 iccv-2013-Conservation Tracking

11 0.14732841 425 iccv-2013-Tracking via Robust Multi-task Multi-view Joint Sparse Representation

12 0.14269298 303 iccv-2013-Orderless Tracking through Model-Averaged Posterior Estimation

13 0.14010495 298 iccv-2013-Online Robust Non-negative Dictionary Learning for Visual Tracking

14 0.14005744 359 iccv-2013-Robust Object Tracking with Online Multi-lifespan Dictionary Learning

15 0.13573022 424 iccv-2013-Tracking Revisited Using RGBD Camera: Unified Benchmark and Baselines

16 0.12669447 418 iccv-2013-The Way They Move: Tracking Multiple Targets with Similar Appearance

17 0.12614433 200 iccv-2013-Higher Order Matching for Consistent Multiple Target Tracking

18 0.12468198 289 iccv-2013-Network Principles for SfM: Disambiguating Repeated Structures with Local Context

19 0.10540982 320 iccv-2013-Pose-Configurable Generic Tracking of Elongated Objects

20 0.1034047 168 iccv-2013-Finding the Best from the Second Bests - Inhibiting Subjective Bias in Evaluation of Visual Tracking Algorithms


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.249), (1, -0.044), (2, 0.023), (3, 0.069), (4, 0.051), (5, -0.037), (6, -0.104), (7, 0.176), (8, -0.019), (9, 0.123), (10, -0.093), (11, -0.191), (12, 0.048), (13, 0.123), (14, 0.082), (15, 0.039), (16, 0.045), (17, 0.048), (18, -0.049), (19, -0.02), (20, -0.119), (21, -0.038), (22, 0.043), (23, -0.05), (24, 0.021), (25, -0.004), (26, 0.022), (27, -0.185), (28, -0.022), (29, 0.003), (30, 0.002), (31, 0.004), (32, 0.017), (33, -0.084), (34, 0.026), (35, 0.071), (36, 0.085), (37, 0.021), (38, 0.103), (39, 0.007), (40, 0.091), (41, 0.027), (42, 0.026), (43, -0.099), (44, -0.195), (45, -0.043), (46, -0.004), (47, 0.006), (48, -0.023), (49, -0.085)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.96153897 230 iccv-2013-Latent Data Association: Bayesian Model Selection for Multi-target Tracking

Author: Aleksandr V. Segal, Ian Reid

Abstract: We propose a novel parametrization of the data association problem for multi-target tracking. In our formulation, the number of targets is implicitly inferred together with the data association, effectively solving data association and model selection as a single inference problem. The novel formulation allows us to interpret data association and tracking as a single Switching Linear Dynamical System (SLDS). We compute an approximate posterior solution to this problem using a dynamic programming/message passing technique. This inference-based approach allows us to incorporate richer probabilistic models into the tracking system. In particular, we incorporate inference over inliers/outliers and track termination times into the system. We evaluate our approach on publicly available datasets and demonstrate results competitive with, and in some cases exceeding the state of the art.

2 0.87512618 58 iccv-2013-Bayesian 3D Tracking from Monocular Video

Author: Ernesto Brau, Jinyan Guan, Kyle Simek, Luca Del Pero, Colin Reimer Dawson, Kobus Barnard

Abstract: Jinyan Guan† j guan1 @ emai l ari z ona . edu . Kyle Simek† ks imek@ emai l ari z ona . edu . Colin Reimer Dawson‡ cdaws on@ emai l ari z ona . edu . ‡School of Information University of Arizona Kobus Barnard‡ kobus @ s i sta . ari z ona . edu ∗School of Informatics University of Edinburgh for tracking an unknown and changing number of people in a scene using video taken from a single, fixed viewpoint. We develop a Bayesian modeling approach for tracking people in 3D from monocular video with unknown cameras. Modeling in 3D provides natural explanations for occlusions and smoothness discontinuities that result from projection, and allows priors on velocity and smoothness to be grounded in physical quantities: meters and seconds vs. pixels and frames. We pose the problem in the context of data association, in which observations are assigned to tracks. A correct application of Bayesian inference to multitarget tracking must address the fact that the model’s dimension changes as tracks are added or removed, and thus, posterior densities of different hypotheses are not comparable. We address this by marginalizing out the trajectory parameters so the resulting posterior over data associations has constant dimension. This is made tractable by using (a) Gaussian process priors for smooth trajectories and (b) approximately Gaussian likelihood functions. Our approach provides a principled method for incorporating multiple sources of evidence; we present results using both optical flow and object detector outputs. Results are comparable to recent work on 3D tracking and, unlike others, our method requires no pre-calibrated cameras.

3 0.78072149 87 iccv-2013-Conservation Tracking

Author: Martin Schiegg, Philipp Hanslovsky, Bernhard X. Kausler, Lars Hufnagel, Fred A. Hamprecht

Abstract: The quality of any tracking-by-assignment hinges on the accuracy of the foregoing target detection / segmentation step. In many kinds of images, errors in this first stage are unavoidable. These errors then propagate to, and corrupt, the tracking result. Our main contribution is the first probabilistic graphical model that can explicitly account for over- and undersegmentation errors even when the number of tracking targets is unknown and when they may divide, as in cell cultures. The tracking model we present implements global consistency constraints for the number of targets comprised by each detection and is solved to global optimality on reasonably large 2D+t and 3D+t datasets. In addition, we empirically demonstrate the effectiveness of a postprocessing that allows to establish target identity even across occlusion / undersegmentation. The usefulness and efficiency of this new tracking method is demonstrated on three different and challenging 2D+t and 3D+t datasets from developmental biology.

4 0.72755623 303 iccv-2013-Orderless Tracking through Model-Averaged Posterior Estimation

Author: Seunghoon Hong, Suha Kwak, Bohyung Han

Abstract: We propose a novel offline tracking algorithm based on model-averaged posterior estimation through patch matching across frames. Contrary to existing online and offline tracking methods, our algorithm is not based on temporallyordered estimates of target state but attempts to select easyto-track frames first out of the remaining ones without exploiting temporal coherency of target. The posterior of the selected frame is estimated by propagating densities from the already tracked frames in a recursive manner. The density propagation across frames is implemented by an efficient patch matching technique, which is useful for our algorithm since it does not require motion smoothness assumption. Also, we present a hierarchical approach, where a small set of key frames are tracked first and non-key frames are handled by local key frames. Our tracking algorithm is conceptually well-suited for the sequences with abrupt motion, shot changes, and occlusion. We compare our tracking algorithm with existing techniques in real videos with such challenges and illustrate its superior performance qualitatively and quantitatively.

5 0.6596787 418 iccv-2013-The Way They Move: Tracking Multiple Targets with Similar Appearance

Author: Caglayan Dicle, Octavia I. Camps, Mario Sznaier

Abstract: We introduce a computationally efficient algorithm for multi-object tracking by detection that addresses four main challenges: appearance similarity among targets, missing data due to targets being out of the field of view or occluded behind other objects, crossing trajectories, and camera motion. The proposed method uses motion dynamics as a cue to distinguish targets with similar appearance, minimize target mis-identification and recover missing data. Computational efficiency is achieved by using a Generalized Linear Assignment (GLA) coupled with efficient procedures to recover missing data and estimate the complexity of the underlying dynamics. The proposed approach works with tracklets of arbitrary length and does not assume a dynamical model a priori, yet it captures the overall motion dynamics of the targets. Experiments using challenging videos show that this framework can handle complex target motions, non-stationary cameras and long occlusions, on scenarios where appearance cues are not available or poor.

6 0.65669107 120 iccv-2013-Discriminative Label Propagation for Multi-object Tracking with Sporadic Appearance Features

7 0.65245724 395 iccv-2013-Slice Sampling Particle Belief Propagation

8 0.60111296 393 iccv-2013-Simultaneous Clustering and Tracklet Linking for Multi-face Tracking in Videos

9 0.59935594 200 iccv-2013-Higher Order Matching for Consistent Multiple Target Tracking

10 0.59494638 433 iccv-2013-Understanding High-Level Semantics by Modeling Traffic Patterns

11 0.59150797 242 iccv-2013-Learning People Detectors for Tracking in Crowded Scenes

12 0.58611822 168 iccv-2013-Finding the Best from the Second Bests - Inhibiting Subjective Bias in Evaluation of Visual Tracking Algorithms

13 0.57383358 65 iccv-2013-Breaking the Chain: Liberation from the Temporal Markov Assumption for Tracking Human Poses

14 0.55014372 289 iccv-2013-Network Principles for SfM: Disambiguating Repeated Structures with Local Context

15 0.54859006 318 iccv-2013-PixelTrack: A Fast Adaptive Algorithm for Tracking Non-rigid Objects

16 0.54082555 425 iccv-2013-Tracking via Robust Multi-task Multi-view Joint Sparse Representation

17 0.5232842 420 iccv-2013-Topology-Constrained Layered Tracking with Latent Flow

18 0.52210569 320 iccv-2013-Pose-Configurable Generic Tracking of Elongated Objects

19 0.51203847 128 iccv-2013-Dynamic Probabilistic Volumetric Models

20 0.50478441 274 iccv-2013-Monte Carlo Tree Search for Scheduling Activity Recognition


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(2, 0.047), (7, 0.019), (26, 0.079), (31, 0.045), (34, 0.25), (40, 0.013), (42, 0.09), (64, 0.116), (73, 0.085), (89, 0.16), (98, 0.011)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.85928267 202 iccv-2013-How Do You Tell a Blackbird from a Crow?

Author: Thomas Berg, Peter N. Belhumeur

Abstract: How do you tell a blackbirdfrom a crow? There has been great progress toward automatic methods for visual recognition, including fine-grained visual categorization in which the classes to be distinguished are very similar. In a task such as bird species recognition, automatic recognition systems can now exceed the performance of non-experts – most people are challenged to name a couple dozen bird species, let alone identify them. This leads us to the question, “Can a recognition system show humans what to look for when identifying classes (in this case birds)? ” In the context of fine-grained visual categorization, we show that we can automatically determine which classes are most visually similar, discover what visual features distinguish very similar classes, and illustrate the key features in a way meaningful to humans. Running these methods on a dataset of bird images, we can generate a visual field guide to birds which includes a tree of similarity that displays the similarity relations between all species, pages for each species showing the most similar other species, and pages for each pair of similar species illustrating their differences.

2 0.83319867 53 iccv-2013-Attribute Dominance: What Pops Out?

Author: Naman Turakhia, Devi Parikh

Abstract: When we look at an image, some properties or attributes of the image stand out more than others. When describing an image, people are likely to describe these dominant attributes first. Attribute dominance is a result of a complex interplay between the various properties present or absent in the image. Which attributes in an image are more dominant than others reveals rich information about the content of the image. In this paper we tap into this information by modeling attribute dominance. We show that this helps improve the performance of vision systems on a variety of human-centric applications such as zero-shot learning, image search and generating textual descriptions of images.

same-paper 3 0.82433617 230 iccv-2013-Latent Data Association: Bayesian Model Selection for Multi-target Tracking

Author: Aleksandr V. Segal, Ian Reid

Abstract: We propose a novel parametrization of the data association problem for multi-target tracking. In our formulation, the number of targets is implicitly inferred together with the data association, effectively solving data association and model selection as a single inference problem. The novel formulation allows us to interpret data association and tracking as a single Switching Linear Dynamical System (SLDS). We compute an approximate posterior solution to this problem using a dynamic programming/message passing technique. This inference-based approach allows us to incorporate richer probabilistic models into the tracking system. In particular, we incorporate inference over inliers/outliers and track termination times into the system. We evaluate our approach on publicly available datasets and demonstrate results competitive with, and in some cases exceeding the state of the art.

4 0.80588812 278 iccv-2013-Multi-scale Topological Features for Hand Posture Representation and Analysis

Author: Kaoning Hu, Lijun Yin

Abstract: In this paper, we propose a multi-scale topological feature representation for automatic analysis of hand posture. Such topological features have the advantage of being posture-dependent while being preserved under certain variations of illumination, rotation, personal dependency, etc. Our method studies the topology of the holes between the hand region and its convex hull. Inspired by the principle of Persistent Homology, which is the theory of computational topology for topological feature analysis over multiple scales, we construct the multi-scale Betti Numbers matrix (MSBNM) for the topological feature representation. In our experiments, we used 12 different hand postures and compared our features with three popular features (HOG, MCT, and Shape Context) on different data sets. In addition to hand postures, we also extend the feature representations to arm postures. The results demonstrate the feasibility and reliability of the proposed method.

5 0.76329458 31 iccv-2013-A Unified Probabilistic Approach Modeling Relationships between Attributes and Objects

Author: Xiaoyang Wang, Qiang Ji

Abstract: This paper proposes a unified probabilistic model to model the relationships between attributes and objects for attribute prediction and object recognition. As a list of semantically meaningful properties of objects, attributes generally relate to each other statistically. In this paper, we propose a unified probabilistic model to automatically discover and capture both the object-dependent and objectindependent attribute relationships. The model utilizes the captured relationships to benefit both attribute prediction and object recognition. Experiments on four benchmark attribute datasets demonstrate the effectiveness of the proposed unified model for improving attribute prediction as well as object recognition in both standard and zero-shot learning cases.

6 0.76065993 64 iccv-2013-Box in the Box: Joint 3D Layout and Object Reasoning from Single Images

7 0.75860858 399 iccv-2013-Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing

8 0.73757118 335 iccv-2013-Random Faces Guided Sparse Many-to-One Encoder for Pose-Invariant Face Recognition

9 0.71040583 380 iccv-2013-Semantic Transform: Weakly Supervised Semantic Inference for Relating Visual Attributes

10 0.705477 138 iccv-2013-Efficient and Robust Large-Scale Rotation Averaging

11 0.6980958 52 iccv-2013-Attribute Adaptation for Personalized Image Search

12 0.69596124 7 iccv-2013-A Deep Sum-Product Architecture for Robust Facial Attributes Analysis

13 0.6950829 107 iccv-2013-Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction

14 0.68545145 58 iccv-2013-Bayesian 3D Tracking from Monocular Video

15 0.68525624 449 iccv-2013-What Do You Do? Occupation Recognition in a Photo via Social Context

16 0.6800077 433 iccv-2013-Understanding High-Level Semantics by Modeling Traffic Patterns

17 0.68000317 359 iccv-2013-Robust Object Tracking with Online Multi-lifespan Dictionary Learning

18 0.67996228 420 iccv-2013-Topology-Constrained Layered Tracking with Latent Flow

19 0.67941183 215 iccv-2013-Incorporating Cloud Distribution in Sky Representation

20 0.67842174 442 iccv-2013-Video Segmentation by Tracking Many Figure-Ground Segments