iccv iccv2013 iccv2013-186 knowledge-graph by maker-knowledge-mining

186 iccv-2013-GrabCut in One Cut


Source: pdf

Author: Meng Tang, Lena Gorelick, Olga Veksler, Yuri Boykov

Abstract: Among image segmentation algorithms there are two major groups: (a) methods assuming known appearance models and (b) methods estimating appearance models jointly with segmentation. Typically, the first group optimizes appearance log-likelihoods in combination with some spacial regularization. This problem is relatively simple and many methods guarantee globally optimal results. The second group treats model parameters as additional variables transforming simple segmentation energies into highorder NP-hard functionals (Zhu-Yuille, Chan-Vese, GrabCut, etc). It is known that such methods indirectly minimize the appearance overlap between the segments. We propose a new energy term explicitly measuring L1 distance between the object and background appearance models that can be globally maximized in one graph cut. We show that in many applications our simple term makes NP-hard segmentation functionals unnecessary. Our one cut algorithm effectively replaces approximate iterative optimization techniques based on block coordinate descent.

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 GrabCut in One Cut Meng Tang Lena Gorelick Olga Veksler University of Western Ontario Canada Yuri Boykov Abstract Among image segmentation algorithms there are two major groups: (a) methods assuming known appearance models and (b) methods estimating appearance models jointly with segmentation. [sent-1, score-0.496]

2 The second group treats model parameters as additional variables transforming simple segmentation energies into highorder NP-hard functionals (Zhu-Yuille, Chan-Vese, GrabCut, etc). [sent-4, score-0.296]

3 It is known that such methods indirectly minimize the appearance overlap between the segments. [sent-5, score-0.428]

4 We propose a new energy term explicitly measuring L1 distance between the object and background appearance models that can be globally maximized in one graph cut. [sent-6, score-0.615]

5 We show that in many applications our simple term makes NP-hard segmentation functionals unnecessary. [sent-7, score-0.36]

6 The most basic object segmentation energy [3, 20] combines boundary regularization with loglikelihood ratios for fixed foreground and background appearance models, e. [sent-11, score-0.663]

7 {p,q}∈N ωpq |sp − sq | where sp ∈ {0, 1} are binary indic? [sent-18, score-0.201]

8 n tO Sne ⊂ important practical advantage o off tnheisig hbbasoicri energy is that there are efficient methods for their global minimization using graph cuts [4] or continuous relaxations [5, 18]. [sent-20, score-0.221]

9 Some well-known approaches to segmentation [25, 19, 6] consider model parameters as extra optimization variables in their segmentation energies. [sent-22, score-0.372]

10 p∈Ω which is known to be NP-hard [22], is used for interactive segmentation in GrabCut [ 19] where initial appearance models θ1, θ0 are computed from a given bounding box. [sent-27, score-0.493]

11 using a graph cut algorithm for energy (1) as in [3]. [sent-31, score-0.351]

12 Second, they fix segmentation S and then optimize over model parameters θ1 and θ0. [sent-32, score-0.226]

13 Two wellknown alternatives, dual decomposition [22] and branchand-mincut [14], can find a global minimum of energy (2), but these methods are too slow in practice. [sent-33, score-0.29]

14 We observe that when appearance models θ1, θ0 are represented by (non-parametric) color histograms, minimization of (2) is equivalent to minimization of energy · H(θS¯) + |∂S| on S only. [sent-34, score-0.389]

15 HT(h·i)s sfo trhme of energy (2) can be obtained by replacing the sum over pixels in (2) by the sum over color bins. [sent-36, score-0.337]

16 Interestingly, the global minimum|θ of) segmentation energy (3) does not depend on the initial color models provided by the user. [sent-40, score-0.42]

17 The entropy terms of this energy prefer segments with more peaked color distributions. [sent-43, score-0.378]

18 In general, the color separation bias in energy (3) is shown by equivalently rewriting its two entropy terms as hΩ(S) −? [sent-49, score-0.518]

19 Energy (3): volum−eh balancing (a) and −Jen|Ssen−-Shan|non color separation terms (b). [sent-85, score-0.349]

20 plicitly bias image segmentation to two parts of equal size, see Fig. [sent-87, score-0.251]

21 The remaining terms in (4) show bias to color separation between the segments, see Fig. [sent-89, score-0.299]

22 Note that a similar analysis in [22] is used to motivate their convexconcave approximation algorithm for energy (2). [sent-91, score-0.237]

23 Volume balancing hΩ (S) is the only term in (4) and (2) that is not submodular and makes optimization difficult. [sent-92, score-0.247]

24 Our observation is that in many applications this volume balancing term is simply unnecessary, see Sections 3. [sent-93, score-0.207]

25 Moreover, while it is known that JS color separation term −hΩi (Si) is submodular1 and can be optimized by graph c−uhts [11, 12, 22], we propose to replace it with a more basic L1 separation term in Fig. [sent-99, score-0.744]

26 We show that it corresponds to a simpler construction with much fewer auxiliary nodes leading to higher efficiency. [sent-101, score-0.338]

27 Interestingly, it also gives better color separation effect, see Section 3. [sent-102, score-0.231]

28 We also observe one practical limitation of blockcoordinate approach to (2), as in GrabCut [ 19], could be due to deteriorating sensitivity to local minima when the number of color bins for models θS and is increased, see Table 2 and Fig. [sent-105, score-0.415]

29 In practice, however, finer bins better capture the information contained in the full dynamic range of color images (8-bit per channel or more). [sent-107, score-0.35]

30 This applies to JS, χ2, Bhattacharyya, and our L1 color separation terms in Figs. [sent-109, score-0.266]

31 The corresponding ROC curves show that the color separation between the object and background increases for finer bins. [sent-113, score-0.305]

32 curves show that even a difficult camouflage image in Figure 2 has a good separation of intensities between the object and background if larger number of bins is used. [sent-114, score-0.485]

33 With bins, however, the overlap between the “fish” and the background is too strong making it hard to segment. [sent-115, score-0.347]

34 Our contributions are summarized below: 163 • • • We propose a simple energy term penalizing L1 measure orfo appearance overlap ybe tetwrmee pne segments. [sent-118, score-0.701]

35 Unlike NP-hard multi-label problems discussed in [ 11, 12], we focus on binary segmentation where such high-order constraints can be globally minimized. [sent-121, score-0.293]

36 Moreover, we show that our L1 term works better for separating colors than other concave separators (including JS, Bhattacharyya, and χ2). [sent-122, score-0.204]

37 We are first to demonstrate fast globally optimal binary segmentation technique explicitly minimizing overlap between object/background color distributions. [sent-123, score-0.651]

38 In one graph cut we get similar or better results at faster running times w. [sent-124, score-0.248]

39 We show general usefulness of the proposed appearance overlap penalty by showing dthifefe prreonpto practical applications: binary segmentation, shape matching, etc. [sent-130, score-0.619]

40 Minimizing appearance overlap in One-Cut Let S ⊂ Ω be a segment and denote by θS and the Luentno Srm ⊂aliz Ωed b eco alor s histograms f dore thoete foreground and background appearance respectively. [sent-132, score-0.827]

41 vnkcor- responding to the pixels in bin k are connected to the auxiliary node Ak using undirected links. [sent-135, score-0.434]

42 The capacity of these links is the weight of appearance overlap term β > 0. [sent-136, score-0.659]

43 of pixels in the image the belong to bin k and let nkS and be the according number of the foreground and background pixels in bin k. [sent-137, score-0.566]

44 Our appearance overlap term penalizes the intersection between the foreground and background bin counts by incorporating the simple yet effective high-order L1 term into the energy function: nSk¯ EL1(θS, θS¯) = −? [sent-138, score-1.107]

45 (5) Below we explain how to incorporate and optimize the term EL1 (θS, θS¯) using one graph cut. [sent-141, score-0.236]

46 The details of the graph construction for the above term are shown in Fig. [sent-145, score-0.289]

47 , AK into the graph and connect kth auxiliary node to all the pixels that belong to the kth bin. [sent-150, score-0.364]

48 Assume that bin k is split into foreground and background, resulting in nkS and nSk¯ pixels accordingly. [sent-153, score-0.279]

49 Then any cut separating the foreground and background pixels must either cut nkS or nSk¯ number of links that connect the pixels in bin k to the auxiliary node Ak. [sent-154, score-0.971]

50 A similar graph construction with auxiliary nodes is proposed in [11, 12] to minimize higher order pseudo-boolean functions of the following form: f(Xc) = min{θ0+? [sent-156, score-0.443]

51 The construction in [11, 12] can be used to minimize EL1 as follows: consider each color bin as a clique and set parameters wi0 = 1, wi1 = Figure4. [sent-166, score-0.357]

52 1, θ0 = 0, θ1 = 0 and θmax = nk/2, where nk is the number of pixels in bin k. [sent-168, score-0.247]

53 e Oedn one auxiliary fn ooudre g froarp hea ccohn bsitrnu icncontrast to two auxiliary nodes in [11, 12]. [sent-171, score-0.468]

54 To see how it works we consider four possible label assignments for the auxiliary nodes Ak1 and Ak0. [sent-174, score-0.245]

55 ments to the binary auxiliary nodes Ak1 and Ak0 . [sent-177, score-0.311]

56 Applications In this section we apply our appearance overlap penalty term in a number of different practical applications including interactive binary segmentation in Sec. [sent-189, score-0.941]

57 Interactive segmentation First, we discuss interactive segmentation with several standard user interfaces: bounding box [ 19] in Section 3. [sent-198, score-0.608]

58 We compare different color separation terms in Section 3. [sent-203, score-0.266]

59 1 Binary segmentation with bounding box First, we use appearance overlap penalty in a binary segmentation experiment a la GrabCut [19]. [sent-234, score-1.144]

60 A user provides a bounding box around an object of interest and the goal is to perform binary image segmentation within the box. [sent-235, score-0.424]

61 The pixels outside the bounding box are assigned to the background using hard constraints. [sent-236, score-0.317]

62 e bounding Ωbo dxe,n oStGeT t ⊆e Ωibe the ground truth segmentation and S ⊆ Ω be a segment. [sent-238, score-0.274]

63 The segmentation energy }fu tnhceti cohna rEac(tSer) istsi given by E(S) = |S¯ ∩ R| − β? [sent-241, score-0.335]

64 L1 + λ|∂S|, (9) where the first term is a standard ballooning inside the bounding box R, the second term is the appearance overlap penalty as in (5), and the last term is a contrast-sensitive smoothness term. [sent-243, score-1.11]

65 It is common to tune the relative weight of each energy term for a given dataset [22]. [sent-251, score-0.337]

66 We use the measure of appearance overlap between the box R and its background R¯ to automatically find image specific relative weight β of the appearance overlap term w. [sent-253, score-1.138]

67 The error rate is defined as the number of misclassified pixels within the bounding box R divided by the size of the box |R| . [sent-270, score-0.494]

68 We test with different number of color boifn sth aend b provide quantitative comparison with the grab-cut method [ 19] (our implementation, modified to work with histograms as in [22]) and the dual decomposition method [22] (results reported by the authors). [sent-271, score-0.231]

69 As we increase the number of color bins, the error rate for the GrabCut method increases, while the error rate of One-Cut decreases. [sent-277, score-0.285]

70 When using 1283 bins One-Cut runs twice as fast, while obtaining much lower error rate. [sent-278, score-0.31]

71 This is because with more bins, more auxiliary nodes are used, but each auxiliary node is connected to less pixels. [sent-279, score-0.466]

72 Finally, Figures 6-7 show examples of input bounding boxes and segmentation results with the GrabCut [ 19], Dual Decomposition [22] and our One-Cut method. [sent-283, score-0.274]

73 2 Comparison of Appearance Overlap Terms Below we discus additional variants for appearance overlap penalty term. [sent-286, score-0.501]

74 4 and compare their performance in the task of binary segmentation applied to the GrabCut dataset [19]. [sent-288, score-0.252]

75 We consider four appearance overlap terms based on the L1 norm, χ2 distance, Bhattacharyya coefficient and Jensen-Shannon divergence 3There are 50 images in the dataset, but we exclude the cross image for the sake of comparison with [22]. [sent-289, score-0.463]

76 All four terms above are concave functions of nks attaining maximum at nk/2. [sent-313, score-0.407]

77 These truncated components can be incorporated into our graph using the construction shown in Fig. [sent-319, score-0.234]

78 All three appearance overlap terms above can be optimized with one graph cut. [sent-322, score-0.535]

79 We wish to find out which term and what level of approximation accuracy are optimal for the task of binary segmentation. [sent-323, score-0.246]

80 In the first experiment, we use an adaptive image specific weight βImg for the appearance overlap term as in (10) and set β? [sent-331, score-0.552]

81 In the second experiment, we choose the optimal βImg separately for each appearance overlap term by replacing the denominator of (10) with either Dχ2 (θR, θR¯), θR¯) θR¯) DBha(θR, or DJS(θR, according to the appearance overlap term used. [sent-336, score-1.154]

82 Comparison of appearance overlap terms: (top-left) shows the functions plotted for one bin k, (top-right) shows segmentation error rates using the same βImg as in (10) for all overlap terms and (bottom-left) shows segmentation results when using a term-specific βImg . [sent-343, score-1.369]

83 10 reports the error (θR, θR¯) rates of the segmentation as a function of the parameter t. [sent-355, score-0.272]

84 This further proves the benefit of our proposed DL1 appearance overlap term. [sent-357, score-0.428]

85 3 Interactive Segmentation with Seeds Unlike interactive segmentation with abounding box, using seeds a la Boykov-Jolly [3] makes volumetric balancing unnecessary due to hard constraints enforced by the user. [sent-371, score-0.432]

86 Therefore, the segmentation energy is quite simple: Eseeds(S) = −β? [sent-372, score-0.335]

87 Template Shape Matching Below we discuss how appearance overlap penalty term can be used for template shape matching. [sent-378, score-0.784]

88 Several prior methods rely on graph-cut based segmentation with shape prior [2 1, 8, 14, 23]. [sent-379, score-0.238]

89 Most commonly, these methods use a binary template mask M and combine the shape matching cue a contrast sensitive smoothness term via energy E1(S) = mρ∈iΦnEShape(S,Mρ) + λ|∂S|. [sent-380, score-0.539]

90 We further incorporate the appearance overlap into the energy: E2(S) = E1(S) − β? [sent-384, score-0.428]

91 Without the appearance overlap term shape matching yields erroneous segmentation due to the clutter edges in the background. [sent-389, score-0.831]

92 We down-scaled all images to 320 240 and used 1283 color bins per image. [sent-392, score-0.35]

93 For the template matching, we scan the image in a sliding-window fashion and compute maxflow/mincut with dynamic graph cut [13]. [sent-394, score-0.309]

94 Template shape matching examples: ages, (b) Contrast sensitive edge weights, (c-d) with and without the appearance overlap penalty. [sent-400, score-0.521]

95 Table 3 provides quantitative comparison of the results obtained with and without incorporating the appearance overlap term, namely using E2 (S) and E1(S). [sent-404, score-0.428]

96 The results are reported with respect to manually outlined ground truth segmentations and clearly point to the benefit of incorporating the overlap term EL1 into the segmentation energy. [sent-405, score-0.583]

97 Template shape matching: comparing E1(S) and E2 (S) in terms of TP, FP, misclassified pixels, and mean running time. [sent-418, score-0.2]

98 Salient object segmentation Salient region detection and segmentation is an important preprocessing step for object recognition and adaptive compression. [sent-423, score-0.372]

99 Salient objects usually have an appearance that is distinct from the background [1, 7, 17]. [sent-424, score-0.229]

100 Below we show how our appearance overlap penalty term can be used for the task of salient object segmentation. [sent-425, score-0.67]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('grabcut', 0.348), ('overlap', 0.273), ('bins', 0.265), ('nks', 0.259), ('auxiliary', 0.186), ('segmentation', 0.186), ('img', 0.18), ('appearance', 0.155), ('energy', 0.149), ('dbha', 0.146), ('nsk', 0.146), ('separation', 0.146), ('bin', 0.142), ('cut', 0.13), ('djs', 0.13), ('term', 0.124), ('template', 0.107), ('construction', 0.093), ('bhattacharyya', 0.09), ('bounding', 0.088), ('sp', 0.085), ('color', 0.085), ('box', 0.084), ('balancing', 0.083), ('concave', 0.08), ('background', 0.074), ('penalty', 0.073), ('lnpr', 0.073), ('js', 0.072), ('xc', 0.072), ('graph', 0.072), ('pixels', 0.071), ('entropy', 0.07), ('truncated', 0.069), ('misclassified', 0.067), ('binary', 0.066), ('foreground', 0.066), ('links', 0.066), ('seeds', 0.065), ('ballooning', 0.065), ('tune', 0.064), ('max', 0.064), ('interactive', 0.064), ('dual', 0.062), ('energies', 0.06), ('nodes', 0.059), ('dd', 0.057), ('cated', 0.056), ('approximation', 0.056), ('pq', 0.055), ('rate', 0.055), ('shape', 0.052), ('kk', 0.051), ('functionals', 0.05), ('denominator', 0.05), ('sq', 0.05), ('replace', 0.047), ('ln', 0.046), ('running', 0.046), ('error', 0.045), ('salient', 0.045), ('decomposition', 0.044), ('min', 0.041), ('matching', 0.041), ('saliency', 0.041), ('rates', 0.041), ('capacity', 0.041), ('globally', 0.041), ('submodular', 0.04), ('si', 0.04), ('histograms', 0.04), ('optimize', 0.04), ('segments', 0.039), ('clique', 0.037), ('fn', 0.037), ('terms', 0.035), ('node', 0.035), ('minimum', 0.035), ('unnecessary', 0.034), ('nk', 0.034), ('bias', 0.033), ('minima', 0.033), ('ratios', 0.033), ('ak', 0.033), ('functions', 0.033), ('roc', 0.033), ('experiment', 0.033), ('blockcoordinate', 0.032), ('plicitly', 0.032), ('nmda', 0.032), ('trhme', 0.032), ('olga', 0.032), ('dbee', 0.032), ('rrte', 0.032), ('jen', 0.032), ('royf', 0.032), ('thoete', 0.032), ('bce', 0.032), ('convexconcave', 0.032), ('dore', 0.032)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 1.0000005 186 iccv-2013-GrabCut in One Cut

Author: Meng Tang, Lena Gorelick, Olga Veksler, Yuri Boykov

Abstract: Among image segmentation algorithms there are two major groups: (a) methods assuming known appearance models and (b) methods estimating appearance models jointly with segmentation. Typically, the first group optimizes appearance log-likelihoods in combination with some spacial regularization. This problem is relatively simple and many methods guarantee globally optimal results. The second group treats model parameters as additional variables transforming simple segmentation energies into highorder NP-hard functionals (Zhu-Yuille, Chan-Vese, GrabCut, etc). It is known that such methods indirectly minimize the appearance overlap between the segments. We propose a new energy term explicitly measuring L1 distance between the object and background appearance models that can be globally maximized in one graph cut. We show that in many applications our simple term makes NP-hard segmentation functionals unnecessary. Our one cut algorithm effectively replaces approximate iterative optimization techniques based on block coordinate descent.

2 0.20026901 379 iccv-2013-Semantic Segmentation without Annotating Segments

Author: Wei Xia, Csaba Domokos, Jian Dong, Loong-Fah Cheong, Shuicheng Yan

Abstract: Numerous existing object segmentation frameworks commonly utilize the object bounding box as a prior. In this paper, we address semantic segmentation assuming that object bounding boxes are provided by object detectors, but no training data with annotated segments are available. Based on a set of segment hypotheses, we introduce a simple voting scheme to estimate shape guidance for each bounding box. The derived shape guidance is used in the subsequent graph-cut-based figure-ground segmentation. The final segmentation result is obtained by merging the segmentation results in the bounding boxes. We conduct an extensive analysis of the effect of object bounding box accuracy. Comprehensive experiments on both the challenging PASCAL VOC object segmentation dataset and GrabCut50 image segmentation dataset show that the proposed approach achieves competitive results compared to previous detection or bounding box prior based methods, as well as other state-of-the-art semantic segmentation methods.

3 0.17914358 411 iccv-2013-Symbiotic Segmentation and Part Localization for Fine-Grained Categorization

Author: Yuning Chai, Victor Lempitsky, Andrew Zisserman

Abstract: We propose a new method for the task of fine-grained visual categorization. The method builds a model of the baselevel category that can be fitted to images, producing highquality foreground segmentation and mid-level part localizations. The model can be learnt from the typical datasets available for fine-grained categorization, where the only annotation provided is a loose bounding box around the instance (e.g. bird) in each image. Both segmentation and part localizations are then used to encode the image content into a highly-discriminative visual signature. The model is symbiotic in that part discovery/localization is helped by segmentation and, conversely, the segmentation is helped by the detection (e.g. part layout). Our model builds on top of the part-based object category detector of Felzenszwalb et al., and also on the powerful GrabCut segmentation algorithm of Rother et al., and adds a simple spatial saliency coupling between them. In our evaluation, the model improves the categorization accuracy over the state-of-the-art. It also improves over what can be achieved with an analogous system that runs segmentation and part-localization independently.

4 0.15468848 326 iccv-2013-Predicting Sufficient Annotation Strength for Interactive Foreground Segmentation

Author: Suyog Dutt Jain, Kristen Grauman

Abstract: The mode of manual annotation used in an interactive segmentation algorithm affects both its accuracy and easeof-use. For example, bounding boxes are fast to supply, yet may be too coarse to get good results on difficult images; freehand outlines are slower to supply and more specific, yet they may be overkill for simple images. Whereas existing methods assume a fixed form of input no matter the image, we propose to predict the tradeoff between accuracy and effort. Our approach learns whether a graph cuts segmentation will succeed if initialized with a given annotation mode, based on the image ’s visual separability and foreground uncertainty. Using these predictions, we optimize the mode of input requested on new images a user wants segmented. Whether given a single image that should be segmented as quickly as possible, or a batch of images that must be segmented within a specified time budget, we show how to select the easiest modality that will be sufficiently strong to yield high quality segmentations. Extensive results with real users and three datasets demonstrate the impact.

5 0.14716443 282 iccv-2013-Multi-view Object Segmentation in Space and Time

Author: Abdelaziz Djelouah, Jean-Sébastien Franco, Edmond Boyer, François Le_Clerc, Patrick Pérez

Abstract: In this paper, we address the problem of object segmentation in multiple views or videos when two or more viewpoints of the same scene are available. We propose a new approach that propagates segmentation coherence information in both space and time, hence allowing evidences in one image to be shared over the complete set. To this aim the segmentation is cast as a single efficient labeling problem over space and time with graph cuts. In contrast to most existing multi-view segmentation methods that rely on some form of dense reconstruction, ours only requires a sparse 3D sampling to propagate information between viewpoints. The approach is thoroughly evaluated on standard multiview datasets, as well as on videos. With static views, results compete with state of the art methods but they are achieved with significantly fewer viewpoints. With multiple videos, we report results that demonstrate the benefit of segmentation propagation through temporal cues.

6 0.13924639 150 iccv-2013-Exemplar Cut

7 0.12406189 160 iccv-2013-Fast Object Segmentation in Unconstrained Video

8 0.12276065 238 iccv-2013-Learning Graphs to Match

9 0.12254714 225 iccv-2013-Joint Segmentation and Pose Tracking of Human in Natural Videos

10 0.11387544 442 iccv-2013-Video Segmentation by Tracking Many Figure-Ground Segments

11 0.11352478 120 iccv-2013-Discriminative Label Propagation for Multi-object Tracking with Sporadic Appearance Features

12 0.11345831 429 iccv-2013-Tree Shape Priors with Connectivity Constraints Using Convex Relaxation on General Graphs

13 0.11287697 318 iccv-2013-PixelTrack: A Fast Adaptive Algorithm for Tracking Non-rigid Objects

14 0.10721019 208 iccv-2013-Image Co-segmentation via Consistent Functional Maps

15 0.10410839 111 iccv-2013-Detecting Dynamic Objects with Multi-view Background Subtraction

16 0.10297545 71 iccv-2013-Category-Independent Object-Level Saliency Detection

17 0.10208889 448 iccv-2013-Weakly Supervised Learning of Image Partitioning Using Decision Trees with Structured Split Criteria

18 0.10087863 95 iccv-2013-Cosegmentation and Cosketch by Unsupervised Learning

19 0.10079422 171 iccv-2013-Fix Structured Learning of 2013 ICCV paper k2opt.pdf

20 0.096594572 377 iccv-2013-Segmentation Driven Object Detection with Fisher Vectors


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.224), (1, -0.053), (2, 0.092), (3, -0.034), (4, 0.057), (5, 0.028), (6, -0.12), (7, 0.111), (8, 0.026), (9, -0.102), (10, -0.019), (11, 0.13), (12, 0.028), (13, -0.002), (14, -0.012), (15, 0.037), (16, 0.043), (17, -0.008), (18, -0.068), (19, -0.067), (20, 0.098), (21, 0.021), (22, -0.073), (23, 0.008), (24, 0.001), (25, 0.072), (26, -0.071), (27, -0.009), (28, -0.003), (29, 0.002), (30, 0.008), (31, 0.033), (32, 0.042), (33, 0.018), (34, -0.025), (35, 0.046), (36, -0.059), (37, 0.102), (38, -0.008), (39, 0.062), (40, 0.008), (41, 0.049), (42, 0.004), (43, 0.033), (44, 0.044), (45, 0.061), (46, 0.049), (47, -0.043), (48, -0.078), (49, -0.054)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.981893 186 iccv-2013-GrabCut in One Cut

Author: Meng Tang, Lena Gorelick, Olga Veksler, Yuri Boykov

Abstract: Among image segmentation algorithms there are two major groups: (a) methods assuming known appearance models and (b) methods estimating appearance models jointly with segmentation. Typically, the first group optimizes appearance log-likelihoods in combination with some spacial regularization. This problem is relatively simple and many methods guarantee globally optimal results. The second group treats model parameters as additional variables transforming simple segmentation energies into highorder NP-hard functionals (Zhu-Yuille, Chan-Vese, GrabCut, etc). It is known that such methods indirectly minimize the appearance overlap between the segments. We propose a new energy term explicitly measuring L1 distance between the object and background appearance models that can be globally maximized in one graph cut. We show that in many applications our simple term makes NP-hard segmentation functionals unnecessary. Our one cut algorithm effectively replaces approximate iterative optimization techniques based on block coordinate descent.

2 0.87664235 63 iccv-2013-Bounded Labeling Function for Global Segmentation of Multi-part Objects with Geometric Constraints

Author: Masoud S. Nosrati, Shawn Andrews, Ghassan Hamarneh

Abstract: The inclusion of shape and appearance priors have proven useful for obtaining more accurate and plausible segmentations, especially for complex objects with multiple parts. In this paper, we augment the popular MumfordShah model to incorporate two important geometrical constraints, termed containment and detachment, between different regions with a specified minimum distance between their boundaries. Our method is able to handle multiple instances of multi-part objects defined by these geometrical hamarneh} @ s fu . ca (a)Standar laΩb ehlingΩfuhnctionseting(Ωb)hΩOuirseΩtijng Figure 1: The inside vs. outside ambiguity in (a) is resolved by our containment constraint in (b). constraints using a single labeling function while maintaining global optimality. We demonstrate the utility and advantages of these two constraints and show that the proposed convex continuous method is superior to other state-of-theart methods, including its discrete counterpart, in terms of memory usage, and metrication errors.

3 0.84860617 150 iccv-2013-Exemplar Cut

Author: Jimei Yang, Yi-Hsuan Tsai, Ming-Hsuan Yang

Abstract: We present a hybrid parametric and nonparametric algorithm, exemplar cut, for generating class-specific object segmentation hypotheses. For the parametric part, we train a pylon model on a hierarchical region tree as the energy function for segmentation. For the nonparametric part, we match the input image with each exemplar by using regions to obtain a score which augments the energy function from the pylon model. Our method thus generates a set of highly plausible segmentation hypotheses by solving a series of exemplar augmented graph cuts. Experimental results on the Graz and PASCAL datasets show that the proposed algorithm achievesfavorable segmentationperformance against the state-of-the-art methods in terms of visual quality and accuracy.

4 0.82664138 379 iccv-2013-Semantic Segmentation without Annotating Segments

Author: Wei Xia, Csaba Domokos, Jian Dong, Loong-Fah Cheong, Shuicheng Yan

Abstract: Numerous existing object segmentation frameworks commonly utilize the object bounding box as a prior. In this paper, we address semantic segmentation assuming that object bounding boxes are provided by object detectors, but no training data with annotated segments are available. Based on a set of segment hypotheses, we introduce a simple voting scheme to estimate shape guidance for each bounding box. The derived shape guidance is used in the subsequent graph-cut-based figure-ground segmentation. The final segmentation result is obtained by merging the segmentation results in the bounding boxes. We conduct an extensive analysis of the effect of object bounding box accuracy. Comprehensive experiments on both the challenging PASCAL VOC object segmentation dataset and GrabCut50 image segmentation dataset show that the proposed approach achieves competitive results compared to previous detection or bounding box prior based methods, as well as other state-of-the-art semantic segmentation methods.

5 0.81007254 208 iccv-2013-Image Co-segmentation via Consistent Functional Maps

Author: Fan Wang, Qixing Huang, Leonidas J. Guibas

Abstract: Joint segmentation of image sets has great importance for object recognition, image classification, and image retrieval. In this paper, we aim to jointly segment a set of images starting from a small number of labeled images or none at all. To allow the images to share segmentation information with each other, we build a network that contains segmented as well as unsegmented images, and extract functional maps between connected image pairs based on image appearance features. These functional maps act as general property transporters between the images and, in particular, are used to transfer segmentations. We define and operate in a reduced functional space optimized so that the functional maps approximately satisfy cycle-consistency under composition in the network. A joint optimization framework is proposed to simultaneously generate all segmentation functions over the images so that they both align with local segmentation cues in each particular image, and agree with each other under network transportation. This formulation allows us to extract segmentations even with no training data, but can also exploit such data when available. The collective effect of the joint processing using functional maps leads to accurate information sharing among images and yields superior segmentation results, as shown on the iCoseg, MSRC, and PASCAL data sets.

6 0.80392814 326 iccv-2013-Predicting Sufficient Annotation Strength for Interactive Foreground Segmentation

7 0.78093928 330 iccv-2013-Proportion Priors for Image Sequence Segmentation

8 0.74388427 282 iccv-2013-Multi-view Object Segmentation in Space and Time

9 0.72800905 429 iccv-2013-Tree Shape Priors with Connectivity Constraints Using Convex Relaxation on General Graphs

10 0.71006387 432 iccv-2013-Uncertainty-Driven Efficiently-Sampled Sparse Graphical Models for Concurrent Tumor Segmentation and Atlas Registration

11 0.69833517 411 iccv-2013-Symbiotic Segmentation and Part Localization for Fine-Grained Categorization

12 0.68726456 245 iccv-2013-Learning a Dictionary of Shape Epitomes with Applications to Image Labeling

13 0.68627656 33 iccv-2013-A Unified Video Segmentation Benchmark: Annotation, Metrics and Analysis

14 0.67344701 8 iccv-2013-A Deformable Mixture Parsing Model with Parselets

15 0.66649276 329 iccv-2013-Progressive Multigrid Eigensolvers for Multiscale Spectral Segmentation

16 0.65778959 95 iccv-2013-Cosegmentation and Cosketch by Unsupervised Learning

17 0.65036005 74 iccv-2013-Co-segmentation by Composition

18 0.62369668 377 iccv-2013-Segmentation Driven Object Detection with Fisher Vectors

19 0.62135482 318 iccv-2013-PixelTrack: A Fast Adaptive Algorithm for Tracking Non-rigid Objects

20 0.61837792 160 iccv-2013-Fast Object Segmentation in Unconstrained Video


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(2, 0.071), (7, 0.013), (8, 0.19), (12, 0.013), (13, 0.011), (26, 0.103), (27, 0.017), (31, 0.051), (40, 0.013), (42, 0.094), (48, 0.017), (64, 0.072), (73, 0.054), (89, 0.193), (95, 0.011), (98, 0.012)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.86812621 246 iccv-2013-Learning the Visual Interpretation of Sentences

Author: C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende

Abstract: Sentences that describe visual scenes contain a wide variety of information pertaining to the presence of objects, their attributes and their spatial relations. In this paper we learn the visual features that correspond to semantic phrases derived from sentences. Specifically, we extract predicate tuples that contain two nouns and a relation. The relation may take several forms, such as a verb, preposition, adjective or their combination. We model a scene using a Conditional Random Field (CRF) formulation where each node corresponds to an object, and the edges to their relations. We determine the potentials of the CRF using the tuples extracted from the sentences. We generate novel scenes depicting the sentences’ visual meaning by sampling from the CRF. The CRF is also used to score a set of scenes for a text-based image retrieval task. Our results show we can generate (retrieve) scenes that convey the desired semantic meaning, even when scenes (queries) are described by multiple sentences. Significant improvement is found over several baseline approaches.

2 0.85851306 3 iccv-2013-3D Sub-query Expansion for Improving Sketch-Based Multi-view Image Retrieval

Author: Yen-Liang Lin, Cheng-Yu Huang, Hao-Jeng Wang, Winston Hsu

Abstract: We propose a 3D sub-query expansion approach for boosting sketch-based multi-view image retrieval. The core idea of our method is to automatically convert two (guided) 2D sketches into an approximated 3D sketch model, and then generate multi-view sketches as expanded sub-queries to improve the retrieval performance. To learn the weights among synthesized views (sub-queries), we present a new multi-query feature to model the similarity between subqueries and dataset images, and formulate it into a convex optimization problem. Our approach shows superior performance compared with the state-of-the-art approach on a public multi-view image dataset. Moreover, we also conduct sensitivity tests to analyze the parameters of our approach based on the gathered user sketches.

same-paper 3 0.85599905 186 iccv-2013-GrabCut in One Cut

Author: Meng Tang, Lena Gorelick, Olga Veksler, Yuri Boykov

Abstract: Among image segmentation algorithms there are two major groups: (a) methods assuming known appearance models and (b) methods estimating appearance models jointly with segmentation. Typically, the first group optimizes appearance log-likelihoods in combination with some spacial regularization. This problem is relatively simple and many methods guarantee globally optimal results. The second group treats model parameters as additional variables transforming simple segmentation energies into highorder NP-hard functionals (Zhu-Yuille, Chan-Vese, GrabCut, etc). It is known that such methods indirectly minimize the appearance overlap between the segments. We propose a new energy term explicitly measuring L1 distance between the object and background appearance models that can be globally maximized in one graph cut. We show that in many applications our simple term makes NP-hard segmentation functionals unnecessary. Our one cut algorithm effectively replaces approximate iterative optimization techniques based on block coordinate descent.

4 0.85227889 272 iccv-2013-Modifying the Memorability of Face Photographs

Author: Aditya Khosla, Wilma A. Bainbridge, Antonio Torralba, Aude Oliva

Abstract: Contemporary life bombards us with many new images of faces every day, which poses non-trivial constraints on human memory. The vast majority of face photographs are intended to be remembered, either because of personal relevance, commercial interests or because the pictures were deliberately designed to be memorable. Can we make aportrait more memorable or more forgettable automatically? Here, we provide a method to modify the memorability of individual face photographs, while keeping the identity and other facial traits (e.g. age, attractiveness, and emotional magnitude) of the individual fixed. We show that face photographs manipulated to be more memorable (or more forgettable) are indeed more often remembered (or forgotten) in a crowd-sourcing experiment with an accuracy of 74%. Quantifying and modifying the ‘memorability ’ of a face lends itself to many useful applications in computer vision and graphics, such as mnemonic aids for learning, photo editing applications for social networks and tools for designing memorable advertisements.

5 0.81203544 62 iccv-2013-Bird Part Localization Using Exemplar-Based Models with Enforced Pose and Subcategory Consistency

Author: Jiongxin Liu, Peter N. Belhumeur

Abstract: In this paper, we propose a novel approach for bird part localization, targeting fine-grained categories with wide variations in appearance due to different poses (including aspect and orientation) and subcategories. As it is challenging to represent such variations across a large set of diverse samples with tractable parametric models, we turn to individual exemplars. Specifically, we extend the exemplarbased models in [4] by enforcing pose and subcategory consistency at the parts. During training, we build posespecific detectors scoring part poses across subcategories, and subcategory-specific detectors scoring part appearance across poses. At the testing stage, likely exemplars are matched to the image, suggesting part locations whose pose and subcategory consistency are well-supported by the image cues. From these hypotheses, part configuration can be predicted with very high accuracy. Experimental results demonstrate significantperformance gainsfrom our method on an extensive dataset: CUB-200-2011 [30], for both localization and classification tasks.

6 0.80136389 238 iccv-2013-Learning Graphs to Match

7 0.79896766 95 iccv-2013-Cosegmentation and Cosketch by Unsupervised Learning

8 0.79882312 414 iccv-2013-Temporally Consistent Superpixels

9 0.79716206 150 iccv-2013-Exemplar Cut

10 0.79554343 196 iccv-2013-Hierarchical Data-Driven Descent for Efficient Optimal Deformation Estimation

11 0.79431498 428 iccv-2013-Translating Video Content to Natural Language Descriptions

12 0.79418921 379 iccv-2013-Semantic Segmentation without Annotating Segments

13 0.79372847 420 iccv-2013-Topology-Constrained Layered Tracking with Latent Flow

14 0.79185212 349 iccv-2013-Regionlets for Generic Object Detection

15 0.79152811 396 iccv-2013-Space-Time Robust Representation for Action Recognition

16 0.79134798 338 iccv-2013-Randomized Ensemble Tracking

17 0.79111689 359 iccv-2013-Robust Object Tracking with Online Multi-lifespan Dictionary Learning

18 0.78975797 361 iccv-2013-Robust Trajectory Clustering for Motion Segmentation

19 0.78967369 411 iccv-2013-Symbiotic Segmentation and Part Localization for Fine-Grained Categorization

20 0.78963125 160 iccv-2013-Fast Object Segmentation in Unconstrained Video