cvpr cvpr2013 cvpr2013-418 knowledge-graph by maker-knowledge-mining

418 cvpr-2013-Submodular Salient Region Detection


Source: pdf

Author: Zhuolin Jiang, Larry S. Davis

Abstract: The problem of salient region detection is formulated as the well-studied facility location problem from operations research. High-level priors are combined with low-level features to detect salient regions. Salient region detection is achieved by maximizing a submodular objective function, which maximizes the total similarities (i.e., total profits) between the hypothesized salient region centers (i.e., facility locations) and their region elements (i.e., clients), and penalizes the number of potential salient regions (i.e., the number of open facilities). The similarities are efficiently computedbyfinding a closed-form harmonic solution on the constructed graph for an input image. The saliency of a selected region is modeled in terms of appearance and spatial location. By exploiting the submodularity properties of the objectivefunction, a highly efficient greedy-based optimization algorithm can be employed. This algorithm is guaranteed to be at least a (e − 1)/e ≈ 0.632-approximation to t heeed optimum. lEeaxpster aim (een −tal 1 r)e/seult ≈s d 0e.m63o2n-satrpaptero txhimata our approach outperforms several recently proposed saliency detection approaches.

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 edu Abstract The problem of salient region detection is formulated as the well-studied facility location problem from operations research. [sent-4, score-1.027]

2 High-level priors are combined with low-level features to detect salient regions. [sent-5, score-0.323]

3 Salient region detection is achieved by maximizing a submodular objective function, which maximizes the total similarities (i. [sent-6, score-0.231]

4 , total profits) between the hypothesized salient region centers (i. [sent-8, score-0.519]

5 , clients), and penalizes the number of potential salient regions (i. [sent-12, score-0.384]

6 The saliency of a selected region is modeled in terms of appearance and spatial location. [sent-16, score-0.511]

7 m63o2n-satrpaptero txhimata our approach outperforms several recently proposed saliency detection approaches. [sent-21, score-0.311]

8 Introduction Visual saliency modeling is relevant to a variety of computer vision problems including object detection and recognition [29, 26], image editing [13, 4, 6] and image segmentation [16]. [sent-23, score-0.311]

9 Most saliency models [2, 20, 4, 6, 8] are based on a contrast prior between salient objects and backgrounds. [sent-24, score-0.648]

10 Saliency models map natural images into saliency maps, in which each image element (pixel, superpixel, region) is assigned a saliency strength or probability. [sent-25, score-0.7]

11 For example, Figure 1 illustrates saliency detection results using four state-of-art algorithms [2, 6, 4, 26]. [sent-30, score-0.311]

12 However, given the ground truth salient regions in Figure 1(b), even for the first simple example, these approaches either fail to separate the object from the background, as in Figures 1(c) and 1(e), or mostly outline the object but miss the interior as in Figure 1(d). [sent-32, score-0.317]

13 (a) Input images; (b) Ground truth salient regions; (c)∼(e): Saliency maps using ;[ (2,b )6 ,G r4o] wnidth t cuothnt sraalsiet priors; (f) Saliency map using [26] with a low-rank prior. [sent-35, score-0.337]

14 [26, 28] represent an image as a low-rank matrix plus sparse noise, where the background is modeled by the low-rank matrix and the salient regions are indicated by the sparse noise (i. [sent-39, score-0.457]

15 For example, the poor saliency detection results in Figure 1(f) using the low-rank prior are due to the cluttered background. [sent-43, score-0.391]

16 We present a submodular objective function for effi- ciently creating saliency maps from natural images; these maps can then be used to detect multiple salient regions within a single image. [sent-44, score-0.767]

17 The diminishing return property of submodularity has been successfully applied in various applications including sensor placement [18], facility location [24] and image segmentations [15]. [sent-45, score-0.716]

18 Our objective function consists oftwo terms: a similarity term (between the selected centers of salient regions and image elements (superpixels) assigned to that center), and the ‘facility’ costs for the selected region centers. [sent-46, score-0.717]

19 The first term encourages the selected centers to represent the region elements well. [sent-47, score-0.351]

20 Hence it favors the extraction of high-quality potential salient regions. [sent-48, score-0.353]

21 The second term penalizes the number of selected potential salient region centers, so it avoids oversegmentation of salient regions. [sent-49, score-0.757]

22 It reduces the redundancy among selected salient region centers because the small gain obtained by splitting a region through the introduction of an extrane222000444311 ous region center is offset by the facility cost. [sent-50, score-1.538]

23 This high level prior is integrated with low level feature information into a unified objective function to identify salient regions. [sent-51, score-0.441]

24 This is in contrast to previous approaches based on low level features [2, 4] or high level information only [29, 5], or heuristic integration approaches [13, 6] based on weighted averages on the saliency maps from low level features and high level priors. [sent-52, score-0.504]

25 In contrast to some approaches [28, 27] which use uniform image patches to represent an image, our representation is based on super-pixels, which are less likely to cross object boundaries and lead to more accurately segmented salient regions. [sent-53, score-0.257]

26 Unlike approaches that identify only one salient region in an image [7], our approach identifies multiple salient regions simultaneously without any strong assumptions about the statistics of the backgrounds [28]. [sent-54, score-0.773]

27 The main contributions of our paper are: • Salient region selection is modeled as the facility locSaalti oenn tp rreogbiloenm, s ewlehcictioh nis sso mlvoedde e bleyd dm aasx tihmeiz faincgil a ysu lbo-modular objective function. [sent-55, score-0.749]

28 This provides a new perspective using submodularity for salient region detection, and it achieves state-of-art performance on two public saliency detection benchmarks. [sent-56, score-0.788]

29 • The similarities between hypothesized region centers aTnhde sthimeiril region eeltewmeeennts h are tfhoersmizueldate redg as a elanbteelr-s ing problem on the vertices of a graph. [sent-57, score-0.428]

30 • We naturally integrate high-level priors with low-level saliency rianltlyo a nutneigfireadte f hraigmh-elwevoerlk fporiro srasl iwenitth region dvee-l tection. [sent-60, score-0.514]

31 Related Work Existing salient region detection approaches can be roughly divided into two categories: bottom-up and topdown approaches. [sent-63, score-0.423]

32 Recently, [26, 28] decompose an image into a low-rank matrix representing the background (low-rank prior) and a sparse noise matrix indicating the salient regions by low-rank matrix recovery. [sent-68, score-0.428]

33 [27] proposes to use the boundary prior, which assumes the image boundary is mostly background for saliency detection. [sent-69, score-0.353]

34 Top-down approaches make use of high level knowledge about ‘interesting’ objects to identify salient regions [29, 5, 14]. [sent-70, score-0.382]

35 [29] learns interesting region features by dictionary learning and then generates the saliency map by modeling spatial consistency via a CRF model. [sent-71, score-0.491]

36 [5] proposes a topdown saliency algorithm by selecting discriminant features from a pre-defined filter bank. [sent-72, score-0.34]

37 In addition, some approaches integrate multiple saliency maps generated from different features or priors to detect salient regions. [sent-73, score-0.671]

38 The saliency maps are combined by weighted averaging, where the weights are predefined [6, 8], learned by a SVM [13] or estimated by a CRF [20]. [sent-74, score-0.348]

39 Unlike previous approaches that are purely top-down or bottom-up, we integrate high level priors with low level information into a unified framework, which is graph-based and is optimized in a submodular framework. [sent-75, score-0.234]

40 Preliminaries Facility Location: [17, 22] We solve a facility location problem to generate candidate regions for saliency-based segmentation. [sent-77, score-0.693]

41 fj is the cost of opening a facility at location j and cij denotes the profit made by satisfying the demand of client iby facility j. [sent-89, score-1.943]

42 y˜j = 1 if facility j is open and y˜j = 0 otherwise; ˜x ij = 1 if the demand of client iis satisfied from facility j and ˜x ij = 0 otherwise. [sent-91, score-1.414]

43 h client to an open facility to maximize the overall profit. [sent-100, score-0.71]

44 Submodular Saliency There are three main steps in our approach: First, a set of potential region centers are extracted from an image. [sent-161, score-0.329]

45 They serve as a set of potential facility locations (denoted by J). [sent-162, score-0.697]

46 Second, given that set of potential region centers, we identify the final region centers and cluster superpixels into regions by solving the facility location problem. [sent-163, score-1.285]

47 Finally, the saliencies ofthe potential salient regions and their constituent superpixels are computed from color and spatial location information. [sent-166, score-0.554]

48 Identifying A Set of Potential Region Centers It is computationally too expensive to use the whole set V as the set, J, of potential region centers to identify the final region centers, denoted by A. [sent-183, score-0.492]

49 chosen as a region center, the region extracted is more or less the same. [sent-185, score-0.274]

50 Extraction of Potential Salient Regions We model the problem of identifying high quality potential salient regions as selecting a subset, A, of J as the final region centers. [sent-192, score-0.521]

51 A is regarded as the set of locations for opening facilities, and the similarities between elements of A and superpixels eventually assigned to elements of A as the profits made by satisfying the demand of clients by facilities from A. [sent-193, score-0.654]

52 As discussed previously, this problem can be modeled as the facility location problem [22]. [sent-194, score-0.657]

53 With the constraint NA = |A| ≤ K, the combinatorial formulation of the facility lo=ca |tAio|n ≤ problem cino m[2b2i]n can biael applied ttioo our problem: mAaxH(A) =i? [sent-196, score-0.588]

54 A ⊆ J ⊆ V, NA ≤ K (3) where cij denotes the similarity between a vertex vi (considered as clients) and its potential region center vj (considered as facilities), and the cost fj of facility opening is fixed to The overall profit H : 2J → R on the graph G is a tsoub λm. [sent-200, score-1.549]

55 2 The first term encourages the similarity between vi and its assigned region center to have the greatest value. [sent-202, score-0.31]

56 The optimization favors region centers that are visually similar to their ‘clients’ . [sent-203, score-0.291]

57 It mitigates against fragmentation of visually homogenous regions, since the small gain in visual similarity to marginally ‘productive’ region centers is more than offset by the cost of opening such a facility. [sent-205, score-0.404]

58 K is the maximum number of salient regions that the algorithm might identify, and is a parameter specified by the user. [sent-211, score-0.317]

59 Generally, fewer than K locations are chosen because the marginal gain does not outweigh the facility cost. [sent-212, score-0.8]

60 1 Computation of cij cij serves as the profit made by satisfying the demand of client ifrom a facility at location j ∈ J. [sent-215, score-1.577]

61 Iitt sish an input cvoamri-able for the facility location problem. [sent-217, score-0.633]

62 Since not all nodes in G should be assigned to any j, we add a background node vg to G with label 0 so vi ∈ U can also be assigned to background. [sent-222, score-0.326]

63 Examples of facility location and facility assignment results (clustering) on two synthetic datasets. [sent-273, score-1.269]

64 The selected region centers (facility locations) are marked as circles. [sent-274, score-0.301]

65 (b)∼(e): a1, a2, a3 are yse l oecctatedio nb as)se adre on athrkeierd marginal gains ri na H(A c∪h {a}) H(A) ainp t uhrreese hiteer asttriuoncstu. [sent-277, score-0.255]

66 (f) Facility assignment d re osnul t hs by using ahal grmaionnsic in so Hl(utAio∪n {toa compute cij . [sent-281, score-0.409]

67 e (g) Facility assignment − results by simply using naive weight wij as cij . [sent-284, score-0.425]

68 (a) Input image; (b) Center prior map; (c) Face prior map; (d) Color prior map; (e) Final combined and smoothed prior map. [sent-287, score-0.32]

69 Hence the possibility of a potential region center close to A being selected increases during the subsequent iteration of the optimization. [sent-290, score-0.278]

70 cij can be computed by finding the harmonic function on the graph G with the labeled nodes L set to nodej with label 1 and the background node with label 0, while the other nodes in G are the unlabeled nodes U. [sent-291, score-0.737]

71 cij is the probability that a random walker starting from vi, will reach j before reaching the background node [9, 32]. [sent-293, score-0.416]

72 With cgj = 0 and cjj = 1, we can also obtain cij = hU ∈ Ru×1 for ∀i ∈ U. [sent-295, score-0.414]

73 cij is fixed during the subsequent optimization o anf (d3∀). [sent-298, score-0.328]

74 Here, the detected face regions Λ are assigned higher probabilities to generate the face prior map pf (x) = σ1 , for x ∈ Λ; otherwise pf (x) = 0. [sent-304, score-0.288]

75 We introduce an ‘assignment cost’ 1− PH for each superpixel tarnodd incorporate int minentot tchoes computation of cij as follows. [sent-327, score-0.375]

76 Given a labeled set L (comprised of a region center node vj and the background node vg), we augment the graph G to include a set of labeled nodes, by attaching a labeled node vqi to each unlabeled node vi (i ∈ U) as its prior. [sent-328, score-0.713]

77 l data points having high gains are aggregated in the area with high prior values; (i) Final selected facility locations with prior. [sent-343, score-0.772]

78 uth |A s|a =lien 5t region; (l) Saliency map without prior; (m) Saliency map with prior; (n) Salient region mask based on the saliency map in (m). [sent-346, score-0.577]

79 We can compute cij = hU ∈ Ru×1while cjj = 1 and cgj = 0. [sent-368, score-0.414]

80 The region center with the largest marginal gain is the location with the lowest assignment cost. [sent-371, score-0.435]

81 Hence this computation of cij encourages the selected facility locations to be close to low cost areas (i. [sent-373, score-1.022]

82 Figures 3(e) and 3(h) show the marginal gain for each point in J in the first iteration of the facility location optimization without and with the high level prior. [sent-378, score-0.842]

83 After highlevel prior integration, the points with large marginal gains are more concentrated in the perceptually important areas (indicated by high-level prior map) such as the flower and the flower leaf. [sent-379, score-0.27]

84 Compared to the selected region centers A without the prior in Figure 3(f), our approach with priors will select most of the potential region centers for A from the high prior areas as shown in Figure 3(i). [sent-380, score-0.856]

85 3 Potential Salient Region Extraction Given a set of selected facility locations A, let the current maximal profit from vi be ρicur = maxj∈A cij, and the facility assignment for vi be xicur = arg maxj∈A cij . [sent-383, score-1.99]

86 Hence, we ccolursrteesrp tohned image eeplesm 1e0nt −s t 1ha4t nsh Aarelg tohrei same facility elo, cwaetion as the most profitable to obtain potential salient regions {ri}i=1. [sent-389, score-0.972]

87 Figure 2 show two examples of facility location and facility assignment results (i. [sent-393, score-1.269]

88 The results in Figure 2(f) using a harmonic function to compute cij are better than the results in Figure 2(g) that simply uses the edge weight wij as cij . [sent-396, score-0.799]

89 The reason is that harmonic solution for cij enforces that nearby points have similar harmonic function values; this better models the geometry of the data induced by the graph structure (edges and weights W). [sent-397, score-0.549]

90 Figures 3(g) and 3(j) show the region extraction results for the two sets of selected region centers A shown in Figure 3(f) and Figure 3(i), respectively. [sent-402, score-0.438]

91 |A| , we next compute the saliency o efx ri icnt tnegrm {sr of} its color and spatial information. [sent-422, score-0.434]

92 The color saliency of ri is defined as: fc(ri) = ? [sent-425, score-0.434]

93 A region which has a wider spatial distribution is typically less salient than regions which have small spatial spread [20, 8]. [sent-430, score-0.454]

94 The spatial saliency of ri is computed as fs(ri) = 1 −maxVi (Vri) (ri). [sent-431, score-0.409]

95 ances of superpixels from ri to the spatial mean μk of region rk . [sent-434, score-0.374]

96 After fc and fs are maximum normalized to [0, 1], the saliency of ri is computed as: f(ri) = fc(ri)fs (ri). [sent-436, score-0.468]

97 We generate the final saliency map S by weighted averaging over superpixels, nwahle sarlei etnhec weights are computed by pair-wise feature distances between superpixels using a Gaussian kernel to enforce that similar superpixels should have similar visual saliency. [sent-437, score-0.554]

98 Figures 3(l) and 3(m) present the saliency maps using our approach without and with high-level priors, respectively. [sent-438, score-0.348]

99 Compared to the ground truth region in Figure 3(k), the saliency maps with priors are better than the result without priors. [sent-439, score-0.551]

100 (a) Input images; (b) Ground truth salient regions; (c) High-level prior map; (d) Saliency map without high level prior; (e) Saliency map with high level prior; (f) Salient Region extraction based on (e) by simple thresholding. [sent-443, score-0.501]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('facility', 0.588), ('cij', 0.328), ('saliency', 0.311), ('salient', 0.257), ('region', 0.137), ('icur', 0.129), ('centers', 0.125), ('profit', 0.115), ('clients', 0.108), ('superpixels', 0.1), ('ri', 0.098), ('harmonic', 0.094), ('facilities', 0.091), ('marginal', 0.087), ('wuu', 0.086), ('xicur', 0.086), ('submodularity', 0.083), ('gain', 0.083), ('hu', 0.082), ('prior', 0.08), ('vi', 0.078), ('client', 0.076), ('potential', 0.067), ('priors', 0.066), ('submodular', 0.065), ('iuu', 0.065), ('puu', 0.065), ('regions', 0.06), ('opening', 0.059), ('demand', 0.05), ('maxj', 0.05), ('wij', 0.049), ('hl', 0.048), ('assignment', 0.048), ('cia', 0.048), ('yq', 0.048), ('fj', 0.047), ('superpixel', 0.047), ('na', 0.047), ('satisfying', 0.047), ('node', 0.046), ('open', 0.046), ('vg', 0.046), ('du', 0.046), ('location', 0.045), ('nodes', 0.044), ('map', 0.043), ('centrality', 0.043), ('cgj', 0.043), ('cjj', 0.043), ('jhe', 0.043), ('profits', 0.043), ('wul', 0.043), ('locations', 0.042), ('background', 0.042), ('hb', 0.039), ('uu', 0.039), ('selected', 0.039), ('rk', 0.039), ('level', 0.039), ('vqi', 0.038), ('vj', 0.038), ('maps', 0.037), ('backgrounds', 0.036), ('xc', 0.035), ('assigned', 0.035), ('center', 0.035), ('pf', 0.035), ('hq', 0.033), ('labeled', 0.033), ('ij', 0.033), ('graph', 0.033), ('hs', 0.033), ('xl', 0.032), ('lazy', 0.032), ('minx', 0.032), ('fc', 0.031), ('yl', 0.029), ('topdown', 0.029), ('oiofn', 0.029), ('similarities', 0.029), ('favors', 0.029), ('unlabeled', 0.029), ('indicated', 0.028), ('fs', 0.028), ('greedy', 0.026), ('identify', 0.026), ('purely', 0.025), ('comprised', 0.025), ('figures', 0.025), ('encourages', 0.025), ('elements', 0.025), ('color', 0.025), ('dth', 0.024), ('vertex', 0.024), ('modeled', 0.024), ('ru', 0.024), ('gains', 0.023), ('eh', 0.023), ('matrix', 0.023)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 1.0000017 418 cvpr-2013-Submodular Salient Region Detection

Author: Zhuolin Jiang, Larry S. Davis

Abstract: The problem of salient region detection is formulated as the well-studied facility location problem from operations research. High-level priors are combined with low-level features to detect salient regions. Salient region detection is achieved by maximizing a submodular objective function, which maximizes the total similarities (i.e., total profits) between the hypothesized salient region centers (i.e., facility locations) and their region elements (i.e., clients), and penalizes the number of potential salient regions (i.e., the number of open facilities). The similarities are efficiently computedbyfinding a closed-form harmonic solution on the constructed graph for an input image. The saliency of a selected region is modeled in terms of appearance and spatial location. By exploiting the submodularity properties of the objectivefunction, a highly efficient greedy-based optimization algorithm can be employed. This algorithm is guaranteed to be at least a (e − 1)/e ≈ 0.632-approximation to t heeed optimum. lEeaxpster aim (een −tal 1 r)e/seult ≈s d 0e.m63o2n-satrpaptero txhimata our approach outperforms several recently proposed saliency detection approaches.

2 0.36559552 375 cvpr-2013-Saliency Detection via Graph-Based Manifold Ranking

Author: Chuan Yang, Lihe Zhang, Huchuan Lu, Xiang Ruan, Ming-Hsuan Yang

Abstract: Most existing bottom-up methods measure the foreground saliency of a pixel or region based on its contrast within a local context or the entire image, whereas a few methods focus on segmenting out background regions and thereby salient objects. Instead of considering the contrast between the salient objects and their surrounding regions, we consider both foreground and background cues in a different way. We rank the similarity of the image elements (pixels or regions) with foreground cues or background cues via graph-based manifold ranking. The saliency of the image elements is defined based on their relevances to the given seeds or queries. We represent the image as a close-loop graph with superpixels as nodes. These nodes are ranked based on the similarity to background and foreground queries, based on affinity matrices. Saliency detection is carried out in a two-stage scheme to extract background regions and foreground salient objects efficiently. Experimental results on two large benchmark databases demonstrate the proposed method performs well when against the state-of-the-art methods in terms of accuracy and speed. We also create a more difficult bench- mark database containing 5,172 images to test the proposed saliency model and make this database publicly available with this paper for further studies in the saliency field.

3 0.33659682 202 cvpr-2013-Hierarchical Saliency Detection

Author: Qiong Yan, Li Xu, Jianping Shi, Jiaya Jia

Abstract: When dealing with objects with complex structures, saliency detection confronts a critical problem namely that detection accuracy could be adversely affected if salient foreground or background in an image contains small-scale high-contrast patterns. This issue is common in natural images and forms a fundamental challenge for prior methods. We tackle it from a scale point of view and propose a multi-layer approach to analyze saliency cues. The final saliency map is produced in a hierarchical model. Different from varying patch sizes or downsizing images, our scale-based region handling is by finding saliency values optimally in a tree model. Our approach improves saliency detection on many images that cannot be handled well traditionally. A new dataset is also constructed. –

4 0.33091545 376 cvpr-2013-Salient Object Detection: A Discriminative Regional Feature Integration Approach

Author: Huaizu Jiang, Jingdong Wang, Zejian Yuan, Yang Wu, Nanning Zheng, Shipeng Li

Abstract: Salient object detection has been attracting a lot of interest, and recently various heuristic computational models have been designed. In this paper, we regard saliency map computation as a regression problem. Our method, which is based on multi-level image segmentation, uses the supervised learning approach to map the regional feature vector to a saliency score, and finally fuses the saliency scores across multiple levels, yielding the saliency map. The contributions lie in two-fold. One is that we show our approach, which integrates the regional contrast, regional property and regional backgroundness descriptors together to form the master saliency map, is able to produce superior saliency maps to existing algorithms most of which combine saliency maps heuristically computed from different types of features. The other is that we introduce a new regional feature vector, backgroundness, to characterize the background, which can be regarded as a counterpart of the objectness descriptor [2]. The performance evaluation on several popular benchmark data sets validates that our approach outperforms existing state-of-the-arts.

5 0.30314684 322 cvpr-2013-PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures with Spatial Priors

Author: Keyang Shi, Keze Wang, Jiangbo Lu, Liang Lin

Abstract: Driven by recent vision and graphics applications such as image segmentation and object recognition, assigning pixel-accurate saliency values to uniformly highlight foreground objects becomes increasingly critical. More often, such fine-grained saliency detection is also desired to have a fast runtime. Motivated by these, we propose a generic and fast computational framework called PISA Pixelwise Image Saliency Aggregating complementary saliency cues based on color and structure contrasts with spatial priors holistically. Overcoming the limitations of previous methods often using homogeneous superpixel-based and color contrast-only treatment, our PISA approach directly performs saliency modeling for each individual pixel and makes use of densely overlapping, feature-adaptive observations for saliency measure computation. We further impose a spatial prior term on each of the two contrast measures, which constrains pixels rendered salient to be compact and also centered in image domain. By fusing complementary contrast measures in such a pixelwise adaptive manner, the detection effectiveness is significantly boosted. Without requiring reliable region segmentation or post– relaxation, PISA exploits an efficient edge-aware image representation and filtering technique and produces spatially coherent yet detail-preserving saliency maps. Extensive experiments on three public datasets demonstrate PISA’s superior detection accuracy and competitive runtime speed over the state-of-the-arts approaches.

6 0.27578643 273 cvpr-2013-Looking Beyond the Image: Unsupervised Learning for Object Saliency and Detection

7 0.27168509 374 cvpr-2013-Saliency Aggregation: A Data-Driven Approach

8 0.20988998 258 cvpr-2013-Learning Video Saliency from Human Gaze Using Candidate Selection

9 0.17766005 12 cvpr-2013-A Global Approach for the Detection of Vanishing Points and Mutually Orthogonal Vanishing Directions

10 0.17103313 411 cvpr-2013-Statistical Textural Distinctiveness for Salient Region Detection in Natural Images

11 0.16324389 450 cvpr-2013-Unsupervised Joint Object Discovery and Segmentation in Internet Images

12 0.12044604 205 cvpr-2013-Hollywood 3D: Recognizing Actions in 3D Natural Scenes

13 0.10684686 325 cvpr-2013-Part Discovery from Partial Correspondence

14 0.10076997 464 cvpr-2013-What Makes a Patch Distinct?

15 0.094545834 200 cvpr-2013-Harvesting Mid-level Visual Concepts from Large-Scale Internet Images

16 0.093731649 20 cvpr-2013-A New Model and Simple Algorithms for Multi-label Mumford-Shah Problems

17 0.090751044 217 cvpr-2013-Improving an Object Detector and Extracting Regions Using Superpixels

18 0.083489269 370 cvpr-2013-SCALPEL: Segmentation Cascades with Localized Priors and Efficient Learning

19 0.079414375 263 cvpr-2013-Learning the Change for Automatic Image Cropping

20 0.071065091 264 cvpr-2013-Learning to Detect Partially Overlapping Instances


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.176), (1, -0.123), (2, 0.352), (3, 0.196), (4, -0.032), (5, -0.019), (6, 0.01), (7, -0.043), (8, 0.001), (9, 0.016), (10, 0.076), (11, 0.013), (12, -0.046), (13, 0.014), (14, -0.034), (15, 0.022), (16, 0.014), (17, 0.007), (18, -0.008), (19, 0.026), (20, -0.0), (21, 0.012), (22, -0.086), (23, 0.033), (24, 0.007), (25, 0.019), (26, -0.041), (27, -0.034), (28, -0.007), (29, 0.044), (30, 0.032), (31, 0.06), (32, 0.043), (33, 0.039), (34, -0.034), (35, 0.027), (36, -0.024), (37, -0.017), (38, 0.083), (39, -0.047), (40, 0.047), (41, -0.032), (42, 0.03), (43, -0.023), (44, 0.008), (45, 0.007), (46, -0.04), (47, 0.026), (48, -0.028), (49, -0.072)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.9413482 418 cvpr-2013-Submodular Salient Region Detection

Author: Zhuolin Jiang, Larry S. Davis

Abstract: The problem of salient region detection is formulated as the well-studied facility location problem from operations research. High-level priors are combined with low-level features to detect salient regions. Salient region detection is achieved by maximizing a submodular objective function, which maximizes the total similarities (i.e., total profits) between the hypothesized salient region centers (i.e., facility locations) and their region elements (i.e., clients), and penalizes the number of potential salient regions (i.e., the number of open facilities). The similarities are efficiently computedbyfinding a closed-form harmonic solution on the constructed graph for an input image. The saliency of a selected region is modeled in terms of appearance and spatial location. By exploiting the submodularity properties of the objectivefunction, a highly efficient greedy-based optimization algorithm can be employed. This algorithm is guaranteed to be at least a (e − 1)/e ≈ 0.632-approximation to t heeed optimum. lEeaxpster aim (een −tal 1 r)e/seult ≈s d 0e.m63o2n-satrpaptero txhimata our approach outperforms several recently proposed saliency detection approaches.

2 0.88843089 375 cvpr-2013-Saliency Detection via Graph-Based Manifold Ranking

Author: Chuan Yang, Lihe Zhang, Huchuan Lu, Xiang Ruan, Ming-Hsuan Yang

Abstract: Most existing bottom-up methods measure the foreground saliency of a pixel or region based on its contrast within a local context or the entire image, whereas a few methods focus on segmenting out background regions and thereby salient objects. Instead of considering the contrast between the salient objects and their surrounding regions, we consider both foreground and background cues in a different way. We rank the similarity of the image elements (pixels or regions) with foreground cues or background cues via graph-based manifold ranking. The saliency of the image elements is defined based on their relevances to the given seeds or queries. We represent the image as a close-loop graph with superpixels as nodes. These nodes are ranked based on the similarity to background and foreground queries, based on affinity matrices. Saliency detection is carried out in a two-stage scheme to extract background regions and foreground salient objects efficiently. Experimental results on two large benchmark databases demonstrate the proposed method performs well when against the state-of-the-art methods in terms of accuracy and speed. We also create a more difficult bench- mark database containing 5,172 images to test the proposed saliency model and make this database publicly available with this paper for further studies in the saliency field.

3 0.8736372 374 cvpr-2013-Saliency Aggregation: A Data-Driven Approach

Author: Long Mai, Yuzhen Niu, Feng Liu

Abstract: A variety of methods have been developed for visual saliency analysis. These methods often complement each other. This paper addresses the problem of aggregating various saliency analysis methods such that the aggregation result outperforms each individual one. We have two major observations. First, different methods perform differently in saliency analysis. Second, the performance of a saliency analysis method varies with individual images. Our idea is to use data-driven approaches to saliency aggregation that appropriately consider the performance gaps among individual methods and the performance dependence of each method on individual images. This paper discusses various data-driven approaches and finds that the image-dependent aggregation method works best. Specifically, our method uses a Conditional Random Field (CRF) framework for saliency aggregation that not only models the contribution from individual saliency map but also the interaction between neighboringpixels. To account for the dependence of aggregation on an individual image, our approach selects a subset of images similar to the input image from a training data set and trains the CRF aggregation model only using this subset instead of the whole training set. Our experiments on public saliency benchmarks show that our aggregation method outperforms each individual saliency method and is robust with the selection of aggregated methods.

4 0.87285626 322 cvpr-2013-PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures with Spatial Priors

Author: Keyang Shi, Keze Wang, Jiangbo Lu, Liang Lin

Abstract: Driven by recent vision and graphics applications such as image segmentation and object recognition, assigning pixel-accurate saliency values to uniformly highlight foreground objects becomes increasingly critical. More often, such fine-grained saliency detection is also desired to have a fast runtime. Motivated by these, we propose a generic and fast computational framework called PISA Pixelwise Image Saliency Aggregating complementary saliency cues based on color and structure contrasts with spatial priors holistically. Overcoming the limitations of previous methods often using homogeneous superpixel-based and color contrast-only treatment, our PISA approach directly performs saliency modeling for each individual pixel and makes use of densely overlapping, feature-adaptive observations for saliency measure computation. We further impose a spatial prior term on each of the two contrast measures, which constrains pixels rendered salient to be compact and also centered in image domain. By fusing complementary contrast measures in such a pixelwise adaptive manner, the detection effectiveness is significantly boosted. Without requiring reliable region segmentation or post– relaxation, PISA exploits an efficient edge-aware image representation and filtering technique and produces spatially coherent yet detail-preserving saliency maps. Extensive experiments on three public datasets demonstrate PISA’s superior detection accuracy and competitive runtime speed over the state-of-the-arts approaches.

5 0.85352737 376 cvpr-2013-Salient Object Detection: A Discriminative Regional Feature Integration Approach

Author: Huaizu Jiang, Jingdong Wang, Zejian Yuan, Yang Wu, Nanning Zheng, Shipeng Li

Abstract: Salient object detection has been attracting a lot of interest, and recently various heuristic computational models have been designed. In this paper, we regard saliency map computation as a regression problem. Our method, which is based on multi-level image segmentation, uses the supervised learning approach to map the regional feature vector to a saliency score, and finally fuses the saliency scores across multiple levels, yielding the saliency map. The contributions lie in two-fold. One is that we show our approach, which integrates the regional contrast, regional property and regional backgroundness descriptors together to form the master saliency map, is able to produce superior saliency maps to existing algorithms most of which combine saliency maps heuristically computed from different types of features. The other is that we introduce a new regional feature vector, backgroundness, to characterize the background, which can be regarded as a counterpart of the objectness descriptor [2]. The performance evaluation on several popular benchmark data sets validates that our approach outperforms existing state-of-the-arts.

6 0.84820598 202 cvpr-2013-Hierarchical Saliency Detection

7 0.79697114 411 cvpr-2013-Statistical Textural Distinctiveness for Salient Region Detection in Natural Images

8 0.71183443 273 cvpr-2013-Looking Beyond the Image: Unsupervised Learning for Object Saliency and Detection

9 0.67553073 258 cvpr-2013-Learning Video Saliency from Human Gaze Using Candidate Selection

10 0.56605393 263 cvpr-2013-Learning the Change for Automatic Image Cropping

11 0.53448534 464 cvpr-2013-What Makes a Patch Distinct?

12 0.51185203 450 cvpr-2013-Unsupervised Joint Object Discovery and Segmentation in Internet Images

13 0.41952288 280 cvpr-2013-Maximum Cohesive Grid of Superpixels for Fast Object Localization

14 0.41061333 26 cvpr-2013-A Statistical Model for Recreational Trails in Aerial Images

15 0.3875173 458 cvpr-2013-Voxel Cloud Connectivity Segmentation - Supervoxels for Point Clouds

16 0.37239781 366 cvpr-2013-Robust Region Grouping via Internal Patch Statistics

17 0.36344206 12 cvpr-2013-A Global Approach for the Detection of Vanishing Points and Mutually Orthogonal Vanishing Directions

18 0.34745857 193 cvpr-2013-Graph Transduction Learning with Connectivity Constraints with Application to Multiple Foreground Cosegmentation

19 0.34606534 468 cvpr-2013-Winding Number for Region-Boundary Consistent Salient Contour Extraction

20 0.34089953 29 cvpr-2013-A Video Representation Using Temporal Superpixels


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(10, 0.095), (16, 0.031), (26, 0.047), (29, 0.269), (33, 0.244), (67, 0.083), (69, 0.032), (87, 0.101)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.84384954 391 cvpr-2013-Sensing and Recognizing Surface Textures Using a GelSight Sensor

Author: Rui Li, Edward H. Adelson

Abstract: Sensing surface textures by touch is a valuable capability for robots. Until recently it wwas difficult to build a compliant sensor with high sennsitivity and high resolution. The GelSight sensor is coompliant and offers sensitivity and resolution exceeding that of the human fingertips. This opens the possibility of measuring and recognizing highly detailed surface texxtures. The GelSight sensor, when pressed against a surfacce, delivers a height map. This can be treated as an image, aand processed using the tools of visual texture analysis. WWe have devised a simple yet effective texture recognitioon system based on local binary patterns, and enhanced it by the use of a multi-scale pyramid and a Hellinger ddistance metric. We built a database with 40 classes of taactile textures using materials such as fabric, wood, and sanndpaper. Our system can correctly categorize materials fromm this database with high accuracy. This suggests that the GGelSight sensor can be useful for material recognition by roobots.

2 0.83981949 140 cvpr-2013-Efficient Color Boundary Detection with Color-Opponent Mechanisms

Author: Kaifu Yang, Shaobing Gao, Chaoyi Li, Yongjie Li

Abstract: Color information plays an important role in better understanding of natural scenes by at least facilitating discriminating boundaries of objects or areas. In this study, we propose a new framework for boundary detection in complex natural scenes based on the color-opponent mechanisms of the visual system. The red-green and blue-yellow color opponent channels in the human visual system are regarded as the building blocks for various color perception tasks such as boundary detection. The proposed framework is a feedforward hierarchical model, which has direct counterpart to the color-opponent mechanisms involved in from the retina to the primary visual cortex (V1). Results show that our simple framework has excellent ability to flexibly capture both the structured chromatic and achromatic boundaries in complex scenes.

same-paper 3 0.81462765 418 cvpr-2013-Submodular Salient Region Detection

Author: Zhuolin Jiang, Larry S. Davis

Abstract: The problem of salient region detection is formulated as the well-studied facility location problem from operations research. High-level priors are combined with low-level features to detect salient regions. Salient region detection is achieved by maximizing a submodular objective function, which maximizes the total similarities (i.e., total profits) between the hypothesized salient region centers (i.e., facility locations) and their region elements (i.e., clients), and penalizes the number of potential salient regions (i.e., the number of open facilities). The similarities are efficiently computedbyfinding a closed-form harmonic solution on the constructed graph for an input image. The saliency of a selected region is modeled in terms of appearance and spatial location. By exploiting the submodularity properties of the objectivefunction, a highly efficient greedy-based optimization algorithm can be employed. This algorithm is guaranteed to be at least a (e − 1)/e ≈ 0.632-approximation to t heeed optimum. lEeaxpster aim (een −tal 1 r)e/seult ≈s d 0e.m63o2n-satrpaptero txhimata our approach outperforms several recently proposed saliency detection approaches.

4 0.80367196 35 cvpr-2013-Adaptive Compressed Tomography Sensing

Author: Oren Barkan, Jonathan Weill, Amir Averbuch, Shai Dekel

Abstract: One of the main challenges in Computed Tomography (CT) is how to balance between the amount of radiation the patient is exposed to during scan time and the quality of the CT image. We propose a mathematical model for adaptive CT acquisition whose goal is to reduce dosage levels while maintaining high image quality at the same time. The adaptive algorithm iterates between selective limited acquisition and improved reconstruction, with the goal of applying only the dose level required for sufficient image quality. The theoretical foundation of the algorithm is nonlinear Ridgelet approximation and a discrete form of Ridgelet analysis is used to compute the selective acquisition steps that best capture the image edges. We show experimental results where for the same number of line projections, the adaptive model produces higher image quality, when compared with standard limited angle, non-adaptive acquisition algorithms.

5 0.77120119 338 cvpr-2013-Probabilistic Elastic Matching for Pose Variant Face Verification

Author: Haoxiang Li, Gang Hua, Zhe Lin, Jonathan Brandt, Jianchao Yang

Abstract: Pose variation remains to be a major challenge for realworld face recognition. We approach this problem through a probabilistic elastic matching method. We take a part based representation by extracting local features (e.g., LBP or SIFT) from densely sampled multi-scale image patches. By augmenting each feature with its location, a Gaussian mixture model (GMM) is trained to capture the spatialappearance distribution of all face images in the training corpus. Each mixture component of the GMM is confined to be a spherical Gaussian to balance the influence of the appearance and the location terms. Each Gaussian component builds correspondence of a pair of features to be matched between two faces/face tracks. For face verification, we train an SVM on the vector concatenating the difference vectors of all the feature pairs to decide if a pair of faces/face tracks is matched or not. We further propose a joint Bayesian adaptation algorithm to adapt the universally trained GMM to better model the pose variations between the target pair of faces/face tracks, which consistently improves face verification accuracy. Our experiments show that our method outperforms the state-ofthe-art in the most restricted protocol on Labeled Face in the Wild (LFW) and the YouTube video face database by a significant margin.

6 0.76599443 215 cvpr-2013-Improved Image Set Classification via Joint Sparse Approximated Nearest Subspaces

7 0.73081797 365 cvpr-2013-Robust Real-Time Tracking of Multiple Objects by Volumetric Mass Densities

8 0.72922611 248 cvpr-2013-Learning Collections of Part Models for Object Recognition

9 0.72634822 242 cvpr-2013-Label Propagation from ImageNet to 3D Point Clouds

10 0.72610867 72 cvpr-2013-Boundary Detection Benchmarking: Beyond F-Measures

11 0.72606587 322 cvpr-2013-PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures with Spatial Priors

12 0.72605824 71 cvpr-2013-Boundary Cues for 3D Object Shape Recovery

13 0.72579074 119 cvpr-2013-Detecting and Aligning Faces by Image Retrieval

14 0.72534478 12 cvpr-2013-A Global Approach for the Detection of Vanishing Points and Mutually Orthogonal Vanishing Directions

15 0.72510934 15 cvpr-2013-A Lazy Man's Approach to Benchmarking: Semisupervised Classifier Evaluation and Recalibration

16 0.7247858 98 cvpr-2013-Cross-View Action Recognition via a Continuous Virtual Path

17 0.72476381 254 cvpr-2013-Learning SURF Cascade for Fast and Accurate Object Detection

18 0.7246471 225 cvpr-2013-Integrating Grammar and Segmentation for Human Pose Estimation

19 0.72396183 339 cvpr-2013-Probabilistic Graphlet Cut: Exploiting Spatial Structure Cue for Weakly Supervised Image Segmentation

20 0.72360349 277 cvpr-2013-MODEC: Multimodal Decomposable Models for Human Pose Estimation