iccv iccv2013 iccv2013-288 knowledge-graph by maker-knowledge-mining

288 iccv-2013-Nested Shape Descriptors

Source: pdf

Author: Jeffrey Byrne, Jianbo Shi

Abstract: In this paper, we propose a new family of binary local feature descriptors called nested shape descriptors. These descriptors are constructed by pooling oriented gradients over a large geometric structure called the Hawaiian earring, which is constructed with a nested correlation structure that enables a new robust local distance function called the nesting distance. This distance function is unique to the nested descriptor and provides robustness to outliers from order statistics. In this paper, we define the nested shape descriptor family and introduce a specific member called the seed-of-life descriptor. We perform a trade study to determine optimal descriptor parameters for the task of image matching. Finally, we evaluate performance compared to state-of-the-art local feature descriptors on the VGGAffine image matching benchmark, showing significant performance gains. Our descriptor is thefirst binary descriptor to outperform SIFT on this benchmark.

Reference: text

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 edu Abstract In this paper, we propose a new family of binary local feature descriptors called nested shape descriptors. [sent-3, score-0.836]

2 These descriptors are constructed by pooling oriented gradients over a large geometric structure called the Hawaiian earring, which is constructed with a nested correlation structure that enables a new robust local distance function called the nesting distance. [sent-4, score-1.59]

3 This distance function is unique to the nested descriptor and provides robustness to outliers from order statistics. [sent-5, score-0.828]

4 In this paper, we define the nested shape descriptor family and introduce a specific member called the seed-of-life descriptor. [sent-6, score-0.908]

5 We perform a trade study to determine optimal descriptor parameters for the task of image matching. [sent-7, score-0.247]

6 Our descriptor is thefirst binary descriptor to outperform SIFT on this benchmark. [sent-9, score-0.407]

7 It is well known that for the task ofimage matching, descriptors constructed with larger support outperform descriptors with smaller support [20, 8, 3, 17, 15]. [sent-16, score-0.33]

8 (left) Hawaiian earrings with k-fold rotational symmetry define a member of the nested shape descriptor family called the seed-oflife descriptor (right) Two Hawaiian earrings substructures in the seed-of-life descriptor are highlighted in grey. [sent-18, score-1.576]

9 For example, there may be arbitrarily large outliers in the descriptor due to occlusions and geometric variation effects far from the descriptor center. [sent-20, score-0.417]

10 In this paper, we introduce nested shape descriptors to address this tradeoff. [sent-22, score-0.699]

11 A nested shape descriptor (NSD) is a family of binary local feature descriptors constructed by pooling oriented and scaled gradients over a large geometric structure called an Hawaiian earring. [sent-23, score-1.211]

12 An example of the nested shape descriptor is shown in figure 1. [sent-24, score-0.789]

13 Each descrip- tor has global support covering the entire image, and the structure of the descriptor exhibits fractal self-similarity in scale. [sent-25, score-0.252]

14 This correlated nested structure enables new a robust distance function called the nesting distance. [sent-26, score-1.25]

15 The nesting distance uses order statistics for robustness to outliers while maintaining a descriptor with global support. [sent-27, score-0.881]

16 1201 • • Binary: NSDs are binary, which enables for compact storage :a NndS Dalslo awres bthinea nesting hdi estnaanbclee st foo use a pfaacstt Hamming distance, without sacrificing matching performance. [sent-30, score-0.646]

17 Robust local distance function: The nesting distance iRs a quadratic dloisctaaln dcies tfaunnccet ifounn:cTt iohne tnheastt nisg r odibsutsant ctoe corruption of the descriptor due to occlusions, geometric variations or lighting. [sent-31, score-0.942]

18 In this paper, we provide sufficient conditions for construction of a nested shape descriptor using key concepts of cumulative nested pooling and log spiral normalization. [sent-32, score-1.538]

19 We perform a trade study to determine optimal descriptor parameters for the task of image matching. [sent-33, score-0.247]

20 Recent work has focused on introducing binary features from local comparison tests [3, 8, 17, 15] which enables fast distance metric based on Hamming distance and faster derivatives [13]. [sent-40, score-0.218]

21 A taxonomy for comparing and contrasting local feature descriptors can be described in terms of five criteria: preprocessing, support, pooling, normalization and descriptor distance. [sent-42, score-0.334]

22 Preprocessing refers to the filtering performed on the input image, support patterns are the geometric struc- ture used for constructing the descriptor and pooling is the aggregation of filter responses over the support structure. [sent-43, score-0.383]

23 Using this taxonomy, the nested shape descriptor is most closely related to DAISY, BRISK and FREAK. [sent-45, score-0.789]

24 In the taxonomy of [16], the nesting distance is per-exemplar (“where”), online (“when”) using order statistics (“how”) without requiring any offline training. [sent-51, score-0.719]

25 Nested Shape Descriptors In this section, we describe the construction of nested shape descriptors. [sent-53, score-0.615]

26 NSD are constructed by first defining the nested pooling structure (section 3. [sent-54, score-0.672]

27 We provide definitions for this construction and show how the nested shape descriptor is constructed from these pieces (section 3. [sent-56, score-0.816]

28 4), which uses the properties of the nested descriptor to provide robust distance function. [sent-59, score-0.801]

29 Finally, we define a specific member of the nested shape descriptor family called the seed-of-life descriptor (section 3. [sent-60, score-1.082]

30 The nested descriptor and nesting distance are compared to a generic grid descriptor (e. [sent-64, score-1.581]

31 The red X’s and green checkmarks show where a grid descriptor is corrupted due to the scene variation, which leads to poor matching performance. [sent-67, score-0.261]

32 For these cases, the NSD and nesting distance are able to select the best subset of supports during matching to provide robustness to these scene variations. [sent-68, score-0.789]

33 Given a pair of descriptors, the nesting distance computes a weighted sum of the best k coordinate matches. [sent-71, score-0.661]

34 The nesting distance relies on nesting, such that all supports are linked by exactly one point in the center of the descriptor. [sent-75, score-0.797]

35 Hawaiian Earrings and Nested Pooling Nested shape descriptors represent shape using cumulative pooling of oriented gradients within Hawaiian earrings. [sent-80, score-0.335]

36 Figure 1 (right) shows an example of the Hawaiian 1202 distance selects the best subset of supports in the nested descriptor that cover only the object (green checkmarks). [sent-81, score-0.9]

37 (middle) Viewpoint changes for long and thin foreground structures introduce errors in grid descriptor matching due to large changes in the background. [sent-82, score-0.219]

38 The nesting distance selects the subset of supports during matching that cover the foreground and are the correct scale to allow for background variation. [sent-83, score-0.789]

39 (right) Scale changes without scale invariant detectors introduce errors in grid descriptor matching due to changes in local support. [sent-84, score-0.235]

40 The nesting distance uses a subset of both large and small scale supports, ignoring intermediate scale supports with corruption. [sent-85, score-0.76]

41 earring substructure formed by a nested set of circles all intersecting at exactly one point at the center. [sent-86, score-0.703]

42 The Hawaiian earring is a nested structure analogous to Matryoshka or Russian nesting dolls, where each smaller doll fits neatly inside the next larger doll. [sent-87, score-1.258]

43 Hawaiian earrings may be combined into sets such that each earring is called a lobe. [sent-88, score-0.27]

44 Each lobe exhibits scale symmetry and all earrings intersect at exactly one point in the center. [sent-89, score-0.232]

45 For example, in figure 1(right), the two lobes highlighted in grey are Hawaiian earrings K6 (1) and K6 (4) and the two largest circles are referenced as supports K6 (1, 4) and K6 (4, 4). [sent-97, score-0.327]

46 Nested Shape Descriptors A nested shape descriptor D at interest point p is defined by nested pooling, logarithmic spiral normalization and binarization of oriented gradients B over a nested support Kn. [sent-100, score-2.2]

47 01 iofthdˆe(ir,wjis,ek) > 0 (3) Equation (1) is nested pooling. [sent-102, score-0.556]

48 The descriptor d(i, j,k) is the pooled response for orientation subband i, lobe j and lobe scale k. [sent-104, score-0.319]

49 Observe that the bandpass octave scale s is equal to the Hawaiian earring support radius k. [sent-105, score-0.253]

50 As the support radius increases, the pooling support contains the next smaller support, resulting in nested pooling within a lobe. [sent-107, score-0.865]

51 A nested support set Kn exhibits a logarithmic spiral when considering neighboring supports. [sent-113, score-0.791]

52 A nested shape descriptor can be binarized by computing the sign of (2). [sent-120, score-0.789]

53 This constructs a nested shape descriptor with binary entries. [sent-121, score-0.829]

54 (top) Logarithmic spiral property of the nested shape descriptor provides normalization and binarization. [sent-124, score-0.914]

55 (bottom) An NSD is formed at each interest point by (left) nested pooling of scaled and oriented gradients and (right) logspiral difference and binarization. [sent-126, score-0.71]

56 pooling is equivalent to pooling of fixed radius over scales × of a steerable pyramid [19], which is analogous to a “flattening” of a pyramid representation of scaled and oriented gradients. [sent-127, score-0.325]

58 The Seed-of-Life Descriptor The nested shape descriptors in section 3. [sent-132, score-0.699]

59 In this section, we define a specific member of this family called the seed-of-life nested shape descriptor or simply the seedof-life descriptor. [sent-135, score-0.908]

60 The seed-of-life descriptor is a nested shape descriptor such that the nested pooling Kn is defined using a rotationally symmetric geometric structure called the seed-oflife. [sent-136, score-1.661]

61 ber of the nested shape descriptor family since it exhibits rotational symmetry where Hawaiian earring lobes are spaced uniformly in angle. [sent-143, score-1.095]

62 Nesting Distance The nesting distance is a robust quadratic local distance function [16] unique to NSDs based on order statistics. [sent-147, score-0.748]

63 Given two nested descriptors p and q, the nesting distance d(p, q) uses order statistics to partition the supports of two nested descriptors into inliers and outliers by sorting the squared differences up to a given maximum order k. [sent-148, score-2.145]

64 Then, × the nesting distance is equivalent to computing the conditional Gaussian distribution of inliers given outliers. [sent-149, score-0.722]

65 Let p and q be two nested descriptors of length n. [sent-163, score-0.64]

66 dLifefet trhenisc partition )b e represented by selection − 1204 points between the reference image (middle) and the observed image using the nesting distance (left) and Euclidean distance (right). [sent-165, score-0.749]

67 The Euclidean distance is affected by occlusions at the image boundary (left ellipse) resulting in local misalignments, while the nested distance is more robust to these occlusion effects. [sent-166, score-0.736]

68 Then, the nesting distance d is d(p, q, Λ, k) = (p −q)T(I −S(k+1,n))ΛS(1,k)(p −q) (6) where Lambda is an optional quadratic weighting matrix. [sent-168, score-0.661]

69 Furthermore, if k = n and Λ = I then the nesting distance is equivalent to the Euclidean distance. [sent-170, score-0.68]

70 If the nesting distance is of the form (6), then it is equivalent to an unnormalizednegative log likelihood ofa conditional Gaussian distribution for inliers given outliers. [sent-173, score-0.722]

71 The nesting distance was designed specifically for the structure of the nested shape descriptor. [sent-182, score-1.276]

72 Therefore, this enables the use of order statistics to partition the supports into inliers and outliers, since all supports have one point in common. [sent-185, score-0.276]

73 The nesting distance cannot be used for descriptors with support constructed on a log-polar or Cartesian grid. [sent-186, score-0.822]

74 Figure 4 shows an example of the benefits of the nesting distance for image matching. [sent-191, score-0.661]

75 We extract interest points using an edge based detector, compute nested descriptors at each point, then perform greedy minimum distance assignment from the reference to the observation using either the nesting distance or Euclidean distance. [sent-192, score-1.372]

76 This example shows that the nested distance is more robust to occlusions at the image border than the Euclidean distance. [sent-193, score-0.649]

77 Finally, The nesting distance has two useful properties that are proven in the supplementary material. [sent-194, score-0.661]

78 First, the nesting distance is non-metric, since it does not satisfy identity or the triangle inequality properties. [sent-195, score-0.661]

79 Second, the nesting distance is robust up to corruption of n k coordinates. [sent-197, score-0.661]

80 Experimental Results In this section, we provide experimental results for the nested shape descriptor and nesting distance for the task of image matching. [sent-199, score-1.45]

81 First, we perform a trade study using the new experimental protocol of similarity stereo matching to determine an optimal set of descriptor parameters for the seed-of-life descriptor. [sent-200, score-0.343]

82 Next, we compare results for the seed-of-life and binary seed-of-life descriptor for the standard VGG-Affine benchmark [12] against SIFT [9] and BRISK [8]. [sent-201, score-0.214]

83 Finally, we show results on a challeng1205 Both SOL and BSOL outperform SIFT and BRISK, and Binary-SOL is the first binary descriptor to outperform SIFT on this benchmark. [sent-202, score-0.252]

84 VGG-Affine We show comparative performance for local feature descriptor matching on the VGG-Affine benchmark [12]. [sent-208, score-0.219]

85 We compare the performance of seed-of-life (SOL) and binary SOL descriptor (section 3. [sent-213, score-0.214]

86 Both SOL and Binary SOL use the Euclidean (and Hamming) distance, as we evaluate the effect of the nesting distance separately in section 4. [sent-217, score-0.661]

87 Furthermore, the binary SOL and SOL descriptor perform equally, which shows that the binarization provides a more compact descriptor without sacrificing performance. [sent-227, score-0.442]

88 VGG-Affine and Local Distance Functions Next, we performed a comparison of the nesting distance vs. [sent-230, score-0.661]

89 This evaluation was proposed to demonstrate the relative benefit of the nesting distance over the Euclidean distance baseline. [sent-232, score-0.732]

90 All distortion classes showed improved performance of the nesting distance over Euclidean. [sent-234, score-0.69]

91 This result summarizes the known tradeoff between descriptor support and matching performance as was discussed in section 1. [sent-264, score-0.253]

92 Automated Helicopter Landing In this section, we describe an application of the nested shape descriptors to the problem of visual landing of a ro- tary wing platform. [sent-274, score-0.789]

93 Seed-of-life descriptors are used to estimate the position and orientation of a candidate landing zone during approach and landing. [sent-275, score-0.276]

94 Visual pose estimation for landing is the problem of estimating the 6-DOF position and orientation of a moving landing zone relative to a vehicle with suitable accuracy for safe landing. [sent-276, score-0.282]

95 Application of the nested shape descriptors to visual landing zone pose estimation. [sent-280, score-0.858]

96 We com- pared the estimated landing zone position to differential GPS ground truth and results show that the nested shape descriptors achieve 2σ position errors in X, Y and Z of less than 1ft during the descent and landing. [sent-282, score-0.858]

97 Conclusions In this paper, we introduced the nested shape descriptor family and the associated nesting distance, and showed performance of the seed-of-life descriptor for the task of image matching. [sent-284, score-1.601]

98 Results show that this is the first binary descriptor to outperform SIFT on the standard VGG-Affine benchmark. [sent-285, score-0.233]

99 Furthermore, the NSD binary descriptor significantly outperforms BRISK, a state-of-the-art binary descriptor. [sent-286, score-0.254]

100 Evaluation of the nesting distance on VGG-Affine dataset. [sent-314, score-0.661]

similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('nesting', 0.59), ('nested', 0.556), ('hawaiian', 0.281), ('descriptor', 0.174), ('nsd', 0.161), ('earrings', 0.125), ('earring', 0.112), ('spiral', 0.104), ('brisk', 0.104), ('supports', 0.099), ('landing', 0.09), ('pooling', 0.089), ('descriptors', 0.084), ('sol', 0.074), ('distance', 0.071), ('lobes', 0.069), ('zone', 0.069), ('shape', 0.059), ('logarithmic', 0.053), ('support', 0.05), ('trade', 0.05), ('family', 0.048), ('stereo', 0.046), ('checkmarks', 0.042), ('lobe', 0.042), ('navair', 0.042), ('nsds', 0.042), ('inliers', 0.042), ('daisy', 0.04), ('binary', 0.04), ('taxonomy', 0.039), ('sift', 0.039), ('member', 0.038), ('freak', 0.037), ('octave', 0.035), ('called', 0.033), ('orientation', 0.033), ('radius', 0.031), ('euclidean', 0.031), ('hamming', 0.03), ('distortion', 0.029), ('matching', 0.029), ('rotational', 0.029), ('exhibits', 0.028), ('ebyrne', 0.028), ('subband', 0.028), ('steerable', 0.028), ('binarization', 0.027), ('sacrificing', 0.027), ('constructed', 0.027), ('outliers', 0.027), ('eight', 0.025), ('bandpass', 0.025), ('approved', 0.025), ('chog', 0.025), ('helicopter', 0.025), ('bark', 0.025), ('subbands', 0.025), ('oriented', 0.025), ('analyzed', 0.024), ('kn', 0.024), ('scales', 0.023), ('diminishing', 0.023), ('study', 0.023), ('life', 0.022), ('occlusions', 0.022), ('flattening', 0.022), ('orb', 0.022), ('bands', 0.021), ('substructures', 0.021), ('normalization', 0.021), ('protocol', 0.021), ('scaled', 0.021), ('surf', 0.021), ('geometric', 0.02), ('mikolajczyk', 0.02), ('symmetry', 0.02), ('uth', 0.02), ('faster', 0.02), ('release', 0.02), ('center', 0.02), ('darpa', 0.019), ('statistics', 0.019), ('outperform', 0.019), ('equivalent', 0.019), ('gradients', 0.019), ('tuytelaars', 0.018), ('circles', 0.018), ('partition', 0.017), ('exactly', 0.017), ('april', 0.017), ('grid', 0.016), ('contract', 0.016), ('ofimage', 0.016), ('elegant', 0.016), ('middlebury', 0.016), ('local', 0.016), ('developments', 0.016), ('grey', 0.016), ('aperture', 0.016)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 1.0 288 iccv-2013-Nested Shape Descriptors

Author: Jeffrey Byrne, Jianbo Shi

2 0.058985963 400 iccv-2013-Stable Hyper-pooling and Query Expansion for Event Detection

Author: Matthijs Douze, Jérôme Revaud, Cordelia Schmid, Hervé Jégou

Abstract: This paper makes two complementary contributions to event retrieval in large collections of videos. First, we propose hyper-pooling strategies that encode the frame descriptors into a representation of the video sequence in a stable manner. Our best choices compare favorably with regular pooling techniques based on k-means quantization. Second, we introduce a technique to improve the ranking. It can be interpreted either as a query expansion method or as a similarity adaptation based on the local context of the query video descriptor. Experiments on public benchmarks show that our methods are complementary and improve event retrieval results, without sacrificing efficiency.

3 0.056296561 48 iccv-2013-An Adaptive Descriptor Design for Object Recognition in the Wild

Author: Zhenyu Guo, Z. Jane Wang

Abstract: Digital images nowadays show large appearance variabilities on picture styles, in terms of color tone, contrast, vignetting, and etc. These ‘picture styles’ are directly related to the scene radiance, image pipeline of the camera, and post processing functions (e.g., photography effect filters). Due to the complexity and nonlinearity of these factors, popular gradient-based image descriptors generally are not invariant to different picture styles, which could degrade the performance for object recognition. Given that images shared online or created by individual users are taken with a wide range of devices and may be processed by various post processing functions, to find a robust object recognition system is useful and challenging. In this paper, we investigate the influence of picture styles on object recognition by making a connection between image descriptors and a pixel mapping function g, and accordingly propose an adaptive approach based on a g-incorporated kernel descriptor and multiple kernel learning, without estimating or specifying the image styles used in training and testing. We conduct experiments on the Domain Adaptation data set, the Oxford Flower data set, and several variants of the Flower data set by introducing popular photography effects through post-processing. The results demonstrate that theproposedmethod consistently yields recognition improvements over standard descriptors in all studied cases.

4 0.055950116 57 iccv-2013-BOLD Features to Detect Texture-less Objects

Author: Federico Tombari, Alessandro Franchi, Luigi Di_Stefano

Abstract: Object detection in images withstanding significant clutter and occlusion is still a challenging task whenever the object surface is characterized by poor informative content. We propose to tackle this problem by a compact and distinctive representation of groups of neighboring line segments aggregated over limited spatial supports and invariant to rotation, translation and scale changes. Peculiarly, our proposal allows for leveraging on the inherent strengths of descriptor-based approaches, i.e. robustness to occlusion and clutter and scalability with respect to the size of the model library, also when dealing with scarcely textured objects.

5 0.055447385 127 iccv-2013-Dynamic Pooling for Complex Event Recognition

Author: Weixin Li, Qian Yu, Ajay Divakaran, Nuno Vasconcelos

Abstract: The problem of adaptively selecting pooling regions for the classification of complex video events is considered. Complex events are defined as events composed of several characteristic behaviors, whose temporal configuration can change from sequence to sequence. A dynamic pooling operator is defined so as to enable a unified solution to the problems of event specific video segmentation, temporal structure modeling, and event detection. Video is decomposed into segments, and the segments most informative for detecting a given event are identified, so as to dynamically determine the pooling operator most suited for each sequence. This dynamic pooling is implemented by treating the locations of characteristic segments as hidden information, which is inferred, on a sequence-by-sequence basis, via a large-margin classification rule with latent variables. Although the feasible set of segment selections is combinatorial, it is shown that a globally optimal solution to the inference problem can be obtained efficiently, through the solution of a series of linear programs. Besides the coarselevel location of segments, a finer model of video struc- ture is implemented by jointly pooling features of segmenttuples. Experimental evaluation demonstrates that the re- sulting event detector has state-of-the-art performance on challenging video datasets.

6 0.055346273 105 iccv-2013-DeepFlow: Large Displacement Optical Flow with Deep Matching

7 0.055309098 233 iccv-2013-Latent Task Adaptation with Large-Scale Hierarchies

8 0.055229124 12 iccv-2013-A General Dense Image Matching Framework Combining Direct and Feature-Based Costs

9 0.050655339 197 iccv-2013-Hierarchical Joint Max-Margin Learning of Mid and Top Level Representations for Visual Recognition

10 0.048495378 39 iccv-2013-Action Recognition with Improved Trajectories

11 0.04701487 396 iccv-2013-Space-Time Robust Representation for Action Recognition

12 0.046199635 368 iccv-2013-SYM-FISH: A Symmetry-Aware Flip Invariant Sketch Histogram Shape Descriptor

13 0.046129532 198 iccv-2013-Hierarchical Part Matching for Fine-Grained Visual Categorization

14 0.041960772 392 iccv-2013-Similarity Metric Learning for Face Recognition

15 0.041187067 74 iccv-2013-Co-segmentation by Composition

16 0.040447708 107 iccv-2013-Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction

17 0.040224079 140 iccv-2013-Elastic Net Constraints for Shape Matching

18 0.038889408 131 iccv-2013-EVSAC: Accelerating Hypotheses Generation by Modeling Matching Scores with Extreme Value Theory

19 0.038626026 283 iccv-2013-Multiple Non-rigid Surface Detection and Registration

20 0.038502317 304 iccv-2013-PM-Huber: PatchMatch with Huber Regularization for Stereo Matching

similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.096), (1, -0.014), (2, -0.006), (3, -0.01), (4, 0.005), (5, 0.035), (6, 0.005), (7, -0.015), (8, -0.027), (9, -0.02), (10, 0.005), (11, 0.021), (12, 0.019), (13, -0.003), (14, 0.004), (15, -0.013), (16, 0.031), (17, 0.03), (18, 0.062), (19, 0.007), (20, 0.057), (21, -0.001), (22, -0.017), (23, -0.013), (24, 0.001), (25, 0.008), (26, 0.014), (27, 0.023), (28, -0.001), (29, 0.033), (30, 0.004), (31, -0.015), (32, -0.037), (33, 0.0), (34, -0.002), (35, 0.034), (36, 0.003), (37, -0.08), (38, 0.03), (39, 0.003), (40, -0.014), (41, 0.051), (42, -0.021), (43, 0.034), (44, -0.01), (45, 0.032), (46, 0.003), (47, -0.024), (48, -0.001), (49, -0.036)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.908014 288 iccv-2013-Nested Shape Descriptors

Author: Jeffrey Byrne, Jianbo Shi

2 0.65328121 365 iccv-2013-SIFTpack: A Compact Representation for Efficient SIFT Matching

Author: Alexandra Gilinsky, Lihi Zelnik Manor

Abstract: Computing distances between large sets of SIFT descriptors is a basic step in numerous algorithms in computer vision. When the number of descriptors is large, as is often the case, computing these distances can be extremely time consuming. In this paper we propose the SIFTpack: a compact way of storing SIFT descriptors, which enables significantly faster calculations between sets of SIFTs than the current solutions. SIFTpack can be used to represent SIFTs densely extracted from a single image or sparsely from multiple different images. We show that the SIFTpack representation saves both storage space and run time, for both finding nearest neighbors and for computing all distances between all descriptors. The usefulness of SIFTpack is also demonstrated as an alternative implementation for K-means dictionaries of visual words.

3 0.64990878 77 iccv-2013-Codemaps - Segment, Classify and Search Objects Locally

Author: Zhenyang Li, Efstratios Gavves, Koen E.A. van_de_Sande, Cees G.M. Snoek, Arnold W.M. Smeulders

Abstract: In this paper we aim for segmentation and classification of objects. We propose codemaps that are a joint formulation of the classification score and the local neighborhood it belongs to in the image. We obtain the codemap by reordering the encoding, pooling and classification steps over lattice elements. Other than existing linear decompositions who emphasize only the efficiency benefits for localized search, we make three novel contributions. As a preliminary, we provide a theoretical generalization of the sufficient mathematical conditions under which image encodings and classification becomes locally decomposable. As first novelty we introduce ℓ2 normalization for arbitrarily shaped image regions, which is fast enough for semantic segmentation using our Fisher codemaps. Second, using the same lattice across images, we propose kernel pooling which embeds nonlinearities into codemaps for object classification by explicit or approximate feature mappings. Results demonstrate that ℓ2 normalized Fisher codemaps improve the state-of-the-art in semantic segmentation for PAS- CAL VOC. For object classification the addition of nonlinearities brings us on par with the state-of-the-art, but is 3x faster. Because of the codemaps ’ inherent efficiency, we can reach significant speed-ups for localized search as well. We exploit the efficiency gain for our third novelty: object segment retrieval using a single query image only.

4 0.6396454 388 iccv-2013-Shape Index Descriptors Applied to Texture-Based Galaxy Analysis

Author: Kim Steenstrup Pedersen, Kristoffer Stensbo-Smidt, Andrew Zirm, Christian Igel

Abstract: A texture descriptor based on the shape index and the accompanying curvedness measure is proposed, and it is evaluated for the automated analysis of astronomical image data. A representative sample of images of low-redshift galaxies from the Sloan Digital Sky Survey (SDSS) serves as a testbed. The goal of applying texture descriptors to these data is to extract novel information about galaxies; information which is often lost in more traditional analysis. In this study, we build a regression model for predicting a spectroscopic quantity, the specific star-formation rate (sSFR). As texture features we consider multi-scale gradient orientation histograms as well as multi-scale shape index histograms, which lead to a new descriptor. Our results show that we can successfully predict spectroscopic quantities from the texture in optical multi-band images. We successfully recover the observed bi-modal distribution of galaxies into quiescent and star-forming. The state-ofthe-art for predicting the sSFR is a color-based physical model. We significantly improve its accuracy by augmenting the model with texture information. This study is thefirst step towards enabling the quantification of physical galaxy properties from imaging data alone.

5 0.63803732 419 iccv-2013-To Aggregate or Not to aggregate: Selective Match Kernels for Image Search

Author: Giorgos Tolias, Yannis Avrithis, Hervé Jégou

Abstract: This paper considers a family of metrics to compare images based on their local descriptors. It encompasses the VLAD descriptor and matching techniques such as Hamming Embedding. Making the bridge between these approaches leads us to propose a match kernel that takes the best of existing techniques by combining an aggregation procedure with a selective match kernel. Finally, the representation underpinning this kernel is approximated, providing a large scale image search both precise and scalable, as shown by our experiments on several benchmarks.

6 0.63125145 287 iccv-2013-Neighbor-to-Neighbor Search for Fast Coding of Feature Vectors

7 0.59872317 22 iccv-2013-A New Adaptive Segmental Matching Measure for Human Activity Recognition

8 0.58566493 57 iccv-2013-BOLD Features to Detect Texture-less Objects

9 0.58245724 131 iccv-2013-EVSAC: Accelerating Hypotheses Generation by Modeling Matching Scores with Extreme Value Theory

10 0.58127815 258 iccv-2013-Low-Rank Sparse Coding for Image Classification

11 0.57843179 48 iccv-2013-An Adaptive Descriptor Design for Object Recognition in the Wild

12 0.5724526 140 iccv-2013-Elastic Net Constraints for Shape Matching

13 0.57024503 193 iccv-2013-Heterogeneous Auto-similarities of Characteristics (HASC): Exploiting Relational Information for Classification

14 0.56427985 400 iccv-2013-Stable Hyper-pooling and Query Expansion for Event Detection

15 0.56043291 169 iccv-2013-Fine-Grained Categorization by Alignments

16 0.55746448 12 iccv-2013-A General Dense Image Matching Framework Combining Direct and Feature-Based Costs

17 0.54684293 198 iccv-2013-Hierarchical Part Matching for Fine-Grained Visual Categorization

18 0.54660165 11 iccv-2013-A Fully Hierarchical Approach for Finding Correspondences in Non-rigid Shapes

19 0.53651577 104 iccv-2013-Decomposing Bag of Words Histograms

20 0.53250319 327 iccv-2013-Predicting an Object Location Using a Global Image Representation

similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(2, 0.074), (7, 0.03), (26, 0.05), (31, 0.029), (34, 0.011), (36, 0.233), (40, 0.012), (42, 0.068), (48, 0.015), (57, 0.041), (64, 0.04), (73, 0.051), (89, 0.175), (98, 0.02)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.82639813 388 iccv-2013-Shape Index Descriptors Applied to Texture-Based Galaxy Analysis

Author: Kim Steenstrup Pedersen, Kristoffer Stensbo-Smidt, Andrew Zirm, Christian Igel

same-paper 2 0.79371899 288 iccv-2013-Nested Shape Descriptors

Author: Jeffrey Byrne, Jianbo Shi

3 0.726717 449 iccv-2013-What Do You Do? Occupation Recognition in a Photo via Social Context

Author: Ming Shao, Liangyue Li, Yun Fu

Abstract: In this paper, we investigate the problem of recognizing occupations of multiple people with arbitrary poses in a photo. Previous work utilizing single person ’s nearly frontal clothing information and fore/background context preliminarily proves that occupation recognition is computationally feasible in computer vision. However, in practice, multiple people with arbitrary poses are common in a photo, and recognizing their occupations is even more challenging. We argue that with appropriately built visual attributes, co-occurrence, and spatial configuration model that is learned through structure SVM, we can recognize multiple people ’s occupations in a photo simultaneously. To evaluate our method’s performance, we conduct extensive experiments on a new well-labeled occupation database with 14 representative occupations and over 7K images. Results on this database validate our method’s effectiveness and show that occupation recognition is solvable in a more general case.

4 0.68115592 193 iccv-2013-Heterogeneous Auto-similarities of Characteristics (HASC): Exploiting Relational Information for Classification

Author: Marco San_Biagio, Marco Crocco, Marco Cristani, Samuele Martelli, Vittorio Murino

Abstract: Capturing the essential characteristics of visual objects by considering how their features are inter-related is a recent philosophy of object classification. In this paper, we embed this principle in a novel image descriptor, dubbed Heterogeneous Auto-Similarities of Characteristics (HASC). HASC is applied to heterogeneous dense features maps, encoding linear relations by covariances and nonlinear associations through information-theoretic measures such as mutual information and entropy. In this way, highly complex structural information can be expressed in a compact, scale invariant and robust manner. The effectiveness of HASC is tested on many diverse detection and classification scenarios, considering objects, textures and pedestrians, on widely known benchmarks (Caltech-101, Brodatz, Daimler Multi-Cue). In all the cases, the results obtained with standard classifiers demonstrate the superiority of HASC with respect to the most adopted local feature descriptors nowadays, such as SIFT, HOG, LBP and feature covariances. In addition, HASC sets the state-of-the-art on the Brodatz texture dataset and the Daimler Multi-Cue pedestrian dataset, without exploiting ad-hoc sophisticated classifiers.

5 0.67542541 358 iccv-2013-Robust Non-parametric Data Fitting for Correspondence Modeling

Author: Wen-Yan Lin, Ming-Ming Cheng, Shuai Zheng, Jiangbo Lu, Nigel Crook

Abstract: We propose a generic method for obtaining nonparametric image warps from noisy point correspondences. Our formulation integrates a huber function into a motion coherence framework. This makes our fitting function especially robust to piecewise correspondence noise (where an image section is consistently mismatched). By utilizing over parameterized curves, we can generate realistic nonparametric image warps from very noisy correspondence. We also demonstrate how our algorithm can be used to help stitch images taken from a panning camera by warping the images onto a virtual push-broom camera imaging plane.

6 0.67321903 47 iccv-2013-Alternating Regression Forests for Object Detection and Pose Estimation

7 0.66682714 404 iccv-2013-Structured Forests for Fast Edge Detection

8 0.66679084 426 iccv-2013-Training Deformable Part Models with Decorrelated Features

9 0.66663063 351 iccv-2013-Restoring an Image Taken through a Window Covered with Dirt or Rain

10 0.6664362 238 iccv-2013-Learning Graphs to Match

11 0.66561496 89 iccv-2013-Constructing Adaptive Complex Cells for Robust Visual Tracking

12 0.6652956 57 iccv-2013-BOLD Features to Detect Texture-less Objects

13 0.66468966 60 iccv-2013-Bayesian Robust Matrix Factorization for Image and Video Processing

14 0.66456652 448 iccv-2013-Weakly Supervised Learning of Image Partitioning Using Decision Trees with Structured Split Criteria

15 0.66394114 445 iccv-2013-Visual Reranking through Weakly Supervised Multi-graph Learning

16 0.66378939 229 iccv-2013-Large-Scale Video Hashing via Structure Learning

17 0.66316891 75 iccv-2013-CoDeL: A Human Co-detection and Labeling Framework

18 0.66295058 439 iccv-2013-Video Co-segmentation for Meaningful Action Extraction

19 0.66271079 297 iccv-2013-Online Motion Segmentation Using Dynamic Label Propagation

20 0.66261154 382 iccv-2013-Semi-dense Visual Odometry for a Monocular Camera