iccv iccv2013 iccv2013-257 knowledge-graph by maker-knowledge-mining

257 iccv-2013-Log-Euclidean Kernels for Sparse Representation and Dictionary Learning


Source: pdf

Author: Peihua Li, Qilong Wang, Wangmeng Zuo, Lei Zhang

Abstract: The symmetric positive de?nite (SPD) matrices have been widely used in image and vision problems. Recently there are growing interests in studying sparse representation (SR) of SPD matrices, motivated by the great success of SR for vector data. Though the space of SPD matrices is well-known to form a Lie group that is a Riemannian manifold, existing work fails to take full advantage of its geometric structure. This paper attempts to tackle this problem by proposing a kernel based method for SR and dictionary learning (DL) of SPD matrices. We disclose that the space of SPD matrices, with the operations of logarithmic multiplication and scalar logarithmic multiplication de?ned in the Log-Euclidean framework, is a complete inner product space. We can thus develop a broad family of kernels that satis?es Mercer’s condition. These kernels characterize the geodesic distance and can be computed ef?ciently. We also consider the geometric structure in the DL process by updating atom matrices in the Riemannian space instead of in the Euclidean space. The proposed method is evaluated with various vision problems and shows notable per- formance gains over state-of-the-arts.

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 nite (SPD) matrices have been widely used in image and vision problems. [sent-6, score-0.284]

2 Though the space of SPD matrices is well-known to form a Lie group that is a Riemannian manifold, existing work fails to take full advantage of its geometric structure. [sent-8, score-0.213]

3 This paper attempts to tackle this problem by proposing a kernel based method for SR and dictionary learning (DL) of SPD matrices. [sent-9, score-0.19]

4 We disclose that the space of SPD matrices, with the operations of logarithmic multiplication and scalar logarithmic multiplication de? [sent-10, score-0.397]

5 ned in the Log-Euclidean framework, is a complete inner product space. [sent-11, score-0.155]

6 We can thus develop a broad family of kernels that satis? [sent-12, score-0.149]

7 These kernels characterize the geodesic distance and can be computed ef? [sent-14, score-0.159]

8 We also consider the geometric structure in the DL process by updating atom matrices in the Riemannian space instead of in the Euclidean space. [sent-16, score-0.392]

9 nite (SPD) matrices can be introduced in the imaging, pre-processing, feature extraction and representation processes, and have been widely adopted in many computer vision applications [4, 29, 27]. [sent-20, score-0.305]

10 As to feature extraction, the covariance matrices that model the second-order statistics of image features also result in SPD matrices, which have been successfully applied to detection [27], recognition [18], and classi? [sent-29, score-0.21]

11 Due to their wide applications, the investigations on learning methods for SPD matrices have recently received considerable research interests. [sent-31, score-0.149]

12 It is known that the space of n n SPD matrices, denoted by Sknn+o,w isn n thoat a hlien esapra space n1 ×but n fo SPrmDs a aLtrieic group nthotaetd dis b a RSiemannian manifold [1]. [sent-33, score-0.141]

13 Hence, the mathematical modeling in this space is different from what is commonly done in the Euclidean space [6, 3 1], and new operators for SPD matrices should be introduced for SPD matrix-based learning. [sent-34, score-0.248]

14 In this work, we take sparse representation (SR) and dictionary learning (DL) [8] as examples, and focus on extending SR and DL to SPD matrix-based learning. [sent-35, score-0.164]

15 Since conventional SR and DL methods are proposed for vector data in the Euclidean space rather than the SPD matrices in the Riemannian manifold, in order to use SR and DL for SPD matrix-based learning, we should consider the following issues in developing new operators in Sn+ . [sent-36, score-0.209]

16 (1) In the Euicnlgide isasnu space tehvee l ionpeianrg cnoemwb oipneatriaotno rosfi anto Sm vectors can be naturally obtained using the conventional matrix operators; but it would be challenging to represent an SPD matrix as a linear combination of atom matrices since Sn+ is not a linear space. [sent-37, score-0.462]

17 (3) The updating of dictionary atoms involves solving a constrained optimization problem in Sn+ and it is more appro1S+n is not a linear space with the operations of conventional matrix additSion and scalar-matrix multiplication. [sent-40, score-0.2]

18 However, with the operations of logarithmic multiplication and scalar logarithmic multiplication Sn+ is not loonglyar a hlimneiacr m space [ic1]a tbiount a alnsod a ccaolmarp l oegtae i nthnmeri cpr moduultcitp space as Sshown in section 2. [sent-41, score-0.406]

19 In Tensor Spare Coding (TSC) [22], an SPD matrix is linearly decomposed as a set of of atom matrices and LogDet (or Bregman matrix) divergence [15] was adopted to measure the reconstruction error. [sent-51, score-0.433]

20 Based on this framework, dictionary learning methods are further proposed to learn atom matrices [23]. [sent-52, score-0.449]

21 In the generalized dictionary learning (GDL) algorithm [25], each SPD matrix is represented as a linear combination of rank-1 atom matrices; the error between one SPD matrix and its linear combination is evaluated by matrix Frobenius norm. [sent-53, score-0.405]

22 Second, we can explicitly or implicitly map SPD matrices to some Reproducing Kernel Hilbert Space (RKHS), and use the kernel SR or DL framework for SPD matrix- based learning. [sent-55, score-0.243]

23 They adopted Stein kernel tmo map dth feo rS SPDR manadtri DcLes tno higher dimensional RKHS. [sent-59, score-0.094]

24 This method is in contrast with those methods which directly embed SPD matrices into Euclidean space (LogE-SR) [11, 34] and achieves state-of-the-art performance compared to its counterparts. [sent-60, score-0.188]

25 The main difference is that we develop a novel family of kernel functions based on the Log-Euclidean framework [1]. [sent-72, score-0.125]

26 The proposed kernels characterize the geodesic distance and thus can accurately × measure the reconstruction error; they also satisfy the Mercer’s condition under broad conditions. [sent-73, score-0.193]

27 These are in contrast to the Stein kernel which is only an approximation of the geodesic distance and satis? [sent-74, score-0.147]

28 Our work differs from them in that we disclose the inner product structure of Sn+, by which we can develop a broad variety of kernel foufn Sctions and the Gaussian kernel is a special case of ours. [sent-78, score-0.378]

29 Log-Euclidean Kernel This section starts with a brief introduction of LogEuclidean framework [1]; subsequently, we show that Sn+ Efourcmlisd an ni fnrnaemr product space; ebqauseedn on twheis ,s we design a family of kernel functions. [sent-80, score-0.178]

30 In the Log-Euclidean framework, an operation of logarithmic multiplication ? [sent-87, score-0.171]

31 ee identity matrix and with the inverse operation the regular matrix inverse. [sent-92, score-0.12]

32 cTohme geodesics equipped wSith a bi-invariant metric are the left translates of the geodesics through the identity element, given by one-parameter subgroup exp(tV), where t ∈ R and V ∈ Sn. [sent-99, score-0.136]

33 nally the geodesic distance between two SPD matrices S and T as follows: ρgeo(S, T) = ? [sent-101, score-0.202]

34 Sn+ as a Complete Inner Product Space It is known that Sn+ is not a linear space with the operationIst iosf k tnhoew wconn tvheant Stional matrix addition and scalar-matrix multiplication but forms a Riemannian manifold [1, 3 1]. [sent-111, score-0.181]

35 However, as shown in [1], in the Log-Euclidean framework it is endowed with a linear space structure with the logarithmic multiplication (1) and the following scalar logarithmic multiplication [1]: × λ ⊗ S = exp(λ log(S)) = Sλ (3) where λ is a real number. [sent-112, score-0.337]

36 and ⊗ satisfy the conditions of a linear space, ewraithti tnhse identity m saattrisixfy being otnhed identity eale limneenart and regular matrix inversion operation as inverse mapping. [sent-114, score-0.113]

37 Indeed, not only a linear space, Sn+ is also an inner productI space as dte osnclryib aed li by rth sep following corollary which is not disclosed previously: Corollary 1 With two operations ? [sent-115, score-0.247]

38 log = tr(log(S) log(T)) (4) is an inner product, where tr denotes the matrix trace, and Sn+ is a complete inner product space (Hilbert space). [sent-120, score-0.423]

39 log = ≥0 0if, aanndd only bifv iSo iss thhaet identity matrix I. [sent-163, score-0.238]

40 nite dimension and therefore it is a complete inner product space (Hilbert space) [26]. [sent-165, score-0.3]

41 The norm induced by the inner product is expressed as S? [sent-166, score-0.126]

42 However, a linear space or a normed liner space is not necessarily an inner product space unless a function that satis? [sent-187, score-0.273]

43 r oacneds sicngala dri logarithmic multiplication ⊗, unlike [1] twiohnich ? [sent-194, score-0.149]

44 i annvodlv sceasl mapping hSmPiDc mmuatlrtiipcleics atotio logarithmic d [1o-] main, performing data processing therein and then mapping back to Sn+ again. [sent-195, score-0.1]

45 kee:r n(1el); T T(h2e) The tensor product φ1 ∗ φ? [sent-268, score-0.125]

46 kernels, such as polynomial, exponential, radial basis, B-Spline kernels, or Fourier kernel etc. [sent-284, score-0.094]

47 T Here we compare the proposed kernels with Stein kernel = >− [24]. [sent-335, score-0.178]

48 ne-Riemannian distance between the two matrices is dA (S, T) = ? [sent-342, score-0.149]

49 1 shows the histogram computed from the SPD matrices used in texture classi? [sent-368, score-0.178]

50 kernel under restricted co =nd eitxipo(n−, tβhdat is, β = 21, . [sent-375, score-0.094]

51 The logarithm of SPD matrices can be computed through the eigen-decomposition. [sent-385, score-0.206]

52 The logarithms of the involved SPD matrices can generally be computed beforehand because oftheir “decoupling” property either in the inner product or distance; in these cases, the complexity of the proposed kernels becomes O(n3) and is the same as that of the Stein kernel. [sent-394, score-0.359]

53 While this method outperforms state-of-the-arts, the symmetric Stein divergence only approximates the Riemannian metric and the Stein kernel only satis? [sent-400, score-0.206]

54 Let φ be th∈e fSunction that maps SPD matrices to RKHS, SR of Y can be formulated as the following kernelised LASSO problem [12]: xm∈RinN? [sent-409, score-0.149]

55 1 Minimaization of the above equation is similar to regular sparse coding in Euclidean space [10], and we use the method introduced in [12] for its solution. [sent-459, score-0.109]

56 , M, the atom matrices can be obtained by learning method so that they have more powerful representation capability. [sent-465, score-0.374]

57 First, suppose that the atom matrices Si ∈ Sn+ , i= 1, . [sent-498, score-0.353]

58 , M, we compute its sparse vector xj as described in the previ- ous section; then, let xj be ? [sent-505, score-0.097]

59 xed, we update dictionary atom matrices Si, i= 1, . [sent-506, score-0.449]

60 In the following, we illustrate the atom matrices update scheme using Gaussian kernel κg. [sent-510, score-0.447]

61 Re-writing (7) in kernel function κ, we have the partial derivative of f(·) w. [sent-513, score-0.094]

62 (8) One may update log Sr instead of Sr, which is equivalent to transforming by logarithm the SPD matrices to Euclidean space in which atoms are updated. [sent-520, score-0.395]

63 We thus instead update the atom matrices in the Lie group as follows: Sr = exp ? [sent-523, score-0.425]

64 rst evaluate the performance of the × proposed family of kernels on sparse representation without dictionary learning. [sent-532, score-0.321]

65 As in [30], the training samples are adopted as atom matrices and the reconstruction errors are used for classi? [sent-533, score-0.353]

66 Then we learn the atom matrices from the training data and the sparse codes obtained from the learned atom matrices are used for classi? [sent-535, score-0.753]

67 cation × mbye taho 43d ×in 4[330 c]o, vaanrdia tnhcee preprocessing ompetth thoed cilna s[1si2? [sent-552, score-0.296]

68 2 shows the recognition accuracy of the proposed kernels with the regularization parameter λ = 10−3, where the recognition rates of RSR using Stein kernel [12] are also shown as baseline (red dash-dotted). [sent-555, score-0.232]

69 ,Irte can ibneg seen thaat ta btoheu proposed kernels are clearly better than the Stein kernel on all datasets. [sent-560, score-0.178]

70 cation rates of RSR that uses the Stein kernel [12] are shown as baseline (red dash-dotted line). [sent-619, score-0.444]

71 TSC has unsatisfactory performance and we owe it to the linear representation of SPD matrices in the Euclidean space without use of the Riemannian metric. [sent-621, score-0.209]

72 By using the Riemannian metric, LogE-SR has improved recognition rates but the sparse decomposition is performed in the logarithm domain rather than in the original Riemannian manifold. [sent-622, score-0.158]

73 cation We employ the Brodatz dataset and follow the experimental setting in [22, 23, 12] for fair comparison. [sent-639, score-0.296]

74 cation algorithm [17] but to testify the proposed method with closely related work. [sent-641, score-0.326]

75 In the Brodatz dataset each class contains only one image and we use the mosaics of 5-texture (‘5c’ , ‘5m’, ‘5v’, ‘5v2’, ‘5v3’), 10texture (‘ 10’,‘ 10v’), and 16-texture (‘ 16c’, ‘ 16v’). [sent-642, score-0.09]

76 Among the 64 covariance matrices per class, 5 are randomly selected for training and the remaining ones are for testing. [sent-646, score-0.21]

77 Dictionary Learning To testify the effectiveness of the proposed dictionary learning method, we compare three methods: random sampling, K-Means clustering and dictionary learning. [sent-673, score-0.222]

78 The K-Means clustering is performed in the LogEuclidean framework [1]: the covariance matrices are ? [sent-675, score-0.21]

79 rst mapped to the linear space Sn by matrix logarithm, in which mthea clustering ilisn performed and the results are then mapped ×× back to Sn+ . [sent-676, score-0.116]

80 cation We use the Brodatz dataset and follow the experimental setting in [12]. [sent-678, score-0.296]

81 T phiexe 5ls-d,i frmoemns wiohnicahl afe 5at×u5re voevcatroirasn ctoe compute the covariance matrix comprise grayscale intensity, and the 1st and 2nd partial derivatives with respect to spatial coordinates. [sent-681, score-0.096]

82 We thus have 2200 covariance matrices in total for dictionary learning. [sent-683, score-0.306]

83 It can be seen that the dictionary learning method is consistently superior to random dictionary and Log-E K-Means, particularly when the number of atom matrices are small. [sent-690, score-0.545]

84 It is interesting to notice that the random dictionary is better than the learned dictionary via Log-E K-Means if the number of atom matrices are less than 80. [sent-691, score-0.545]

85 We also observe that the performance of both random sampling and Log-E K-Means improves with the increase of atom matrix number. [sent-693, score-0.239]

86 cation accuracy on the Brodatz dataset Scene Categorization We use the popular benchmark database Scene15 [16] for classi? [sent-696, score-0.296]

87 In each image, we extract 8 8 covariance matrices at dense grids with a stride of t8r a pcitxe 8l×s. [sent-699, score-0.21]

88 First, among covariance matrices ofall images, 50,000 ones are randomly chosen which are used to obtain atom matrices. [sent-705, score-0.414]

89 cation rates of the proposed method are over 18 percent higher than the random dictionary. [sent-712, score-0.383]

90 We can also observe that the proposed method has over 8 percent, 4 percent, and 2 percent advantages over Log-E K-Means Clustering for 32, 64, and 128 atom matrices, respectively. [sent-713, score-0.237]

91 From both of the above experiments, we observe that as atom matrix number grows, the performance gains of the dictionary learning over the other two methods gets smaller. [sent-714, score-0.357]

92 As the current dictionary is generative without discriminative information, more powerful representational capability does not necessarily mean better discriminability. [sent-715, score-0.096]

93 ndings and we think that the performance difference between the three methods will get smaller or even negligible as the atom matrix number becomes much larger. [sent-717, score-0.239]

94 Conclusion This paper presented a novel Riemannian metric based kernel method for SR and DL in Sn+. [sent-728, score-0.114]

95 We disclosed that the space of SPD matrices is a complete inner product space, and developed a broad 11660077 family of p. [sent-734, score-0.404]

96 Action recognition using sparse representation on covariance manifolds of optical ? [sent-828, score-0.129]

97 Sparse coding and dictionary learning for symmetric positive de? [sent-838, score-0.166]

98 Kernel methods on the riemannian manifold of symmetric positive de? [sent-854, score-0.236]

99 Gabor feature based sparse representation for face recognition with gabor occlusion dictionary. [sent-990, score-0.096]

100 Action recognition using sparse representation on covariance manifolds of optical ? [sent-998, score-0.129]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('spd', 0.452), ('classi', 0.428), ('cation', 0.296), ('sn', 0.283), ('atom', 0.204), ('stein', 0.188), ('sr', 0.16), ('riemannian', 0.151), ('log', 0.15), ('matrices', 0.149), ('nite', 0.135), ('rsr', 0.12), ('dl', 0.098), ('dictionary', 0.096), ('kernel', 0.094), ('feret', 0.09), ('mosaics', 0.09), ('pn', 0.085), ('kernels', 0.084), ('logarithmic', 0.08), ('corollary', 0.08), ('brodatz', 0.08), ('inner', 0.073), ('tensor', 0.072), ('multiplication', 0.069), ('satis', 0.066), ('covariance', 0.061), ('tsc', 0.058), ('logarithm', 0.057), ('rates', 0.054), ('geodesic', 0.053), ('harandi', 0.053), ('product', 0.053), ('mercer', 0.053), ('rkhs', 0.053), ('euclidean', 0.048), ('sparse', 0.047), ('exp', 0.047), ('symmetric', 0.047), ('divergence', 0.045), ('hilbert', 0.044), ('geodesics', 0.044), ('rst', 0.042), ('bregman', 0.04), ('space', 0.039), ('manifold', 0.038), ('si', 0.036), ('de', 0.036), ('matrix', 0.035), ('bd', 0.034), ('broad', 0.034), ('percent', 0.033), ('family', 0.031), ('en', 0.03), ('boley', 0.03), ('disclose', 0.03), ('gdl', 0.03), ('morellas', 0.03), ('normed', 0.03), ('sandwiching', 0.03), ('sivalingam', 0.03), ('testify', 0.03), ('operations', 0.03), ('texture', 0.029), ('ned', 0.029), ('identity', 0.028), ('gabor', 0.028), ('bf', 0.027), ('bg', 0.027), ('nine', 0.027), ('arsigny', 0.027), ('pennec', 0.027), ('commutative', 0.027), ('cients', 0.027), ('xj', 0.025), ('group', 0.025), ('iss', 0.025), ('fillard', 0.025), ('logeuclidean', 0.025), ('rinn', 0.025), ('disclosed', 0.025), ('dsr', 0.025), ('yj', 0.024), ('pages', 0.023), ('lie', 0.023), ('coding', 0.023), ('proposition', 0.023), ('gains', 0.022), ('geo', 0.022), ('xed', 0.022), ('ds', 0.022), ('characterize', 0.022), ('operation', 0.022), ('representation', 0.021), ('diffusion', 0.021), ('xm', 0.021), ('operators', 0.021), ('coef', 0.02), ('therein', 0.02), ('metric', 0.02)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.99999815 257 iccv-2013-Log-Euclidean Kernels for Sparse Representation and Dictionary Learning

Author: Peihua Li, Qilong Wang, Wangmeng Zuo, Lei Zhang

Abstract: The symmetric positive de?nite (SPD) matrices have been widely used in image and vision problems. Recently there are growing interests in studying sparse representation (SR) of SPD matrices, motivated by the great success of SR for vector data. Though the space of SPD matrices is well-known to form a Lie group that is a Riemannian manifold, existing work fails to take full advantage of its geometric structure. This paper attempts to tackle this problem by proposing a kernel based method for SR and dictionary learning (DL) of SPD matrices. We disclose that the space of SPD matrices, with the operations of logarithmic multiplication and scalar logarithmic multiplication de?ned in the Log-Euclidean framework, is a complete inner product space. We can thus develop a broad family of kernels that satis?es Mercer’s condition. These kernels characterize the geodesic distance and can be computed ef?ciently. We also consider the geometric structure in the DL process by updating atom matrices in the Riemannian space instead of in the Euclidean space. The proposed method is evaluated with various vision problems and shows notable per- formance gains over state-of-the-arts.

2 0.37045458 347 iccv-2013-Recursive Estimation of the Stein Center of SPD Matrices and Its Applications

Author: Hesamoddin Salehian, Guang Cheng, Baba C. Vemuri, Jeffrey Ho

Abstract: Symmetric positive-definite (SPD) matrices are ubiquitous in Computer Vision, Machine Learning and Medical Image Analysis. Finding the center/average of a population of such matrices is a common theme in many algorithms such as clustering, segmentation, principal geodesic analysis, etc. The center of a population of such matrices can be defined using a variety of distance/divergence measures as the minimizer of the sum of squared distances/divergences from the unknown center to the members of the population. It is well known that the computation of the Karcher mean for the space of SPD matrices which is a negativelycurved Riemannian manifold is computationally expensive. Recently, the LogDet divergence-based center was shown to be a computationally attractive alternative. However, the LogDet-based mean of more than two matrices can not be computed in closed form, which makes it computationally less attractive for large populations. In this paper we present a novel recursive estimator for center based on the Stein distance which is the square root of the LogDet di– vergence that is significantly faster than the batch mode computation of this center. The key theoretical contribution is a closed-form solution for the weighted Stein center of two SPD matrices, which is used in the recursive computation of the Stein center for a population of SPD matrices. Additionally, we show experimental evidence of the convergence of our recursive Stein center estimator to the batch mode Stein center. We present applications of our recursive estimator to K-means clustering and image indexing depicting significant time gains over corresponding algorithms that use the batch mode computations. For the latter application, we develop novel hashing functions using the Stein distance and apply it to publicly available data sets, and experimental results have shown favorable com– ∗This research was funded in part by the NIH grant NS066340 to BCV. †Corresponding author parisons to other competing methods.

3 0.28181154 25 iccv-2013-A Novel Earth Mover's Distance Methodology for Image Matching with Gaussian Mixture Models

Author: Peihua Li, Qilong Wang, Lei Zhang

Abstract: The similarity or distance measure between Gaussian mixture models (GMMs) plays a crucial role in contentbased image matching. Though the Earth Mover’s Distance (EMD) has shown its advantages in matching histogram features, its potentials in matching GMMs remain unclear and are not fully explored. To address this problem, we propose a novel EMD methodology for GMM matching. We ?rst present a sparse representation based EMD called SR-EMD by exploiting the sparse property of the underlying problem. SR-EMD is more ef?cient and robust than the conventional EMD. Second, we present two novel ground distances between component Gaussians based on the information geometry. The perspective from the Riemannian geometry distinguishes the proposed ground distances from the classical entropy- or divergence-based ones. Furthermore, motivated by the success of distance metric learning of vector data, we make the ?rst attempt to learn the EMD distance metrics between GMMs by using a simple yet effective supervised pair-wise based method. It can adapt the distance metrics between GMMs to speci?c classi?ca- tion tasks. The proposed method is evaluated on both simulated data and benchmark real databases and achieves very promising performance.

4 0.23913664 220 iccv-2013-Joint Deep Learning for Pedestrian Detection

Author: Wanli Ouyang, Xiaogang Wang

Abstract: Feature extraction, deformation handling, occlusion handling, and classi?cation are four important components in pedestrian detection. Existing methods learn or design these components either individually or sequentially. The interaction among these components is not yet well explored. This paper proposes that they should be jointly learned in order to maximize their strengths through cooperation. We formulate these four components into a joint deep learning framework and propose a new deep network architecture1. By establishing automatic, mutual interaction among components, the deep model achieves a 9% reduction in the average miss rate compared with the current best-performing pedestrian detection approaches on the largest Caltech benchmark dataset.

5 0.15128103 114 iccv-2013-Dictionary Learning and Sparse Coding on Grassmann Manifolds: An Extrinsic Solution

Author: Mehrtash Harandi, Conrad Sanderson, Chunhua Shen, Brian Lovell

Abstract: Recent advances in computer vision and machine learning suggest that a wide range of problems can be addressed more appropriately by considering non-Euclidean geometry. In this paper we explore sparse dictionary learning over the space of linear subspaces, which form Riemannian structures known as Grassmann manifolds. To this end, we propose to embed Grassmann manifolds into the space of symmetric matrices by an isometric mapping, which enables us to devise a closed-form solution for updating a Grassmann dictionary, atom by atom. Furthermore, to handle non-linearity in data, we propose a kernelised version of the dictionary learning algorithm. Experiments on several classification tasks (face recognition, action recognition, dynamic texture classification) show that the proposed approach achieves considerable improvements in discrimination accuracy, in comparison to state-of-the-art methods such as kernelised Affine Hull Method and graphembedding Grassmann discriminant analysis.

6 0.10901572 293 iccv-2013-Nonparametric Blind Super-resolution

7 0.10650744 10 iccv-2013-A Framework for Shape Analysis via Hilbert Space Embedding

8 0.1044926 35 iccv-2013-Accurate Blur Models vs. Image Priors in Single Image Super-resolution

9 0.099004522 384 iccv-2013-Semi-supervised Robust Dictionary Learning via Efficient l-Norms Minimization

10 0.098372929 161 iccv-2013-Fast Sparsity-Based Orthogonal Dictionary Learning for Image Restoration

11 0.097116306 188 iccv-2013-Group Sparsity and Geometry Constrained Dictionary Learning for Action Recognition from Depth Maps

12 0.083686993 51 iccv-2013-Anchored Neighborhood Regression for Fast Example-Based Super-Resolution

13 0.082918778 265 iccv-2013-Mining Motion Atoms and Phrases for Complex Action Recognition

14 0.081915572 295 iccv-2013-On One-Shot Similarity Kernels: Explicit Feature Maps and Properties

15 0.080649756 276 iccv-2013-Multi-attributed Dictionary Learning for Sparse Coding

16 0.079698123 119 iccv-2013-Discriminant Tracking Using Tensor Representation with Semi-supervised Improvement

17 0.076603033 354 iccv-2013-Robust Dictionary Learning by Error Source Decomposition

18 0.07624609 96 iccv-2013-Coupled Dictionary and Feature Space Learning with Applications to Cross-Domain Image Synthesis and Recognition

19 0.075985238 307 iccv-2013-Parallel Transport of Deformations in Shape Space of Elastic Surfaces

20 0.07373178 197 iccv-2013-Hierarchical Joint Max-Margin Learning of Mid and Top Level Representations for Visual Recognition


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, 0.132), (1, 0.046), (2, -0.06), (3, -0.023), (4, -0.152), (5, -0.016), (6, -0.033), (7, -0.007), (8, 0.0), (9, -0.085), (10, -0.026), (11, -0.056), (12, 0.02), (13, -0.052), (14, 0.039), (15, 0.042), (16, 0.027), (17, -0.022), (18, 0.031), (19, 0.019), (20, 0.013), (21, 0.066), (22, 0.105), (23, 0.006), (24, -0.083), (25, 0.09), (26, 0.061), (27, 0.099), (28, -0.183), (29, 0.184), (30, -0.124), (31, -0.097), (32, -0.066), (33, -0.008), (34, 0.161), (35, 0.222), (36, 0.097), (37, 0.126), (38, -0.169), (39, 0.066), (40, 0.036), (41, 0.04), (42, 0.07), (43, 0.025), (44, -0.042), (45, -0.023), (46, -0.114), (47, -0.055), (48, 0.042), (49, 0.066)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.93211144 257 iccv-2013-Log-Euclidean Kernels for Sparse Representation and Dictionary Learning

Author: Peihua Li, Qilong Wang, Wangmeng Zuo, Lei Zhang

Abstract: The symmetric positive de?nite (SPD) matrices have been widely used in image and vision problems. Recently there are growing interests in studying sparse representation (SR) of SPD matrices, motivated by the great success of SR for vector data. Though the space of SPD matrices is well-known to form a Lie group that is a Riemannian manifold, existing work fails to take full advantage of its geometric structure. This paper attempts to tackle this problem by proposing a kernel based method for SR and dictionary learning (DL) of SPD matrices. We disclose that the space of SPD matrices, with the operations of logarithmic multiplication and scalar logarithmic multiplication de?ned in the Log-Euclidean framework, is a complete inner product space. We can thus develop a broad family of kernels that satis?es Mercer’s condition. These kernels characterize the geodesic distance and can be computed ef?ciently. We also consider the geometric structure in the DL process by updating atom matrices in the Riemannian space instead of in the Euclidean space. The proposed method is evaluated with various vision problems and shows notable per- formance gains over state-of-the-arts.

2 0.8352347 347 iccv-2013-Recursive Estimation of the Stein Center of SPD Matrices and Its Applications

Author: Hesamoddin Salehian, Guang Cheng, Baba C. Vemuri, Jeffrey Ho

Abstract: Symmetric positive-definite (SPD) matrices are ubiquitous in Computer Vision, Machine Learning and Medical Image Analysis. Finding the center/average of a population of such matrices is a common theme in many algorithms such as clustering, segmentation, principal geodesic analysis, etc. The center of a population of such matrices can be defined using a variety of distance/divergence measures as the minimizer of the sum of squared distances/divergences from the unknown center to the members of the population. It is well known that the computation of the Karcher mean for the space of SPD matrices which is a negativelycurved Riemannian manifold is computationally expensive. Recently, the LogDet divergence-based center was shown to be a computationally attractive alternative. However, the LogDet-based mean of more than two matrices can not be computed in closed form, which makes it computationally less attractive for large populations. In this paper we present a novel recursive estimator for center based on the Stein distance which is the square root of the LogDet di– vergence that is significantly faster than the batch mode computation of this center. The key theoretical contribution is a closed-form solution for the weighted Stein center of two SPD matrices, which is used in the recursive computation of the Stein center for a population of SPD matrices. Additionally, we show experimental evidence of the convergence of our recursive Stein center estimator to the batch mode Stein center. We present applications of our recursive estimator to K-means clustering and image indexing depicting significant time gains over corresponding algorithms that use the batch mode computations. For the latter application, we develop novel hashing functions using the Stein distance and apply it to publicly available data sets, and experimental results have shown favorable com– ∗This research was funded in part by the NIH grant NS066340 to BCV. †Corresponding author parisons to other competing methods.

3 0.74440885 25 iccv-2013-A Novel Earth Mover's Distance Methodology for Image Matching with Gaussian Mixture Models

Author: Peihua Li, Qilong Wang, Lei Zhang

Abstract: The similarity or distance measure between Gaussian mixture models (GMMs) plays a crucial role in contentbased image matching. Though the Earth Mover’s Distance (EMD) has shown its advantages in matching histogram features, its potentials in matching GMMs remain unclear and are not fully explored. To address this problem, we propose a novel EMD methodology for GMM matching. We ?rst present a sparse representation based EMD called SR-EMD by exploiting the sparse property of the underlying problem. SR-EMD is more ef?cient and robust than the conventional EMD. Second, we present two novel ground distances between component Gaussians based on the information geometry. The perspective from the Riemannian geometry distinguishes the proposed ground distances from the classical entropy- or divergence-based ones. Furthermore, motivated by the success of distance metric learning of vector data, we make the ?rst attempt to learn the EMD distance metrics between GMMs by using a simple yet effective supervised pair-wise based method. It can adapt the distance metrics between GMMs to speci?c classi?ca- tion tasks. The proposed method is evaluated on both simulated data and benchmark real databases and achieves very promising performance.

4 0.52628148 119 iccv-2013-Discriminant Tracking Using Tensor Representation with Semi-supervised Improvement

Author: Jin Gao, Junliang Xing, Weiming Hu, Steve Maybank

Abstract: Visual tracking has witnessed growing methods in object representation, which is crucial to robust tracking. The dominant mechanism in object representation is using image features encoded in a vector as observations to perform tracking, without considering that an image is intrinsically a matrix, or a 2nd-order tensor. Thus approaches following this mechanism inevitably lose a lot of useful information, and therefore cannot fully exploit the spatial correlations within the 2D image ensembles. In this paper, we address an image as a 2nd-order tensor in its original form, and find a discriminative linear embedding space approximation to the original nonlinear submanifold embedded in the tensor space based on the graph embedding framework. We specially design two graphs for characterizing the intrinsic local geometrical structure of the tensor space, so as to retain more discriminant information when reducing the dimension along certain tensor dimensions. However, spatial correlations within a tensor are not limited to the elements along these dimensions. This means that some part of the discriminant information may not be encoded in the embedding space. We introduce a novel technique called semi-supervised improvement to iteratively adjust the embedding space to compensate for the loss of discriminant information, hence improving the performance of our tracker. Experimental results on challenging videos demonstrate the effectiveness and robustness of the proposed tracker.

5 0.51543099 114 iccv-2013-Dictionary Learning and Sparse Coding on Grassmann Manifolds: An Extrinsic Solution

Author: Mehrtash Harandi, Conrad Sanderson, Chunhua Shen, Brian Lovell

Abstract: Recent advances in computer vision and machine learning suggest that a wide range of problems can be addressed more appropriately by considering non-Euclidean geometry. In this paper we explore sparse dictionary learning over the space of linear subspaces, which form Riemannian structures known as Grassmann manifolds. To this end, we propose to embed Grassmann manifolds into the space of symmetric matrices by an isometric mapping, which enables us to devise a closed-form solution for updating a Grassmann dictionary, atom by atom. Furthermore, to handle non-linearity in data, we propose a kernelised version of the dictionary learning algorithm. Experiments on several classification tasks (face recognition, action recognition, dynamic texture classification) show that the proposed approach achieves considerable improvements in discrimination accuracy, in comparison to state-of-the-art methods such as kernelised Affine Hull Method and graphembedding Grassmann discriminant analysis.

6 0.481406 10 iccv-2013-A Framework for Shape Analysis via Hilbert Space Embedding

7 0.4708758 362 iccv-2013-Robust Tucker Tensor Decomposition for Effective Image Representation

8 0.44750217 212 iccv-2013-Image Set Classification Using Holistic Multiple Order Statistics Features and Localized Multi-kernel Metric Learning

9 0.43457428 220 iccv-2013-Joint Deep Learning for Pedestrian Detection

10 0.43373683 295 iccv-2013-On One-Shot Similarity Kernels: Explicit Feature Maps and Properties

11 0.42812276 227 iccv-2013-Large-Scale Image Annotation by Efficient and Robust Kernel Metric Learning

12 0.42642114 307 iccv-2013-Parallel Transport of Deformations in Shape Space of Elastic Surfaces

13 0.3960785 35 iccv-2013-Accurate Blur Models vs. Image Priors in Single Image Super-resolution

14 0.39316225 293 iccv-2013-Nonparametric Blind Super-resolution

15 0.38750204 55 iccv-2013-Automatic Kronecker Product Model Based Detection of Repeated Patterns in 2D Urban Images

16 0.37682685 177 iccv-2013-From Point to Set: Extend the Learning of Distance Metrics

17 0.35235864 134 iccv-2013-Efficient Higher-Order Clustering on the Grassmann Manifold

18 0.34479752 259 iccv-2013-Manifold Based Face Synthesis from Sparse Samples

19 0.34406993 29 iccv-2013-A Scalable Unsupervised Feature Merging Approach to Efficient Dimensionality Reduction of High-Dimensional Visual Data

20 0.3362 51 iccv-2013-Anchored Neighborhood Regression for Fast Example-Based Super-Resolution


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(2, 0.049), (7, 0.027), (13, 0.011), (26, 0.09), (31, 0.041), (41, 0.241), (42, 0.113), (48, 0.021), (64, 0.028), (73, 0.031), (89, 0.121), (95, 0.051), (97, 0.036), (98, 0.014)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.77257937 257 iccv-2013-Log-Euclidean Kernels for Sparse Representation and Dictionary Learning

Author: Peihua Li, Qilong Wang, Wangmeng Zuo, Lei Zhang

Abstract: The symmetric positive de?nite (SPD) matrices have been widely used in image and vision problems. Recently there are growing interests in studying sparse representation (SR) of SPD matrices, motivated by the great success of SR for vector data. Though the space of SPD matrices is well-known to form a Lie group that is a Riemannian manifold, existing work fails to take full advantage of its geometric structure. This paper attempts to tackle this problem by proposing a kernel based method for SR and dictionary learning (DL) of SPD matrices. We disclose that the space of SPD matrices, with the operations of logarithmic multiplication and scalar logarithmic multiplication de?ned in the Log-Euclidean framework, is a complete inner product space. We can thus develop a broad family of kernels that satis?es Mercer’s condition. These kernels characterize the geodesic distance and can be computed ef?ciently. We also consider the geometric structure in the DL process by updating atom matrices in the Riemannian space instead of in the Euclidean space. The proposed method is evaluated with various vision problems and shows notable per- formance gains over state-of-the-arts.

2 0.69413102 157 iccv-2013-Fast Face Detector Training Using Tailored Views

Author: Kristina Scherbaum, James Petterson, Rogerio S. Feris, Volker Blanz, Hans-Peter Seidel

Abstract: Face detection is an important task in computer vision and often serves as the first step for a variety of applications. State-of-the-art approaches use efficient learning algorithms and train on large amounts of manually labeled imagery. Acquiring appropriate training images, however, is very time-consuming and does not guarantee that the collected training data is representative in terms of data variability. Moreover, available data sets are often acquired under controlled settings, restricting, for example, scene illumination or 3D head pose to a narrow range. This paper takes a look into the automated generation of adaptive training samples from a 3D morphable face model. Using statistical insights, the tailored training data guarantees full data variability and is enriched by arbitrary facial attributes such as age or body weight. Moreover, it can automatically adapt to environmental constraints, such as illumination or viewing angle of recorded video footage from surveillance cameras. We use the tailored imagery to train a new many-core imple- mentation of Viola Jones ’ AdaBoost object detection framework. The new implementation is not only faster but also enables the use of multiple feature channels such as color features at training time. In our experiments we trained seven view-dependent face detectors and evaluate these on the Face Detection Data Set and Benchmark (FDDB). Our experiments show that the use of tailored training imagery outperforms state-of-the-art approaches on this challenging dataset.

3 0.6654321 34 iccv-2013-Abnormal Event Detection at 150 FPS in MATLAB

Author: Cewu Lu, Jianping Shi, Jiaya Jia

Abstract: Speedy abnormal event detection meets the growing demand to process an enormous number of surveillance videos. Based on inherent redundancy of video structures, we propose an efficient sparse combination learning framework. It achieves decent performance in the detection phase without compromising result quality. The short running time is guaranteed because the new method effectively turns the original complicated problem to one in which only a few costless small-scale least square optimization steps are involved. Our method reaches high detection rates on benchmark datasets at a speed of 140∼150 frames per soenc obnednc on average wsehtesn a computing on an ordinary desktop PC using MATLAB.

4 0.65899235 25 iccv-2013-A Novel Earth Mover's Distance Methodology for Image Matching with Gaussian Mixture Models

Author: Peihua Li, Qilong Wang, Lei Zhang

Abstract: The similarity or distance measure between Gaussian mixture models (GMMs) plays a crucial role in contentbased image matching. Though the Earth Mover’s Distance (EMD) has shown its advantages in matching histogram features, its potentials in matching GMMs remain unclear and are not fully explored. To address this problem, we propose a novel EMD methodology for GMM matching. We ?rst present a sparse representation based EMD called SR-EMD by exploiting the sparse property of the underlying problem. SR-EMD is more ef?cient and robust than the conventional EMD. Second, we present two novel ground distances between component Gaussians based on the information geometry. The perspective from the Riemannian geometry distinguishes the proposed ground distances from the classical entropy- or divergence-based ones. Furthermore, motivated by the success of distance metric learning of vector data, we make the ?rst attempt to learn the EMD distance metrics between GMMs by using a simple yet effective supervised pair-wise based method. It can adapt the distance metrics between GMMs to speci?c classi?ca- tion tasks. The proposed method is evaluated on both simulated data and benchmark real databases and achieves very promising performance.

5 0.63238573 204 iccv-2013-Human Attribute Recognition by Rich Appearance Dictionary

Author: Jungseock Joo, Shuo Wang, Song-Chun Zhu

Abstract: We present a part-based approach to the problem of human attribute recognition from a single image of a human body. To recognize the attributes of human from the body parts, it is important to reliably detect the parts. This is a challenging task due to the geometric variation such as articulation and view-point changes as well as the appearance variation of the parts arisen from versatile clothing types. The prior works have primarily focused on handling . edu . cn ???????????? geometric variation by relying on pre-trained part detectors or pose estimators, which require manual part annotation, but the appearance variation has been relatively neglected in these works. This paper explores the importance of the appearance variation, which is directly related to the main task, attribute recognition. To this end, we propose to learn a rich appearance part dictionary of human with significantly less supervision by decomposing image lattice into overlapping windows at multiscale and iteratively refining local appearance templates. We also present quantitative results in which our proposed method outperforms the existing approaches.

6 0.62911963 182 iccv-2013-GOSUS: Grassmannian Online Subspace Updates with Structured-Sparsity

7 0.62664789 20 iccv-2013-A Max-Margin Perspective on Sparse Representation-Based Classification

8 0.62417686 161 iccv-2013-Fast Sparsity-Based Orthogonal Dictionary Learning for Image Restoration

9 0.62077528 259 iccv-2013-Manifold Based Face Synthesis from Sparse Samples

10 0.62044632 330 iccv-2013-Proportion Priors for Image Sequence Segmentation

11 0.61918432 427 iccv-2013-Transfer Feature Learning with Joint Distribution Adaptation

12 0.61832821 156 iccv-2013-Fast Direct Super-Resolution by Simple Functions

13 0.61825228 227 iccv-2013-Large-Scale Image Annotation by Efficient and Robust Kernel Metric Learning

14 0.61728126 180 iccv-2013-From Where and How to What We See

15 0.61530441 150 iccv-2013-Exemplar Cut

16 0.61445576 326 iccv-2013-Predicting Sufficient Annotation Strength for Interactive Foreground Segmentation

17 0.61442 347 iccv-2013-Recursive Estimation of the Stein Center of SPD Matrices and Its Applications

18 0.61406296 220 iccv-2013-Joint Deep Learning for Pedestrian Detection

19 0.61400157 80 iccv-2013-Collaborative Active Learning of a Kernel Machine Ensemble for Recognition

20 0.61249793 114 iccv-2013-Dictionary Learning and Sparse Coding on Grassmann Manifolds: An Extrinsic Solution