iccv iccv2013 iccv2013-194 iccv2013-194-reference knowledge-graph by maker-knowledge-mining

194 iccv-2013-Heterogeneous Image Features Integration via Multi-modal Semi-supervised Learning Model


Source: pdf

Author: Xiao Cai, Feiping Nie, Weidong Cai, Heng Huang

Abstract: Automatic image categorization has become increasingly important with the development of Internet and the growth in the size of image databases. Although the image categorization can be formulated as a typical multiclass classification problem, two major challenges have been raised by the real-world images. On one hand, though using more labeled training data may improve the prediction performance, obtaining the image labels is a time consuming as well as biased process. On the other hand, more and more visual descriptors have been proposed to describe objects and scenes appearing in images and different features describe different aspects of the visual characteristics. Therefore, how to integrate heterogeneous visual features to do the semi-supervised learning is crucial for categorizing large-scale image data. In this paper, we propose a novel approach to integrate heterogeneous features by performing multi-modal semi-supervised classification on unlabeled as well as unsegmented images. Considering each type of feature as one modality, taking advantage of the large amoun- t of unlabeled data information, our new adaptive multimodal semi-supervised classification (AMMSS) algorithm learns a commonly shared class indicator matrix and the weights for different modalities (image features) simultaneously.


reference text

[1] X. Cai, F. Nie, and H. Huang. Multi-view k-means clustering on big data. International Joint Conference on Artificial Intelligence, pages 2598–2604, 2013.

[2] X. Cai, F. Nie, H. Huang, and F. Kamangar. Heterogeneous image feature integration via multi-modal spectral clustering. In CVPR, pages 1977–1984, 2011.

[3] L. Cao, J. Luo, F. Liang, and T. Huang. Heterogeneous feature machines for visual recognition. In Computer Vision, 2009 IEEE 12th International Conference on, pages 1095– 1102. IEEE, 2010.

[4] C.-C. Chang and C.-J. Lin. Libsvm: A library for support vector machines. ACM TIST, 2(3):27, 2011.

[5] H. Chen, X. Cai, D. Zhu, F. Nie, T. Liu, and H. Huang. Group-wise consistent parcellation of gyri via adaptive multi-view spectral clustering of fiber shapes. International Conference on Medical Image Computing and Computer Assisted Intervention, pages 271–279, 2012.

[6] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR (1), pages 886–893, 2005.

[7] C. H. Q. Ding, R. Jin, T. Li, and H. D. Simon. A learning framework using function and kernel regularization with application to recommender system. In KDD, pages 260–269, 2007.

[8] D. Dueck and B. J. Frey. Non-metric affinity propagation for unsupervised image categorization. In ICCV, pages 1–8, 2007. green’s

[9] L. Fei-Fei, R. Fergus, and P. Perona. Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. In Workshop on Generative-Model Based Vision, 2004.

[10] A. Frank and A. Asuncion. UCI machine learning repository, 2010.

[11] K. Grauman and T. Darrell. The pyramid match kernel: Discriminative classification with sets of image features. In ICCV, pages 1458–1465, 2005.

[12] C. H. Lampert, H. Nickisch, and S. Harmeling. Learning to detect unseen object classes by between-class attribute transfer. In CVPR, pages 951–958, 2009.

[13] Y. J. Lee and K. Grauman. Foreground focus: Unsupervised learning from partially matching images. International Journal of Computer Vision, 85(2):143–166, 2009.

[14] F.-F. Li, R. Fergus, and P. Perona. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Computer Vision and Image Understanding, 106(1):59–70, 2007.

[15] D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2):91–1 10, 2004.

[16] T. Ojala, M. Pietik¨ ainen, and T. Ma¨ enp a¨ a¨. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell., 24(7):971–987, 2002.

[17] A. Oliva and A. B. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42(3): 145–175, 2001.

[18] S. Sonnenburg, G. R ¨atsch, C. Sch a¨fer, and B. Sch o¨lkopf.

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28] Large scale multiple kernel learning. Journal of Machine Learning Research, 7: 153 1–1565, 2006. H. Wang, F. Nie, and H. Huang. Multi-view clustering and feature learning via structured sparsity. International Conference on Machine Learning, pages 352–360, 2013. H. Wang, F. Nie, H. Huang, and C. Ding. Heterogeneous visual features fusion via sparse multimodal machine. IEEE Conference on Computer Vision and Pattern Recognition, pages 3097–3102, 2013. H. Wang, F. Nie, H. Huang, et al. Identifying disease sensitive and quantitative trait-relevant biomarkers from multidimensional heterogeneous imaging genetics data via sparse multimodal multitask learning. Bioinformatics, 28(12):i127– i136, 2012. J. M. Winn and N. Jojic. Locus: Learning object classes with unsupervised segmentation. In ICCV, pages 756–763, 2005. J. Wu and J. M. Rehg. Where am i: Place instance and category recognition using spatial pact. In CVPR, 2008. H. Yu, M. Li, H. Zhang, and J. Feng. Color texture moments for content-based image retrieval. In ICIP (3), pages 929– 932, 2002. L. Zelnik-Manor and P. Perona. Self-tuning spectral clustering. In NIPS, 2004. D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Sch o¨lkopf. Learning with local and global consistency. In NIPS, 2003. D. Zhou and B. Sch o¨lkopf. Learning from labeled and unlabeled data using random walks. In DAGM-Symposium, pages 237–244, 2004. X. Zhu, Z. Ghahramani, and J. D. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In ICML, pages 912–919, 2003. 11 774444