iccv iccv2013 iccv2013-106 iccv2013-106-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Zhenyao Zhu, Ping Luo, Xiaogang Wang, Xiaoou Tang
Abstract: Face recognition with large pose and illumination variations is a challenging problem in computer vision. This paper addresses this challenge by proposing a new learningbased face representation: the face identity-preserving (FIP) features. Unlike conventional face descriptors, the FIP features can significantly reduce intra-identity variances, while maintaining discriminativeness between identities. Moreover, the FIP features extracted from an image under any pose and illumination can be used to reconstruct its face image in the canonical view. This property makes it possible to improve the performance of traditional descriptors, such as LBP [2] and Gabor [31], which can be extracted from our reconstructed images in the canonical view to eliminate variations. In order to learn the FIP features, we carefully design a deep network that combines the feature extraction layers and the reconstruction layer. The former encodes a face image into the FIP features, while the latter transforms them to an image in the canonical view. Extensive experiments on the large MultiPIE face database [7] demonstrate that it significantly outperforms the state-of-the-art face recognition methods.
[1] H. Abdi. Discriminant and Statistics, 2007.
[2] T. Ahonen, A. Hadid, patterns: Application Analysis and Machine correspondence analysis. Encyclopedia of Measurement and M. Pietikainen. Face description with local binary to face recognition. IEEE Transactions on Pattern Intelligence, 28(12):2037–2041, 2006. 119 Figure4.Examplesof acer construction.Foreachidentiy,wesel ctisimageswith6pose andarbitrayiluminations.Ther constructedfrontalface images under neutral illumination are visualized below. We clearly see that our method can remove the effects of both poses and illuminations, and retains the intrinsic face shapes and structures of the identity.
[3] A. Asthana, T. K. Marks, M. J. Jones, K. H. Tieu, and M. Rohith. Fully automatic pose-invariant face recognition via 3d pose normalization. In ICCV, 2011.
[4] Z. Cao, Q. Yin, X. Tang, and J. Sun. Face recognition with learning-based descriptor. In CVPR, 2010.
[5] C. D. Castillo and D. W. Jacobs. Wide-baseline stereo for face recognition with large pose variation. In CVPR, 201 1.
[6] S. Chopra, R. Hadsell, and Y. LeCun. Learning a similarity metric discriminatively, with application to face. In CVPR, 2005.
[7] R. Gross, I. Matthews, J. Cohn, T. Kanade, and S. Baker. Multi-pie. In International Conference on Automatic Face and Gesture Recognition, 2008.
[8] Y. Guo, G. Zhao, M. Pietikainen, and Z. Xu. Descriptor learning based on fisher separation criterion for texture classification. In ACCV, 2010.
[9] G. E. Hinton, S. Osindero, and Y.-W. Teh. A fast learning algorithm for deep belief nets. Neural Computation, 18(7): 1527–1554, 2006.
[10] G. B. Huang, H. Lee, and E. Learned-Miller. Learning hierarchical representations for face verification with convolutional deep belief networks. In CVPR, 2012.
[11] I. T. Jolliffe. Principal component analysis, volume 487. 1986.
[12] A. Krizhevsky, I. Sutskever, and G. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
[13] Q. V. Le, J. Ngiam, Z. Chen, D. Chia, P. W. Koh, and A. Y. Ng. Tiled convolutional neural networks. In NIPS, 2010.
[14] Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. In Proceedings of the IEEE, 1998.
[15] H. Lee, R. Grosse, R. Ranganath, and A. Y. Ng. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In Proc. 26th International Conference on Machine Learning, pages 609–616. ACM, 2009.
[16] Z. Lei, D. Yi, and S. Z. Li. Discriminant image filter learning for face recognition with local binary pattern like representation. In CVPR, 2012.
[17] A. Li, S. Shan, and W. Gao. Coupled bias–variance tradeoff for cross-pose face recognition. IEEE Transactions on Image Processing, 21(1):305–315, 2012.
[18] S. Li, X. Liu, X. Chai, H. Zhang, S. Lao, and S. Shan. Morphable displacement field based image matching for face recognition across pose. In ECCV. 2012.
[19] V. Nair and G. E. Hinton. Rectified linear units improve restricted boltzmann machines. In Proc. 27th International Conference on Machine Learning, 2010.
[20] N. Qian. On the momentum term in gradient descent learning algorithms. Neural Networks, 1999.
[21] M. Ranzato, J. Susskind, V. Mnih, and G. Hinton. On deep generative models with applications to recognition. In CVPR, 2011.
[22] R. Salakhutdinov and G. E. Hinton. Deep boltzmann machines. In Proceedings of the International Conference on Artificial Intelligence and Statistics, volume 5, pages 448–455, 2009.
[23] F. Schroff, T. Treibitz, D. Kriegman, and S. Belongie. Pose, illumination and expression invariant pairwise face-similarity measure via doppelg a¨nger list comparison. In ICCV, 2011.
[24] Y. Sun, X. Wang, and X. Tang. Hybrid deep learning for face verification. In ICCV, 2013.
[25] X. Tang and X. Wang. Face sketch recognition. IEEE Transactions on Circuits and Systems for Video Technology, 14(1):50–57, 2004.
[26] A. Wagner, J. Wright, A. Ganesh, Z. Zhou, H. Mobahi, and Y. Ma. Toward a practical face recognition system: Robust alignment and illumination by sparse representation. IEEE Transactions on Pattern Analysis and Machine
[27]
[28]
[29]
[30] Intelligence, 34(2):372–386, 2012. X. Wang and X. Tang. Dual-space linear discriminant analysis for face recognition. In CVPR, 2004. X. Wang and X. Tang. A unified framework for subspace face recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(9): 1222– 1228, 2004. X. Wang and X. Tang. Random sampling for subspace face recognition. International Journal of Computer Vision, 70(1):91–104, 2006. X. Wang and X. Tang. Face photo-sketch synthesis and recognition. IEEE Transactions on PatternAnalysis andMachine Intelligence, 3 1(11): 1955–1967, 2009. [3 1] L. Wiskott, J.-M. Fellous, N. Kuiger, and C. von der Malsburg. Face recognition by elastic bunch graph matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(7):775–779, 1997.
[32] J. Wright, A. Y. Yang, A. Ganesh, S. S. Sastry, and Y. Ma. Robust face recognition via sparse representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(2):210–227, 2009.
[33] M. D. Zeiler, G. W. Taylor, and R. Fergus. Adaptive deconvolutional networks for mid and high level feature learning. In ICCV, 201 1.
[34] W. Zhang, S. Shan, W. Gao, X. Chen, and H. Zhang. Local gabor binary pattern histogram sequence (lgbphs): A novel non-statistical model for face representation and recognition. In ICCV, 2005.
[35] W. Zhang, X. Wang, and X. Tang. Coupled information-theoretic encoding for face photo-sketch recognition. In CVPR, 2011.
[36] X. Zhang and Y. Gao. Face recognition across pose: A review. Pattern Recognition, 42(1 1):2876–2896, 2009. 120