jmlr jmlr2007 jmlr2007-80 jmlr2007-80-reference knowledge-graph by maker-knowledge-mining

80 jmlr-2007-Synergistic Face Detection and Pose Estimation with Energy-Based Models


Source: pdf

Author: Margarita Osadchy, Yann Le Cun, Matthew L. Miller

Abstract: We describe a novel method for simultaneously detecting faces and estimating their pose in real time. The method employs a convolutional network to map images of faces to points on a lowdimensional manifold parametrized by pose, and images of non-faces to points far away from that manifold. Given an image, detecting a face and estimating its pose is viewed as minimizing an energy function with respect to the face/non-face binary variable and the continuous pose parameters. The system is trained to minimize a loss function that drives correct combinations of labels and pose to be associated with lower energy values than incorrect ones. The system is designed to handle very large range of poses without retraining. The performance of the system was tested on three standard data sets—for frontal views, rotated faces, and profiles— is comparable to previous systems that are designed to handle a single one of these data sets. We show that a system trained simuiltaneously for detection and pose estimation is more accurate on both tasks than similar systems trained for each task separately.1 Keywords: face detection, pose estimation, convolutional networks, energy based models, object recognition


reference text

L. Bottou and Y. LeCun. The Lush Manual. http://lush.sf.net, 2002. R. Caruana. Multitask learning. Machine Learning, 28:41–75, 1997. F. Fleuret and D. Geman. Coarse-to-fine face detection. IJCV, pages 85–107, 2001. C. Garcia and M. Delakis. A neural architecture for fast and robust face detection. IEEE-IAPR Int. Conference on Pattern Recognition, pages 40–43, 2002. 1213 O SADCHY, L E C UN AND M ILLER C. Huang, B. Wu, H. Ai, and S. Lao. Omni-directional face detection based on real adaboost. In International Conference on Image Processing, Singapore, 2004. M. Jones and P. Viola. Fast multi-view face detection. Technical Report TR2003-96, Mitsubishi Electric Research Laboratories, 2003. Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel. Backpropagation applied to handwritten zip code recognition. Neural Computation, 1(4):541– 551, Winter 1989. Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, November 1998. Y. LeCun, R. Hadsell, S. Chopra, F.-J. Huang, and M.-A. Ranzato. A tutorial on energy-based learning. In Predicting Structured Outputs. Bakir et al. (eds),MIT Press, 2006. Y. LeCun and F. J. Huang. Loss functions for discriminative training of energy-based models. In Proc. of the 10-th International Workshop on Artificial Intelligence and Statistics (AIStats’05), 2005. Y. LeCun, F.-J. Huang, and L. Bottou. Learning methods for generic object recognition with invariance to pose and lighting. In Proceedings of CVPR’04. IEEE Press, 2004. S. Z. Li, L. Zhu, Z. Zhang, A. Blake, H. Zhang, and H. Shum. Statistical learning of multi-view face detection. In Proceedings of the 7th European Conference on Computer Vision-Part IV, 2002. Y. Li, S. Gong, and H. Liddell. Support vector regression and classification based multi-view face detection and recognition. In Face and Gesture, 2000. S. Nowlan and J. Platt. A convolutional neural network hand tracker. In Advances in Neural Information Processing Systems (NIPS 1995), pages 901–908, San Mateo, CA, 1995. Morgan Kaufmann. M. Osadchy, M. Miller, and Y. LeCun. Synergistic face detection and pose estimation with energybased model. In Advances in Neural Information Processing Systems (NIPS 2004). MIT Press, 2005. A. Pentland, B. Moghaddam, and T. Starner. View-based and modular eigenspaces for face recognition. In CVPR, 1994. H. A. Rowley, S. Baluja, and T. Kanade. Neural network-based face detection. PAMI, 20:22–38, 1998a. H. A. Rowley, S. Baluja, and T. Kanade. Rotation invariant neural network-based face detection. In Computer Vision and Pattern Recognition, 1998b. H. Schneiderman and T. Kanade. A statistical method for 3d object detection applied to faces and cars. In Computer Vision and Pattern Recognition, 2000. K. Sung and T. Poggio. Example-based learning of view-based human face detection. PAMI, 20: 39–51, 1998. 1214 S YNERGISTIC FACE D ETECTION AND P OSE E STIMATION WITH E NERGY-BASED M ODELS R. Vaillant, C. Monrocq, and Y. LeCun. Original approach for the localisation of objects in images. IEE Proc on Vision, Image, and Signal Processing, 141(4):245–250, August 1994. P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition, pages 511–518, 2001. M.-H. Yang, D. Kriegman, and N. Ahuja. Detecting faces in images: A survey. PAMI, 24(1):34–58, 2002. 1215