cvpr cvpr2013 cvpr2013-119 cvpr2013-119-reference knowledge-graph by maker-knowledge-mining

119 cvpr-2013-Detecting and Aligning Faces by Image Retrieval

Source: pdf

Author: Xiaohui Shen, Zhe Lin, Jonathan Brandt, Ying Wu

Abstract: Detecting faces in uncontrolled environments continues to be a challenge to traditional face detection methods[24] due to the large variation in facial appearances, as well as occlusion and clutter. In order to overcome these challenges, we present a novel and robust exemplarbased face detector that integrates image retrieval and discriminative learning. A large database of faces with bounding rectangles and facial landmark locations is collected, and simple discriminative classifiers are learned from each of them. A voting-based method is then proposed to let these classifiers cast votes on the test image through an efficient image retrieval technique. As a result, faces can be very efficiently detected by selecting the modes from the voting maps, without resorting to exhaustive sliding window-style scanning. Moreover, due to the exemplar-based framework, our approach can detect faces under challenging conditions without explicitly modeling their variations. Evaluation on two public benchmark datasets shows that our new face detection approach is accurate and efficient, and achieves the state-of-the-art performance. We further propose to use image retrieval for face validation (in order to remove false positives) and for face alignment/landmark localization. The same methodology can also be easily generalized to other facerelated tasks, such as attribute recognition, as well as general object detection.

reference text

[1] P. Belhumeur, D. Jacobs, D. Kriegman, and N. Kumar. Localizing parts of faces using a consensus of exemplars. In CVPR, 2011.

[2] L. Bourdev and J. Brandt. Robust object detection via soft cascade. In CVPR, 2005.

[3] S. C. Brubaker, J. Wu, J. Sun, M. D. Mullin, and J. M. Rehg. On the design of cascades of boosted ensembles for face detection. IJCV, 77, 2008.

[4] H. Cevikalp and B. Triggs. Efficient object detection using cascades ofnearest convex model classifiers. In CVPR, 2012.

[5] S. Dai, M. Yang, Y. Wu, and A. K. Katsaggelos. Detector ensemble. In CVPR, 2007.

[6] P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained part based models. PAMI, 2009.

[7] B. Heisele, T. Serre, and T. Poggio. A component-based framework for face detection and identification. IJCV, 74(2), 2007.

[8] C. Huang, H. Ai, Y. Li, and S. Lao. High-performance rotation invariant multiview face detection. PAMI, 2007.

[9] V. Jain and E. Learned-Miller. Fddb: A benchmark for face detection in unconstrained settings. Technical Report UMCS-2010-009, 2010.

[10] V. Jain and E. Learned-Miller. Online domain adaptation of a pre-trained cascade of classifiers. In CVPR, 2011.

[11] Z. Kalal, J. Matas, and K. Mikolajczyk. Weighted sampling for large-scale boosting. In BMVC, 2008.

[12] M. Koestinger, P. Wohlhart, P. M. Roth, and H. Bischof. Annotated facial landmarks in the wild: A large-scale, real-world database for facial landmark localization. In First IEEE International Workshop on Benchmarking Facial Image Analysis Technologies, 2011.

[13] C. H. Lampert. Detecting objects in large image collections and videos by efficient subimage retrieval. In ICCV, 2009.

[14] B. Leibe, A. Leonardis, and B. Schiele. Combined object categorization and segmentation with an implicit shape model. In ECCV Workshop on Statistical Learning in Computer Vision, 2004.

[15] J. Li, T. Wang, and Y. Zhang. Face detection using surf cascade. In ICCV Workshops, 2011.

[16] Z. Lin and J. Brandt. A local bag-of-features model for largescale object retrieval. In ECCV, 2010.

[17] D. G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60(2):91–1 10, 2004.

[18] K. Mikolajczyk, C. Schmid, and A. Zisserman. Human detection based on a probabilistic assembly of robust part detectors. In ECCV, 2004.

[19] M. Muja and D. G. Lowe. Fast approximate nearest neighbors with automatic algorithm configuration. In VISAPP, 2009.

[20] X. Shen, Z. Lin, J. Brandt, S. Avidan, and Y. Wu.

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29] Object retrieval and localization with spatially-constrained similarity measure and k-nn reranking. In CVPR, 2012. X. Shen, Z. Lin, J. Brandt, and Y. Wu. Mobile product image search by automatic query object extraction. In ECCV, 2012. J. Sivic and A. Zisserman. Video google: A text retrieval approach to object matching in videos. In ICCV, 2003. B. S. Venkatesh and S. Marcel. Fast bounding box estimation based face detection. In ECCV Workshop on Face Detection, 2010. P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features. In CVPR, 2001 . X. Wang, T. X. Han, and S. Yan. An hog-lbp human detector with partial occlusion handling. In ICCV, 2009. B. Wu, H. Ai, C. Huang, and S. Lao. Fast rotation invariant multi-view face detection based on real adaboost. In FG, 2004. Z. Wu, Q. Ke, J. Sun, and H.-Y. Shum. Scalable face image retrieval with identity-based quantization and multireference reranking. PAMI, 33(10), 2011. C. Zhang and Z. Zhang. A survey of recent advances in face detection. Technical Report, MSR-TR-2010-66, 2010. X. Zhu and D. Ramanan. Face detection, pose estimation, and landmark localization in the wild. In CVPR, 2012. 333444666755