cvpr cvpr2013 cvpr2013-167 cvpr2013-167-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Dan Levi, Shai Silberstein, Aharon Bar-Hillel
Abstract: In this work we present a new part-based object detection algorithm with hundreds of parts performing realtime detection. Part-based models are currently state-ofthe-art for object detection due to their ability to represent large appearance variations. However, due to their high computational demands such methods are limited to several parts only and are too slow for practical real-time implementation. Our algorithm is an accelerated version of the “Feature Synthesis ” (FS) method [1], which uses multiple object parts for detection and is among state-of-theart methods on human detection benchmarks, but also suffers from a high computational cost. The proposed Accelerated Feature Synthesis (AFS) uses several strategies for reducing the number of locations searched for each part. The first strategy uses a novel algorithm for approximate nearest neighbor search which we developed, termed “KDFerns ”, to compare each image location to only a subset of the model parts. Candidate part locations for a specific part are further reduced using spatial inhibition, and using an object-level “coarse-to-fine ” strategy. In our empirical evaluation on pedestrian detection benchmarks, AFS main- × tains almost fully the accuracy performance of the original FS, while running more than 4 faster than existing partbased methods which use only several parts. AFS is to our best knowledge the first part-based object detection method achieving real-time running performance: nearly 10 frames per-second on 640 480 images on a regular CPU.
[1] A. Bar-Hillel, D. Levi, E. Krupka, and C. Goldberg. Partbased feature synthesis for human detection. In ECCV 2010, volume 63 14, pages 127–142. 2010.
[2] R. Benenson, M. Mathias, R. Timofte, and L. J. V. Gool. Pedestrian detection at 100 frames per second. In CVPR, pages 2903–2910. IEEE, 2012.
[3] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, 2005.
[4] P. Doll a´r, R. Appel, and W. Kienzle. Crosstalk cascades for frame-rate pedestrian detection. In ECCV, 2012.
[5] P. Doll a´r, S. Belongie, and P. Perona. The fastest pedestrian detector in the west. In BMVC, 2010.
[6] P. Dollar, Z. Tu, P. Perona, and S. Belongie. Integral channel features. In BMVC, 2009.
[7] P. Doll a´r, C. Wojek, B. Schiele, and P. Perona. Pedestrian detection: An evaluation of the state of the art. PAMI, 99, 2011.
[8] C. Dubout and F. Fleuret. Exact acceleration of linear object detectors. In Proceedings of the European Conference on Computer Vision, 2012.
[9] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2009 (VOC2009) Results. http://www.pascalnetwork.org/challenges/VOC/voc2009/workshop/index.html.
[10] P. F. Felzenszwalb, R. B. Girshick, and D. A. McAllester. Cascade object detection with deformable part models. In CVPR, pages 2241–2248. IEEE, 2010.
[11] P. F. Felzenszwalb, R. B. Girshick, D. A. McAllester, and
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22] D. Ramanan. Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell., 32(9): 1627–1645, 2010. R. Fergus, P. Perona, and A. Zisserman. Object class recognition by unsupervised scale invariant learning. In CVPR, 2003. J. H. Friedman, J. L. Bentley, and R. A. Finkel. An algorithm for finding best matches in logarithmic expected time. ACM Trans. Math. Softw., 3(3):209–226, 1977. K. Fukunaga and P. M. Narendra. A branch and bound algorithms for computing k-nearest neighbors. IEEE Trans. Computers, 24(7):750–753, 1975. D. G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60:91–1 10, 2004. M. Muja and D. G. Lowe. Fast approximate nearest neighbors with automatic algorithm configuration. In VISAPP (1), pages 331–340, 2009. M. Pedersoli, A. Vedaldi, and J. Gonz a`lez. A coarse-to-fine approach for fast deformable object detection. In CVPR, pages 1353–1360. IEEE, 2011. A. Shashua, Y. Gdalyahu, and G. Hayun. Pedestrian detection for driving assistance systems: Single-frame classification and system level performance. In intelligent vehicles symposium, pages 1–6, 2004. C. Silpa-Anan and R. Hartley. Optimised kd-trees for fast image descriptor matching. In CVPR, 2008. www .mobi leye . com. P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features. CVPR, 1:5 11, 2001 . C. Wojek and B. Schiele. A performance evaluation of single and multi-feature people detection. In G. Rigoll, editor, DAGM-Symposium, volume 5096 of Lecture Notes in Computer Science, pages 82–91 . Springer, 2008. 999995555544222