cvpr cvpr2013 cvpr2013-299 cvpr2013-299-reference knowledge-graph by maker-knowledge-mining

299 cvpr-2013-Multi-source Multi-scale Counting in Extremely Dense Crowd Images


Source: pdf

Author: Haroon Idrees, Imran Saleemi, Cody Seibert, Mubarak Shah

Abstract: We propose to leverage multiple sources of information to compute an estimate of the number of individuals present in an extremely dense crowd visible in a single image. Due to problems including perspective, occlusion, clutter, and few pixels per person, counting by human detection in such images is almost impossible. Instead, our approach relies on multiple sources such as low confidence head detections, repetition of texture elements (using SIFT), and frequency-domain analysis to estimate counts, along with confidence associated with observing individuals, in an image region. Secondly, we employ a global consistency constraint on counts using Markov Random Field. This caters for disparity in counts in local neighborhoods and across scales. We tested our approach on a new dataset of fifty crowd images containing 64K annotated humans, with the head counts ranging from 94 to 4543. This is in stark con- trast to datasets usedfor existing methods which contain not more than tens of individuals. We experimentally demonstrate the efficacy and reliability of the proposed approach by quantifying the counting performance.


reference text

[1] O. Arandjelovic. Crowd detection from still images. In BMVC, 2008.

[2] R. Azencott, J.-P. Wang, and L. Younes. Texture classification using windowed fourier filters. PAMI, 19(2): 148–153, 1997.

[3] G. Brostow and R. Cipolla. Unsupervised bayesian detection of independent motion in crowds. In CVPR, 2006.

[4] A. Chan, Z. Liang, and N. Vasconcelos. Privacy preserving crowd monitoring: Counting people without people models or tracking. In CVPR, 2008.

[5] K. Chen, C. Loy, S. Gong, and T. Xiang. Feature mining for localised crowd counting. In BMVC, 2012.

[6] S. Cho, T. Chow, and C. Leung. A neural-based crowd estimation by hybrid global learning algorithm. Systems, Man,

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18] and Cybernetics, PartB: Cybernetics, IEEE Transactions on, 29(4):535–541, 1999. P. Felzenszwalb, D. McAllester, and D. Ramaman. A discriminatively trained, multiscale, deformable part model. In CVPR, 2008. P. F. Felzenszwalb and D. P. Huttenlocher. Efficient belief propagation for early vision. Int. J. Comput. Vision, 70(1):41–54, Oct. 2006. J. Ferryman and A. Ellis. Pets2010: Dataset and challenge. In AVSS, 2010. W. Ge and R. Collins. Marked point processes for crowd counting. In CVPR, 2009. D. Kong, D. Gray, and H. Tao. Counting pedestrians in crowds using viewpoint invariant training. In BMVC, 2005. L. Kratz and K. Nishino. Anomaly detection in extremely crowded scenes using spatio-temporal motion pattern models. In CVPR, 2009. V. Lempitsky and A. Zisserman. Learning to count objects in images. In NIPS, 2010. T. Leung and J. Malik. Recognizing surface using threedimensional textons. In ICCV, 1999. M. Li, Z. Zhang, K. Huang, and T. Tan. Estimating the number of people in crowded scenes by mid based foreground segmentation and head-shoulder detection. In ICPR, 2008. W. Ma, L. Huang, and C. Liu. Crowd density analysis using co-occurrence texture features. In ICCIT, 2010. A. Marana, S. Velastin, L. Costa, and R. Lotufo. Automatic estimation of crowd density using texture. In IWSIP, 1997. R. Melina. How is crowd size estimated? In Life ’sLittleMysteries.com, 2010.

[19] V. Rabaud and S. Belongie. Counting crowded moving objects. In CVPR, 2006.

[20] M. Rodriguez, J. Sivic, I. Laptev, and J. Y. Audibert. Density-aware person detection and tracking in crowds. In ICCV, 2011.

[21] D. Ryan, S. Denman, C. Fookes, and S. Sridharan. Crowd counting using multiple local features. In Digital Image Computing: Techniques and Applications, 2009.

[22] X. Wang, X. Ma, and E. Grimson. Unsupervised activity perception by hierarchical bayesian models. In CVPR, 2007.

[23] T. Xiang and S. Gong. Beyond tracking: Modelling activity and understanding behaviour. IJCV, 67(1):21–5 1, 2006.

[24] B. Zhou, F. Zhang, and L. Peng. Higher-order svd analysis for crowd density estimation. CVIU, 116(9): 1014–1021, 2012.

[25] S. Zhu, C. Guo, Y. Wu, and Y. Wang. What are textons? IJCV, pages 121–143, 2002. 222555555422