iccv iccv2013 iccv2013-325 iccv2013-325-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Hyun Soo Park, Eakta Jain, Yaser Sheikh
Abstract: We present a method to predict primary gaze behavior in a social scene. Inspired by the study of electric fields, we posit “social charges ”—latent quantities that drive the primary gaze behavior of members of a social group. These charges induce a gradient field that defines the relationship between the social charges and the primary gaze direction of members in the scene. This field model is used to predict primary gaze behavior at any location or time in the scene. We present an algorithm to estimate the time-varying behavior of these charges from the primary gaze behavior of measured observers in the scene. We validate the model by evaluating its predictive precision via cross-validation in a variety of social scenes.
[1] R. Adolphs. The social brain: Neural basis of social knowledge. Annual Review of Psychology, 2009. 1
[2] L. Bazzani, D. Tosato, M. Cristani, M. Farenzena, G. Pagetti, G. Menegaz, and V. Murino. Social interactions by visual focus of attention in a threedimensional environment. Expert Systems, 2011. 2
[3] M. Bernhard, E. Stavrakis, and M. Wimmer. An empirical pipeline to derive gaze prediction heuristics for 3d action games. TAP. 2
[4] E. Birmingham and A. Kingstone. Human social attention. Brain Research, 2009. 2
[5] K. Chawarska, S. Macari, and F. Shic. Decreased spontaneous attention to social scenes in 6-month-old infants later diagnosed with autism spectrum disorders. Biological Psychiatry, 2013. 1
[6] W. Choi, K. Shahid, and S. Savarese. Learning context for collective activity recognition. In CVPR, 2011. 2
[7] M. Cristani, L. Bazzani, G. Paggetti, A. Fossati, D. Tosato, A. Del Bue, G. Menegaz, and V. Murino. Social interaction discovery by statistical analysis of F-formations. In BMVC, 2011. 2
[8] M. Cristani, G. Paggetti, A. Vinciarelli, L. Bazzani, G. Menegaz, and V. Murino. Towards computational proxemics: Inferring social relations from
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
[23]
[24]
[25]
[26] interpersonal distances. In SocialCom, 2011. 2 N. J. Emery. The eyes have it: The neuroethology, function and evolution of social gaze. Neuroscience and Biobehavioral Reviews, 2000. 1, 7 A. Fathi, J. K. Hodgins, and J. M. Rehg. Social interaction: A first-person perspective. In CVPR, 2012. 2, 3, 5 C. K. Friesen and A. Kingstone. The eyes have it! reflexive orienting is triggered by nonpredictive gaze. Psychonomic Bulletin and Review, 1998. 2 K. Fukunaga and L. D. Hostetler. The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Transactions on Information Theory, 1975. 5 R. Gayle, W. Moss, M. C. Lin, and D. Manocha. Multi-robot coordination using generalized social potential fields. In ICRA, 2009. 2 C. Granger. Investigating causal relations by econometric models and crossspectral methods. Econometrica, 1969. 2 A. Gupta, P. Srinivasan, J. Shi, and L. S. Davis. Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos. In CVPR, 2009. 2 E. Hall. The hidden dimension. Doubleday New York, 1966. 2 R. I. Hartley and A. Zisserman. Multiple View Geometry in Computer Vision. Cambridge University Press, 2004. 4 D. Helbing and P. Moln ´ar. Social force model for pedestrian dynamics. Physics Review E, 1995. 2 J. M. Henderson. Human gaze control during real-world scene perception. Trends in Cognitive Sciences. 2 L. Itti, C. Koch, and E. Niebur. A model of saliency-based visual attention for rapid scene analysis. TPAMI, 1998. 2 R. S. Jampel and D. X. Shi. The primary position of the eyes, the resetting saccade, and the transverse visual head plane. head movements around the cervical joints. Investigative Ophthalmology and Vision Science, 1992. 3 A. Johansson, D. Helbing, and P. K. Shukla. Specification of a microscopic pedestrian model by evolutionary adjustment to video tracking data. Advances in Complex Systems, 2008. 2 A. Kendon. Conducting Interaction: Patterns of Behavior in Focused Encounters. Cambridge University Press, 1990. 2 A. Kendon. Spacing and orientation in co-present interaction. In Development of Multimodal Interfaces: active Listening and Synchrony, 2010. 2 O. Khatib. Real-time obstacle avoidance for manipulators and mobile robots. In ICRA, 1985. 2 K. Kim, M. Grundmann, A. Shamir, I. Matthews, J. Hodgins, and I. Essa. Motion fields to predict play evolution in dynamic sport scenes. In CVPR, 2010. 2, 3, 5, 6
[27] C. Koch and S. Ullman. Shifts in selective visual attention: towards the underlying neural circuitry. Human Neurobiology, 1985. 2
[28] O. Komogortsev and J. Khan. Perceptual attention focus prediction for multiple viewers in case of multimedia perceptual compression with feedback delay. In ETRA, 2006. 2
[29] M. Marin-Jimenez, A. Zisserman, and V. Ferrari. heres looking at you, kid. detecting people looking at each other in videos. In BMVC, 2011. 2
[30] P. Marshall, Y. Rogers, and N. Pantidi. Using f-formations to analyse spatial patterns of interaction in physical environments. In CSCW, 2011. 2
[31] R. Mehran, A. Oyama, and M. Shah. Abnormal crowd behavior detection using social force model. In CVPR, 2009. 2
[32] H. Murphy and A. T. Duchowski. Gaze-contingent level of detail rendering. In Eurographics, 2001 . 2
[33] N. Oliver, B. Rosario, and A. Pentland. Graphical models for recognising human interactions. In NIPS, 1998. 2
[34] H. S. Park, E. Jain, and Y. Shiekh. 3D social saliency from head-mounted cameras. In NIPS, 2012. 2, 5, 6, 7
[35] S. Pellegrini, A. Ess, K. Schindler, and L. van Gool. Youll never walk alone: Modeling social behavior for multi-target tracking. In ICCV, 2009. 2
[36] A. S. Pentland. To signal is human. American Scientist, 2010. 1
[37] R. J. Peters and L. Itti. Applying computational tools to predict gaze direction in interactive visual environments. ACM Transactions on Applied Perception, 2007. 2
[38] M. I. Posner. Orienting of attention. Quarterly Journal of Experimental Psychology, 1980. 2, 7
[39] K. Prabhakar, S. Oh, P. Wang, G. D. Abowd, and J. M. Rehg. Temporal causality for the analysis of visual events. In CVPR, 2010. 2
[40] K. Prabhakar and J. M. Rehg. Categorizing turn-taking interactions. In ECCV, 2012. 2
[41] R. Raghavendra, A. Del Bue, M. Cristani, and V. Murino. Abnormal crowd behavior detection by social force optimization. In HBU, 2011. 2
[42] V. Ramanathan, B. Yao, and L. Fei-Fei. Social role discovery in human events. In CVPR, 2013. 2
[43] J. H. Reif and H. Wang. Social potential field: a distributed behavioral control for autonomous robots. Robotics and Autonomous Systems, 1999. 2
[44] M. S. Ryoo and J. K. Aggarwal. Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities. In ICCV, 2009. 2
[45] D. J. Simons and C. F. Chabris. Gorillas in our midst: Sustained inattentional blindness for dynamic events. Perception. 2
[46] M. Sophie, G. Nathalie, and P. Denis. Gaze prediction improvement by adding a face feature to a saliency model. Recent Advances in Signal Processing, 2009. 2
[47] G. Strang. The discrete cosine transform. SIAM Reivew, 1999. 5
[48] A. Vinciarelli, M. Pantic, and H. Bourlard. Social signal processing: Survey of an emerging domain. Image and Vision Computing, 2009. 1
[49] P. Wang, G. D. Abowd, and J. M. Rehg. Quasi-periodic event analysis for social game retrieval. In ICCV, 2009. 2
[50] X. Wang, V. H. Ablavsky, B. Shitrit, H. Beny, and P. Fua. Take your eyes off the ball: Improving ball-tracking by focusing on team play. CVIU, 2013. 2 [5 1] X. Xiong and F. De la Torre Frade. Supervised descent method and its applications to face alignment. In CVPR, 2013. 6
[52] Y. Yang, S. Baker, A. Kannan, and D. Ramanan. Recognizing proxemics in personal photos. In CVPR, 2012. 2
[53] Y. Zhou, S. Yan, and T. S. Huang. Pair-activity classification by bi-trajectories analysis. In CVPR, 2008. 2 3355 0103