acl acl2012 acl2012-190 acl2012-190-reference knowledge-graph by maker-knowledge-mining

190 acl-2012-Syntactic Stylometry for Deception Detection

Source: pdf

Author: Song Feng ; Ritwik Banerjee ; Yejin Choi

Abstract: Most previous studies in computerized deception detection have relied only on shallow lexico-syntactic patterns. This paper investigates syntactic stylometry for deception detection, adding a somewhat unconventional angle to prior literature. Over four different datasets spanning from the product review to the essay domain, we demonstrate that features driven from Context Free Grammar (CFG) parse trees consistently improve the detection performance over several baselines that are based only on shallow lexico-syntactic features. Our results improve the best published result on the hotel review data (Ott et al., 2011) reaching 91.2% accuracy with 14% error reduction. ,

reference text

S. Argamon-Engelson, M. Koppel, and G. Avneri. 1998. Style-based text categorization: What newspaper am i reading. In Proc. of the AAAI Workshop on Text Categorization, pages 1–4. Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. 2008. LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research, 9:1871– 1874. S. Feng, L. Xing, Gogar A., and Y. Choi. 2012. Distributional footprints of deceptive product reviews. In Proceedings of the 2012 International AAAI Conference on WebBlogs and Social Media, June. S. Greene and P. Resnik. 2009. More than words: Syntactic packaging and implicit sentiment. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 503–511. Association for Computational Linguistics. J.T. Hancock, L.E. Curry, S. Goorha, and M. Woodworth. 2007. On lying and being lied to: A linguistic analysis of deception in computer-mediated communication. Discourse Processes, 45(1) :1–23. Nitin Jindal and Bing Liu. 2008. Opinion spam and analysis. In Proceedings of the international conference on Web search and web data mining, WSDM ’08, pages 219–230, New York, NY, USA. ACM. Nitin Jindal, Bing Liu, and Ee-Peng Lim. 2010. Finding unusual review patterns using unexpected rules. In Proceedings of the 19th A CM Conference on Information and Knowledge Management, pages 1549–1552. X. Li, J. Shen, X. Gao, and X. Wang. 2010. Ex- ploiting rich features for detecting hedges and their scope. In Proceedings of the Fourteenth Conference on Computational Natural Language Learning—Shared Task, pages 78–83. Association for Computational Linguistics. Ee-Peng Lim, Viet-An Nguyen, Nitin Jindal, Bing Liu, and Hady Wirawan Lauw. 2010. Detecting product review spammers using rating behaviors. In Proceedings of the 19th A CM international conference on Information and knowledge management, CIKM ’10, pages 939–948, New York, NY, USA. ACM. R. Mihalcea and C. Strapparava. 2009. The lie detector: Explorations in the automatic recognition of deceptive language. In Proceedings of the A CLIJCNLP 2009 Conference Short Papers, pages 175 309–312. Association for Computational Linguistics. Arjun Mukherjee, Bing Liu, Junhui Wang, Natalie S. Glance, and Nitin Jindal. 2011. Detecting group review spam. In Proceedings of the 20th International Conference on World Wide Web (Companion Volume), pages 93–94. Myle Ott, Yejin Choi, Claire Cardie, and Jeffrey T. Hancock. 2011. Finding deceptive opinion spam by any stretch of the imagination. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 309–319, Portland, Oregon, USA, June. Association for Computational Linguistics. J.W. Pennebaker, C.K. Chung, M. Ireland, A. Gonzales, and R.J. Booth. 2007. The development and psychometric properties of liwc2007. Austin, TX, LIWC. Net. S. Petrov and D. Klein. 2007. Improved inference for unlexicalized parsing. In Proceedings of NAA CL HLT 2007, pages 404–41 1. A. Vrij , S. Mann, S. Kristen, and R.P. Fisher. 2007. Cues to deception and ability to detect lies as a function of police interview styles. Law and human behavior, 31(5) :499–518. Ying Zhao and Justin Zobel. 2007. Searching with style: authorship attribution in classic literature. In Proceedings of the thirtieth Australasian conference on Computer science - Volume 62, ACSC ’07, pages 59–68, Darlinghurst, Australia, Australia. Australian Computer Society, Inc.