emnlp emnlp2011 emnlp2011-27 emnlp2011-27-reference knowledge-graph by maker-knowledge-mining

27 emnlp-2011-Classifying Sentences as Speech Acts in Message Board Posts


Source: pdf

Author: Ashequl Qadir ; Ellen Riloff

Abstract: This research studies the text genre of message board forums, which contain a mixture of expository sentences that present factual information and conversational sentences that include communicative acts between the writer and readers. Our goal is to create sentence classifiers that can identify whether a sentence contains a speech act, and can recognize sentences containing four different speech act classes: Commissives, Directives, Expressives, and Representatives. We conduct experiments using a wide variety of features, including lexical and syntactic features, speech act word lists from external resources, and domain-specific semantic class features. We evaluate our results on a collection of message board posts in the domain of veterinary medicine.


reference text

James Allen. 1995. Natural language understanding (2nd ed.). Benjamin-Cummings Publishing Co., Inc., Redwood City, CA, USA. Jean Carletta. 1996. Assessing agreement on classification tasks: the kappa statistic. Comput. Linguist., 22:249–254, June. Vitor R. Carvalho and William W. Cohen. 2005. On the collective classification of email ”speech acts”. In SIGIR ’05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 345–352, New York, NY, USA. ACM Press. Vitor R. Carvalho and William W. Cohen. 2006. Improving ”email speech acts” analysis via n-gram selection. In Proceedings of the HLT-NAACL 2006 Workshop on Analyzing Conversations in Text and Speech, ACTS ’09, pages 35–41, Stroudsburg, PA, USA. Association for Computational Linguistics. William W. Cohen, Vitor R. Carvalho, and Tom M. Mitchell. 2004. Learning to classify email into “speech acts”. In EMNLP, pages 309–316. ACL. Jade Goldstein and Roberta Evans Sabin. 2006. Using speech acts to categorize email and identify email gen- res. In Proceedings of the 39th Annual Hawaii International Conference on System Sciences - Volume 03, pages 50.2–, Washington, DC, USA. IEEE Computer Society. Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, and Ian H. Witten. 2009. The weka data mining software: an update. SIGKDD Explor. Newsl., 11: 10–18, November. Ruihong Huang and Ellen Riloff. 2010. Inducing domain-specific semantic class taggers from (almost) nothing. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL ’ 10, pages 275–285, Stroudsburg, PA, USA. Association for Computational Linguistics. Minwoo Jeong, Chin-Yew Lin, and Gary Geunbae Lee. 2009. Semi-supervised speech act recognition in emails and forums. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3, EMNLP ’09, pages 1250–1259, Stroudsburg, PA, USA. Association for Computational Linguistics. Andrew Lampert, Robert Dale, and Cecile Paris. 2006. Classifying speech acts using verbal response modes. In Proceedings of the 2006 Australasian Language Technology Workshop (ALTW2006), pages 34–41. Sydney Australia : ALTA. Tara McIntosh. 2010. Unsupervised discovery of negative categories in lexicon bootstrapping. In Pro- ceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, EMNLP ’ 10, pages 356–365, Stroudsburg, PA, USA. Association for Computational Linguistics. John Mildinhall and Jan Noyes. 2008. Toward a stochastic speech act model of email behavior. In CEAS. Jacqueline Nastri, Jorge Pena, and Jeffrey T. Hancock. 2006. The construction of away messages: A speech act analysis. J. Computer-Mediated Communication, pages 1025–1045. Sujith Ravi and Jihie Kim. 2007. Profiling student interactions in threaded discussions with speech act classifiers. In Proceeding of the 2007 conference on Artificial Intelligence in Education: Building Technology Rich Learning Contexts That Work, pages 357–364, Amsterdam, The Netherlands, The Netherlands. IOS Press. Ellen Riloff, Janyce Wiebe, and Theresa Wilson. 2003. Learning subjective nouns using extraction pattern bootstrapping. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4, CONLL ’03, pages 25–32, Stroudsburg, PA, USA. Association for Computational Linguistics. John R. Searle. 1976. A classification of illocutionary acts. Language in Society, 5(1):pp. 1–23. 758 Michael Thelen and Ellen Riloff. 2002. A bootstrapping method for learning semantic lexicons using extraction pattern contexts. In Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10, EMNLP ’02, pages 214–221, Stroudsburg, PA, USA. Association for Computational Linguistics. Kristina Toutanova, Dan Klein, Christopher D. Manning, and Yoram Singer. 2003. Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1, NAACL ’03, pages 173–180, Stroudsburg, PA, USA. Association for Computational Linguistics. Douglas P. Twitchell and Jay F. Nunamaker Jr. 2004. Speech act profiling: a probabilistic method for analyzing persistent conversations and their participants. In System Sciences, 2004. Proceedings of the 37th Annual Hawaii International Conference on, pages 1–10, January. Douglas P. Twitchell, Mark Adkins, Jay F. Nunamaker Jr., and Judee K. Burgoon. 2004. Using speech act theory to model conversations for automated classification and retrieval. In Proceedings of the International Working Conference Language Action Perspective Communication Modelling (LAP 2004), pages 121–130. A. Wierzbicka. 1987. English speech act verbs: a semantic dictionary. Academic Press, Sydney, Orlando.