acl acl2011 acl2011-101 acl2011-101-reference knowledge-graph by maker-knowledge-mining

101 acl-2011-Disentangling Chat with Local Coherence Models

Source: pdf

Author: Micha Elsner ; Eugene Charniak

Abstract: We evaluate several popular models of local discourse coherence for domain and task generality by applying them to chat disentanglement. Using experiments on synthetic multiparty conversations, we show that most models transfer well from text to dialogue. Coherence models improve results overall when good parses and topic models are available, and on a constrained task for real chat data.

reference text

Paige H. Adams. 2008. Conversation Thread Extraction and Topic Detection in Text-based Chat. Ph.D. thesis, Naval Postgraduate School. David Aldous. 1985. Exchangeability and related topics. In Ecole d'Ete de Probabilities de Saint-Flour XIII 1983, pages 1–198. Springer. Paul M. Aoki, Matthew Romaine, Margaret H. Szymanski, James D. Thornton, Daniel Wilson, and Allison Woodruff. 2003. The mad hatter's cocktail party: a social mobile audio space supporting multiple simultaneous conversations. In CHI '03: Proceedings of the SIGCHI conference on Human factors in computing systems, pages 425–432, New York, NY, USA. ACM Press. Paul M. Aoki, Margaret H. Szymanski, Luke D. Plurkowski, James D. Thornton, Allison Woodruff, and Weilie Yi. 2006. Where's the “party” in “multiparty”?: analyzing the structure of small-group sociable talk. In CSCW '06: Proceedings of the 2006 20th anniversary conference on Computer supported cooperative work, pages 393–402, New York, NY, USA. ACM Press. Regina Barzilay and Mirella Lapata. 2005. Modeling local coherence: an entity-based approach. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05). Regina Barzilay and Lillian Lee. 2004. Catching the drift: Probabilistic content models, with applications to generation and summarization. In HLT-NAACL 2004: Proceedings of the Main Conference, pages 113–120. David Blei, Andrew Y. Ng, and Michael I. Jordan. 2001. Latent Dirichlet allocation. Journal of Machine Learning Research, 3:2003. Eugene Charniak and Micha Elsner. 2009. EM works for pronoun anaphora resolution. In Proceedings of EACL, Athens, Greece. Harr Chen, S.R.K. Branavan, Regina Barzilay, and David R. Karger. 2009. Global models of document structure using latent permutations. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 371– 379, Boulder, Colorado, June. Association for Computational Linguistics. Jacob Eisenstein and Regina Barzilay. 2008. Bayesian unsupervised topic segmentation. In EMNLP, pages 334–343. Micha Elsner and Eugene Charniak. 2008a. Coreference-inspired coherence modeling. In Proceedings of ACL-08: HLT, Short Papers, pages 41–44, Columbus, Ohio, June. Association for Computational Linguistics. Micha Elsner and Eugene Charniak. 2008b. You talking to me? a corpus and algorithm for conversation disentanglement. In Proceedings of ACL-08: HLT, pages 834–842, Columbus, Ohio, June. Association for Computational Linguistics. Peter Foltz, Walter Kintsch, and Thomas Landauer. 1998. The measurement of textual coherence with latent semantic analysis. Discourse Processes, 25(2&3):285–307. Jennifer Foster. 2010. “cba to check the spelling”: Investigating parser performance on discussion forum posts. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 381–384, Los Angeles, California, June. Association for Computational Linguistics. Niyu Ge, John Hale, and Eugene Charniak. 1998. A statistical approach to anaphora resolution. In Proceed- ings of the Sixth Workshop on Very Large Corpora, pages 161–171, Orlando, Florida. Harcourt Brace. Fred Glover and Manuel Laguna. 1997. Tabu Search. University of Colorado at Boulder. Barbara J. Grosz, Aravind K. Joshi, and Scott Weinstein. 1995. Centering: A framework for modeling the local coherence of discourse. Computational Linguistics, 21(2):203–225. Simon Haykin and Zhe Chen. 2005. The Cocktail Party Problem. Neural Computation, 17(9): 1875–1902. Nikiforos Karamanis, Massimo Poesio, Chris Mellish, and Jon Oberlander. 2004. Evaluating centeringbased metrics of coherence. In ACL, pages 391–398. Mirella Lapata and Regina Barzilay. 2005. Automatic evaluation of text coherence: Models and representations. In IJCAI, pages 1085–1090. Mirella Lapata. 2003. Probabilistic text structuring: Experiments with sentence ordering. In Proceedings of the annual meeting of ACL, 2003. Mirella Lapata. 2006. Automatic evaluation of information ordering: Kendall's tau. Computational Linguistics, 32(4): 1–14. 1188 Gideon Mann, Ryan McDonald, Mehryar Mohri, Nathan Silberman, and Dan Walker. 2009. Ef?cient largescale distributed training of conditional maximum entropy models. In Y. Bengio, D. Schuurmans, J. Laf- ferty, C. K. I. Williams, and A. Culotta, editors, Advances in Neural Information Processing Systems 22, pages 123 1–1239. David McClosky, Eugene Charniak, and Mark Johnson. 2006. Effective self-training for parsing. In Proceedings of the Human Language Technology Conference of the NAACL, Main Conference, pages 152–159. David McClosky, Eugene Charniak, and Mark Johnson. 2010. Automatic domain adaptation for parsing. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 28–36, Los Angeles, California, June. Association for Computational Linguistics. Eleni Miltsakaki and K. Kukich. 2004. Evaluation oftext coherence for electronic essay scoring systems. Nat. Lang. Eng., 10(1):25–55. Neville Moray. 1959. Attention in dichotic listening: Affective cues and the in?uence of instructions. Quarterly Journal of Experimental Psychology, 11(1):56– 60. Ani Nenkova and Kathleen McKeown. 2003. References to named entities: a corpus study. In NAACL '03, pages 70–72. Malvina Nissim. 2006. Learning information status of discourse entities. In Proceedings of EMNLP, pages 94–102, Morristown, NJ, USA. Association for Com- putational Linguistics. Jacki O'Neill and David Martin. 2003. Text chat in action. In GROUP '03: Proceedings of the 2003 international ACM SIGGROUP conference on Supporting group work, pages 40–49, New York, NY, USA. ACM Press. Emily Pitler and Ani Nenkova. 2008. Revisiting readability: A uni?ed framework for predicting text quality. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pages 186–195, Honolulu, Hawaii, October. Association for Computational Linguistics. Massimo Poesio, Mijail Alexandrov-Kabadjov, Renata Vieira, Rodrigo Goulart, and Olga Uryupina. 2005. Does discourse-new detection help de?nite description resolution? In Proceedings of the Sixth International Workshop on Computational Semantics, Tillburg. Amruta Purandare and Diane J. Litman. 2008. Analyzing dialog coherence using transition patterns in lexical and semantic features. In FLAIRS Conference '08, pages 195–200. Dou Shen, Qiang Yang, Jian-Tao Sun, and Zheng Chen. 2006. Thread detection in dynamic text message streams. In SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 35–42, New York, NY, USA. ACM. Radu Soricut and Daniel Marcu. 2006. Discourse generation using utility-trained coherence models. In Proceedings of the Association for Computational Linguistics Conference (ACL-2006). Lidan Wang and Douglas W. Oard. 2009. Context-based message expansion for disentanglement of interleaved text conversations. In Proceedings of NAACL-09. 1189