emnlp emnlp2010 emnlp2010-102 emnlp2010-102-reference knowledge-graph by maker-knowledge-mining

102 emnlp-2010-Summarizing Contrastive Viewpoints in Opinionated Text


Source: pdf

Author: Michael Paul ; ChengXiang Zhai ; Roxana Girju

Abstract: This paper presents a two-stage approach to summarizing multiple contrastive viewpoints in opinionated text. In the first stage, we use an unsupervised probabilistic approach to model and extract multiple viewpoints in text. We experiment with a variety of lexical and syntactic features, yielding significant performance gains over bag-of-words feature sets. In the second stage, we introduce Comparative LexRank, a novel random walk formulation to score sentences and pairs of sentences from opposite viewpoints based on both their representativeness of the collection as well as their contrastiveness with each other. Exper- imental results show that the proposed approach can generate informative summaries of viewpoints in opinionated text.


reference text

David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993–1022. Samuel Brody and Noemie Elhadad. 2010. An unsupervised aspect-sentiment model for online reviews. In NAACL ’10. Jaime Carbonell and Jade Goldstein. 1998. The use of mmr, diversity-based reranking for reordering documents and producing summaries. In SIGIR ’98, pages 335–336. Dennis Chong and James N. Druckman. 2010. Identifying frames in political news. In Erik P. Bucy and R. Lance Holbert, editors, Sourcebook for Political Communication Research: Methods, Measures, and Analytical Techniques. Routledge. Cindy Chung and James W. Pennebaker. 2007. The psychological function of function words. Social Communication: Frontiers of Social Psychology, pages 343– 359. G u¨nes Erkan and Dragomir R. Radev. 2004. Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Int. Res., 22(1):457–479. Stephan Greene and Philip Resnik. 2009. More than words: syntactic packaging and implicit sentiment. In NAACL ’09, pages 503–51 1. Aria Haghighi and Lucy Vanderwende. 2009. Exploring content models for multi-document summarization. In NAACL ’09, pages 362–370. Sanda Harabagiu, Andrew Hickl, and Finley Lacatusu. 2006. Negation, contrast and contradiction in text processing. Minqing Hu and Bing Liu. 2004. Mining opinion features in customer reviews. In Proceedings of AAAI, pages 755–760. Minqing Hu and Bing Liu. 2006. Opinion extraction and summarization on the Web. In Proceedings of the 21st National Conference on Artificial Intelligence (AAAI2006), Nectar Paper Track, Boston, MA. Jeffrey M. Jones. 2010. “in u.s., 45% favor, 48% oppose obama healthcare plan”, March. Mahesh Joshi and Carolyn Penstein Ros e´. 2009. Generalizing dependency features for opinion mining. In ACL-IJCNLP ’09: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pages 313–316. Hyun Duk Kim and ChengXiang Zhai. 2009. Generating comparative summaries of contradictory opinions in text. In CIKM ’09: Proceeding of the 18th ACM conference on Information and knowledge management, pages 385–394, New York, NY, USA. ACM. Kevin Lerman and Ryan McDonald. 2009. Contrastive summarization: an experiment with consumer reviews. 76 In NAACL ’09, pages 113–1 16, Morristown, NJ, USA. Association for Computational Linguistics. Wei-Hao Lin, Theresa Wilson, Janyce Wiebe, and Alexander Hauptmann. 2006. Which side are you on?: identifying perspectives at the document and sentence levels. In CoNLL-X ’06: Proceedings of the Tenth Conference on Computational Natural Language Learning, pages 109–1 16. Wei-Hao Lin, Eric Xing, and Alexander Hauptmann. 2008. A joint topic and perspective model for ideological discourse. In ECML PKDD ’08: Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II, pages 17–32, Berlin, Heidelberg. Springer-Verlag. Chin-Yew Lin. 2004. Rouge: A package for automatic evaluation of summaries. In Stan Szpakowicz MarieFrancine Moens, editor, Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, pages 74– 81, Barcelona, Spain, July. Bing Liu, Minqing Hu, and Junsheng Cheng. 2005. Opinion observer: analyzing and comparing opinions on the web. In WWW ’05: Proceedings of the 14th international conference on World Wide Web, pages 342–351, New York, NY, USA. ACM Press. Marie-Catherine De Marneffe and Christopher Manning. 2008. Stanford typed dependencies manual. Technical report, Stanford University. Marie-Catherine De Marneffe, Anna Rafferty, and Christopher Manning. 2008. Finding contradictions in text. In Proceedings of the Association for Compu- tational Linguistics Conference (ACL). Ryan McDonald. 2007. A study of global inference algorithms in multi-document summarization. In ECIR ’07: Proceedings of the 29th European conference on IR research, pages 557–564, Berlin, Heidelberg. Springer-Verlag. Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1998. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project. Michael Paul and Roxana Girju. 2010. A twodimensional topic-aspect model for discovering multifaceted topics. In AAAI-2010: Twenty-Fourth Conference on Artificial Intelligence. Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. 2005. Recognizing contextual polarity in phraselevel sentiment analysis. In HLT ’05: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pages 347–354. Li Zhuang, Feng Jing, Xiao-yan Zhu, and Lei Zhang. 2006. Movie review mining and summarization. In Proceedings of the ACM SIGIR Conference on Information and Knowledge Management (CIKM).