emnlp emnlp2011 emnlp2011-30 emnlp2011-30-reference knowledge-graph by maker-knowledge-mining

30 emnlp-2011-Compositional Matrix-Space Models for Sentiment Analysis

Source: pdf

Author: Ainur Yessenalina ; Claire Cardie

Abstract: We present a general learning-based approach for phrase-level sentiment analysis that adopts an ordinal sentiment scale and is explicitly compositional in nature. Thus, we can model the compositional effects required for accurate assignment of phrase-level sentiment. For example, combining an adverb (e.g., “very”) with a positive polar adjective (e.g., “good”) produces a phrase (“very good”) with increased polarity over the adjective alone. Inspired by recent work on distributional approaches to compositionality, we model each word as a matrix and combine words using iterated matrix multiplication, which allows for the modeling of both additive and multiplicative semantic effects. Although the multiplication-based matrix-space framework has been shown to be a theoretically elegant way to model composition (Rudolph and Giesbrecht, 2010), training such models has to be done carefully: the optimization is nonconvex and requires a good initial starting point. This paper presents the first such algorithm for learning a matrix-space model for semantic composition. In the context of the phrase-level sentiment analysis task, our experimental results show statistically significant improvements in performance over a bagof-words model.

reference text

Marco Baroni and Roberto Zamparelli. 2010. Nouns are vectors, adjectives are matrices: representing adjective-noun constructions in semantic space. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, EMNLP ’ 10, pages 1183–1 193, Morristown, NJ, USA. Association for Computational Linguistics. Yoshua Bengio, J ´er ˆome Louradour, Ronan Collobert, and Jason Weston. 2009. Curriculum learning. In Proceedings of the 26th Annual International Conference on Machine Learning, ICML ’09. ACM. R. H. Byrd, P. Lu, and J. Nocedal. 1995. A limited memory algorithm for bound constrained optimization. SIAM Journal on Scientific and Statistical Computing, pages 1190–1208. Yejin Choi and Claire Cardie. 2008. Learning with compositional semantics as structural inference for subsentential sentiment analysis. In Empirical Methods in Natural Language Processing (EMNLP). Isaac G. Councill, Ryan McDonald, and Leonid Velikovich. 2010. What’s great and what’s not: learning to classify the scope of negation for improved sentiment analysis. In Proceedings of the Workshop on Negation and Speculation in Natural Language Processing, NeSp-NLP ’ 10, Stroudsburg, PA, USA. Association for Computational Linguistics. Koby Crammer and Yoram Singer. 2001 . Pranking with ranking. In Advances in Neural Information Processing Systems 14, pages 641–647. MIT Press. Marie-Catherine de Marneffe, Christopher D. Manning, and Christopher Potts. 2010. Was it good? It was provocative. learning the meaning of scalar adjectives. In Proceedings ofthe 48thAnnual Meeting ofthe Association for Computational Linguistics, Uppsala, Sweden, July 11–16. ACL. I. S. Dhillon. 2001. Co-clustering documents and words using bipartite spectral graph partitioning. In KDD. Andrew B. Goldberg and Jerry Zhu. 2006. Seeing stars when there aren’t many stars: Graph-based semi- supervised learning for sentiment categorization. In HLT-NAACL Workshop on Textgraphs: Graph-based Algorithms for Natural Language Processing. James W. Hardin and Joseph Hilbe. 2007. Generalized Linear Models and Extensions. Stata Press. Vasileios Hatzivassiloglou and Kathleen R. McKeown. 1997. Predicting the semantic orientation of adjectives. In EACL, pages 174–181 . Alistair Kennedy and Diana Inkpen. 2006. Sentiment classification of movie reviews using contextual valence shifters. Computational Intelligence, 22(2, Special Issue on Sentiment Analysis)): 110–125. M. Pawan Kumar, Benjamin Packer, and Daphne Koller. 2010. Self-paced learning for latent variable models. In Advances in Neural Information Processing Systems 23. NIPS. D. Lee and H. Seung. 2001. Algorithms for non-negative matrix factorization. In NIPS. Jingjing Liu and Stephanie Seneff. 2009. Review sentiment scoring via a parse-and-paraphrase paradigm. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 161–169, Singapore, August. Association for Computational Linguistics. Jeff Mitchell and Mirella Lapata. 2010. Composition in distributional models of semantics. Cognitive Science, 34(8): 1388–1429. Saif Mohammad, Cody Dunne, and Bonnie Dorr. 2009. Generating high-coverage semantic orientation lexicons from overtly marked words and a thesaurus. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 599–608, Singapore, August. Association for Computational Linguistics. Karo Moilanen and Stephen Pulman. 2007. Sentiment composition. In Proceedings of Recent Advances in Natural Language Processing (RANLP 2007), pages 378–382, September 27-29. Tetsuji Nakagawa, Kentaro Inui, and Sadao Kurohashi. 2010. Dependency tree-based sentiment classification using crfs with hidden variables. In Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). Bo Pang and Lillian Lee. 2005. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the ACL, pages 115–124. Bo Pang and Lillian Lee. 2008. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1-2): 1–135. K. B. Petersen and M. S. Pedersen. ”2008”. The Matrix Cookbook. ”Technical University of Denmark”, ”oct”. ”Version 20081 110”. Livia Polanyi and Annie Zaenen. 2004. Contextual lexical valence shifters. In Proceedings of the AAAI Spring Symposium on Exploring Attitude and Affect in Text: Theories and Applications. 182 Delip Rao and Deepak Ravichandran. 2009. Semisupervised polarity lexicon induction. In Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), pages 675–682, Athens, Greece, March. Association for Computational Linguistics. Sebastian Rudolph and Eugenie Giesbrecht. 2010. Compositional matrix-space models of language. In Proceedings ofthe 48thAnnual Meeting ofthe Association for Computational Linguistics, ACL ’ 10, pages 907– 916, Morristown, NJ, USA. Association for Computational Linguistics. Mostafa Shaikh, Helmut Prendinger, and Ishizuka Mitsuru. 2007. Assessing sentiment of text by semantic dependency and contextual valence analysis. Maite Taboada, Julian Brooke, Milan Tofiloskiy, and Kimberly Vollz. 2011). Lexicon-based methods for sentiment analysis. In Computational Linguistics. Leonid Velikovich, Sasha Blair-Goldensohn, Kerry Hannan, and Ryan McDonald. 2010. The viability ofwebderived polarity lexicons. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 777–785, Los Angeles, California, June. Association for Computational Linguis- tics. Janyce Wiebe, Theresa Wilson, and Claire Cardie. 2005. Annotating expressions of opinions and emotions in language. Language Resources and Evaluation (formerly Computers and the Humanities), 39(2/3): 164– 210. Janyce M. Wiebe. 2000. Learning subjective adjectives from corpora. In In AAAI, pages 735–740. Theresa Wilson, Janyce Wiebe, and Rebecca Hwa. 2004. Just how mad are you? In AAAI. AAAI. Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. 2005. Recognizing contextual polarity in phrase-level sentiment analysis. In Empirical Methods in Natural Language Processing (EMNLP).