acl acl2011 acl2011-198 acl2011-198-reference knowledge-graph by maker-knowledge-mining

198 acl-2011-Latent Semantic Word Sense Induction and Disambiguation

Source: pdf

Author: Tim Van de Cruys ; Marianna Apidianaki

Abstract: In this paper, we present a unified model for the automatic induction of word senses from text, and the subsequent disambiguation of particular word instances using the automatically extracted sense inventory. The induction step and the disambiguation step are based on the same principle: words and contexts are mapped to a limited number of topical dimensions in a latent semantic word space. The intuition is that a particular sense is associated with a particular topic, so that different senses can be discriminated through their association with particular topical dimensions; in a similar vein, a particular instance of a word can be disambiguated by determining its most important topical dimensions. The model is evaluated on the SEMEVAL-20 10 word sense induction and disambiguation task, on which it reaches stateof-the-art results.

reference text

Eneko Agirre and Aitor Soroa. 2007. SemEval-2007 Task 02: Evaluating word sense induction and discrimination systems. In Proceedings of the fourth International Workshop on Semantic Evaluations (SemEval), ACL, pages 7–12, Prague, Czech Republic. Eneko Agirre, David Mart ı´nez, Ojer L o´pez de Lacalle, and Aitor Soroa. 2006. Two graph-based algorithms for state-of-the-art WSD. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-06), pages 585–593, Sydney, Australia. Marianna Apidianaki and Tim Van de Cruys. 2011. A Quantitative Evaluation of Global Word Sense Induction. In Proceedings of the 12th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), published in Springer Lecture Notes in Computer Science (LNCS), volume 6608, pages 253–264, Tokyo, Japan. 1484 Javier Artiles, Enrique Amig o´, and Julio Gonzalo. 2009. The role of named entities in web people search. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-09), pages 534–542, Singapore. Stefan Bordag. 2006. Word sense induction: Tripletbased clustering and automatic evaluation. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL-06), pages 137–144, Trento, Italy. Zellig S. Harris. 1954. Distributional structure. Word, 10(23): 146–162. Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw, and Ralph Weischedel. 2006. Ontonotes: the 90% solution. In Proceedings of the Human Language Technology / North American Association of Computational Linguistics conference (HLT-NAACL06), pages 57–60, New York, NY. Nancy Ide and Yorick Wilks. 2007. Making Sense About Sense. In Eneko Agirre and Philip Edmonds, editors, Word Sense Disambiguation, Algorithms and Applications, pages 47–73. Springer. Thomas Landauer and Susan Dumais. 1997. A solution to Plato’s problem: The Latent Semantic Analysis theory of the acquisition, induction, and representation of knowledge. Psychology Review, 104:21 1–240. Thomas Landauer, Peter Foltz, and Darrell Laham. 1998. An Introduction to Latent Semantic Analysis. Discourse Processes, 25:295–284. Daniel D. Lee and H. Sebastian Seung. 2000. Algorithms for non-negative matrix factorization. In Advances in Neural Information Processing Systems, volume 13, pages 556–562. Dekang Lin. 1998. Automatic Retrieval and Clustering of Similar Words. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 1 7th International Conference on Computational Linguistics (COLING-ACL98), volume 2, pages 768–774, Montreal, Quebec, Canada. Suresh Manandhar, Ioannis P. Klapaftis, Dmitriy Dligach, and Sameer S. Pradhan. 2010. SemEval-2010 Task 14: Word Sense Induction & Disambiguation. In Proceedings of the fifth International Workshop on Semantic Evaluation (SemEval), ACL-10, pages 63–68, Uppsala, Sweden. Roberto Navigli. 2009. Word Sense Disambiguation: a Survey. ACM Computing Surveys, 41(2): 1–69. Joakim Nivre, Johan Hall, and Jens Nilsson. 2006. Maltparser: A data-driven parser-generator for dependency parsing. In Proceedings of the fifth International Conference on Language Resources and Evaluation (LREC-06), pages 2216–2219, Genoa, Italy. Pad o´ and Mirella Lapata. 2007. Dependencyof semantic space models. Computational Linguistics, 33(2): 161–199. Patrick Pantel and Dekang Lin. 2002. Discovering word senses from text. In ACM SIGKDD International ConSebastian based construction ference on Knowledge Discovery and Data Mining, pages 613–619, Edmonton, Alberta, Canada. Ted Pedersen. 2010. Duluth-WSI: SenseClusters Applied to the Sense Induction Task of SemEval-2. In Proceedings of the fifth International Workshop on Semantic Evaluations (SemEval-2010), pages 363–366, Uppsala, Sweden. Amruta Purandare and Ted Pedersen. 2004. Word Sense Discrimination by Clustering Contexts in Vector and Similarity Spaces. In Proceedings of the Conference on Computational Natural Language Learning (CoNLL), pages 41–48, Boston, MA. Andrew Rosenberg and Julia Hirschberg. 2007. Vmeasure: A conditional entropy-based external cluster evaluation measure. In Proceedings of the Joint 2007 Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pages 410–420, Prague, Czech Republic. Hinrich Sch u¨tze. 1998. Automatic Word Sense Discrimination. Computational Linguistics, 24(1):97–123. Kristina Toutanova and Christopher D. Manning. 2000. Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC-2000), pages 63–70. Kristina Toutanova, Dan Klein, Christopher Manning, and Yoram Singer. 2003. Feature-Rich Part-ofSpeech Tagging with a Cyclic Dependency Network. In Proceedings of the Human Language Technology / North American Association of Computational Linguistics conference (HLT-NAACL-03, pages 252–259, Edmonton, Canada. Tim Van de Cruys. 2008. Using Three Way Data for Word Sense Discrimination. In Proceedings of the 22nd International Conference on Computational Linguistics (COLING-08), pages 929–936, Manchester, UK. Jean V ´eronis. 2004. Hyperlex: lexical cartography for information retrieval. Computer Speech & Language, 18(3):223–252. Dominic Widdows and Beate Dorow. 2002. A Graph Model for Unsupervised Lexical Acquisition. In Proceedings of the 19th International Conference on Computational Linguistics (COLING-02), pages 1093– 1099, Taipei, Taiwan. 1485