emnlp emnlp2010 emnlp2010-103 emnlp2010-103-reference knowledge-graph by maker-knowledge-mining

103 emnlp-2010-Tense Sense Disambiguation: A New Syntactic Polysemy Task

Source: pdf

Author: Roi Reichart ; Ari Rappoport

Abstract: Polysemy is a major characteristic of natural languages. Like words, syntactic forms can have several meanings. Understanding the correct meaning of a syntactic form is of great importance to many NLP applications. In this paper we address an important type of syntactic polysemy the multiple possible senses of tense syntactic forms. We make our discussion concrete by introducing the task of Tense Sense Disambiguation (TSD): given a concrete tense syntactic form present in a sentence, select its appropriate sense among a set of possible senses. Using English grammar textbooks, we compiled a syntactic sense dictionary comprising common tense syntactic forms and semantic senses for each. We annotated thousands of BNC sentences using the – defined senses. We describe a supervised TSD algorithm trained on these annotations, which outperforms a strong baseline for the task.

reference text

Eneko Agirre and Philip Edmonds (Eds). 2006. Word Sense Disambiguation: Algorithms and Applications. Springer Verlag. Timothy Baldwin, Mark Dras, Julia Hockenmaier, Tracy Holloway King, and Gertjan van Noord. 2007. The Impact of Deep Linguistic Processing on Parsing Technology. IWPT ’07. Douglas Biber, Stig Johansson, Geoffrey Leech, Susan Conard, Edward Finegan. 1999. Longman Grammar of Spoken and Written English. Longman. Samuel Brody, Roberto Navigli and Mirella Lapata. 2006. Ensemble Methods for Unsupervised WSD. ACL-COLING ’06. Lou Burnard. 2000. The British National Corpus User Reference Guide. Technical Report, Oxford University. Nathanael Chambers and Dan Jurafsky. 2008. Jointly Combining Implicit Constraints Improves Temporal Ordering. EMNLP ’08. Michelle Cutrer. 1994. Time and Tense in Narratives and in Everyday Language. PhD dissertation, University of California at San Diego. Ido Dagan, Oren Glickman and Bernardo Magnini. 2006. The PASCAL Recognising Textual Entailment Challenge. Lecture Notes in Computer Science 2006, 3944: 177-190. John Dinsmore. 1991 . Partitioned representations. Dordrecht, Netherlands: Kluwer. Bonnie Dorr. 1992. A Two-Level Knowledge Representation for Machine Translation: Lexical Semantics and Tense/Aspect. In James Pustejovsky and Sabine Bergler, editors, Lexical Semantics and Knowledge Representation. Yair Even-Zohar and Dan Roth. 2001 . A Sequential Model for Multi-Class Classification. EMNLP ’01. Gilles Fauconnier. 2007. Mental Spaces. in Dirk Geeraerts and Hubert Cuyckens, editors, The Oxford Handbook of Cognitive Linguistics. Radu Florian and David Yarowsky. 2002. Modeling Consensus: Classifier Combination for Word Sense Disambiguation. EMNLP ’02. Adele E. Goldberg. 1995. Constructions: A Construction Grammar Approach to Argument Structure. University of Chicago Press. Jan Hajic. 1998. Building a Syntactically Annotated Corpus: The Prague Dependency Treebank. Issues of Valency and Meaning, 106–132. Martin Hewings. 2005. Advanced Grammar in Use, Second Edition. Cambridge University University. Mirella Lapata and Alex Lascarides. 2006. Learning Sentence-internal Temporal Relations. Journal of Artificial Intelligence Research, 27:85–1 17. Nick Littlestone. 1988. Learning Quickly When Irrelevant Attributes Abound: A New Linear-threshold Algorithm. Machine Learning, 285–3 18. David MacKay. 2002. Information Theory, Inference and Learning Algorithms. Cambridge University Press. Mitchell P. Marcus, Beatrice Santorini and Mary Ann Marcinkiewicz. 1993. Building a Large Annotated Corpus of English: The Penn Treebank. Computational Linguistics, 19(2):3 13–330. David Martinez, Eneko Agirre, Lluis Marquez. 2002. Syntactic Features for High Precision Word Sense Disambiguation. COLING ’02. Rada Mihalcea and Ted Pedersen. 2005. Advances in Word Sense Disambiguation. Tutorial in ACL ’05. Raymond J. Mooney. 1996. Comparative Experiments on Disambiguating Word Senses: An Illustration of the Role of Bias in Machine Learning. EMNLP ’96. Masaki Murata, Qing Ma, Kiyotaka Uchimoto, Toshiyuki Kanamaru and Hitoshi Isahara. 2007. Japanese-toEnglish translations of Tense, Aspect, and Modality Using Machine-Learning Methods and Comparison with Cachine-Translation Systems on Market. LREC ’07. 334 Raymond Murphy. 1994. English Grammar In Use, Second Edition. Cambridge University Press. Raymond Murphy. 2007. Essential Grammar In Use, Third Edition. Cambridge University Press. Roberto Navigli. 2009. Word Sense Disambiguation: a Survey. ACM Computing Surveys, 41(2) 1–69. Rebecca J. Passonneau. 1988. A Computational Model of Semantics of Tenses and Aspect. Computational Linguistics, 14(2):44–60. Ted Pedersen. 2000. A Simple Approach to Building Ensembles of Naive Bayesian Classifiers for Word Sense Disambiguation. NAACL ’00. Adwait Ratnaparkhi. 1996. A Maximum Entropy PartOf-Speech Tagger. EMNLP ’06. Dan Roth. 1998. Learning to Resolve Natural Language Ambiguities: A Unified Approach. AAAI ’98. Michael Schiehlen. 2000. Granularity Effects in Tense Translation. COLING ’00. Marc Verhagen, Robert Gaizauskas, Frank Schilder, Mark Hepple, Graham Katz, and James Pustejovsky. 2007. SemEval-2007 Task 15: TempEval Temporal Relation Identification. ACL ’07. Takaaki Tanaka, Francis Bond, Timothy Baldwin, Sanae Fujita and Chikara Hashimoto. 2007. Word Sense Disambiguation Incorporating Lexical and Structural Semantic Information. EMNLP-CoNLL ’07. Dave Willis and Jon Wright. 2003. Collins Cobuild Elementary English Grammar, Second Edition. HarperCollins Publishers. Dave Willis. 2004. Collins CobuildIntermediate English Grammar, Second Edition. HarperCollins Publishers. Yang Ye, Victoria Li Fossum and Steven Abney. 2006. Latent Features in Automatic Tense Translation between Chinese and English. SIGHAN ’06. Yang Ye and Zhu Zhang. 2005. Tense Tagging for Verbs in Cross-Lingual Context: A Case Study. IJCNLP ’05. Harry Zhang. 2004. The Optimality of Naive Bayes. FLAIRS ’04.