acl acl2013 acl2013-113 acl2013-113-reference knowledge-graph by maker-knowledge-mining

113 acl-2013-Derivational Smoothing for Syntactic Distributional Semantics


Source: pdf

Author: Sebastian Pado ; Jan Snajder ; Britta Zeller

Abstract: Syntax-based vector spaces are used widely in lexical semantics and are more versatile than word-based spaces (Baroni and Lenci, 2010). However, they are also sparse, with resulting reliability and coverage problems. We address this problem by derivational smoothing, which uses knowledge about derivationally related words (oldish → old) to improve semantic similarity est→imates. We develop a set of derivational smoothing methods and evaluate them on two lexical semantics tasks in German. Even for models built from very large corpora, simple derivational smoothing can improve coverage considerably.


reference text

James Allan and Giridhar Kumaran. 2003. Stemming in the Language Modeling Framework. In Proceedings of SIGIR, pages 455–456. Harald R. Baayen, Richard Piepenbrock, and Leon Gulikers. 1996. The CELEX Lexical Database. Release 2. LDC96L14. Linguistic Data Consortium, University of Pennsylvania, Philadelphia, Pennsylvania. Marco Baroni and Alessandro Lenci. 2010. Distributional Memory: A General Framework for Corpus-based Semantics. Computational Linguis- tics, 36(4):673–721. Shane Bergsma, Dekang Lin, and Randy Goebel. 2008. Discriminative Learning of Selectional Preference from Unlabeled Text. In Proceedings of EMNLP, pages 59–68, Honolulu, Hawaii. Bernd Bohnet. 2010. Top Accuracy and Fast Dependency Parsing is not a Contradiction. In Proceedings of the 23rd International Conference on Computational Linguistics, pages 89–97, Beijing, China. Stanley F. Chen and Joshua Goodman. 1999. An Empirical Study of Smoothing Techniques for Language Modeling. Computer Speech and Language, 13(4):359–394. Ido Dagan, Lillian Lee, and Fernando C. N. Pereira. 1999. Similarity-Based Models of Word Cooccurrence Probabilities. Machine Learning, 34(1–3):43– 69. Bradley Efron and Robert J. Tibshirani. 1993. An Introduction to the Bootstrap. Chapman and Hall, New York. Katrin Erk, Sebastian Pad o´, and Ulrike Pad o´. 2010. A Flexible, Corpus-driven Model of Regular and Inverse Selectional Preferences. Computational Linguistics, 36(4):723–763. Katrin Erk. 2012. Vector Space Models of Word Meaning and Phrase Meaning: A Survey. Language and Linguistics Compass, 6(10):635–653. Gertrud Faaß, Ulrich Heid, and Helmut Schmid. 2010. Design and Application of a Gold Standard for Morphological Analysis: SMOR in Validation. In Proceedings of LREC-2010, pages 803–810. Jenny Rose Finkel, Trond Grenager, and Christopher Manning. 2005. Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In Proceedings of the 43rd Annual Meeting of the ACL, pages 363–370. Ronald Aylmer Fisher. 1925. Statistical methods for research workers. Oliver and Boyd, Edinburgh. Arne Fitschen. 2004. Ein computerlinguistisches Lexikon als komplexes System. Ph.D. thesis, IMS, Universit a¨t Stuttgart. Julio Gonzalo, Felisa Verdejo, Irina Chugur, and Juan M. Cigarr´ an. 1998. Indexing with WordNet Synsets Can Improve Text Retrieval. In Proceedings of the COLING/ACL Workshop on Usage of WordNet in Natural Language Processing Systems, Montr ´eal, Canada. Rochelle Lieber. 2009. Morphology and Lexical Semantics. Cambridge University Press. Saif Mohammad, Iryna Gurevych, Graeme Hirst, and Torsten Zesch. 2007. Cross-Lingual Distributional Profiles of Concepts for Measuring Semantic Distance. In Proceedings of the 2007 Joint Conference on EMNLP and CoNLL, pages 571–580, Prague, Czech Republic. Roberto Navigli and Paola Velardi. 2003. An Analysis of Ontology-based Query Expansion Strategies. In Workshop on Adaptive Text Extraction and Mining, Dubrovnik, Croatia. Sebastian Pad o´ and Jason Utt. 2012. A Distributional Memory for German. In Proceedings of KONVENS 2012 workshop on lexical-semantic resources and applications, pages 462–470, Vienna, Austria. Patrick Pantel and Dekang Lin. 2002. Discovering Word Senses from Text. In In Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 613–619. Philip Resnik. 1996. Selectional Constraints: An Information-theoretic Model and its Computational Realization. Cognition, 61(1-2): 127–159. Herbert Rubenstein and John B. Goodenough. 1965. Contextual Correlates of Synonymy. Communications of the ACM, 8(10):627–633. Peter D. Turney and Patrick Pantel. 2010. From Frequency to Meaning: Vector Space Models of Semantics. Journal of Artificial Intelligence Research, 37(1): 141–188. Ellen M. Voorhees. 1994. Query Expansion Using Lexical-semantic Relations. In Proceedings of SIGIR, pages 61–69. DeWitt Wallace and Lila Acheson Wallace. 2005. Reader’s Digest, das Beste f u¨r Deutschland. Verlag Das Beste, Stuttgart. Qin Iris Wang, Dale Schuurmans, and Dekang Lin. 2005. Strictly Lexical Dependency Parsing. In Proceedings of IWPT, pages 152–159. Britta Zeller, Jan Sˇnajder, and Sebastian Pad o´. 2013. DErivBase: Inducing and Evaluating a Derivational Morphology Resource for German. In Proceedings of ACL, Sofia, Bulgaria. Torsten Zesch, Iryna Gurevych, and Max M ¨uhlh a¨user. 2007. Comparing Wikipedia and German Wordnet by Evaluating Semantic Relatedness on Multiple Datasets. In Proceedings of NAACL/HLT, pages 205–208, Rochester, NY. 735