acl acl2013 acl2013-22 acl2013-22-reference knowledge-graph by maker-knowledge-mining

22 acl-2013-A Structured Distributional Semantic Model for Event Co-reference

Source: pdf

Author: Kartik Goyal ; Sujay Kumar Jauhar ; Huiying Li ; Mrinmaya Sachan ; Shashank Srivastava ; Eduard Hovy

Abstract: In this paper we present a novel approach to modelling distributional semantics that represents meaning as distributions over relations in syntactic neighborhoods. We argue that our model approximates meaning in compositional configurations more effectively than standard distributional vectors or bag-of-words models. We test our hypothesis on the problem of judging event coreferentiality, which involves compositional interactions in the predicate-argument structure of sentences, and demonstrate that our model outperforms both state-of-the-art window-based word embeddings as well as simple approaches to compositional semantics pre- viously employed in the literature.

reference text

Marco Baroni and Alessandro Lenci. 2010. Distributional memory: A general framework for corpusbased semantics. Comput. Linguist. , 36(4):673–721, December. Marco Baroni and Roberto Zamparelli. 2010. Nouns are vectors, adjectives are matrices: representing adjective-noun constructions in semantic space. In Proceedings of the 2010 Conference on Empirical 471 Methods in Natural Language Processing, EMNLP ’ 10, pages 1183–1 193, Stroudsburg, PA, USA. Association for Computational Linguistics. Cosmin Adrian Bejan and Sanda Harabagiu. 2010. Unsupervised event coreference resolution with rich linguistic features. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL ’ 10, pages 1412–1422, Stroudsburg, PA, USA. Association for Computational Linguistics. Massimiliano Ciaramita and Yasemin Altun. 2006. Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, EMNLP ’06, pages 594–602, Stroudsburg, PA, USA. Association for Computational Linguistics. Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. Natural language processing (almost) from scratch. J. Mach. Learn. Res., 999888:2493–2537, November. Georgiana Dinu and Mirella Lapata. 2010. Measuring distributional similarity in context. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, EMNLP ’ 10, pages 1162–1 172, Stroudsburg, PA, USA. Association for Computational Linguistics. Katrin Erk and Sebastian Padó. 2008. A structured vector space model for word meaning in context. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’08, pages 897–906, Stroudsburg, PA, USA. Association for Computational Linguistics. Katrin Erk. 2007. A simple, similarity-based model for selectional preferences. Christiane Fellbaum. 1998. WordNet: An Electronic Lexical Database. Bradford Books. John R. Firth. 1957. A Synopsis of Linguistic Theory, 1930-1955. Studies in Linguistic Analysis, pages 1– 32. Kartik. Goyal, Sujay Kumar Jauhar, Mrinmaya Sachan, Shashank Srivastava, Huiying Li, and Eduard Hovy. 2013. A structured distributional semantic model : Integrating structure with semantics. In Proceed- ings of the 1st Continuous Vector Space Models and their Compositionality Workshop at the conference of ACL 2013. Edward Grefenstette, Mehrnoosh Sadrzadeh, Stephen Clark, Bob Coecke, and Stephen Pulman. 2011. Concrete sentence spaces for compositional distributional models of meaning. In Proceedings of the Ninth International Conference on Computational Semantics, IWCS ’ 11, pages 125–134, Stroudsburg, PA, USA. Association for Computational Linguistics. Emiliano Guevara. 2010. A regression model of adjective-noun compositionality in distributional semantics. In Proceedings of the 2010 Workshop on GEometrical Models of Natural Language Semantics, GEMS ’ 10, pages 33–37, Stroudsburg, PA, USA. Association for Computational Linguistics. Aria Haghighi and Dan Klein. 2009. Simple coreference resolution with rich syntactic and semantic features. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3, EMNLP ’09, pages 1152– 1161, Stroudsburg, PA, USA. Association for Computational Linguistics. E.H. Hovy, T. Mitamura, M.F. Verdejo, J. Araki, and A. Philpot. 2013. Events are not simple: Iden- tity, non-identity, and quasi-identity. In Proceedings of the 1st Events Workshop at the conference of the HLT-NAACL 2013. Heeyoung Lee, Marta Recasens, Angel Chang, Mihai Surdeanu, and Dan Jurafsky. 2012. Joint entity and event coreference resolution across documents. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL ’ 12, pages 489–500, Stroudsburg, PA, USA. Association for Computational Linguistics. Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. 2008. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA. Diana McCarthy and John Carroll. 2003. Disambiguating nouns, verbs, and adjectives using automatically acquired selectional preferences. Comput. Linguist., 29(4):639–654, December. Diana McCarthy, Rob Koeling, Julie Weeds, and John Carroll. 2004. Finding predominant word senses in untagged text. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, ACL ’04, Stroudsburg, PA, USA. Association for Computational Linguistics. Ó Jeff Mitchell and Mirella Lapata. 2008. Vector-based models of semantic composition. In Proceedings of ACL-08: HLT, pages 236–244. Patrick Pantel and Dekang Lin. 2000. Word-for-word glossing with contextually similar words. In Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference, NAACL 2000, pages 78–85, Stroudsburg, PA, USA. Association for Computational Linguistics. Karthik Raghunathan, Heeyoung Lee, Sudarshan Rangarajan, Nathanael Chambers, Mihai Surdeanu, Dan Jurafsky, and Christopher Manning. 2010. A multipass sieve for coreference resolution. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, EMNLP ’ 10, pages 492–501, Stroudsburg, PA, USA. Association for Computational Linguistics. Sebastian Rudolph and Eugenie Giesbrecht. 2010. Compositional matrix-space models of language. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL ’ 10, pages 907–916, Stroudsburg, PA, USA. Association for Computational Linguistics. Hinrich Schütze. 1998. Automatic word sense discrimination. Comput. Linguist., 24(1):97–123. Diarmuid Séaghdha and Anna Korhonen. 2011. Probabilistic models of similarity in syntactic context. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’ 11, pages 1047–1057, Stroudsburg, PA, USA. Association for Computational Linguistics. Richard Socher, Brody Huval, Christopher D. Manning, and Andrew Y. Ng. 2012. Semantic compositionality through recursive matrix-vector spaces. 472 Natural Language Learning, EMNLP-CoNLL ’ 12, pages 194–204, Stroudsburg, PA, USA. Association for Computational Linguistics. S. K. M. Wong and Vijay V. Raghavan. 1984. Vector space model of information retrieval: a reevaluation. In Proceedings of the 7th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR ’84, pages 167–185, Swinton, UK. British Computer Society. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL ’ 12, pages 1201–121 1, Stroudsburg, PA, USA. Association for Computational Linguistics. Veselin Stoyanov, Nathan Gilbert, Claire Cardie, and Ellen Riloff. 2009. Conundrums in noun phrase coreference resolution: making sense of the stateof-the-art. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2, ACL ’09, pages 656–664, Stroudsburg, PA, USA. Association for Computational Linguistics. Stefanie Tellex, Boris Katz, Jimmy J. Lin, Aaron Fernandes, and Gregory Marton. 2003. Quantitative evaluation of passage retrieval algorithms for question answering. In SIGIR, pages 41–47. Stefan Thater, Hagen Fürstenau, and Manfred Pinkal. 2010. Contextualizing semantic representations using syntactically enriched vector models. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL ’ 10, pages 948–957, Stroudsburg, PA, USA. Association for Computational Linguistics. Stephen Tratz and Eduard Hovy. 2011. A fast, accurate, non-projective, semantically-enriched parser. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’ 11, pages 1257–1268, Stroudsburg, PA, USA. Association for Computational Linguistics. Hila Weisman, Jonathan Berant, Idan Szpektor, and Ido Dagan. 2012. Learning verb inference rules from linguistically-motivated evidence. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational 473