acl acl2013 acl2013-130 acl2013-130-reference knowledge-graph by maker-knowledge-mining

130 acl-2013-Domain-Specific Coreference Resolution with Lexicalized Features

Source: pdf

Author: Nathan Gilbert ; Ellen Riloff

Abstract: Most coreference resolvers rely heavily on string matching, syntactic properties, and semantic attributes of words, but they lack the ability to make decisions based on individual words. In this paper, we explore the benefits of lexicalized features in the setting of domain-specific coreference resolution. We show that adding lexicalized features to off-the-shelf coreference resolvers yields significant performance gains on four domain-specific data sets and with two types of coreference resolution architectures.

reference text

ACE03. 2003. NIST ACE evaluation website. In http://www.nist.gov/speech/tests/ace/2003. ACE04. 2004. NIST ACE evaluation website. In http://www.nist.gov/speech/tests/ace/2004. ACE05. 2005. NIST ACE evaluation website. In http://www.nist.gov/speech/tests/ace/2005. Amit Bagga and Breck Baldwin. 1998. Entity-based cross-document coreference using the Vector Space Model. Proceedings of the 17th international conference on Computational Linguistics (COLING). Riza Theresa Batista-Navarro and Sophia Ananiadou. 2011. Building a coreference-annotated corpus from the domain of biochemistry. In Proceedings of BioNLP 2011 Workshop, BioNLP ’ 11, pages 83–91. David Bean and Ellen Riloff. 2004. Unsupervised learning of Contextual Role Knowledge for coreference resolution. Proceedings of the HLT/NAACL 2004. 85 Eric Bengston and Dan Roth. 2008. Understanding the value of features for coreference resolution. Empirical Methods in Natural Language Processing. Anders Bj ¨orkelund and Pierre Nugues. 2011. Exploring lexicalized features for coreference resolution. Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task, pages 45–50. Nadjet Bouayad-Agha, Gerard Casamayor, Gabriela Ferraro, Simon Mille, Vanesa Vidal, and Leo Wanner. 2009. Improving the comprehension of legal documentation: the case of patent claims. In Proceedings ofthe 12th International Conference on Artificial Intelligence and Law, pages 78–87. Jos e´ Casta ˜no, Jason Zhang, and James Pustejovsky. 2002. Anaphora resolution in biomedical literature. International Symposium on Reference Resolution. Radu Florian, Hany Hassan, Abraham Ittycheriah, Hongyan Jing, Nanda Kambhatla, Xiaoqiang Luo, Nicolas Nicolov, Salim Roukos, and T Zhang. 2004. A statistical model for multilingual entity detection and tracking. HLT-NAACL. Caroline Gasperin and Ted Briscoe. 2008. Statistical anaphora resolution in biomedical texts. Proceedings of the 22nd Annual Conference on Computational Linguistics, pages 257–264. Demetrios G. Glinos. 2011. A search based method for clinical text coreference resolution. In Proceedings of the Fifth i2b2/VA Track on Challenges in Natural Language Processing for Clinical Data (i2b2 2011). Phil Gooch and Abdul Roudsari. 2012. Lexical patterns, features and knowledge resources for coreference resolution in clinical notes. Journal of Biomedical Informatics, 45. Tian Ye He. 2007. Coreference resolution on entities and events for hospital discharge summaries. Ph.D. thesis, Massachusetts Institute of Technology. Lynette Hirschman. 1997. Proceedings of MUC-7. MUC-7 task definition. Youngjun Kim, Ellen Riloff, and Nathan Gilbert. 2011. The taming of Reconcile as a Biomedical coreference resolver. ACL/HLT2011 Workshop on Biomedical Natural Language Processing (BioNLP 2011) Shared Task Paper. Tyne Liang and Yu-Hsiang Lin. 2005. Anaphora resolution for biomedical literature by exploiting multiple resources. Natural Language Processing– IJCNLP 2005, pages 742–753. Vincent Ng and Claire Cardie. 2002. Improving machine learning approaches to coreference resolution. Proceedings of the 40th Annual Meeting of the ACL, pages 104–1 11. Vincent Ng. 2007. Shallow semantics for coreference resolution. Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI-07), pages 1689–1694. Fortunato Pesarin. 2001 . Multivariate permutation tests: with applications in biostatistics, volume 240. Wiley Chichester. Sameer S. Pradhan, Lance Ramshaw, Ralph Weischedel, Jessice MacBride, and Linnea Micciulla. 2007. Unrestricted coreference: Identifying entities and events in ontonotes. In Proceedings of the International Conference on Semantic Computing. Karthik Raghunathan, Heeyoung Lee, Sudarshan Rangarajan, Nathanael Chambers, Mihai Surdeanu, Dan Jurafsky, and Christopher Manning. 2010. A MultiPass Sieve for coreference resolution. Empirical Methods in Natural Langugage Processing 2010. Altaf Rahman and Vincent Ng. 2011a. Coreference resolution with world knowledge. Proceedings of the 49thAnnual Meeting ofthe Associationfor Computational Linguistics and Human Language Technologies (ACL-HLT), pages 814–824. Altaf Rahman and Vincent Ng. 2011b. Narrowing the modelling gap: A cluster-ranking approach to coreference resolution. Journal of Artificial Intelligence Research. Veselin Stoyanov, Nathan Gilbert, Claire Cardie, and Ellen Riloff. 2009. Conundrums in noun phrase coreference resolution: Making sense of the Stateof-the-Art. Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th IJCNLP (ACL-IJCNLP 2009). Veselin Stoyanov, Nathan Gilbert, Claire Cardie, and Ellen Riloff. 2010. Coreference resolution with Reconcile. Proceedings of the Joint Conference of the 48thAnnual Meeting ofthe Associationfor Com- putational Linguistics (ACL 2010). Marc Villain, John Aberdeen, John Berger, Dennis Connolly, and Lynette Hirschman. 1995. A modeltheoretic coreference scoring scheme. Proceedings of the 6th conference on Message understanding. Ian H. Witten and Eibe Frank. 2005. Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, 2nd edition. Jiaping Zheng, Wendy Chapman, Rebecca Crowley, and Guergana Savova. 2011. Coreference resolution: A review of general methodologies and applications in the clinical domain. Journal of Biomedical Informatics, 44: 1113–1 122. 86