acl acl2010 acl2010-28 acl2010-28-reference knowledge-graph by maker-knowledge-mining

28 acl-2010-An Entity-Level Approach to Information Extraction


Source: pdf

Author: Aria Haghighi ; Dan Klein

Abstract: We present a generative model of template-filling in which coreference resolution and role assignment are jointly determined. Underlying template roles first generate abstract entities, which in turn generate concrete textual mentions. On the standard corporate acquisitions dataset, joint resolution in our entity-level model reduces error over a mention-level discriminative approach by up to 20%.


reference text

M. C. de Marneffe, B. Maccartney, and C. D. Manning. 2006. Generating typed dependency parses from phrase structure parses. In LREC. Dayne Freitag and Andrew McCallum. 2000. Information extraction with hmm structures learned by stochastic optimization. In Association for the Advancement of Artificial Intelligence (AAAI). Dayne Freitag. 1998. Machine learning for information extraction in informal domains. A. Haghighi and D. Klein. 2010. Coreference resolution in a modular, entity-centered model. In North American Association of Computational Linguistics (NAACL). P. Liang and D. Klein. 2007. Structured Bayesian nonparametric models with variational inference (tutorial). In Association for Computational Linguistics (ACL). S. Patwardhan and E. Riloff. 2007. Effective information extraction with semantic affinity patterns and relevant regions. In Joint Conference on Empirical Methods in Natural Language Processing. S. Patwardhan and E Riloff. 2009. A unified model of phrasal and sentential evidence for information extraction. In Empirical Methods in Natural Language Processing (EMNLP). Slav Petrov, Leon Barrett, Romain Thibaux, and Dan Klein. 2006. Learning accurate, compact, and interpretable tree annotation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pages 433–440, Sydney, Australia, July. Association for Computational Linguistics. Christian Siefkes. 2008. An Incrementally Trainable Statistical Approach to Information Extraction: Based on Token Classification and Rich Context Model. VDM Verlag, Saarbr¨ ucken, Germany, Ger- many. 295