acl acl2011 acl2011-293 acl2011-293-reference knowledge-graph by maker-knowledge-mining

293 acl-2011-Template-Based Information Extraction without the Templates

Source: pdf

Author: Nathanael Chambers ; Dan Jurafsky

Abstract: Standard algorithms for template-based information extraction (IE) require predefined template schemas, and often labeled data, to learn to extract their slot fillers (e.g., an embassy is the Target of a Bombing template). This paper describes an approach to template-based IE that removes this requirement and performs extraction without knowing the template structure in advance. Our algorithm instead learns the template structure automatically from raw text, inducing template schemas as sets of linked events (e.g., bombings include detonate, set off, and destroy events) associated with semantic roles. We also solve the standard IE task, using the induced syntactic patterns to extract role fillers from specific documents. We evaluate on the MUC-4 terrorism dataset and show that we induce template structure very similar to handcreated gold structure, and we extract role fillers with an F1 score of .40, approaching the performance of algorithms that require full knowledge of the templates.

reference text

Collin F. Baker, Charles J. Fillmore, and John B. Lowe. 1998. The Berkeley FrameNet project. In Christian Boitet and Pete Whitelock, editors, ACL-98, pages 86– 90, San Francisco, California. Morgan Kaufmann Publishers. Michele Banko, Michael J Cafarella, Stephen Soderland, Matt Broadhead, and Oren Etzioni. 2007a. Learning relations from the web. In Proceedings of the International Joint Conferences on Artificial Intelligence (IJCAI). Michele Banko, Michael J Cafarella, Stephen Soderland, Matt Broadhead, and Oren Etzioni. 2007b. Open information extraction from the web. In Proceedings of the International Joint Conferences on Artificial Intelligence (IJCAI). David Blei, Andrew Ng, and Michael Jordan. 2003. Latent dirichlet allocation. Journal of Machine Learning Research. Razvan Bunescu and Raymond Mooney. 2004. Collective information extraction with relational markov networks. In Proceedings of the Association of Computational Linguistics (ACL), pages 438–445. Andrew Carlson, J. Betteridge, R.C. Wang, E.R. Hruschka Jr., and T.M. Mitchell. 2010. Coupled semisupervised learning for information extraction. In Proceedings of the ACM International Conference on Web Search and Data Mining (WSDM). Nathanael Chambers and Dan Jurafsky. 2008. Unsupervised learning of narrative event chains. In Proceedings of the Association of Computational Linguistics (ACL), Hawaii, USA. Nathanael Chambers and Dan Jurafsky. 2009. Unsupervised learning of narrative schemas and their participants. In Proceedings of the Association of Computational Linguistics (ACL), Columbus, Ohio. Hai Leong Chieu, Hwee Tou Ng, and Yoong Keok Lee. 2003. Closing the gap: Learning-based information extraction rivaling knowledge-engineering methods. In Proceedings of the Association of Computational Linguistics (ACL). Nancy Chinchor, David Lewis, and Lynette Hirschman. 1993. Evaluating message understanding systems: an analysis of the third message understanding conference. Computational Linguistics, 19:3:409–449. Katrin Erk and Sebastian Pado. 2008. A structured vector space model for word meaning in context. In Proceedings of the 2008 Conference on Empirical Methods on Natural Language Processing (EMNLP). Elena Filatova, Vasileios Hatzivassiloglou, and Kathleen McKeown. 2006. Automatic creation of domain templates. In Proceedings of the Association of Computational Linguistics (ACL). Dayne Freitag. 985 1998. Toward general-purpose learning for information extraction. In Proceedings of the Association of Computational Linguistics (ACL), pages 404–408. David Graff. 2002. English gigaword. Linguistic Data Consortium. Trond Grenager and Christopher D. Manning. 2006. Unsupervised discovery of a statistical verb lexicon. In Proceedings of the the 2006 Conference on Empirical Methods on Natural Language Processing (EMNLP). Shan He and Daniel Gildea. 2006. Self-training and co-training for semantic role labeling: Primary report. Technical Report 891, University of Rochester. Heng Ji and Ralph Grishman. 2008. Refining event extraction through unsupervised cross-document inference. In Proceedings of the Association of Computational Linguistics (ACL). Niels Kasch and Tim Oates. 2010. Mining script-like structures from the web. In Proceedings of NAACL HLT, pages 34–42. Joel Lang and Mirella Lapata. 2010. Unsupervised induction of semantic roles. In Proceedings of the North American Association of Computational Linguistics. Mstislav Maslennikov and Tat-Seng Chua. 2007. Automatic acquisition of domain knowledge for information extraction. In Proceedings of the Association of Computational Linguistics (ACL). Siddharth Patwardhan and Ellen Riloff. 2007. Effective ie with semantic affinity patterns and relevant regions. In Proceedings of the 2007 Conference on Empirical Methods on Natural Language Processing (EMNLP). Siddharth Patwardhan and Ellen Riloff. 2009. A unified model of phrasal and sentential evidence for information extraction. In Proceedings ofthe 2009 Conference on Empirical Methods on Natural Language Processing (EMNLP). Lisa Rau, George Krupka, Paul Jacobs, Ira Sider, and Lois Childs. 1992. Ge nltoolset: Muc-4 test results and analysis. In Proceedings of the Message Understanding Conference (MUC-4), pages 94–99. Ellen Riloff and Mark Schmelzenbach. 1998. An empirical approach to conceptual case frame acquisition. In Proceedings of the Sixth Workshop on Very Large Corpora. Ellen Riloff, Janyce Wiebe, and William Phillips. 2005. Exploiting subjectivity classification to improve information extraction. In Proceedings of AAAI-05. Roger C. Schank and Robert P. Abelson. 1977. Scripts, plans, goals and understanding. Lawrence Erlbaum. Yusuke Shinyama and Satoshi Sekine. 2006. Preemptive ie using unrestricted relation discovery. In Proceedings of NAACL. Kiyoshi 2003. model Sudo, Satoshi Sekine, An improved extraction for automatic and Ralph Grishman. pattern representation ie pattern acquisition. In Pro- ceedings of the Association of Computational Linguistics (ACL), pages 224–23 1 . Beth M. Sundheim. 1991. Third message understanding evaluation and conference (muc-3): Phase 1 status report. In Proceedings of the Message Understanding Conference. Mihai Surdeanu, Jordi Turmo, and Alicia Ageno. 2006. A hybrid approach for the acquisition of information extraction patterns. In Proceedings of the EACL Workshop on Adaptive Text Extraction and Mining. Robert S. Swier and Suzanne Stevenson. 2004. Unsupervised semantic role labelling. In Proceedings of the 2004 Conference on Empirical Methods on Natural Language Processing (EMNLP). Jing Xiao, Tat-Seng Chua, and Hang Cui. 2004. Cascading use of soft and hard matching pattern rules for weakly supervised information extraction. In Proceedings of the 20th International Conference on Computational Linguistics (COLING). Roman Yangarber, Ralph Grishman, Pasi Tapanainen, and Silja Huttunen. 2000. Automatic acquisition of domain knowledge for information extraction. In COLING, pages 940–946. 986