emnlp emnlp2010 emnlp2010-31 emnlp2010-31-reference knowledge-graph by maker-knowledge-mining

31 emnlp-2010-Constraints Based Taxonomic Relation Classification

Source: pdf

Author: Quang Do ; Dan Roth

Abstract: Determining whether two terms in text have an ancestor relation (e.g. Toyota and car) or a sibling relation (e.g. Toyota and Honda) is an essential component of textual inference in NLP applications such as Question Answering, Summarization, and Recognizing Textual Entailment. Significant work has been done on developing stationary knowledge sources that could potentially support these tasks, but these resources often suffer from low coverage, noise, and are inflexible when needed to support terms that are not identical to those placed in them, making their use as general purpose background knowledge resources difficult. In this paper, rather than building a stationary hierarchical structure of terms and relations, we describe a system that, given two terms, determines the taxonomic relation between them using a machine learning-based approach that makes use of existing resources. Moreover, we develop a global constraint opti- mization inference process and use it to leverage an existing knowledge base also to enforce relational constraints among terms and thus improve the classifier predictions. Our experimental evaluation shows that our approach significantly outperforms other systems built upon existing well-known knowledge sources.

reference text

A. Abad, L. Bentivogli, I. Dagan, D. Giampiccolo, S. Mirkin, E. Pianta, and A. Stern. 2010. A resource for investigating the impact of anaphora and coreference on inference. In LREC. M. Banko and O. Etzioni. 2008. The tradeoffs between open and traditional relation extraction. In ACL-HLT. M. Baroni and A. Lenci. 2010. Distributional memory: A general framework for corpus-based semantics. Computational Linguistics, 36. S. Chakrabarti, B. Dom, R. Agrawal, and P. Raghavan. 1997. Using taxonomy, discriminants, and signatures for navigating in text databases. In VLDB. M. Chang, L. Ratinov, and D. Roth. 2008. Constraints as prior knowledge. In ICML Workshop on Prior Knowledge for Text and Language Processing. D. Davidov and A. Rappoport. 2008. Unsupervised discovery of generic relationships using pattern clusters and its evaluation by automatically generated sat analogy questions. In ACL. P. Denis and J. Baldridge. 2007. Joint determination of anaphoricity and coreference resolution using integer programming. In NAACL. C. Fellbaum. 1998. WordNet: An Electronic Lexical Database. MIT Press. Y. Freund and R. E. Schapire. 1999. Large margin classification using the perceptron algorithm. Machine Learning. M. A. Hearst. 1992. Acquisition of hyponyms from large text corpora. In COLING. A. Hotho, S. Staab, and G. Stumme. 2003. Ontologies improve text document clustering. In ICDM. Z. Kozareva, E. Riloff, and E. Hovy. 2008. Semantic class learning from the web with hyponym pattern linkage graphs. In ACL-HLT. B. MacCartney and C. D. Manning. 2008. Modeling semantic containment and exclusion in natural language inference. In COLING. B. MacCartney and C. D. Manning. 2009. An extended model of natural logic. In IWCS-8. M. Pas ¸ca and B. Van Durme. 2008. Weakly-supervised acquisition of open-domain classes and class attributes from web documents and query logs. In ACL-HLT. M. Pas ¸ca. 2007. Organizing and searching the world wide web of facts step two: Harnessing the wisdom of the crowds. In WWW. P. Pantel and M. Pennacchiotti. 2006. Espresso: Leveraging generic patterns for automatically harvesting semantic relations. In ACL, pages 113–120. S. P. Ponzetto and M. Strube. 2007. Deriving a large scale taxonomy from wikipedia. AAAI. 1109 V. Punyakanok, D. Roth, and W. Yih. 2008. The importance of syntactic parsing and inference in semantic role labeling. Computational Linguistics, 34(2). D. Roth and W. Yih. 2004. A linear programming formulation for global inference in natural language tasks. In CoNLL. D. Roth and W. Yih. 2007. Global inference for entity and relation identification via a linear programming formulation. In Lise Getoor and Ben Taskar, editors, Introduction to Statistical Relational Learning. MIT Press. M. Sammons, V.G. Vydiswaran, and D. Roth. 2010. Ask not what textual entailment can do for you... In ACL. L. Sarmento, V. Jijkuon, M. de Rijke, and E. Oliveira. 2007. ”more like these”: growing entity classes from seeds. In CIKM. A. K. Saxena, G. V. Sambhu, S. Kaushik, and L. V. Subramaniam. 2007. Iitd-ibmirl system for question answering using pattern matching, semantic type and semantic category recognition. In TREC. R. Snow, D. Jurafsky, and A. Y. Ng. 2005. Learning syntactic patterns for automatic hypernym discovery. In NIPS. R. Snow, D. Jurafsky, and A. Y. Ng. 2006. Semantic taxonomy induction from heterogenous evidence. In ACL. F. M. Suchanek, G. Kasneci, and G. Weikum. 2007. Yago: A Core of Semantic Knowledge. In WWW. O. Vikas, A. K. Meshram, G. Meena, and A. Gupta. 2008. Multiple document summarization using principal component analysis incorporating semantic vector space model. In Computational Linguistics and Chinese Language Processing. V. Vyas and P. Pantel. 2009. Semi-automatic entity set refinement. In NAACL-HLT. D. Yarowsky. 1995. Unsupervised woed sense disambiguation rivaling supervied methods. In Proceedings of ACL-95.