emnlp emnlp2010 emnlp2010-72 emnlp2010-72-reference knowledge-graph by maker-knowledge-mining

72 emnlp-2010-Learning First-Order Horn Clauses from Web Text

Source: pdf

Author: Stefan Schoenmackers ; Jesse Davis ; Oren Etzioni ; Daniel Weld

Abstract: input. Even the entire Web corpus does not explicitly answer all questions, yet inference can uncover many implicit answers. But where do inference rules come from? This paper investigates the problem of learning inference rules from Web text in an unsupervised, domain-independent manner. The SHERLOCK system, described herein, is a first-order learner that acquires over 30,000 Horn clauses from Web text. SHERLOCK embodies several innovations, including a novel rule scoring function based on Statistical Relevance (Salmon et al., 1971) which is effective on ambiguous, noisy and incomplete Web extractions. Our experiments show that inference over the learned rules discovers three times as many facts (at precision 0.8) as the TEXTRUNNER system which merely extracts facts explicitly stated in Web text.

reference text

M. Banko, M. Cafarella, S. Soderland, M. Broadhead, and O. Etzioni. 2007. Open information extraction from the Web. In Procs. of IJCAI. Andrew Carlson, Justin Betteridge, Bryan Kisiel, Burr Settles, Estevam R. Hruschka Jr., and Tom M. Mitchell. 2010. Toward an architecture for neverending language learning. In Proceedings of the Twenty-Fourth Conference on Artificial Intelligence (AAAI 2010). M. Craven, D. DiPasquo, D. Freitag, A.K. McCallum, T. Mitchell, K. Nigam, and S. Slattery. 1998. Learning to Extract Symbolic Knowledge from the World Wide Web. In Procs. of the 15th Conference of the American Association for Artificial Intelligence, pages 509–516, Madison, US. AAAI Press, Menlo Park, US. I. Dagan, O. Glickman, and B. Magnini. 2005. The PASCAL Recognising Textual Entailment Challenge. Proceedings of the PASCAL Challenges Workshop on Recognising Textual Entailment, pages 1–8. S. Dzeroski and I. Bratko. 1992. Handling noise in inductive logic programming. In Proceedings of the 2nd International Workshop on Inductive Logic Programming. M. Hearst. 1992. Automatic Acquisition of Hyponyms from Large Text Corpora. In Procs. of the 14th International Conference on Computational Linguistics, pages 539–545, Nantes, France. T.N. Huynh and R.J. Mooney. 2008. Discriminative structure and parameter learning for Markov logic networks. In Proceedings of the 25th international conference on Machine learning, pages 416–423. ACM. Stanley Kok and Pedro Domingos. 2005. Learning the structure of markov logic networks. In ICML ’05: Proceedings of the 22nd international conference on Machine learning, pages 441–448, New York, NY, USA. ACM. N. Lavrac and S. Dzeroski, editors. 2001 . Relational Data Mining. Springer-Verlag, Berlin, September. D. Lin and P. Pantel. 2001 . DIRT Discovery of Inference Rules from Text. In KDD. D. Lin and P. Pantel. 2002. Concept discovery from text. In Proceedings of the 19th International Conference on Computational linguistics (COLING-02), pages 1– – 7. E. McCreath and A. Sharma. 1997. ILP with noise and fixed example size: a Bayesian approach. In Proceedings ofthe Fifteenth internationaljoint conference on Artifical intelligence-Volume 2, pages 13 10–1315. Morgan Kaufmann Publishers Inc. G. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. 1990. Introduction to WordNet: An on-line lexical database. International Journal of Lexicography, 3(4):235–312. S. Muggleton. 1995. Inverse entailment and Progol. New Generation Computing, 13:245–286. S. Muggleton. 1997. Learning from positive data. Lecture Notes in Computer Science, 13 14:358–376. P. Pantel, R. Bhagat, B. Coppola, T. Chklovski, and E. Hovy. 2007. ISP: Learning inferential selectional preferences. In Proceedings of NAACL HLT, volume 7, pages 564–571 . M. Pennacchiotti and F.M. Zanzotto. 2007. Learning Shallow Semantic Rules for Textual Entailment. Proceedings of RANLP 2007. J. R. Quinlan. 1990. Learning logical definitions from relations. Machine Learning, 5:239–2666. Philip Resnik. 1997. Selectional preference and sense disambiguation. In Proc. of the ACL SIGLEX Workshop on Tagging Text with Lexical Semantics: Why, What, and How? M. Richardson and P. Domingos. 2006. Markov Logic Networks. Machine Learning, 62(1-2): 107–136. W.C. Salmon, R.C. Jeffrey, and J.G. Greeno. 1971. Statistical explanation & statistical relevance. Univ of Pittsburgh Pr. 1098 S. Schoenmackers, O. Etzioni, and D. Weld. 2008. Scal- ing Textual Inference to the Web. In Procs. ofEMNLP. Y. Shinyama and S. Sekine. 2006. Preemptive information extraction using unrestricted relation discovery. In Procs. of HLT/NAACL. R. Snow, D. Jurafsky, and A. Y. Ng. 2006. Semantic taxonomy induction from heterogenous evidence. In COLING/ACL 2006. M. Tatu and D. Moldovan. 2007. COGEX at RTE3. In Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, pages 22–27. M.P. Wellman, J.S. Breese, and R.P. Goldman. 1992. From knowledge bases to decision models. The Knowledge Engineering Review, 7(1):35–53. A. Yates and O. Etzioni. 2007. Unsupervised resolution of objects and relations on the Web. In Procs. of HLT.