acl acl2010 acl2010-139 acl2010-139-reference knowledge-graph by maker-knowledge-mining

139 acl-2010-Identifying Generic Noun Phrases

Source: pdf

Author: Nils Reiter ; Anette Frank

Abstract: This paper presents a supervised approach for identifying generic noun phrases in context. Generic statements express rulelike knowledge about kinds or events. Therefore, their identification is important for the automatic construction of knowledge bases. In particular, the distinction between generic and non-generic statements is crucial for the correct encoding of generic and instance-level information. Generic expressions have been studied extensively in formal semantics. Building on this work, we explore a corpus-based learning approach for identifying generic NPs, using selections of linguistically motivated features. Our results perform well above the baseline and existing prior work.

reference text

R. Harald Baayen, Richard Piepenbrock, and Leon Gulikers. 1996. CELEX2. Linguistic Data Consor- tium, Philadelphia. Johan Bos. 2009. Applying automated deduction to natural language understanding. Journal of Applied 8Consider example (1.a), which is contextually restricted to a certain time and space. 48 Logic, 7(1): 100 – 1 12. Special Issue: Empirically Successful Computerized Reasoning. Miriam Butt, Helge Dyvik, Tracy Holloway King, Hiroshi Marsuichi, and Christian Rohrer. 2002. The Parallel Grammar Project. In Proceedings of Grammar Engineering and Evaluation Workshop. Gregory Norman Carlson. 1977. Reference to Kinds in English. Ph.D. thesis, University of Massachusetts. Philipp Cimiano. 2006. Ontology Learning and Populating from Text. Springer. Dick Crouch, Mary Dalrymple, Ron Kaplan, Tracy King, John Maxwell, and Paula Newman, 2010. XLE Documentation. www2.parc.com/isl/groups/nltt/xle/doc/xle toc.html. O¨sten Dahl. 1975. On Generics. In Edward Keenan, editor, Formal Semantics of Natural Lan- guage, pages 99–1 11. Cambridge University Press, Cambridge. Renaat Declerck. 1991 . The Origins of Genericity. Linguistics, 29:79–102. Christiane Fellbaum. 1998. WordNet: An Electronic Lexical Database. MIT Press. Lisa Ferro, Laurie Gerber, Janet Hitzeman, Elizabeth Lima, and Beth Sundheim. 2005. ACE English Training Data. Linguistic Data Consortium, Philadelphia. Michael Gelfond. 2007. Answer sets. In Handbook of Knowledge Representation. Elsevier Science. Marti A. Hearst. 1992. Automatic acquisition of hyponyms from large text corpora. In Proceedings of the 14th International Conference on Computational Linguistics, pages 539–545. Irene Heim. 1982. The Semantics of Definite and Indefinite Noun Phrases. Ph.D. thesis, University of Massachusetts, Amherst. Aurelie Herbelot and Ann Copestake. 2008. Annotating genericity: How do humans decide? (a case study in ontology extraction). In Sam Featherston and Susanne Winkler, editors, The Fruits of Empirical Linguistics, volume 1. de Gruyter. Dan Klein and Christopher Manning. 2003. Accurate unlexicalized parsing. In Proceedings of the 41st Meeting of the Association for Computational Linguistics, pages 423–430. Manfred Krifka, Francis Jeffry Pelletier, Gregory N. Carlson, Alice ter Meulen, Gennaro Chierchia, and Godehard Link. 1995. Genericity: An Introduction. In Gregory Norman Carlson and Francis Jeffry Pelletier, editors, The Generic Book. University of Chicago Press, Chicago. Douglas B. Lenat. 1995. Cyc: a large-scale investment in knowledge infrastructure. Commun. ACM, 38(1 1):33–38. Vladimir Lifschitz. 2002. Answer set programming and plan generation. Artificial Intelligence, 138(12):39 54. – Vladimir Lifschitz. 2008. What is Answer Set Programming? In Proceedings of AAAI. Mitchell P. Marcus, Mary Ann Marcinkiewicz, and Beatrice Santorini. 1993. Building a large annotated corpus of English: the Penn treebank. Computational Linguistics, 19(2):3 13–330. Thomas Mathew and Graham Katz. 2009. Supervised Categorization of Habitual and Episodic Sentences. In Sixth Midwest Computational Linguistics Colloquium. Bloomington, Indiana: Indiana University. Alexis Mitchell, Stephanie Strassel, Mark Przybocki, JK Davis, George Doddington, Ralph Grishman, Adam Meyers, Ada Brunstein, Lisa Ferro, and Beth Sundheim. 2003. ACE-2 Version 1.0. Linguistic Data Consortium, Philadelphia. Ian Niles and Adam Pease. 2001 . Towards a Standard Upper Ontology. In Proceedings of the 2nd International Conference on Formal Ontology in Information Systems. Athina Pappas and Susan A. Gelman. 1998. Generic noun phrases in mother–child conversations. Journal of Child Language, 25(1): 19–33. Simone Paolo Ponzetto and Michael Strube. 2007. Deriving a large scale taxonomy from wikipedia. In Proceedings of the 22nd Conference on the Advancement of Artificial Intelligence, pages 1440– 1445, Vancouver, B.C., Canada, July. Willard Van Orman Quine. 1960. Word and Object. MIT Press, Cambridge, Massachusetts. Raymond Reiter. 1980. A logic for default reasoning. Artificial Intelligence, 13:81–132. Helmut Schmid. 1994. Probabilistic part-of-speech tagging using decision trees. Proceedings of the conference on New Methods in Language Processing, 12. Sangweon Suh. 2006. Extracting Generic Statements for the Semantic Web. Master’s thesis, University of Edinburgh. Ian H. Witten and Eibe Frank. 2002. Data mining: practical machine learning tools and techniques with Java implementations. ACM SIGMOD Record, 31(1):76–77. C ¨acilia Zirn, Vivi Nastase, and Michael Strube. 2008. Distinguishing between instances and classes in the Wikipedia taxonomy. In Proceedings of the 5th European Semantic Web Conference. 49