emnlp emnlp2012 emnlp2012-97 emnlp2012-97-reference knowledge-graph by maker-knowledge-mining

97 emnlp-2012-Natural Language Questions for the Web of Data


Source: pdf

Author: Mohamed Yahya ; Klaus Berberich ; Shady Elbassuoni ; Maya Ramanath ; Volker Tresp ; Gerhard Weikum

Abstract: The Linked Data initiative comprises structured databases in the Semantic-Web data model RDF. Exploring this heterogeneous data by structured query languages is tedious and error-prone even for skilled users. To ease the task, this paper presents a methodology for translating natural language questions into structured SPARQL queries over linked-data sources. Our method is based on an integer linear program to solve several disambiguation tasks jointly: the segmentation of questions into phrases; the mapping of phrases to semantic entities, classes, and relations; and the construction of SPARQL triple patterns. Our solution harnesses the rich type system provided by knowledge bases in the web of linked data, to constrain our semantic-coherence objective function. We present experiments on both the . in question translation and the resulting query answering.


reference text

Auer, S.; Bizer, C.; Kobilarov, G.; Lehmann, J.; Cyganiak, R.; and Ives, Z. G. 2007. DBpedia: A Nucleus for a Web of Open Data. In ISWC/ASWC. Bhalotia, G.; Hulgeri, A.; Nakhe, C.; Chakrabarti, S.; and Sudarshan, S. 2002. Keyword Searching and Browsing in Databases using BANKS. In ICDE. Bollacker, K. D.; Evans, C.; Paritosh, P.; Sturge, T.; and Taylor, J. 2008. Freebase: a Collaboratively Created Graph Database for Structuring Human Knowledge. In SIGMOD. Chu-Carroll, J.; Fan, J.; Boguraev, B. K.; Carmel, D.; and Sheinwald, D.; Welty, C. 2012. Finding needles in the haystack: Search and candidate generation. In IBM J. Res. & Dev., vol 56, no.3/4. Damljanovic, D.; Agatonovic, M.; and Cunningham, H. 2011. FREyA: an Interactive Way of Querying Linked Data using Natural Language. Dang, H. T.; Kelly, D.; and Lin, J. J. 2007. Overview of the trec 2007 question answering track. In TREC. de Marneffe, M. C.; Maccartney, B.; and Manning, C. D. 2006. Generating typed dependency parses from phrase structure parses. In LREC. Elbassuoni, S.; Ramanath, M.; Schenkel, R.; Sydow, M.; and Weikum, G. 2009. Language-model-based ranking for queries on rdf-graphs. In CIKM. Elbassuoni, S.; Ramanath, M.; and Weikum, G. 2011. Query relaxation for entity-relationship search. In ESWC. Fader, A.; Soderland, S.; and Etzioni, O. 2011. Identifying relations for open information extraction. In EMNLP. Ferrucci, D. A.; Brown, E. W.; Chu-Carroll, J.; Fan, J.; Gondek, D.; Kalyanpur, A.; Lally, A.; Murdock, J. W.; Nyberg, E.; Prager, J. M.; Schlaefer, N.; and Welty, C. A. 2010. Building Watson: An Overview of the DeepQA Project. AIMagazine 3 1(3). Frank, A.; Krieger, H.-U.; Xu, F.; Uszkoreit, H.; Crys- mann, B.; J o¨rg, B.; and Sch a¨fer, U. 2007. Question Answering from Structured Knowledge Sources. J. Applied Logic 5(1). Gurobi Optimization, Inc. 2012. Gurobi Optimizer Reference Manual. http://www.gurobi.com/. Heath, T., and Bizer, C. 2011. Linked Data: Evolving the Web into a Global Data Space. San Rafael, CA: Morgan & Claypool, 1edition. Hirschman, L., and Gaizauskas, R. 2001. Natural Language Question Answering: The View from Here. Nat. Lang. Eng. 7. Hoffart, J.; Mohamed, A. Y.; Bordino, I.; F ¨urstenau, H.; Pinkal, M.; Spaniol, M.; Taneva, B.; Thaterm S.; and Weikum, G. 2011. Robust Disambiguation of Named Entities in Text. In EMNLP. 389 Hoffart, J.; Suchanek, F. M.; Berberich, K.; LewisKelham, E.; de Melo, G.; and Weikum, G. 2011. Yago2: exploring and querying world knowledge in time, space, context, and many languages. In WWW (Companion Volume). Kalyanpur, A.; Murdock, J. W.; Fan, J.; and Welty, C. A. 2011. Leveraging community-built knowledge for type coercion in question answering. In International Semantic Web Conference. Katz, B.; Felshin, S.; Marton, G.; Mora, F.; Shen, Y. K.; Zaccak, G.; Ammar, A.; Eisner, E.; Turgut, A.; and Westrick, L. B. 2007. CSAIL at TREC 2007 Question Answering. In TREC. Kulkarni, S.; Singh, A.; Ramakrishnan, G.; and Chakrabarti, S. 2009. Collective annotation of wikipedia entities in web text. In KDD. Kwok, C. C. T.; Etzioni, O.; and Weld, D. S. 2001. Scaling Question Answering to the Web. In WWW. Li, Y.; Yang, H.; and Jagadish, H. V. 2007. NaLIX: A Generic Natural Language Search Environment for XML Data. ACM Trans. Database Syst. 32(4). Milne, D. N., and Witten, I. H. 2008. Learning to link with wikipedia. In CIKM. Ndapandula Nakashole, Gerhard Weikum and Fabian Suchanek 2012. PATTY: A Taxonomy of Relational Patterns with Semantic Types. In EMNLP. Navigli, R. 2009. Word sense disambiguation: A survey. ACM Comput. Surv. 41(2). Pound, J.; Ilyas, I. F.; and Weddell, G. E. 2010. Expressive and Flexible Access to Web-extracted Data: A Keyword-based Structured Query Language. In SIGMOD. 2011. 1st Workshop on Question Answering over Linked Data (QALD-1). http://www.sc.cit-ec.unibielefeld.de/qald-1. Resnik, P. 1995. Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In IJCAI. Spitkovsky, V. I. Spitkovsky; Chang, A. X. ; 2012. A Cross-Lingual Dictionary for English Wikipedia Concepts. In LREC. Suchanek, F. M.; Kasneci, G.; and Weikum, G. 2007. Yago: a core of semantic knowledge. In WWW. Tummarello, G.; Cyganiak, R.; Catasta, M.; Danielczyk, S.; Delbru, R.; and Decker, S. 2010. Sig.ma: Live views on the web of data. J. Web Sem. 8(4). Unger, C.; and Cimiano, P. 2011. Pythia: Compositional Meaning Construction for Ontology-Based Question Answering on the Semantic Web. In NLDB. Unger, C.; B ¨uhmann, L.; Lehmann, J.; Ngonga Ngomo, A.-C.; Gerber, D.; and Cimiano, P. 2012. Templatebased question answering over RDF data. In WWW. Voorhees, E. M. 2003. Overview of the trec 2003 question answering track. In TREC. Yahya, M.; Berberich, K.; Elbassuoni, S.; Ramanath, M.; Tresp, V.; for and Weikum, naturally WWW. G. asked questions 2012. Deep answers on the web of data. In 390 Zheng, Z. 2002. tem. In HLT. AnswerBus Question Answering Sys-