acl acl2012 acl2012-206 acl2012-206-reference knowledge-graph by maker-knowledge-mining

206 acl-2012-UWN: A Large Multilingual Lexical Knowledge Base


Source: pdf

Author: Gerard de Melo ; Gerhard Weikum

Abstract: We present UWN, a large multilingual lexical knowledge base that describes the meanings and relationships of words in over 200 languages. This paper explains how link prediction, information integration and taxonomy induction methods have been used to build UWN based on WordNet and extend it with millions of named entities from Wikipedia. We additionally introduce extensions to cover lexical relationships, frame-semantic knowledge, and language data. An online interface provides human access to the data, while a software API enables applications to look up over 16 million words and names.


reference text

Jordi Atserias et al. 2004. The MEANING multilingual central repository. In Proc. GWC 2004. Collin F. Baker, Charles J. Fillmore, and John B. Lowe. 1998. The Berkeley FrameNet project. In Proc. COLING-ACL 1998. Niladri Chatterjee, Shailly Goyal, and Anjali Naithani. 2005. Resolving pattern ambiguity for English to 156 Hindi machine translation using WordNet. In Proc. Workshop Translation Techn. at RANLP 2005. Bonaventura Coppola et al. 2009. Frame detection over the Semantic Web. In Proc. ESWC. Gerard de Melo and Gerhard Weikum. 2008. Language as a foundation of the Semantic Web. In Proc. ISWC. Gerard de Melo and Gerhard Weikum. 2009. Towards a universal wordnet by learning from combined evidence. In Proc. CIKM 2009. Gerard de Melo and Gerhard Weikum. 2010a. MENTA: Inducing multilingual taxonomies from Wikipedia. In Proc. CIKM 2010. Gerard de Melo and Gerhard Weikum. 2010b. Untangling the cross-lingual link structure of Wikipedia. In Proc. ACL 2010. Oren Etzioni, Kobi Reiter, Stephen Soderland, and Marcus Sammer. 2007. Lexical translation with application to image search on the Web. In Proc. MT Summit. Christiane Fellbaum, editor. 1998. WordNet: An Electronic Lexical Database. The MIT Press. Namrata Godbole, Manjunath Srinivasaiah, and Steven Skiena. 2007. Large-scale sentiment analysis for news and blogs. In Proc. ICWSM. Zhiguo Gong, Chan Wa Cheang, and Leong Hou U. 2005. Web query expansion by WordNet. In Proc. DEXA 2005. Iryna Gurevych et al. 2012. Uby: A large-scale unified lexical-semantic resource based on LMF. In Proc. EACL 2012. Yves R. Jean-Mary and Mansur R. Kabuka. 2008. ASMOV: Results for OAEI 2008. In Proc. OM 2008. Alex Judea, Vivi Nastase, and Michael Strube. 2011. WikiNetTk – A tool kit for embedding world knowledge in NLP applications. In Proc. IJCNLP 2011. Zoubida Kedad and Elisabeth Métais. 2002. Ontologybased data cleaning. In Proc. NLDB 2002. Jayant Madhavan, P. Bernstein, and E. Rahm. 2001. Generic schema matching with Cupid. In Proc. VLDB. Marcin Marszałek and C. Schmid. 2007. Semantic hierarchies for visual object recognition. In Proc. CVPR. Roberto Navigli and Simone Paolo Ponzetto. 2010. BabelNet: Building a very large multilingual semantic network. In Proc. ACL 2010. Emanuele Pianta, Luisa Bentivogli, and Christian Girardi. 2002. MultiWordNet: Developing an aligned multilingual database. In Proc. GWC. Daniel L. Rubin et al. 2006. National Center for Biomedical Ontology. OMICS, 10(2): 185–98. Lei Shi and Rada Mihalcea. 2005. Putting the pieces together: Combining FrameNet, VerbNet, and WordNet for robust semantic parsing. In Proc. CICLing. Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2007. YAGO: A core of semantic knowledge. In Proc. WWW 2007.