acl acl2013 acl2013-234 acl2013-234-reference knowledge-graph by maker-knowledge-mining

234 acl-2013-Linking and Extending an Open Multilingual Wordnet


Source: pdf

Author: Francis Bond ; Ryan Foster

Abstract: We create an open multilingual wordnet with large wordnets for over 26 languages and smaller ones for 57 languages. It is made by combining wordnets with open licences, data from Wiktionary and the Unicode Common Locale Data Repository. Overall there are over 2 million senses for over 100 thousand concepts, linking over 1.4 million words in hundreds of languages.


reference text

Stephen Bird, Ewan Klein, and Edward Loper. 2010. Nyumon Shizen Gengo Shori [Introduction to Natural Language Processing]. O’Reilly. (translated by Hagiwara, Nakamura and Mizuno). W. Black, S. Elkateb, H. Rodriguez, M. Alkhalifa, P. Vossen, A. Pease, M. Bertran, and C. Fellbaum. 2006. The Arabic wordnet project. In Proceedings of LREC 2006. Francis Bond and Kyonghee Paik. 2012. A survey of wordnets and their licenses. In Proceedings of the 6th Global WordNet Conference (GWC 2012). Matsue. 64–71. Jordan Boyd-Graber, Christiane Fellbaum, Daniel Osherson, and Robert Schapire. 2006. Adding dense, weighted connections to WordNet. In Proceedings of the Third Global WordNet Meeting. Jeju. Thatsanee Charoenporn, Virach Sornlerlamvanich, Chumpol Mokarat, and Hitoshi Isahara. 2008. Semi-automatic compilation of Asian WordNet. In 14th Annual Meeting of the Association for Natural Language Processing, pages 1041–1044. Tokyo. Jordi Daude, Lluis Padro, and German Rigau. 2003. Validation and tuning of Wordnet mapping techniques. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP’03). Borovets, Bulgaria. Gerard de Melo and Gerhard Weikum. 2009. Towards a universal wordnet by learning from combined evidence. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM 2009), pages 513– 522. ACM, New York, NY, USA. Gerard de Melo and Gerhard Weikum. 2010. Towards universal multilingual knowledge bases. In Pushpak Bhattacharyya, Christiane Fellbaum, and Piek Vossen, editors, Principles, Construction, and Applications of Multilingual Wordnets. Proceedings of the 5th Global WordNet Conference (GWC 2010), pages 149–156. Narosa Publishing, New Delhi, India. Valeria de Paiva and Alexandre Rademaker. 2012. Revisiting a Brazilian wordnet. In Proceedings of the 6th Global WordNet Conference (GWC 2012). Matsue. Christiane Fellbaum and Piek Vossen. 2012. Challenges for a multilingual wordnet. Language Resources and Evaluation, 46(2):3 13– 326. Doi=10.1007/s10579-012-9186-z. Christine Fellbaum, editor. 1998. WordNet: An Electronic Lexical Database. MIT Press. Aitor Gonzalez-Agirre, Egoitz Laparra, and German Rigau. 2012. Multilingual central repos1360 itory version 3.0: upgrading a very large lexical knowledge base. In Proceedings of the 6th Global WordNet Conference (GWC 2012). Matsue. Val e´rie Hanoka and Beno ıˆt Sagot. 2012. Wordnet creation and extension made simple: A multilingual lexicon-based approach using wiki resources. In Proceedings of LREC 2012. Istanbul. Chu-Ren Huang, Shu-Kai Hsieh, Jia-Fei Hong, Yun-Zhu Chen, I-Li Su, Yong-Xiang Chen, and Sheng-Wei Huang. 2010. Chinese wordnet: Design and implementation of a cross-lingual knowledge processing infrastructure. Journal of Chinese Information Processing, 24(2): 14–23. (in Chinese). Hitoshi Isahara, Francis Bond, Kiyotaka Uchimoto, Masao Utiyama, and Kyoko Kanzaki. 2008. Development of the Japanese WordNet. In Sixth International conference on Language Resources and Evaluation (LREC 2008). Marrakech. Toru Ishida. 2006. Language grid: An infrastructure for intercultural collaboration. In IEEE/IPSJ Symposium on Applications and the Internet (SAINT-06), pages 96–100. URL http ://langrid .nict .go .j p/file/ langrid2006021 1.pdf, (keynote address). Kow Kuroda, Takayuki Kuribayashi, Francis Bond, Kyoko Kanzaki, and Hitoshi Isahara. 2011. Orthographic variants and multilingual sense tagging with the Japanese WordNet. In 17th Annual Meeting of the Association for Natural Language Processing, pages A4–1. Toyohashi. Krister Lind e´n and Lauri Carlson. 2010. Finnwordnet wordnet p ˚afinska via o¨vers¨ attning. LexicoNordica Nordic Journal of Lexicography, 17: 119–140. In — — Swedish with an English abstract. John McCrae, Philipp Cimiano, and Elena Montiel-Ponsoda. 2012. Integrating wordnet and wiktionary with lemon. In Christian Chiarcos, Sebastian Nordhoff, and Sebastian Hellman, editors, Linked Data in Linguistics. Springer. Christian M. Meyer and Iryna Gurevych. 2011. What psycholinguists know about chemistry: Aligning wiktionary and wordnet for increased domain coverage. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP), pages 883–892. Nurril Hirfana Mohamed Noor, Suerya Sapuan, and Francis Bond. 2011. Creating the open Wordnet Bahasa. In Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation (PACLIC 25), pages 258–267. Singapore. Mortaza Montazery and Heshaam Faili. 2010. Automatic Persian wordnet construction. In 23rd International conference on computational linguistics, pages 846–850. Roberto Navigli and Simone Paolo Ponzetto. 2012. BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artificial Intelligence, 193:217–250. NIST. 2012. Secure hash standard (shs). Fips pub 180-4, National Institute of Standards and Technology. Noam Ordan and Shuly Wintner. 2007. Hebrew wordnet: a test case of aligning lexical databases across languages. International Journal of Translation, 19(1):39–58. B.S Pedersen, S. Nimb, J. Asmussen, N. Sørensen, L. Trap-Jensen, and H. Lorentzen. 2009. DanNet the challenge of compiling a wordnet for Danish by reusing a monolingual dictionary. Language Resources and Evaluation. Ted Pederson. 2008. Empiricism is not a matter of faith. Computational Linguistics, 34(3):465– 470. Emanuele Pianta, Luisa Bentivogli, and Christian Girardi. 2002. Multiwordnet: Developing an aligned multilingual database. In In Proceed— ings of the First International Conference on Global WordNet, pages 293–302. Mysore, India. Maciej Piasecki, Stan Szpakowicz, and Bartosz Broda. 2009. A Wordnet from the Ground Up. Wroclaw University of Technology Press. URL http : //www.plwordnet .pwr .wroc .pl/ main/content/files/publications/A_ Wordnet_from_the_Ground_Up . pdf, (ISBN 978-83-7493-476-3). Chiew Kin Quah, Francis Bond, and Takefumi Yamazaki. 2001. Design and construction of a machine-tractable Malay-English lexicon. 1361 In Asialex 2001 Proceedings, pages 200–205. Seoul. Ervin Ruci. 2008. On the current state of Albanet and related applications. Technical report, University of Vlora. (http ://fj alnet .com/ technicalreportalbanet .pdf). Beno ıˆt Sagot and Darja Fiˇ ser. 2008. Building a free French wordnet from multilingual resources. In European Language Resources Association (ELRA), editor, Proceedings of the Sixth International Language Resources and Evaluation (LREC’08). Marrakech, Morocco. Sareewan Thoongsup, Thatsanee Charoenporn, Kergrit Robkop, Tan Sinthurahat, Chumpol Mokarat, Virach Sornlertlamvanich, and Hitoshi Isahara. 2009. Thai wordnet construction. In Proceedings of The 7th Workshop on Asian Language Resources (ALR7), Joint conference of the 47th Annual Meeting of the Association for Computational Linguistics (ACL) and the 4th International Joint Conference on Natural Language Processing (IJCNLP), . Suntec, Singapore. Dan Tufi ¸s, Dan Cristea, and Sofia Stamou. 2004. BalkaNet: Aims, methods, results and perspectives. a general overview. Romanian Journal of Information Science and Technology, 7(1–2):9– 34. Unicode, Inc. 2012. Unicode, Inc. license agreement - data files and software. http ://www . unicode .org/copyright .html. Piek Vossen, editor. 1998. Euro WordNet. Kluwer. Piek Vossen. 2005. Building wordnets. http ://www. globalwordnet .org/gwa/ BuildingWordnets .ppt. Wikimedia. 2013. List of wiktionaries. http ://meta .wikimedia. .org/w/index . php?title=Wiktionary&oldid;=4729333. (accessed on 2013-02-14). Wikipedia. 2013. Wikipedia wikipedia, the free encyclopedia. URL http : //en.wikipedia. org/w/index .php? title=Wikipedia&oldid;=552515903, [Online; accessed 30-April-2013]. Renjie Xu, Zhiqiang Gao, Yuzhong Qu, and Zhisheng Huang. 2008. An integrated approach for automatic construction of bilingual ChineseEnglish WordNet. In 3rd Asian Semantic Web Conference (ASWC 2008), pages 302–341. — 1362