emnlp emnlp2013 emnlp2013-182 emnlp2013-182-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Jimmy Dubuisson ; Jean-Pierre Eckmann ; Christian Scheible ; Hinrich Schutze
Abstract: Studies of the graph of dictionary definitions (DD) (Picard et al., 2009; Levary et al., 2012) have revealed strong semantic coherence of local topological structures. The techniques used in these papers are simple and the main results are found by understanding the structure of cycles in the directed graph (where words point to definitions). Based on our earlier work (Levary et al., 2012), we study a different class of word definitions, namely those of the Free Association (FA) dataset (Nelson et al., 2004). These are responses by subjects to a cue word, which are then summarized by a directed, free association graph. We find that the structure of this network is quite different from both the Wordnet and the dictionary networks. This difference can be explained by the very nature of free association as compared to the more “logical” construction of dictionaries. It thus sheds some (quantitative) light on the psychology of free association. In NLP, semantic groups or clusters are interesting for various applications such as word sense disambiguation. The FA graph is tighter than the DD graph, because of the large number of triangles. This also makes drift of meaning quite measurable so that FA graphs provide a quantitative measure of the semantic coherence of small groups of words.
Eneko Agirre and Aitor Soroa. 2009. Personalizing pagerank for word sense disambiguation. In Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL ’09, pages 33–41, Stroudsburg, PA, USA. Association for Computational Linguistics. Mark H Ashcraft and Gabriel A Radvansky. 2009. Cognition. Pearson Prentice Hall. Chris Biemann. 2006. Chinese whispers: an efficient graph clustering algorithm and its application to natural language processing problems. In Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing, TextGraphs-1, pages 73–80, Stroudsburg, PA, USA. Association for Computational Linguistics. B e´la Bollob´ as. 2001. Random graphs, volume 73 of Cambridge Studies in Advanced Mathematics. Cambridge University Press, Cambridge, second edition. Etienne Brunet. 1974. Le traitement des faits linguistiques et stylistiques sur ordinateur. Texte d’application: Giraudoux, Statistique et Linguistique. David, J. y Martin, R.(eds.). Paris: Klincksieck, pages 105–137. Scott Deerwester, Susan T. Dumais, George W Furnas, Thomas K Landauer, and Richard Harshman. 1990. Indexing by latent semantic analysis. Journal of the American society for information science, 41(6):391 407. James Deese. 1962. On the structure of associative meaning. Psychological review, 69: 161. James Deese. 1967. Meaning and change of meaning. The American psychologist, 22(8):641 . Beate Dorow, Dominic Widdows, Katerina Ling, JeanPierre Eckmann, Danilo Sergi, and Elisha Moses. 2005. Using curvature and Markov clustering in graphs for lexical acquisition and word sense discrimination. In MEANING-2005, 2nd Workshop organized by the MEANING Project, February 3rd-4th 2005, Trento, Italy. Jean-Pierre Eckmann and Elisha Moses. 2002. Curvature of co-links uncovers hidden thematic layers in the World Wide Web. Proc. Natl. Acad. Sci. USA, 99(9):5825–5829 (electronic). Brian Everitt, Sabine Landau, and Morven Leese. 2001. Cluster analysis. 4th Edition. Arnold, London. Santo Fortunato. 2010. Community Detection in Graphs. Physics Reports, 486(3):75–174. Pietro Gravino, Vito DP Servedio, Alain Barrat, and Vittorio Loreto. 2012. Complex structures and semantics in free word association. Advances in Complex Systems, 15(03n04). Jeffrey T Hancock, Michael T Woodworth, and Stephen Porter. 2013. Hungry like the wolf: A word-pattern analysis of the language of psychopaths. Legal and Criminological Psychology, 18(1): 102–1 14. Ahmed Hassan and Dragomir Radev. 2010. Identifying text polarity using random walks. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 395–403. Association for Computational Linguistics. Francis Heylighen. 2001 . Mining associative meanings from the web: from word disambiguation to the global brain. In Proceedings of Trends in Special Language & Language Technology, pages 15–44. Donald B Johnson. 1975. Finding all the elementary circuits of a directed graph. SIAM Journal on Computing, 4(1):77–84. Istvan Jonyer, Diane J Cook, and Lawrence B Holder. 2002. Graph-based hierarchical conceptual clustering. The Journal of Machine Learning Research, 2: 19–43. Grace H Kent and Aaron J Rosanoff. 1910. A study of association in insanity. American Journal of Insanity. George R Kiss, Christine Armstrong, Robert Milroy, and James Piper. 1973. An associative thesaurus of english and its computer analysis. The computer and lit- erary studies, pages 153–165. Andrea Lancichinetti and Santo Fortunato. 2009. Community detection algorithms: A comparative analysis. Physical review E, 80(5):0561 17. David Levary, Jean-Pierre Eckmann, Elisha Moses, and Tsvi Tlusty. 2012. Loops and self-reference in the construction of dictionaries. Phys. Rev. X, 2:03 1018. Christopher D Manning, Prabhakar Raghavan, and Hinrich Sch u¨tze. 2008. Introduction to information retrieval, volume 1. Cambridge University Press Cambridge. 679 Yutaka Matsuo, Takeshi Sakaki, K oˆki Uchiyama, and Mitsuru Ishizuka. 2006. Graph-based word clustering using a web search engine. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, EMNLP ’06, pages 542–550, Stroudsburg, PA, USA. Association for Computational Linguistics. Ken McRae and Kazunaga Matsuki. 2009. People use their knowledge of common events to understand language, and do so as quickly as possible. Language and linguistics compass, 3(6): 1417–1429. George Miller and Christiane Fellbaum. 1998. WordNet: An Electronic Lexical database. MIT Press, Cambridge, MA. George A Miller. 1995. WordNet: a lexical database for english. Communications of the ACM, 38(1 1):39–41 . Douglas L Nelson, Cathy L McEvoy, and Thomas A Schreiber. 2004. The University of South Florida free association, rhyme, and word fragment norms. Behavior Research Methods, Instruments, & Computers, 36(3):402–407. David S Palermo and James J Jenkins. 1964. Word association norms: Grade school through college. University of Minnesota Press. Olivier Picard, Alexandre Blondin-Mass e´, Stevan Harnad, Odile Marcotte, Guillaume Chicoisne, and Yassine Gargouri. 2009. Hierarchies in dictionary definition space. In Annual Conference on Neural Information Processing Systems. Pascal Pons and Matthieu Latapy. 2006. Computing communities in large networks using random walks. In Journal of Graph Algorithms and Applications, pages 284–293. Springer. Christian Scheible. 2010. Sentiment translation through lexicon induction. In Proceedings of the ACL 2010 Student Research Workshop, pages 25–30, Uppsala, Sweden, July. Association for Computational Linguistics. David R Shanks. 1995. The psychology of associative learning, volume 13. Cambridge University Press. Mark Steyvers and Joshua B Tenenbaum. 2005. The large-scale structure of semantic networks: Statistical analyses and a model of semantic growth. Cognitive Science, 29(1):41–78. Mark Steyvers, Richard M Shiffrin, and Douglas L Nelson. 2004. Word association spaces for predicting semantic similarity effects in episodic memory. Experimental cognitive psychology and its applications: Festschrift in honor of Lyle Bourne, Walter Kintsch, and Thomas Landauer, pages 237–249. Robert Tarjan. 1972. Depth-first search and linear graph algorithms. SIAM journal on computing, 1(2): 146– 160. Dominic Widdows and Beate Dorow. 2002. A graph model for unsupervised lexical acquisition. In Proceedings of the 19th international conference on Computational linguistics - Volume 1, COLING ’02, pages 1–7, Stroudsburg, PA, USA. Association for Computational Linguistics. Zhibiao Wu and Martha Palmer. 1994. Verbs semantics and lexical selection. In Proceedings of the 32nd annual meeting on Association for Computational Linguistics, pages 133–138. Association for Computational Linguistics. 680