acl acl2013 acl2013-238 acl2013-238-reference knowledge-graph by maker-knowledge-mining

238 acl-2013-Measuring semantic content in distributional vectors


Source: pdf

Author: Aurelie Herbelot ; Mohan Ganesalingam

Abstract: Some words are more contentful than others: for instance, make is intuitively more general than produce and fifteen is more ‘precise’ than a group. In this paper, we propose to measure the ‘semantic content’ of lexical items, as modelled by distributional representations. We investigate the hypothesis that semantic content can be computed using the KullbackLeibler (KL) divergence, an informationtheoretic measure of the relative entropy of two distributions. In a task focusing on retrieving the correct ordering of hyponym-hypernym pairs, the KL diver- gence achieves close to 80% precision but does not outperform a simpler (linguistically unmotivated) frequency measure. We suggest that this result illustrates the rather ‘intensional’ aspect of distributions.


reference text

Baroni, Marco, and Lenci, Alessandro. 2008. Concepts and properties in word spaces. In Alessandro Lenci (ed.), From context to meaning: Distribu- tional models of the lexicon in linguistics and cognitive science (Special issue of the Italian Journal of Linguistics 20(1)), pages 55–88. Baroni, Marco, Raffaella Bernardi, Ngoc-Quynh Do and Chung-chieh Shan. 2012. Entailment above the word level in distributional semantics. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL2012), pages 23–32. Baroni, Marco, Raffaella Bernardi, and Roberto Zamparelli. 2012. Frege in Space: a Program for Compositional Distributional Semantics. Under review. Curran, James. 2003. From Distributional to Semantic Similarity. Ph.D. thesis, University of Edinburgh, Scotland, UK. Erk, Katrin. 2012. Vector space models of word meaning and phrase meaning: a survey. Language and Linguistics Compass, 6: 10:635–653. Erk, Katrin. 2013. Towards a semantics for distributional representations. In Proceedings of the Tenth International Conference on Computational Semantics (IWCS2013). Evert, Stefan. 2004. The statistics of word cooccurrences: word pairs and collocations. Ph.D. thesis, University of Stuttgart. 444 Leech, Geoffrey, Roger Garside, and Michael Bryant. 1994. Claws4: The tagging of the british national corpus. In Proceedings of the 15th International Conference on Computational Linguistics (COLING 94), pages 622–628, Kyoto, Japan. Lund, Kevin, Curt Burgess, and Ruth Ann Atchley. 1995. Semantic and associative priming in highdimensional semantic space. In Proceedings of the 17th annual conference of the Cognitive Science Society, Vol. 17, pages 660–665. McNally, Louise. 2013. Formal and distributional semantics: From romance to relationship. In Proceedings of the ‘Towards a Formal Distributional Semantics ’ workshop, 10th International Conference on Computational Semantics (IWCS2013), Potsdam, Germany. Invited talk. Mitchell, Jeff and Mirella Lapata. 2010. Composition in Distributional Models of Semantics. Cognitive Science, 34(8): 1388–1429, November. Resnik, Philipp. 1995. Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI-95), pages 448–453. Searle, John R. 1969. Speech acts: An essay in the philosophy of language. Cambridge University Press. Turney, Peter D. and Patrick Pantel. 2010. From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research, 37: 141–188. 445