emnlp emnlp2010 emnlp2010-7 emnlp2010-7-reference knowledge-graph by maker-knowledge-mining

7 emnlp-2010-A Mixture Model with Sharing for Lexical Semantics


Source: pdf

Author: Joseph Reisinger ; Raymond Mooney

Abstract: We introduce tiered clustering, a mixture model capable of accounting for varying degrees of shared (context-independent) feature structure, and demonstrate its applicability to inferring distributed representations of word meaning. Common tasks in lexical semantics such as word relatedness or selectional preference can benefit from modeling such structure: Polysemous word usage is often governed by some common background metaphoric usage (e.g. the senses of line or run), and likewise modeling the selectional preference of verbs relies on identifying commonalities shared by their typical arguments. Tiered clustering can also be viewed as a form of soft feature selection, where features that do not contribute meaningfully to the clustering can be excluded. We demonstrate the applicability of tiered clustering, highlighting particular cases where modeling shared structure is beneficial and where it can be detrimental.


reference text

Eneko Agirre, Enrique Alfonseca, Keith Hall, Jana Kravalova, Marius Pas ¸ca, and Aitor Soroa. 2009. A study on similarity and relatedness using distributional and Wordnet-based approaches. In Proc. of NAACL-HLT-09, pages 19–27. David J. Aldous. 1985. Exchangeability and related topics. In E´cole d’´ et´ e de probabilit e´s de SaintFlour, XIII—1983, volume 1117, pages 1–198. Springer, Berlin. David Blei, Thomas Griffiths, Michael Jordan, and Joshua Tenenbaum. 2003. Hierarchical topic models and the nested Chinese restaurant process. In Proc. NIPS-2003. Stephen Clark and David Weir. 2002. Class-based probability estimation using a semantic hierarchy. Computational Linguistics, 28(2): 187–206. James Richard Curran. 2004. From Distributional to Semantic Similarity. Ph.D. thesis, University of Edinburgh. College of Science. Katrin Erk and Sebastian Pado. 2008. A structured vector space model for word meaning in context. In Proceedings of EMNLP 2008. Christiane Fellbaum, editor. 1998. WordNet: An Electronic Lexical Database and Some of its Applications. MIT Press. Lev Finkelstein, Evgeniy Gabrilovich, Yossi Matias, Ehud Rivlin, Zach Solan, Gadi Wolfman, and Eytan Ruppin. 2001. Placing search in context: the concept revisited. In Proc. of WWW 2001. Evgeniy Gabrilovich and Shaul Markovitch. 2007. Computing semantic relatedness using Wikipedia-based explicit semantic analysis. In Proc. of IJCAI-07, pages 1606–161 1. Daniel Gildea and Daniel Jurafsky. 2002. Automatic labeling of semantic roles. Computational Linguistics, 28(3):245–288. James Gorman and James R. Curran. 2006. Scaling distributional similarity to large corpora. In Proc. of ACL 2006. Thomas L. Griffiths, Mark Steyvers, and Joshua B. Tenenbaum. 2007. Topics in semantic representation. Psychological Review, 114:2007. Ama ¸c Herda ˇgdelen and Marco Baroni. 2009. Bagpack: A general framework to represent semantic relations. In Proc. of GEMS 2009. Donald Hindle and Mats Rooth. 1991. Structural ambiguity and lexical relations. In Proc. of ACL 1991. Thomas Landauer and Susan Dumais. 1997. A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction and representation of knowledge. Psychological Review, 104(2):21 1–240. Martin H. C. Law, Anil K. Jain, and M a´rio A. T. Figueiredo. 2002. Feature selection in mixturebased clustering. In Proc. of NIPS 2002. Will Lowe. 2001 . Towards a theory of semantic space. In Proceedings of the 23rd Annual Meeting of the Cognitive Science Society, pages 576–581 . Xiaojuan Ma, Jordan Boyd-Graber, Sonya S. Nikolova, and Perry Cook. 2009. Speaking through pictures: Images vs. icons. In ACM Conference on Computers and Accessibility. Christopher D. Manning, Prabhakar Raghavan, and Hinrich Sch u¨tze. 2008. Introduction to Information Retrieval. Cambridge University Press. Diana McCarthy and John Carroll. 2003. Disambiguating nouns, verbs, and adjectives using automatically acquired selectional preferences. Computational Linguistics, 29(4):639–654. George A. Miller and Walter G. Charles. 1991. Contextual correlates of semantic similarity. Language and Cognitive Processes, 6(1): 1–28. Sebastian Pad o´ and Mirella Lapata. 2007. Dependency-based construction of semantic space models. Computational Linguistics, 33(2): 161–199. Sebastian Pad o´, Ulrike Pad o´, and Katrin Erk. 2007. Flexible, corpus-based modelling of human plausibility judgements. In Proc. of EMNLP 2007. Ulrike Pad o´. 2007. The Integration ofSyntax and Semantic Plausibility in a Wide-Coverage Model of Sentence Processing. Ph.D. thesis, Saarland University, Saarbr¨ ucken. 1182 Patrick Pantel, Rahul Bhagat, Timothy Chklovski, and Eduard Hovy. 2007. ISP: Learning inferential selectional preferences. In In Proceedings of NAACL 2007. Patrick Andre Pantel. 2003. Clustering by committee. Ph.D. thesis, Edmonton, Alta., Canada. Lance Parsons, Ehtesham Haque, and Huan Liu. 2004. Subspace clustering for high dimensional data: A review. SIGKDD Explor. Newsl., 6(1). Fernando Pereira, Naftali Tishby, and Lillian Lee. 1993. Distributional clustering of English words. In Proc. of ACL 1993. Carl E. Rasmussen. 2000. The infinite Gaussian mixture model. In Advances in Neural Information Processing Systems. MIT Press. Joseph Reisinger and Raymond Mooney. 2010. Multi-prototype vector-space models of word meaning. In Proc. of NAACL 2010. Philip Resnik. 1997. Selectional preference and sense disambiguation. In Proceedings of ACL SIGLEX Workshop on Tagging Text with Lexical Semantics, pages 52–57. ACL. Adam N. Sanborn, Thomas L. Griffiths, and Daniel J. Navarro. 2006. A more rational model of categorization. In Proceedings of the 28th An- nual Conference of the Cognitive Science Society. Hinrich Sch u¨tze. 1998. Automatic word sense discrimination. Computational Linguistics, 24(1):97–123. Patrick Shafto, Charles Kemp, Vikash Mansinghka, Matthew Gordon, and Joshua B. Tenenbaum. 2006. Learning cross-cutting systems of categories. In Proc. CogSci 2006. Rion Snow, Daniel Jurafsky, and Andrew Ng. 2006. Semantic taxonomy induction from heterogenous evidence. In Proc. of ACL 2006. Peter D. Turney. 2006. Similarity of semantic relations. Computational Linguistics, 32(3):379–416. Benjamin Van Durme and Marius Pas ¸ca. 2008. Finding cars, goddesses and enzymes: Parametrizable acquisition of labeled instances for open-domain information extraction. In Proc. of AAAI 2008. Nianwen Xue, Jinying Chen, and Martha Palmer. 2006. Aligning features with sense distinction dimensions. In Proc. of COLING/ACL 2006.