emnlp emnlp2011 emnlp2011-67 emnlp2011-67-reference knowledge-graph by maker-knowledge-mining

67 emnlp-2011-Hierarchical Verb Clustering Using Graph Factorization


Source: pdf

Author: Lin Sun ; Anna Korhonen

Abstract: Most previous research on verb clustering has focussed on acquiring flat classifications from corpus data, although many manually built classifications are taxonomic in nature. Also Natural Language Processing (NLP) applications benefit from taxonomic classifications because they vary in terms of the granularity they require from a classification. We introduce a new clustering method called Hierarchical Graph Factorization Clustering (HGFC) and extend it so that it is optimal for the task. Our results show that HGFC outperforms the frequently used agglomerative clustering on a hierarchical test set extracted from VerbNet, and that it yields state-of-the-art performance also on a flat test set. We demonstrate how the method can be used to acquire novel classifications as well as to extend existing ones on the basis of some prior knowledge about the classification.


reference text

Arik Azran and Zoubin Ghahramani. A new approach to data driven clustering. In Proceedings of the 23rd international conference on Machine learning, ICML ’06, pages 57–64, New York, NY, USA, 2006a. ISBN 1-59593-383-2. Arik Azran and Zoubin Ghahramani. Spectral methods for automatic multiscale data clustering. In Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition-Volume 1, pages 190–197. IEEE Computer Society Washington, DC, USA, 2006b. Collin F. Baker, Charles J. Fillmore, and John B. Lowe. The berkeley framenet project. In In COLING-ACL, pages 86–90, 1998. Roberto. Basili, Maria Teresa Pazienza, and Paola Velardi. Hierarchical clustering of verbs. In Proceedings of the Workshop on Acquisition of Lexical Knowledge from Text, 1993. Nikoletta Bassiou and Constantine Kotropoulos. Long distance bigram models applied to word clustering. Pattern Recogn., 44: 145–158, January 2011. ISSN 0031-3203. Ted Briscoe, John Carroll, and Rebecca Watson. The second release of the rasp system. In Proceedings of the COLING/ACL on Interactive presentation sessions, 2006. Hoa Trang Dang. Investigations into the Role of Lexical Semantics in Word Sense Disambiguation. PhD thesis, CIS, University of Pennsylvania, 2004. Katrin Erk, Andrea Kowalski, Sebastian Pad o´, and Manfred Pinkal. Towards a resource for lexical semantics: a large german corpus with extensive semantic annotation. In Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1, ACL ’03, pages 537–544, Stroudsburg, PA, USA, 2003. Association for Computational Linguistics. Eva Esteve Ferrer. Towards a semantic classification of spanish verbs based on subcategorisation information. In Proceedings of the ACL 2004 workshop on Student research, ACLstudent ’04, Stroudsburg, PA, USA, 2004. Association for Computational Linguistics. Douglas H. Fisher. Knowledge acquisition via incremental conceptual clustering. Machine Learning, 2: 139– 172, 1987. ISSN 0885-6125. David Graff. North american news text corpus. Linguistic Data Consortium, 1995. 1032 Ralph Grishman, Catherine Macleod, and Adam Meyers. Comlex syntax: Building a computational lexicon. In COLING, pages 268–272, 1994. Katherine A. Heller and Zoubin Ghahramani. Bayesian hierarchical clustering. In Proceedings of the 22nd international conference on Machine learning, pages 297–304. ACM, 2005. ISBN 1595931805. Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw, and Ralph Weischedel. Ontonotes: the 90% solution. In Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers, NAACL-Short ’06, pages 57–60, Stroudsburg, PA, USA, 2006. Association for Computational Linguistics. Lawrence Hubert and Phipps Arabie. Comparing partitions. Journal of Classification, 2: 193–218, 1985. ISSN 0176-4268. Eric Joanis, Suzanne Stevenson, and David James. A general feature space for automatic verb classification. Natural Language Engineering, 14(3):337–367, 2008. Karin Kipper. VerbNet: A broad-coverage, comprehensive verb lexicon. 2005. Anna Korhonen, Yuval Krymolowski, and Nigel Collier. The Choice of Features for Classification of Verbs in Biomedical Texts. In Proceedings of COLING, 2008. Claudia Kunze and Lothar Lemnitzer. GermaNetrepresentation, visualization, application. In Proceedings of LREC, 2002. Geoffrey Leech. 100 million words of english: the british national corpus. Language Research, 28(1): 1– 13, 1992. Beth. Levin. English verb classes and alternations: A preliminary investigation. Chicago, IL, 1993. Jianguo Li and Chris Brew. Which Are the Best Features for Automatic Verb Classification. In Proceedings of ACL, 2008. Yu-Ru Lin, Yun Chi, Shenghuo Zhu, Hari Sundaram, and Belle L. Tseng. Facetnet: a framework for analyzing communities and their evolutions in dynamic networks. In Proceeding of the 17th international conference on World Wide Web, pages 685–694, New York, NY, USA, 2008. ACM. Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schtze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008. ISBN 0521865719, 9780521865715. Yutaka Matsuo, Takeshi Sakaki, K oˆki Uchiyama, and Mitsuru Ishizuka. Graph-based word clustering using a web search engine. In Proceedings of the EMNLP, pages 542–550, 2006. George A. Miller. WordNet: a lexical database for En- glish. 1995. Communications of the ACM, 38(1 1):39–41, Travis E. Oliphant. Python for scientific computing. Computing in Science and Engineering, 9: 10–20, 2007. ISSN 1521-9615. Diarmuid O´ S ´eaghdha and Ann Copestake. Semantic classification with distributional kernels. In Proceedings of COLING, 2008. Martha Palmer, Daniel Gildea, and Paul Kingsbury. The proposition bank: An annotated corpus of semantic roles. Computational Linguistics, 31(1):71–106, 2005. Judita Preiss, Ted Briscoe, and Anna Korhonen. A system for large-scale acquisition of verbal, nominal and adjectival subcategorization frames from corpora. In Proceedings of ACL, pages 912–919, 2007. Andrew Rosenberg and Julia Hirschberg. V-measure: A conditional entropy-based external cluster evaluation measure. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2007. Sabine Schulte im Walde. Experiments on the automatic induction of german semantic verb classes. Computational Linguistics, 32(2), 2006. Sabine Schulte im Walde. Human associations and the choice of features for semantic verb classification. Research on Language and Computation, 6:79–1 11, 2008. ISSN 1570-7075. Sabine Schulte im Walde and Chris Brew. Inducing german semantic verb classes from purely syntactic subcategorisation information. In Proceedings of ACL, pages 223–230, 2002. Jianbo Shi and Jitendra Malik. Normalized cuts and image segmentation. IEEE Transactions on pattern analysis and machine intelligence, 22(8):888–905, 2000. Lei Shi and Rada Mihalcea. Putting pieces together: Combining FrameNet, VerbNet and WordNet for robust semantic parsing. In Proceedings of CICLING, 2005. Suzanne Stevenson and Eric Joanis. Semi-supervised verb class discovery using noisy features. In Proceedings of HLT-NAACL 2003, pages 71–78, 2003. Lin Sun and Anna Korhonen. Improving verb clustering with automatically acquired selectional preferences. In Proceedings of the EMNLP 2009, 2009. Lin Sun, Anna Korhonen, and Yuval Krymolowski. Verb class discovery from rich syntactic data. Lecture Notes in Computer Science, 4919: 16, 2008. 1033 Robert Swier and Suzanne Stevenson. Unsupervised semantic role labelling. In Proceedings of EMNLP, pages 95–102, 2004. Yee Whye Teh, Hal Daum e´ III, and Daniel Roy. Bayesian agglomerative clustering with coalescents. In Ad- vances in Neural Information Processing Systems, volume 20, 2008. Akira Ushioda. Hierarchical clustering of words. In Proceedings of the 16th conference on Computational linguistics-Volume 2, pages 1159–1 162. Association for Computational Linguistics, 1996. Gloria V ´azquez, Ana Fern a´ndez-Montraveta, and M. Ant o`nia Mart ı´. Clasificaci o´n verbal:(alternancias de di ´atesis). Universitat de Lleida, 2000. ISBN 8484090671. Nguyen Xuan Vinh, Julien Epps, and James Bailey. Information theoretic measures for clusterings comparison: is a correction for chance necessary? In ICML ’09: Proceedings of the 26th Annual International Conference on Machine Learning, pages 1073–1080, New York, NY, USA, 2009. ACM. ISBN 978-1-60558-5161. Andreas Vlachos, Anna Korhonen, and Zoubin Ghahramani. Unsupervised and constrained dirichlet process mixture models for verb clustering. In Proceedings of the Workshop on Geometrical Models of Natural Language Semantics, pages 74–82, 2009. Joe H. Ward Jr. Hierarchical grouping to optimize an objective function. Journal oftheAmerican statistical association, 58(301):236–244, 1963. ISSN 0162-1459. Zhenyu Wu and Richard Leahy. An optimal graph theoretic approach to data clustering: Theory and its ap- plication to image segmentation. IEEE transactions on pattern analysis and machine intelligence, pages 1101–1 113, 1993. ISSN 0162-8828. Kai Yu, Shipeng Yu, and Volker Tresp. Soft clustering on graphs. Advances in Neural Information Processing Systems, 18: 1553, 2006. Be˜ nat Zapirain, Eneko Agirre, and Llu ı´s M `arquez. Robustness and generalization of role sets: PropBank vs. VerbNet. In Proceedings of ACL-08: HLT, pages 550– 558, 2008.