acl acl2012 acl2012-64 acl2012-64-reference knowledge-graph by maker-knowledge-mining

64 acl-2012-Crosslingual Induction of Semantic Roles


Source: pdf

Author: Ivan Titov ; Alexandre Klementiev

Abstract: We argue that multilingual parallel data provides a valuable source of indirect supervision for induction of shallow semantic representations. Specifically, we consider unsupervised induction of semantic roles from sentences annotated with automatically-predicted syntactic dependency representations and use a stateof-the-art generative Bayesian non-parametric model. At inference time, instead of only seeking the model which explains the monolingual data available for each language, we regularize the objective by introducing a soft constraint penalizing for disagreement in argument labeling on aligned sentences. We propose a simple approximate learning algorithm for our set-up which results in efficient inference. When applied to German-English parallel data, our method obtains a substantial improvement over a model trained without using the agreement signal, when both are tested on non-parallel sentences.


reference text

Omri Abend, Roi Reichart, and Ari Rappoport. 2009. Unsupervised argument identification for semantic role labeling. In ACL-IJCNLP. Roberto Basili, Diego De Cao, Danilo Croce, Bonaventura Coppola, and Alessandro Moschitti. 2009. Crosslanguage frame semantics transfer in bilingual corpora. In CICLING. A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado, and M. Pinkal. 2006. The SALSA corpus: a german corpus resource for lexical semantics. In LREC. Ming-Wei Chang, Lev Ratinov, and Dan Roth. 2007. Guiding semi-supervision with constraintdriven learning. In ACL. Hal Daume III. 2007. Fast search for dirichlet process mixture models. In AISTATS. Marie-Catherine de Marneffe, Bill MacCartney, and Christopher D. Manning. 2006. Generating typed dependency parses from phrase structure parses. In LREC 2006. Koen Deschacht and Marie-Francine Moens. 2009. Semi-supervised semantic role labeling using the Latent Words Language Model. In EMNLP. Thomas S. Ferguson. 1973. A Bayesian analysis of some nonparametric problems. The Annals of Statistics, 1(2):209–230. Hagen F ¨urstenau and Mirella Lapata. 2009. Graph alignment for semi-supervised semantic role labeling. In EMNLP. Kuzman Ganchev, Joao Graca, Jennifer Gillenwater, and Ben Taskar. 2010. Posterior regularization for structured latent variable models. Journal of Machine Learning Research (JMLR), 11:2001–2049. Qin Gao and Stephan Vogel. 2011. Corpus expansion for statistical machine translation with semantic role label substitution rules. In ACL:HLT. Daniel Gildea and Daniel Jurafsky. 2002. Automatic labelling of semantic roles. Computational Linguistics, 28(3):245–288. Dan Goldwasser, Roi Reichart, James Clarke, and Dan Roth. 2011. Confidence driven unsupervised semantic parsing. In ACL. Trond Grenager and Christoph Manning. 2006. Unsupervised discovery of a statistical verb lexicon. In EMNLP. Jan Haji cˇ, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Ant o`nia Mart ı´, Llu ı´s M `arquez, Adam Meyers, Joakim Nivre, Sebastian Pad o´, Jan Sˇt eˇp a´nek, Pavel Stra nˇ a´k, Mihai Surdeanu, Nianwen Xue, and Yi Zhang. 2009. The conll-2009 shared task: Syntactic and semantic dependencies in multiple languages. In CoNLL 2009: Shared Task. 655 Richard Johansson and Pierre Nugues. 2008. Dependency-based semantic role labeling of PropBank. In EMNLP. Michael Kaisser and Bonnie Webber. 2007. Question answering based on semantic roles. In ACL Workshop on Deep Linguistic Processing. Rohit J. Kate and Raymond J. Mooney. 2007. Learning language semantics from ambigous supervision. In AAAI. Philipp Koehn. 2005. Europarl: A parallel corpus for statistical machine translation. In Proceedings of the MT Summit. Harold W. Kuhn. 1955. The hungarian method for the assignment problem. Naval Research Logistics Quarterly, 2:83–97. Jonas Kuhn. 2004. Experiments in parallel-text based grammar induction. In ACL. Joel Lang and Mirella Lapata. 2010. Unsupervised in- duction of semantic roles. In ACL. Joel Lang and Mirella Lapata. 2011a. Unsupervised semantic role induction via split-merge clustering. In ACL. Joel Lang and Mirella Lapata. 2011b. Unsupervised semantic role induction with graph partitioning. In EMNLP. Beth Levin. 1993. English Verb Classes and Alternations: A Preliminary Investigation. University of Chicago Press. Percy Liang, Michael I. Jordan, and Dan Klein. 2009. Learning semantic correspondences with less supervision. In ACL-IJCNLP. Percy Liang, Michael Jordan, and Dan Klein. 2011. Learning dependency-based compositional semantics. In ACL: HLT. Ding Liu and Daniel Gildea. 2010. Semantic role features for machine translation. In Coling. Mitchell P. Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. 1993. Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313–330. Andrew McCallum, Gideon Mann, and Gregory Druck. 2007. Generalized expectation criteria. Technical Report TR 2007-60, University of Massachusetts, Amherst, MA. Ryan McDonald, Slav Petrov, and Keith Hall. 2011. Multi-source transfer of delexicalized dependency parsers. In EMNLP. J. Nivre, J. Hall, S. K ¨ubler, R. McDonald, J. Nilsson, S. Riedel, and D. Yuret. 2007. The CoNLL 2007 shared task on dependency parsing. In EMNLPCoNLL. Franz Josef Och and Hermann Ney. 2003. A systematic comparison of various statistical alignment models. Computational Linguistics, 29: 19–5 1. Sebastian Pado and Mirella Lapata. 2009. Cross-lingual annotation projection for semantic roles. Journal of Artificial Intelligence Research, 36:307–340. Jim Pitman. 2002. Poisson-Dirichlet and GEM invariant distributions for split-and-merge transformations of an interval partition. Combinatorics, Probability and Computing, 11:501–5 14. Hoifung Poon and Pedro Domingos. 2009. Unsupervised semantic parsing. In EMNLP. Sameer Pradhan, Wayne Ward, and James H. Martin. 2008. Towards robust semantic role labeling. Computational Linguistics, 34:289–3 10. M. Sammons, V. Vydiswaran, T. Vieira, N. Johri, M. Chang, D. Goldwasser, V. Srikumar, G. Kundu, Y. Tu, K. Small, J. Rule, Q. Do, and D. Roth. 2009. Relation alignment for textual entailment recognition. In Text Analysis Conference (TAC). Dan Shen and Mirella Lapata. 2007. Using semantic roles to improve question answering. In EMNLP. Benjamin Snyder and Regina Barzilay. 2008. Unsupervised multilingual learning for morphological segmentation. In ACL. Benjamin Snyder and Regina Barzilay. 2010. Climbing the tower of Babel: Unsupervised multilingual learning. In ICML. Benjamin Snyder, Tahira Naseem, Jacob Eisenstein, and Regina Barzilay. 2008. Unsupervised multilingual learning for POS tagging. In EMNLP. Benjamin Snyder, Tahira Naseem, and Regina Barzilay. 2009. Unsupervised multilingual grammar induction. In ACL. Mihai Surdeanu, Adam Meyers Richard Johansson, Llu ı´s M `arquez, and Joakim Nivre. 2008. The CoNLL-2008 shared task on joint parsing of syntactic and semantic dependencies. In CoNLL 2008: Shared Task. Richard Swier and Suzanne Stevenson. 2004. Unsupervised semantic role labelling. In EMNLP. Yee Whye Teh. 2007. Dirichlet process. Encyclopedia of Machine Learning. Ivan Titov and Alexandre Klementiev. 2011. A Bayesian model for unsupervised semantic parsing. In ACL. Ivan Titov and Alexandre Klementiev. 2012. A Bayesian approach to unsupervised semantic role induction. In EACL. Ivan Titov and Mikhail Kozhevnikov. 2010. Bootstrap- Dekai Wu, Marianna Apidianaki, Marine Carpuat, and Lucia Specia, editors. 2011. Proc. of Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation. ACL. ping semantic analyzers from non-contradictory texts. In ACL. Lonneke van der Plas, Paola Merlo, and James Henderson. 2011. Scaling up automatic cross-lingual semantic role annotation. In ACL. Dekai Wu and Pascale Fung. 2009. Semantic roles for SMT: A hybrid two-pass model. In NAACL. 656