emnlp emnlp2011 emnlp2011-95 emnlp2011-95-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Ryan McDonald ; Slav Petrov ; Keith Hall
Abstract: We present a simple method for transferring dependency parsers from source languages with labeled training data to target languages without labeled training data. We first demonstrate that delexicalized parsers can be directly transferred between languages, producing significantly higher accuracies than unsupervised parsers. We then use a constraint driven learning algorithm where constraints are drawn from parallel corpora to project the final parser. Unlike previous work on projecting syntactic resources, we show that simple methods for introducing multiple source lan- guages can significantly improve the overall quality of the resulting parsers. The projected parsers from our system result in state-of-theart performance when compared to previously studied unsupervised and projected parsing systems across eight different languages.
T. Berg-Kirkpatrick and D. Klein. 2010. Phylogenetic grammar induction. In Proc. of ACL. P. Blunsom and T. Cohn. 2010. Unsupervised induction of tree substitution grammars for dependency parsing. Proc. of EMNLP. P. F. Brown, V. J. Della Pietra, S. A. Della Pietra, and R. L. Mercer. 1993. The mathematics of statistical machine translation: parameter estimation. Computational Linguistics, 19. S. Buchholz and E. Marsi. 2006. CoNLL-X shared task on multilingual dependency parsing. In Proc. of CoNLL. D. Burkett, S. Petrov, J. Blitzer, and D. Klein. 2010. Learning better monolingual models with unannotated bilingual text. In Proc. of CoNLL. G. Carroll and E. Charniak. 1992. Two experiments on learning probabilistic dependency grammars from corpora. In Proc. of the Working Notes of the Workshop Statistically-Based NLP Techniques. 71 M.W. Chang, L. Ratinov, and D. Roth. 2007. Guiding semi-supervision with constraint-driven learning. In Proc. of ACL. M. Chang, D. Goldwasser, D. Roth, and V. Srikumar. 2010. Structured output learning with indirect super- vision. In Proc. of ICML. E. Charniak. 2000. A maximum-entropy-inspired parser. In Proc. of NAACL. S. Clark and J. R. Curran. 2004. Parsing the WSJ using CCG and log-linear models. In Proc. of ACL. S.B. Cohen and N.A. Smith. 2009. Shared logistic normal distributions for soft parameter tying in unsupervised grammar induction. In Proc. of NAACL. S.B. Cohen, D. Das, and N.A. Smith. 2011. Unsupervised structure prediction with non-parallel multilingual guidance. In Proc. of EMNLP. M. Collins, J. Haji cˇ, L. Ramshaw, and C. Tillmann. 1999. A statistical parser for Czech. In Proc. of ACL. M. Collins. 1997. Three generative, lexicalised models for statistical parsing. In Proc. of ACL. M. Collins. 2002. Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In Proc. of ACL. D. Das and S. Petrov. 2011. Unsupervised part-ofspeech tagging with bilingual graph-based projections. In Proc. of ACL-HLT. K. Ganchev, J. Gillenwater, and B. Taskar. 2009. Dependency grammar induction via bitext projection constraints. In Proc. of ACL-IJCNLP. K. Ganchev, J. Gra ¸ca, J. Gillenwater, and B. Taskar. 2010. Posterior regularization for structured latent variable models. Journal of Machine Learning Research. D. Gildea. 2001. Corpus variation and parser performance. In Proc of EMNLP. K. Hall, R. McDonald, J. Katz-Brown, and M. Ringgaard. 2011. Training dependency parsers by jointly optimizing multiple objectives. In Proc. of EMNLP. R. Hwa, P. Resnik, A. Weinberg, C. Cabezas, and O. Kolak. 2005. Bootstrapping parsers via syntactic projection across parallel texts. Natural Language Engineering, 11(03):31 1–325. D. Klein and C. D. Manning. 2004. Corpus-based induction of syntactic structure: models of dependency and constituency. In Proc. of ACL. P. Koehn. 2005. Europarl: A parallel corpus for statistical machine translation. In MT Summit. M. P. Marcus, Mary Ann Marcinkiewicz, and Beatrice Santorini. 1993. Building a large annotated corpus of English: the Penn treebank. Computational Linguistics, 19. D. McClosky, E. Charniak, and M. Johnson. 2006. Reranking and self-training for parser adaptation. In Proc. of ACL. R. McDonald, K. Crammer, and F. Pereira. 2005. Online large-margin training of dependency parsers. In Proc. of ACL. T. Naseem, H. Chen, R. Barzilay, and M. Johnson. 2010. Using universal linguistic knowledge to guide grammar induction. In Proc. of EMNLP. J. Nivre and J. Nilsson. 2005. Pseudo-projective dependency parsing. In Proc. of ACL. J. Nivre, J. Hall, and J. Nilsson. 2006. Maltparser: A data-driven parser-generator for dependency parsing. In Proc. of LREC. J. Nivre, J. Hall, S. K ¨ubler, R. McDonald, J. Nilsson, S. Riedel, and D. Yuret. 2007. The CoNLL 2007 shared task on dependency parsing. In Proc. of EMNLP-CoNLL. J. Nivre. 2008. Algorithms for deterministic incremental dependency parsing. Computational Linguistics, 34(4):513–553. S. Petrov, L. Barrett, R. Thibaux, and D. Klein. 2006. Learning accurate, compact, and interpretable tree annotation. In Proc. of ACL. S. Petrov, P. Chang, M. Ringgaard, and H. Alshawi. 2010. Uptraining for accurate deterministic question parsing. In EMNLP ’10. S. Petrov, D. Das, and R. McDonald. 2011. A universal part-of-speech tagset. In ArXiv:1104.2086. Y. Seginer. 2007. Fast unsupervised incremental parsing. In Proc. of ACL. L. Shen and A.K. Joshi. 2008. Ltag dependency parsing with bidirectional incremental construction. In Proc. of EMNLP. N.A. Smith and J. Eisner. 2005. Contrastive estimation: Training log-linear models on unlabeled data. In Proc. of ACL. 72 D.A. Smith and J. Eisner. 2009. Parser adaptation and projection with quasi-synchronous grammar features. In Proc. of EMNLP. D.A. Smith and N.A. Smith. 2004. Bilingual parsing with factored estimation: Using english to parse korean. In Proc. of EMNLP. B. Snyder, T. Naseem, J. Eisenstein, and R. Barzilay. 2009. Adding more languages improves unsupervised multilingual part-of-speech tagging: A Bayesian nonparametric approach. In Proc. of NAACL. A. Søgaard. 2011. Data point selection for crosslanguage adaptation of dependency parsers. In Proc. ACL. V.I. Spitkovsky, H. Alshawi, and D. Jurafsky. 2010. From baby steps to leapfrog: How “less is more” in unsupervised dependency parsing. In Proc. of NAACLHLT. S. Vogel, H. Ney, and C. Tillmann. 1996. HMM-based word alignment in statistical translation. In Proc. of COLING. W. Wang and M. P. Harper. 2004. A statistical constraint dependency grammar (CDG) parser. In Proc. of the Workshop on Incremental Parsing: Bringing Engineering and Cognition Together. D. Zeman and P. Resnik. 2008. Cross-language parser adaptation between related languages. In NLP for Less Privileged Languages. Y. Zhang and S. Clark. 2008. A Tale of Two Parsers: Investigating and Combining Graph-based and Transition-based Dependency Parsing. In Proc. of EMNLP.