emnlp emnlp2011 emnlp2011-136 emnlp2011-136-reference knowledge-graph by maker-knowledge-mining

136 emnlp-2011-Training a Parser for Machine Translation Reordering

Source: pdf

Author: Jason Katz-Brown ; Slav Petrov ; Ryan McDonald ; Franz Och ; David Talbot ; Hiroshi Ichikawa ; Masakazu Seno ; Hideto Kazawa

Abstract: We propose a simple training regime that can improve the extrinsic performance of a parser, given only a corpus of sentences and a way to automatically evaluate the extrinsic quality of a candidate parse. We apply our method to train parsers that excel when used as part of a reordering component in a statistical machine translation system. We use a corpus of weakly-labeled reference reorderings to guide parser training. Our best parsers contribute significant improvements in subjective translation quality while their intrinsic attachment scores typically regress.

reference text

A. Birch and M. Osborne. 2010. LRscore for evaluating lexical and reordering quality in MT. In ACL-2010 WMT. T. Brants. 2000. TnT a statistical part-of-speech tagger. In ANLP ’00. D. Burkett and D. Klein. 2008. Two languages are better than one (for syntactic parsing). In EMNLP ’08. D. Burkett, S. Petrov, J. Blitzer, and D. Klein. 2010. – Learning better monolingual models with unannotated bilingual text. In CoNLL ’10. M. Chang, L. Ratinov, and D. Roth. 2007. Guiding semisupervision with constraint-driven learning. In ACL ’07. M. Chang, D. Goldwasser, D. Roth, and V. Srikumar. 2010. Structured output learning with indirect supervision. In ICML ’10. E. Charniak. 2000. A maximum–entropy–inspired parser. In NAACL ’00. J. Clarke, D. Goldwasser, M. Chang, and D. Roth. 2010. Driving semantic parsing from the world’s response. In CoNLL ’10. M. Collins, P. Koehn, and I. Kuˇ cerov a´. 2005. Clause restructuring for statistical machine translation. In ACL ’05. M. Collins. 1997. Three generative, lexicalised models for statistical parsing. In ACL ’97. M.-C. de Marneffe, B. MacCartney, and C. Manning. 2006. Generating typed dependency parses from phrase structure parses. In LREC ’06. J. Foster. 2010. “cba to check the spelling”: Investigating parser performance on discussion forum posts. In NAACL ’10. M. Galley, M. Hopkins, K. Knight, and D. Marcu. 2004. What’s in a translation rule? In HLT-NAACL ’04. K. Ganchev, J. Gra ¸ca, J. Gillenwater, and B. Taskar. 2010. Posterior regularization for structured latent variable models. Journal of Machine Learning Research. D. Genzel. 2010. Automatically learning source-side reordering rules for large scale machine translation. In COLING ’10. D. Gildea. 2001 . Corpus variation and parser performance. In EMNLP ’01. N. Habash. 2007. Syntactic preprocessing for statistical machine translation. In MTS ’07. K. Hall, R. McDonald, J. Katz-Brown, and M. Ringgaard. 2011. Training dependency parsers by jointly optimizing multiple objectives. In EMNLP ’11. H. Isozaki, T. Hirao, K. Duh, K. Sudoh, and H. Tsukada. 2010a. Automatic evaluation of translation quality for distant language pairs. In EMNLP ’10. H. Isozaki, K. Sudoh, H. Tsukada, and K. Duh. 2010b. Head finalization: A simple reordering rule for SOV languages. In ACL-2010 WMT. J. Judge, A. Cahill, and J. v. Genabith. 2006. QuestionBank: creating a corpus of parse-annotated questions. In ACL ’06. A. Lavie and M. Denkowski. 2009. The Meteor metric for automatic evaluation of machine translation. Machine Translation, 23(2-3). 192 P. Liang, A. Bouchard-C oˆt´ e, D. Klein, and B. Taskar. 2006. An end-to-end discriminative approach to ma- chine translation. In ACL ’06. M. Marcus, B. Santorini, and M. Marcinkiewicz. 1993. Building a large annotated corpus of English: The Penn Treebank. In Computational Linguistics. D. McClosky., E. Charniak, and M. Johnson. 2006. Effective self-training for parsing. In NAACL ’06. R. McDonald, K. Crammer, and F. Pereira. 2005. Online large-margin training of dependency parsers. In ACL ’05. Y. Miyao, R. Sætre,K. Sagae, T. Matsuzaki, and J. Tsujii. 2008. Task-oriented evaluation of syntactic parsers and their representations. In ACL ’08. J. Nivre. 2008. Algorithms for deterministic incremental dependency parsing. Computational Linguistics, 34(4). F. Och. 2003. Minimum error rate training in statistical machine translation. In ACL ’03. K. Papineni, S. Roukos, T. Ward, and W. Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In ACL ’02. S. Petrov and D. Klein. 2007. Improved inference for unlexicalized parsing. In NAACL ’07. S. Petrov, L. Barrett, R. Thibaux, and D. Klein. 2006. Learning accurate, compact, and interpretable tree annotation. In ACL ’06. S. Petrov, P. Chang, and M. Ringgaard H. Alshawi. 2010. Uptraining for accurate deterministic question parsing. In EMNLP ’10. D. Talbot, H. Kazawa, H. Ichikawa, J. Katz-Brown, M. Seno, and F. Och. 2011. A lightweight evaluation framework for machine translation reordering. In EMNLP-2011 WMT. C. Wang. 2007. Chinese syntactic reordering for statistical machine translation. In EMNLP ’07. F. Xia and M. McCord. 2004. Improving a statistical MT system with automatically learned rewrite patterns. In Coling ’04. P. Xu, J. Kang, M. Ringgaard, and F. Och. 2009. Using a dependency parser to improve SMT for subject-objectverb languages. In NAACL-HLT ’09. K. Yamada and K. Knight. 2001. A syntax-based statistical translation model. In ACL ’01. R. Zens and H. Ney. 2006. Discriminative reordering models for statistical machine translation. In NAACL06 WMT. Y. Zhang and S. Clark. 2008. A tale of two parsers: Investigating and combining graph-based and transitionbased dependency parsing. In EMNLP ’08.