emnlp emnlp2013 emnlp2013-127 emnlp2013-127-reference knowledge-graph by maker-knowledge-mining

127 emnlp-2013-Max-Margin Synchronous Grammar Induction for Machine Translation

Source: pdf

Author: Xinyan Xiao ; Deyi Xiong

Abstract: Traditional synchronous grammar induction estimates parameters by maximizing likelihood, which only has a loose relation to translation quality. Alternatively, we propose a max-margin estimation approach to discriminatively inducing synchronous grammars for machine translation, which directly optimizes translation quality measured by BLEU. In the max-margin estimation of parameters, we only need to calculate Viterbi translations. This further facilitates the incorporation of various non-local features that are defined on the target side. We test the effectiveness of our max-margin estimation framework on a competitive hierarchical phrase-based system. Experiments show that our max-margin method significantly outperforms the traditional twostep pipeline for synchronous rule extraction by 1.3 BLEU points and is also better than previous max-likelihood estimation method.

reference text

Phil Blunsom and Trevor Cohn. 2006. Discriminative word alignment with conditional random fields. In Prob. ACL 2006, July. Phil Blunsom, Trevor Cohn, and Miles Osborne. 2008. A discriminative latent variable model for statistical machine translation. In Proc. ACL 2008. Phil Blunsom, Trevor Cohn, Chris Dyer, and Miles Osborne. 2009. A gibbs sampler for phrasal synchronous grammar induction. In Proc. ACL 2009. David Burkett, John Blitzer, and Dan Klein. 2010. Joint parsing and alignment with weakly synchronized grammars. In Proc. NAACL 2010. Colin Cherry and Dekang Lin. 2007. Inversion transduction grammar for joint phrasal translation modeling. In Proc. SSST 2007, NAACL-HLT Workshop on Syntax and Structure in Statistical Translation, April. David Chiang, Kevin Knight, and Wei Wang. 2009. 11,001 new features for statistical machine translation. In Proc. NAACL 2009. David Chiang. 2007. Hierarchical phrase-based translation. Computational Linguistics, 33(2):201–228. Trevor Cohn and Phil Blunsom. 2009. A Bayesian model of syntax-directed tree to string grammar induction. In Proc. EMNLP 2009. John DeNero and Dan Klein. 2010. Discriminative modeling of extraction sets for machine translation. In Proc. ACL 2010. John DeNero, Alexandre Bouchard-C oˆt´ e, and Dan Klein. 2008. Sampling alignment structure under a Bayesian translation model. In Proc. EMNLP 2008. Chris Dyer, Kevin Gimpel, Jonathan H. Clark, and Noah A. Smith. 2011. The cmu-ark german-english translation system. In Proc. WMT 2011. Kevin Gimpel and Noah A. Smith. 2012. Structured ramp loss minimization for machine translation. In Proc. NAACL 2012. Xiaodong He and Li Deng. 2012. Maximum expected bleu training of phrase and lexicon translation models. In Proc. ACL 2012. Fei Huang and Bing Xiang. 2010. Feature-rich discriminative phrase rescoring for smt. In Proc. Coling 2010. Liang Huang. 2008. Forest reranking: Discriminative parsing with non-local features. In Proc. ACL 2008. Matti K a¨ a¨ri a¨inen. 2009. Sinuhe – statistical machine translation using a globally trained conditional exponential family translation model. In Proc. EMNLP 2009. Philipp Koehn, Franz Josef Och, and Daniel Marcu. 2003. Statistical phrase-based translation. In Proc. HLT-NAACL 2003. Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, and Evan Herbst. 2007. Moses: Open source toolkit for statistical machine translation. In Proc. ACL 2007 (demonstration session). Philipp Koehn. 2004. Statistical significance tests for machine translation evaluation. In Proc. EMNLP 2004. Abby Levenberg, Chris Dyer, and Phil Blunsom. 2012. A bayesian model for learning scfgs with discontiguous rules. In Proc. EMNLP 2012. Association for Computational Linguistics, July. Zhifei Li, Jason Eisner, and Sanjeev Khudanpur. 2009. Variational decoding for statistical machine translation. In Proc. ACL 2009. Chin-Yew Lin and Franz Josef Och. 2004. Orange: a method for evaluating automatic evaluation metrics for machine translation. In Pro. Coling 2004. Daniel Marcu and William Wong. 2002. A phrase-based, joint probability model for statistical machine translation. In Proc. EMNLP 2002. 264 Jonathan May and Kevin Knight. 2007. Syntactic realignment models for machine translation. In Proc. EMNLP 2007. Graham Neubig, Taro Watanabe, Eiichiro Sumita, Shinsuke Mori, and Tatsuya Kawahara. 2011. An unsupervised model for joint phrase alignment and extraction. In Proc. ACL 2011. Franz Josef Och. 2003. Minimum error rate training in statistical machine translation. In Proc. ACL 2003. Kishore Papineni, Salim Roukos, Todd Ward, and WeiJing Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. In Proc. ACL 2002. Adam Pauls, Dan Klein, David Chiang, and Kevin Knight. 2010. Unsupervised syntactic alignment with inversion transduction grammars. In Proc. NAACL 2010. Jason Riesa, Ann Irvine, and Daniel Marcu. 2011. Feature-rich language-independent syntax-based alignment for statistical machine translation. In Proc. EMNLP 2011. Shai Shalev-Shwartz, Yoram Singer, and Nathan Srebro. 2007. Pegasos: Primal estimated sub-gradient solver for svm. In Proc. ICML 2007. Libin Shen, Jinxi Xu, Bing Zhang, Spyros Matsoukas, and Ralph Weischedel. 2009. Effective use of linguistic and contextual information for statistical machine translation. In Proc. EMNLP 2009. Andreas Stolcke. 2002. Srilm an extensible language modeling toolkit. Ben Taskar, Dan Klein, Mike Collins, Daphne Koller, and – Christopher Manning. 2004. Max-margin parsing. In Proc. EMNLP 2004. Taro Watanabe, Jun Suzuki, Hajime Tsukada, and Hideki Isozaki. 2007. Online large-margin training for statistical machine translation. In Proc. EMNLP-CoNLL 2007. Xinyan Xiao, Yang Liu, Qun Liu, and Shouxun Lin. 2011. Fast generation of translation forest for largescale smt discriminative training. In Proc. EMNLP 2011. Xinyan Xiao, Deyi Xiong, Yang Liu, Qun Liu, and Shouxun Lin. 2012. Unsupervised discriminative induction of synchronous grammar for machine translation. In Proc. Coling 2012. Deyi Xiong, Min Zhang, and Haizhou Li. 2010. Learning translation boundaries for phrase-based decoding. In Proc. NAACL2010. Richard Zens and Hermann Ney. 2004. Improvements in phrase-based statistical machine translation. In Prob. NAACL 2004. Hao Zhang, Chris Quirk, Robert C. Moore, and Daniel Gildea. 2008. Bayesian learning of noncompositional phrases with synchronous parsing. In Proc. ACL 2008.