acl acl2011 acl2011-30 acl2011-30-reference knowledge-graph by maker-knowledge-mining

30 acl-2011-Adjoining Tree-to-String Translation

Source: pdf

Author: Yang Liu ; Qun Liu ; Yajuan Lu

Abstract: We introduce synchronous tree adjoining grammars (TAG) into tree-to-string translation, which converts a source tree to a target string. Without reconstructing TAG derivations explicitly, our rule extraction algorithm directly learns tree-to-string rules from aligned Treebank-style trees. As tree-to-string translation casts decoding as a tree parsing problem rather than parsing, the decoder still runs fast when adjoining is included. Less than 2 times slower, the adjoining tree-tostring system improves translation quality by +0.7 BLEU over the baseline system only allowing for tree substitution on NIST ChineseEnglish test sets.

reference text

Anne Abeille, Yves Schabes, and Aravind Joshi. 1990. Using lexicalized tags for machine translation. In Proc. of COLING 1990. John Chen and K. Vijay-Shanker. 2000. Automated extraction of tags from the penn treebank. In Proc. of IWPT 2000. David Chiang. 2003. Statistical parsing with an automatically extracted tree adjoining grammar. DataOriented Parsing. David Chiang. 2006. An introduction to synchronous grammars. ACL Tutorial. David Chiang. 2007. Hierarchical phrase-based translation. Computational Linguistics, 33(2):201–228. David Chiang. 2010. Learning to translate with source and target syntax. In Proc. of ACL 2010. Michael Collins. 2003. Head-driven statistical models for natural language parsing. Computational Linguistics, 29(4). Steve DeNeefe and Kevin Knight. 2009. Synchronous tree adjoining machine translation. In Proc. of EMNLP 2009. Mark Dras. 1999. A meta-level grammar: Redefining synchronous tag for translation and paraphrase. In Proc. of ACL 1999. Jason Eisner. 2003. Learning non-isomorphic tree mappings for machine translation. In Proc. of ACL 2003. Michel Galley, Mark Hopkins, Kevin Knight, and Daniel Marcu. 2004. What’s in a translation rule? In Proc. of NAACL 2004. Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang, and Ignacio Thayer. 2006. Scalable inference and training of context-rich syntactic translation models. In Proc. of ACL 2006. Liang Huang and David Chiang. 2007. Forest rescoring: Faster decoding with integrated language models. In Proc. of ACL 2007. Liang Huang and Haitao Mi. 2010. Efficient incremental decoding for tree-to-string translation. In Proc. of EMN?LP 2010. Liang H?uang, Kevin Knight, and Aravind Joshi. 2006. Statis?tical syntax-directed translation with extended doma?in of locality. In Proc. of AMTA 2006. Aravind? ?Joshi, L. Levy, and M. Takahashi. 1975. Tree adjun?ct grammars. Journal of Computer and System Scien?ces, 10(1). Aravind? Joshi. 1985. How much contextsensitivity is? ?necessary for characterizing structural descriptions ?tree adjoining grammars. Natural Language Processing ?Theoretical, Computational, and Psychological Perspectives. Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, and Evan Herbst. 2007. Moses: Open source toolkit for statistical machine translation. In Proceedings of ACL 2007 (poster), pages 77–80, Prague, Czech Republic, June. Yang Liu, Qun Liu, and Shouxun Lin. 2006. Tree-tostring alignment template for statistical machine translation. In Proc. of ACL 2006. Yang Liu, Yajuan L u¨, and Qun Liu. 2009. Improving tree-to-tree translation with packed forests. In Proc. of ACL 2009. Haitao Mi and Liang Huang. 2008. Forest-based translation rule extraction. In Proceedings of EMNLP 2008. Haitao Mi, Liang Huang, and Qun Liu. 2008. Forestbased translation. In Proceedings of ACL/HLT 2008, pages 192–199, Columbus, Ohio, USA, June. Rebecca Nesson, Stuart Shieber, and Alexander Rush. 2006. Induction of probabilistic synchronous treeinsertion grammars for machine translation. In Proc. of AMTA 2006. Franz J. Och and Hermann Ney. 2003. A systematic comparison of various statistical alignment models. Computational Linguistics, 29(1): 19–5 1. Franz Och. 2003. Minimum error rate training in statistical machine translation. In Proc. of ACL 2003. Gilles Prigent. 1994. Synchronous tags and machine translation. In Proc. of TAG+3. Yves Schabes and Richard Waters. 1995. A cubic-time, parsable formalism that lexicalizes context-free grammar without changing the trees produced. Computational Linguistics, 21(4). Stuart M. Shieber and Yves Schabes. 1990. Synchronous tree-adjoining grammars. In Proc. of COLING 1990. Stuart M. Shieber. 2007. Probabilistic synchronous treeadjoining grammars for machine translation: The argument from bilingual dictionaries. In Proc. of SSST 2007. Andreas Stolcke. 2002. Srilm - an extensible language modeling toolkit. In Proceedings of ICSLP 2002, pages 901–904. Dekai Wu. 1997. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, 23(3):377–404. Fei Xia. 1999. Extracting tree adjoining grammars from Min Zhang, Hongfei Jiang, Aiti Aw, Haizhou Li, Chew Lim Tan, and Sheng Li. 2008. A tree sequence alignment-based tree-to-tree translation model. In Proc. of ACL 2008. bracketed corpora. In Proc. of the Fifth Natural Language Processing Pacific Rim Symposium. Deyi Xiong, Qun Liu, and Shouxun Lin. 2006. Maximum entropy based phrase reordering model for statistical machine translation. In Proc. of ACL 2006. 1287