emnlp emnlp2011 emnlp2011-15 emnlp2011-15-reference knowledge-graph by maker-knowledge-mining

15 emnlp-2011-A novel dependency-to-string model for statistical machine translation


Source: pdf

Author: Jun Xie ; Haitao Mi ; Qun Liu

Abstract: Dependency structure, as a first step towards semantics, is believed to be helpful to improve translation quality. However, previous works on dependency structure based models typically resort to insertion operations to complete translations, which make it difficult to specify ordering information in translation rules. In our model of this paper, we handle this problem by directly specifying the ordering information in head-dependents rules which represent the source side as head-dependents relations and the target side as strings. The head-dependents rules require only substitution operation, thus our model requires no heuristics or separate ordering models of the previous works to control the word order of translations. Large-scale experiments show that our model performs well on long distance reordering, and outperforms the state- of-the-art constituency-to-string model (+1.47 BLEU on average) and hierarchical phrasebased model (+0.46 BLEU on average) on two Chinese-English NIST test sets without resort to phrases or parse forest. For the first time, a source dependency structure based model catches up with and surpasses the state-of-theart translation models.


reference text

David Chiang. 2005. A hierarchical phrase-based model for statistical machine translation. In Proceedings of ACL 2005, pages 263–270. David Chiang. 2007. Hierarchical phrase-based translation. Computational Linguistics, 33. Yuan Ding and Martha Palmer. 2005. Machine translation using probabilistic synchronous dependency insertion grammars. In Proceedings of ACL 2005. Heidi J. Fox. 2002. Phrasal cohesion and statistical machine translation. In In Proceedings of EMNLP 2002, pages 304–3 11. Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang, and Ignacio Thayer. 2006. Scalable inference and training of context-rich syntactic translation models. In Proceedings of ACL 2006, pages 961–968, Sydney, Australia, July. Association for Computational Linguistics. Peter Hellwig. 2006. Parsing with dependency grammars. In Dependenz und Valenz / Dependency and Valency, volume 2, pages 1081–1 109. Berlin, New York. Liang Huang and David Chiang. 2007. Forest rescoring: Faster decoding with integrated language models. In Proceedings of ACL 2007, pages 144–15 1, Prague, Czech Republic, June. Liang Huang, Kevin Knight, and Aravind Joshi. 2006. A syntax-directed translator with extended domain of locality. In Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing, pages 1–8, New York City, New York, June. Association for Computational Linguistics. Richard Hudson. 1990. English Word Grammar. Blackell. Dan Klein and Christopher D. Manning. 2003. Fast exact inference with a factored model for natural language parsing. In In Advances in Neural Information Processing Systems 15 (NIPS, pages 3–10. MIT Press. Philipp Koehn, Franz J. Och, and Daniel Marcu. 2003. Statistical phrase-based translation. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Edmonton, Canada, July. Dekang Lin. 2004. A path-based transfer model for machine translation. In Proceedings of Coling 2004, pages 625–630, Geneva, Switzerland, Aug 23–Aug 27. Yang Liu, Qun Liu, and Shouxun Lin. 2006. Tree-tostring alignment template for statistical machine translation. In Proceedings of ACL 2006, pages 609–616, Sydney, Australia, July. Franz Josef Och and Hermann Ney. 2002. Discriminative training and maximum entropy models for statistical machine translation. In Proceedings of 40th Annual Meeting of the Association for Computational Linguistics, pages 295–302, Philadelphia, Pennsylvania, USA, July. 226 Franz Josef Och and Hermann Ney. 2003. A systematic comparison of various statistical alignment models. Computational Linguistics, 29(1): 19–5 1. Franz Josef Och and Hermann Ney. 2004. The alignment template approach to statistical machine translation. Franz Josef Och. 2003. Minimum error rate training in statistical machine translation. In Proceedings ofACL2003, pages 160–167, Sapporo, Japan, July. Kishore Papineni, Salim Roukos, Todd Ward, and WeiJing Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. In Proceedings of ACL 2002, pages 3 11–3 18, Philadelphia, Pennsylvania, USA, July. Chris Quirk, Arul Menezes, and Colin Cherry. 2005. Dependency treelet translation: Syntactically informed phrasal smt. In Proceedings of ACL 2005, pages 271– 279. Libin Shen, Jinxi Xu, and Ralph Weischedel. 2008. A new string-to-dependency machine translation al- gorithm with a target dependency language model. In Proceedings of ACL 2008: HLT, pages 577–585, Columbus, Ohio, June. Association for Computational Linguistics. Andreas Stolcke. 2002. Srilm - an extensible language modeling toolkit. In Proceedings ofICSLP, volume 30, pages 901–904. Deyi Xiong, Qun Liu, and Shouxun Lin. 2007. A dependency treelet string correspondence model for statistical machine translation. In Proceedings of the Second Workshop on Statistical Machine Translation, pages 40–47, Prague, Czech Republic, June. Arnold M. Zwicky. 1985. Heads. Journal of Linguistics, 21: 1–29.