acl acl2010 acl2010-54 acl2010-54-reference knowledge-graph by maker-knowledge-mining

54 acl-2010-Boosting-Based System Combination for Machine Translation

Source: pdf

Author: Tong Xiao ; Jingbo Zhu ; Muhua Zhu ; Huizhen Wang

Abstract: In this paper, we present a simple and effective method to address the issue of how to generate diversified translation systems from a single Statistical Machine Translation (SMT) engine for system combination. Our method is based on the framework of boosting. First, a sequence of weak translation systems is generated from a baseline system in an iterative manner. Then, a strong translation system is built from the ensemble of these weak translation systems. To adapt boosting to SMT system combination, several key components of the original boosting algorithms are redesigned in this work. We evaluate our method on Chinese-to-English Machine Translation (MT) tasks in three baseline systems, including a phrase-based system, a hierarchical phrasebased system and a syntax-based system. The experimental results on three NIST evaluation test sets show that our method leads to significant improvements in translation accuracy over the baseline systems. 1

reference text

David Chiang. 2005. A hierarchical phrase-based model for statistical machine translation. In Proc. of ACL 2005, Ann Arbor, Michigan, pages 263270. David Chiang. 2007. Hierarchical phrase-based translation. Computational Linguistics, 33(2):201-228. David Chiang, Yuval Marton and Philip Resnik. 2008. Online Large-Margin Training of Syntactic and Structural Translation Features. In Proc. of EMNLP 2008, Honolulu, pages 224-233. Michael Collins and Yoram Singer. 1999. Unsupervised Models for Named Entity Classification. In Proc. of EMNLP/VLC 1999, pages 100-1 10. Michael Collins, Robert Schapire and Yoram Singer. 2002. Logistic Regression, AdaBoost and Bregman Distances. Machine Learning, 48(3): 253-285. Brooke Cowan, Ivona Kučerová and Michael Collins. 2006. A discriminative model for tree-to-tree translation. In Proc. of EMNLP 2006, pages 232-241 . Yuan Ding and Martha Palmer. 2005. Machine translation using probabilistic synchronous dependency insertion grammars. In Proc. of ACL 2005, Ann Arbor, Michigan, pages 541-548. Nan Duan, Mu Li, Tong Xiao and Ming Zhou. 2009. The Feature Subspace Method for SMT System Combination. In Proc. of EMNLP 2009, pages 1096-1 104. Jason Eisner. 2003. Learning non-isomorphic tree mappings for machine translation. In Proc. of ACL 2003, pages 205-208. Yoav Freund. 1995. Boosting a weak learning algorithm by majority. Information and Computation, 121(2): 256-285. Yoav Freund and Robert Schapire. 1997. A decisiontheoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1): 119-139. Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang and Ignacio Thayer. 2006. Scalable inferences and training of context-rich syntax translation models. In Proc. of ACL 2006, Sydney, Australia, pages 961-968. Michel Galley and Christopher D. Manning. 2008. A Simple and Effective Hierarchical Phrase Reordering Model. In Proc. of EMNLP 2008, Hawaii, pages 848-856. Almut Silja Hildebrand and Stephan Vogel. 2008. Combination of machine translation systems via hypothesis selection from combined n-best lists. In Proc. of the 8th AMTA conference, pages 254-261 . Liang Huang and David Chiang. 2007. Forest rescoring: Faster decoding with integrated language models. In Proc. of ACL 2007, Prague, Czech Republic, pages 144-15 1. 747 Philipp Koehn, Franz Och and Daniel Marcu. 2003. Statistical Phrase-Based Translation. In Proc. of HLT-NAACL 2003, Edmonton, USA, pages 48-54. Philipp Koehn. 2004. Statistical Significance Tests for Machine Translation Evaluation. In Proc. of EMNLP 2004, Barcelona, Spain, pages 388-395. Antonio Lagarda and Francisco Casacuberta. 2008. Applying Boosting to Statistical Machine Translation. In Proc. of the 12th EAMT conference, pages 88-96. Mu Li, Nan Duan, Dongdong Zhang, Chi-Ho Li and Ming Zhou. 2009. Collaborative Decoding: Partial Hypothesis Re-Ranking Using Translation Consensus between Decoders. In Proc. of ACL-IJCNLP 2009, Singapore, pages 585-592. Percy Liang, Alexandre Bouchard-Côté, Dan Klein and Ben Taskar. 2006. An end-to-end discriminative approach to machine translation. In Proc. of COLING/ACL 2006, pages 104-1 11. Yang Liu, Qun Liu and Shouxun Lin. 2006. Tree-toString Alignment Template for Statistical Machine Translation. In Proc. of ACL 2006, pages 609-616. Wolfgang Macherey and Franz Och. 2007. An Empirical Study on Computing Consensus Translations from Multiple Machine Translation Systems. In Proc. of EMNLP 2007, pages 986-995. Daniel Marcu, Wei Wang, Abdessamad Echihabi and Kevin Knight. 2006. SPMT: Statistical machine translation with syntactified target language phrases. In Proc. of EMNLP 2006, Sydney, Australia, pages 44-52. Evgeny Matusov, Nicola Ueffing and Hermann Ney. 2006. Computing consensus translation from multiple machine translation systems using enhanced hypotheses alignment. In Proc. of EACL 2006, pages 33-40. Franz Och and Hermann Ney. 2002. Discriminative Training and Maximum Entropy Models for Statistical Machine Translation. In Proc. of ACL 2002, Philadelphia, pages 295-302. Franz Och. 2003. Minimum Error Rate Training in Statistical Machine Translation. In Proc. of ACL 2003, Japan, pages 160-167. Antti-Veikko Rosti, Spyros Matsoukas and Richard Schwartz. 2007. Improved Word-Level System Combination for Machine Translation. In Proc. of ACL 2007, pages 3 12-3 19. Cynthia Rudin, Robert Schapire and Ingrid Daubechies. 2007. Analysis of boosting algorithms using the smooth margin function. The Annals of Statistics, 35(6): 2723-2768. Robert Schapire and Yoram Singer. 2000. BoosTexter: A boosting-based system for text categorization. Machine Learning, 39(2/3): 135-168. Robert Schapire. The boosting approach to machine learning: an overview. 2001 . In Proc. of MSRI Workshop on Nonlinear Estimation and Classification, Berkeley, CA, USA, pages 1-23. Matthew Snover, Bonnie Dorr, Richard Schwartz, Linnea Micciulla and John Makhoul. 2006. A Study of Translation Edit Rate with Targeted Human Annotation. In Proc. of the 7th AMTA conference, pages 223-23 1. Tong Xiao, Mu Li, Dongdong Zhang, Jingbo Zhu and Ming Zhou. 2009. Better Synchronous Binarization for Machine Translation. In Proc. of EMNLP 2009, Singapore, pages 362-370. Deyi Xiong, Qun Liu and Shouxun Lin. 2006. Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation. In Proc. of ACL 2006, Sydney, pages 521-528. Hao Zhang, Liang Huang, Daniel Gildea and Kevin Knight. 2006. Synchronous Binarization for Machine Translation. In Proc. of HLT-NAACL 2006, New York, USA, pages 256- 263. 748