acl acl2011 acl2011-313 acl2011-313-reference knowledge-graph by maker-knowledge-mining

313 acl-2011-Two Easy Improvements to Lexical Weighting

Source: pdf

Author: David Chiang ; Steve DeNeefe ; Michael Pust

Abstract: We introduce two simple improvements to the lexical weighting features of Koehn, Och, and Marcu (2003) for machine translation: one which smooths the probability of translating word f to word e by simplifying English morphology, and one which conditions it on the kind of training data that f and e co-occurred in. These new variations lead to improvements of up to +0.8 BLEU, with an average improvement of +0.6 BLEU across two language pairs, two genres, and two translation systems.

reference text

David Chiang, Yuval Marton, and Philip Resnik. Online large-margin training 2008. of syntactic and struc- tural translation features. In Proc. EMNLP 2008, pages 224–233. David Chiang, Kevin Knight, and Wei Wang. 11 ,001 new 2009. features for statistical machine translation. In Proc. NAACL HLT, pages David Chiang. 2005. 218–226. A hierarchical phrase-based model for statistical machine translation. 263–270. Chiang. 2010. In Proc. ACL 2005, pages David Learning to translate with source and target syntax. In Proc. ACL, pages 1443–1452. Koby Crammer, Ofer Dekel, Joseph Keshet, Shai ShalevShwartz, and Yoram Singer. aggressive Research, algorithms. 7:551–585. 2006. Online passive- Journal of Machine Learning Michel Galley, Mark Hopkins, Kevin Knight, and Daniel Marcu. 2004. What’s in a translation rule? In Proc. HLT-NAACL 2004, pages 273–280. Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang, and Ignacio Thayer. 2006. Scalable inference and training of context-rich syntactic translation models. In Proc. COLING-ACL 2006, pages 961–968. Philipp Koehn, Franz Josef Och, and Daniel Marcu. 2003. Statistical phrase-based translation. In Proc. HLT-NAACL 2003, pages 127–133. Spyros Matsoukas, Antti-Veikko I. Rosti, and Bing Zhang. 2009. Discriminative corpus weight estimation for machine translation. In Proc. EMNLP 2009, pages 708–717. Dragos Stefan Munteanu and Daniel Marcu. 2005. Improving machine translation performance by exploiting non-parallel corpora. Computational Linguistics, 31:477–504. M. F. Porter. 1980. An algorithm for suffix stripping. Program, 14(3): 130–137. Taro Watanabe, Jun Suzuki, Hajime Tsukuda, and Hideki Isozaki. 2007. Online large-margin training for statistical machine translation. In Proc. EMNLP-CoNLL 2007, pages 764–773. Ian H. Witten and Timothy C. Bell. 1991 . The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression. IEEE Trans. Information Theory, 37(4): 1085–1094. 460