acl acl2010 acl2010-88 acl2010-88-reference knowledge-graph by maker-knowledge-mining

88 acl-2010-Discriminative Pruning for Discriminative ITG Alignment


Source: pdf

Author: Shujie Liu ; Chi-Ho Li ; Ming Zhou

Abstract: While Inversion Transduction Grammar (ITG) has regained more and more attention in recent years, it still suffers from the major obstacle of speed. We propose a discriminative ITG pruning framework using Minimum Error Rate Training and various features from previous work on ITG alignment. Experiment results show that it is superior to all existing heuristics in ITG pruning. On top of the pruning framework, we also propose a discriminative ITG alignment model using hierarchical phrase pairs, which improves both F-score and Bleu score over the baseline alignment system of GIZA++. 1


reference text

Peter F. Brown, Stephen A. Della Pietra, Vincent J. Della Peitra, Robert L. Mercer. 1993. The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics, 19(2):263-3 11. Colin Cherry and Dekang Lin. 2006. Soft Syntactic Constraints for Word Alignment through Discriminative Training. In Proceedings of ACLCOLING. Colin Cherry and Dekang Lin. 2007. Inversion Transduction Grammar for Joint Phrasal Translation Modeling. In Proceedings of SSST, NAACL-HLT, Pages: 17-24. David Chiang. 2007. Hierarchical Phrase-based Translation. Computational Linguistics, 33(2). John DeNero, Mohit Bansal, Adam Pauls, and Dan Klein. 2009. Efficient Parsing for Transducer Grammars. In Proceedings of NAACL, Pages:227-235. Alexander Fraser and Daniel Marcu. 2006. SemiSupervised Training for StatisticalWord Alignment. In Proceedings of ACL, Pages:769776. Aria Haghighi, John Blitzer, John DeNero, and Dan Klein. 2009. Better Word Alignments with Supervised ITG Models. In Proceedings of ACL, Pages: 923-93 1. Liang Huang and David Chiang. 2005. Better k-best Parsing. In Proceedings of IWPT 2005, Pages: 173-1 80. Franz Josef Och and Hermann Ney. 2000. Improved statistical alignment models. In Proceedings of ACL. Pages: 440-447 Franz Josef Och. 2003. Minimum error rate training in statistical machine translation. In Pro- ceedings of ACL, Pages: 160-167. Dan Klein and Christopher D. Manning. 2001 . Parsing and Hypergraphs. In Proceedings of IWPT, Pages: 17-19 Philipp Koehn. 2004. Statistical Significance Tests for Machine Translation Evaluation. In Proceedings of EMNLP, Pages: 388-395. Yang Liu, Qun Liu and Shouxun Lin. 2005. Loglinear models for word alignment. In Proceedings of ACL, Pages: 8 1-88. Robert Moore. 2005. A Discriminative Framework for Bilingual Word Alignment. In Proceedings of EMNLP 2005, Pages: 81-88. Robert Moore, Wen-tau Yih, and Andreas Bode. 2006. Improved Discriminative Bilingual Word Alignment. In Proceedings of ACL, Pages: 5 13520. Stephan Vogel, Hermann Ney, and Christoph Tillmann. 1996. HMM-based word alignment in statistical translation. In Proceedings of COLING, Pages: 836-841 . Stephan Vogel. 2005. PESA: Phrase Pair Extraction as Sentence Splitting. In Proceedings of MT Summit. Dekai Wu. 1997. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora. Computational Linguistics, 23(3). Hao Zhang and Daniel Gildea. 2005. Stochastic Lexicalized Inversion Transduction Grammar for Alignment. In Proceedings of ACL. Hao Zhang, Chris Quirk, Robert Moore, and Daniel Gildea. 2008. Bayesian learning of noncompositional phrases with synchronous parsing. In Proceedings of ACL, Pages: 3 14-323. 324