nips nips2008 nips2008-35 nips2008-35-reference knowledge-graph by maker-knowledge-mining

35 nips-2008-Bayesian Synchronous Grammar Induction

Source: pdf

Author: Phil Blunsom, Trevor Cohn, Miles Osborne

Abstract: We present a novel method for inducing synchronous context free grammars (SCFGs) from a corpus of parallel string pairs. SCFGs can model equivalence between strings in terms of substitutions, insertions and deletions, and the reordering of sub-strings. We develop a non-parametric Bayesian model and apply it to a machine translation task, using priors to replace the various heuristics commonly used in this ﬁeld. Using a variational Bayes training procedure, we learn the latent structure of translation equivalence through the induction of synchronous grammar categories for phrasal translations, showing improvements in translation performance over maximum likelihood models. 1

reference text

[1] Andreas Zollmann and Ashish Venugopal. Syntax augmented machine translation via chart parsing. In Proc. of the HLT-NAACL 2006 Workshop on Statistical Machine Translation, New York City, June 2006.

[2] David Chiang. Hierarchical phrase-based translation. Computational Linguistics, 33(2):201–228, 2007.

[3] Percy Liang, Slav Petrov, Michael Jordan, and Dan Klein. The inﬁnite PCFG using hierarchical Dirichlet processes. In Proc. of the 2007 Conference on Empirical Methods in Natural Language Processing (EMNLP-2007), pages 688–697, Prague, Czech Republic, 2007.

[4] Jenny Rose Finkel, Trond Grenager, and Christopher D. Manning. The inﬁnite tree. In Proc. of the 45th Annual Meeting of the ACL (ACL-2007), Prague, Czech Republic, 2007.

[5] Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei. Hierarchical Dirichlet processes. Journal of the American Statistical Association, 101(476):1566–1581, 2006.

[6] Philipp Koehn, Franz Josef Och, and Daniel Marcu. Statistical phrase-based translation. In Proc. of the 3rd International Conference on Human Language Technology Research and 4th Annual Meeting of the NAACL (HLT-NAACL 2003), pages 81–88, Edmonton, Canada, May 2003.

[7] Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang, and Ignacio Thayer. Scalable inference and training of context-rich syntactic translation models. In Proc. of the 44th Annual Meeting of the ACL and 21st International Conference on Computational Linguistics (COLING/ACL-2006), pages 961–968, Sydney, Australia, July 2006.

[8] Dekai Wu. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, 23(3):377–403, 1997.

[9] Colin Cherry and Dekany Lin. Inversion transduction grammar for joint phrasal translation modeling. In Proc. of the HLT-NAACL Workshop on Syntax and Structure in Statistical Translation (SSST 2007), Rochester, USA, 2007.

[10] Hao Zhang, Chris Quirk, Robert C. Moore, and Daniel Gildea. Bayesian learning of non-compositional phrases with synchronous parsing. In Proc. of the 46th Annual Conference of the Association for Computational Linguistics: Human Language Technologies (ACL-08:HLT), pages 97–105, Columbus, Ohio, June 2008.

[11] Daniel Marcu and William Wong. A phrase-based, joint probability model for statistical machine translation. In Proc. of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP2002), pages 133–139, Philadelphia, July 2002. Association for Computational Linguistics.

[12] John DeNero, Dan Gillick, James Zhang, and Dan Klein. Why generative phrase models underperform surface heuristics. In Proc. of the HLT-NAACL 2006 Workshop on Statistical Machine Translation, pages 31–38, New York City, June 2006.

[13] Philip M. Lewis II and Richard E. Stearns. Syntax-directed transduction. J. ACM, 15(3):465–488, 1968.

[14] P. F. Brown, S. A. Della Pietra, V. J. Della Pietra, and R. L. Mercer. The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics, 19(2):263–311, 1993.

[15] Mark Johnson. The DOP estimation method is biased and inconsistent. Computational Linguistics, 28(1):71–76, 2002.

[16] Matthew Beal. Variational Algorithms for Approximate Bayesian Inference. PhD thesis, The Gatsby Computational Neuroscience Unit, University College London, 2003.

[17] Matthias Eck and Chiori Hori. Overview of the IWSLT 2005 evaluation campaign. In Proc. of the International Workshop on Spoken Language Translation, Pittsburgh, October 2005.

[18] Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. Bleu: a method for automatic evaluation of machine translation. In Proc. of the 40th Annual Meeting of the ACL and 3rd Annual Meeting of the NAACL (ACL-2002), pages 311–318, Philadelphia, Pennsylvania, 2002.