emnlp emnlp2011 emnlp2011-65 emnlp2011-65-reference knowledge-graph by maker-knowledge-mining

65 emnlp-2011-Heuristic Search for Non-Bottom-Up Tree Structure Prediction


Source: pdf

Author: Andrea Gesmundo ; James Henderson

Abstract: State of the art Tree Structures Prediction techniques rely on bottom-up decoding. These approaches allow the use of context-free features and bottom-up features. We discuss the limitations of mainstream techniques in solving common Natural Language Processing tasks. Then we devise a new framework that goes beyond Bottom-up Decoding, and that allows a better integration of contextual features. Furthermore we design a system that addresses these issues and we test it on Hierarchical Machine Translation, a well known tree structure prediction problem. The structure of the proposed system allows the incorporation of non-bottom-up features and relies on a more sophisticated decoding approach. We show that the proposed approach can find bet- ter translations using a smaller portion of the search space.


reference text

Sharon A. Caraballo and Eugene Charniak. 1998. New figures of merit for best-first probabilistic chart parsing, Computational Linguistics, 24:275-298. J. C. Chappelier and M. Rajman and R. Arages and A. Rozenknop. 1999. Lattice Parsing for Speech Recognition. In Proceedings of TALN 1999, Cargse, France. David Chiang. 2007. Hierarchical phrase-based translation. Computational Linguistics, 33(2):201-228, 2007. Anna Corazza, Renato De Mori, Roberto Gretter and Giorgio Satta. 1994. Optimal Probabilistic Evaluation Functions for Search Controlled by Stochastic Context-Free Grammars. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(10): 10181027. 907 Chris Dyer, Adam Lopez, Juri Ganitkevitch, Johnathan Weese, Ferhan Ture, Phil Blunsom, Hendra Setiawan, Vladimir Eidelman, and Philip Resnik. 2010. cdec: A Decoder, Alignment, and Learning framework for finite-state and context-free translation models. In Proceedings of the Conference of the Association of Computational Linguistics 2010, Uppsala, Sweden. Andrea Gesmundo and James Henderson 2010. Faster Cube Pruning. Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT), Paris, France. Andrea Gesmundo 2011. Bidirectional Sequence Classification for Tagging Tasks with Guided Learning. Proceedings of TALN 2011, Montpellier, France. Mark Hopkins and Greg Langmead 2009. Cube pruning as heuristic search. Proceedings of the Conference on Empirical Methods in Natural Language Processing 2009, Singapore. Liang Huang and David Chiang. 2007. Forest rescoring: Faster decoding with integrated language models. In Proceedings of the Conference of the Association of Computational Linguistics 2007, Prague, Czech Republic. Dan Klein and Christopher D. Manning. 2001 Parsing and Hypergraphs, In Proceedings of the International Workshop on Parsing Technologies 2001, Beijing, China. Dan Klein and Christopher D. Manning. 2003 A* Parsing: Fast Exact Viterbi Parse Selection, In Proceedings of the Conference of the North American Association for Computational Linguistics 2003, Edmonton, Canada. Daphne Koller and Nir Friedman. 2010. Probabilistic Graphical Models: Principles and Techniques. The MIT Press, Cambridge, Massachusetts. Shankar Kumar, Wolfgang Macherey, Chris Dyer, and Franz Och. 2009. Efficient Minimum Error Rate Training and Minimum Bayes-Risk decoding for translation hypergraphs and lattices, In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Suntec, Singapore. Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Khudanpur, Lane Schwartz, Wren N. G. Thornton, Jonathan Weese, and Omar F. Zaidan. 2009. Joshua: An Open Source Toolkit for Parsing-based Machine Translation. In Proceedings of the Workshop on Statistical Machine Translation 2009, Athens, Greece. Adam Lopez. 2007. Hierarchical Phrase-Based Translation with Suffix Arrays. In Proceedings of the Conference on Empirical Methods in Natural Language Processing 2007, Prague, Czech Republic. Ganitkevitch, Sanjeev Haitao Mi, Liang Huang and Qun Liu. 2008. ForestBased Translation. In Proceedings of the Conference of the Association of Computational Linguistics 2008, Columbus, OH. Libin Shen, Giorgio Satta and Aravind Joshi. 2007. Guided Learning for Bidirectional Sequence Classification. In Proceedings of the Conference of the Association of Computational Linguistics 2007, Prague, Czech Republic. Andreas Stolcke. 2002. SRILM - An extensible language modeling toolkit. In Proceedings of the International Conference on Spoken Language Processing 2002, Denver, CO. Andreas Zollmann and Ashish Venugopal. 2006. Syntax augmented machine translation via chart parsing, Proceedings of the Workshop on Statistical Machine Translation, New York City, New York. 908