acl acl2011 acl2011-96 acl2011-96-reference knowledge-graph by maker-knowledge-mining

96 acl-2011-Disambiguating temporal-contrastive connectives for machine translation


Source: pdf

Author: Thomas Meyer

Abstract: Temporal–contrastive discourse connectives (although, while, since, etc.) signal various types ofrelations between clauses such as temporal, contrast, concession and cause. They are often ambiguous and therefore difficult to translate from one language to another. We discuss several new and translation-oriented experiments for the disambiguation of a specific subset of discourse connectives in order to correct some of the translation errors made by current statistical machine translation systems.


reference text

Leo Breiman. 2001. Random Forests. Machine Learning, 45(1):5–32. Michael Collins, Phillipp Koehn, Ivona Kucerova. 2005. Clause Restructuring for Statistical Machine Translation. Proceedings of the 43rd Annual Meeting of the ACL, 531–540 David duVerle, Helmut Prendinger. 2009. A Novel Discourse Parser Based on Support Vector Machine Classification. Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP, 665–673. Barry Haddow. 2005. Acquiring a Disambiguation Model For Discourse Connectives. Master Thesis. University of Edinburgh, School of Informatics. Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, Ian H. Witten. 2009. The WEKA Data Mining Software: An Update. SIGKDD Explorations, 11(1). Philipp Koehn. 2005. Europarl: A Parallel Corpus for Statistical Machine Translation. Proceedings of MT Summit X, 79–86. Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, Evan Herbs. 2007. Moses: Open Source Toolkit for Statistical Machine Translation. Proceedings of the 45th Annual Meeting of the ACL, Demonstration session, 177–180. Ronan Le Nagard, Philipp Koehn. 2010. Aiding Pronoun Translation with Co-Reference Resolution. Proceedings of the Joint 5th Workshop on Statistical Machine Translation and Metrics MATR, 258–267. Ziheng Lin, Hwee Tou Ng, Min-Yen Kan. 2010. A PDTB-Styled End-to-End Discourse Parser. Technical Report TRB8/10. School of Computing, National University of Singapore, 1–15. William C. Mann, Sandra A. Thompson. 1988. Rhetorical structure theory: towards a functional theory oftext organization. Text 8(3):243–281 . Eleni Miltsakaki, Nikhil Dinesh, Rashmi Prasad, Aravind Joshi, Bonnie Webber. 2005. Experiments on Sense Annotations and Sense Disambiguation of Discourse Connectives. Proceedings of the Fourth Workshop on Treebanks and Linguistic Theories (TLT). 51 Marie-Paule P ´ery-Woodley, Nicholas Asher, Patrice Enjalbert, Farah Benamara, Myriam Bras, C ´ecile Fabre, St´ ephane Ferrari, Lydia-Mai Ho-Dac, Anne Le Draoulec, Yann Mathet, Philippe Muller, Laurent Pr´ evot, Josette Rebeyrolle, Ludovic Tanguy, Marianne Vergez-Couret, Laure Vieu, Antoine Widl o¨cher. 2009. ANNODIS: une approche out- ille de l’annotation de structures discursives. Proceedings of TALN. Emily Pitler, Ani Nenkova. 2009. Using Syntax to Disambiguate Explicit Discourse Connectives in Text. Proceedings of the ACL-IJCNLP 2009 Conference, Short Papers. 13–16. Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind Joshi, Bonnie Webber. 2008. The Penn Discourse Treebank 2.0. Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC), 29641-2968. Charlotte Roze, Laurence Danlos, Philippe Muller. 2010. LEXCONN: a French Lexicon of Discourse Connectives. Proceedings of Multidisciplinary Approaches to Discourse (MAD). Manfred Stede, Carla Umbach. 1998. DiMLex: a lexicon of discourse markers for text generation and understanding. Proceedings of the 36th Annual Meeting of the ACL, 1238–1242. Yannick Versley. 2010. Discovery of Ambiguous and Unambiguous Discourse Connectives via Annotation Projection. Proceedings of Workshop on Annotation and Exploitation of Parallel Corpora (AEPC), 83–82 S a´rka Zik a´nov a´, Lucie Mladov´ a, Jiˇ r ´ı M ´ırovsk y´, Pavlina J´ı nov a´. 2010. Typical Cases of Annotators’ Disagreement in Discourse Annotations in Prague Dependency Treebank. Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC), 2002–2006.