acl acl2011 acl2011-311 acl2011-311-reference knowledge-graph by maker-knowledge-mining

311 acl-2011-Translationese and Its Dialects


Source: pdf

Author: Moshe Koppel ; Noam Ordan

Abstract: While it is has often been observed that the product of translation is somehow different than non-translated text, scholars have emphasized two distinct bases for such differences. Some have noted interference from the source language spilling over into translation in a source-language-specific way, while others have noted general effects of the process of translation that are independent of source language. Using a series of text categorization experiments, we show that both these effects exist and that, moreover, there is a continuum between them. There are many effects of translation that are consistent among texts translated from a given source language, some of which are consistent even among texts translated from families of source languages. Significantly, we find that even for widely unrelated source languages and multiple genres, differences between translated texts and non-translated texts are sufficient for a learned classifier to accurately determine if a given text is translated or original.


reference text

Mona Baker. 1993. Corpus linguistics and translation studies: Implications and applications. In Gill Francis Mona Baker and Elena Tognini Bonelli, editors, Text and technology: in honour of John Sinclair, pages 233-252. John Benjamins, Amsterdam. Marco Baroni and Silvia proach to the study learning the difference lated text. Literary 21(3):259-274. Bernardini. 2006. A new apof Translationese: Machinebetween original and transand Linguistic Computing, Shoshan Blum-Kulka. Shifts of cohesion and coherence in translation. 1986. In Juliane House and Shoshana Blum-Kulka (Eds), Interlingual and Intercultural Communication (17-35). Tübingen: Günter Narr Verlag. William Frawley. 1984. Prolegomenon to a theory of translation. In William Frawley (ed), Translation. Literary, Linguistic and Philosophical Perspectives (179-175). Newark: University of Delaware Press. Maria Antonietta Fusco. 1990. Quality in conference interpreting between cognate languages: A preliminary approach to the Spanish-Italian case. The Interpreters ’ Newsletter, 3, 93-97. Martin Gellerstam. 1986. Translationese in Swedish novels translated from English, in Lars Wollin & Hans Lindquist (eds.), Translation Studies in Scandinavia (88-95). Lund: CWK Gleerup. Iustina Ilisei, Diana Inkpen, Gloria Corpas Pastor, and Ruslan Mitkov. Identification of translationese: A machine learning approach. In Alexander F. Gelbukh, editor, Proceedings of CICLing-2010: Computational Linguistics and Intelligent Text Processing, 11th International, volume 6008 of Lecture Notes in Computer Science, pages 503-5 11. Springer, 2010. Kenneth Katzner. 2002. The Languages of the World. Routledge. Grzegorz Kondrak and Tarek Sherif. 2006. Evaluation of several phonetic similarity algorithms on the task of cognate identification. In Proceedings of the Workshop on Linguistic Distances (LD '06). 43-50. David Kurokawa, Cyril Goutte, and Pierre Isabelle. 2009. Automatic detection of translated text and its impact on machine translation. In Proceedings of MT-Summit Sara XII. Laviosa: 1997. How Comparable Corpora' Be?. Target, can 'Comparable 9 (2), pp. 289-3 19. Sara Laviosa-Braithwaite. 1998. In Mona Baker (ed.) Routledge Encyclopedia of Translation Studies. London/New York: Routledge, pp.288-291. Sara Laviosa. 2002. Corpus-based Translation Studies. Theory, Findings, Applications. Amsterdam/New York: Rodopi. David Madigan, Alexander Genkin, David D. Lewis and Dmitriy Fradkin 2005. Bayesian Multinomial Logistic Regression for Author Identification, In Maxent Conference, 509-516. James W. Pennebaker, Martha E. Francis, and Roger J. Booth. 2001. Linguistic Inquiry and Word Count (LIWC): LIWC2001 Manual. Erlbaum Publishers, Mahwah, NJ, USA. Helmut Schmid. Probabilistic Part-of-Speech Tagging Using Decision Trees. 2004. In Proceedings of International Conference on New Methods in Language Processing. Larry Selinker. 1972. Interlanguage. International Review of Applied Linguistics. 10, 209-241 . Gideon Toury. 1995. Descriptive Translation Studies and beyond. John Benjamins, Amsterdam / Philadelphia. Hans van Halteren. 2008. Source language markers in EUROPARL translations. In COLING '08: Proceedings of the 22nd International Conference on Computational Linguistics, pages 937-944. 1326