acl acl2013 acl2013-58 acl2013-58-reference knowledge-graph by maker-knowledge-mining

58 acl-2013-Automated Collocation Suggestion for Japanese Second Language Learners

Source: pdf

Author: Lis Pereira ; Erlyn Manguilimotan ; Yuji Matsumoto

Abstract: This study addresses issues of Japanese language learning concerning word combinations (collocations). Japanese learners may be able to construct grammatically correct sentences, however, these may sound “unnatural”. In this work, we analyze correct word combinations using different collocation measures and word similarity methods. While other methods use well-formed text, our approach makes use of a large Japanese language learner corpus for generating collocation candidates, in order to build a system that is more sensitive to constructions that are difficult for learners. Our results show that we get better results compared to other methods that use only wellformed text. 1

reference text

Y. C. Chang, J. S. Chang, H. J. Chen, and H. C. Liou. 2008. An automatic collocation writing assistant for Taiwanese EFL learners: A case of corpus-based NLP technology. Computer Assisted Language Learning, 21(3):283–299. K. Church, and P. Hanks. 1990. Word Association Norms, Mutual Information and Lexicography, Computational Linguistics, Vol. 16: 1, pp. 2229. J. R. Curran. 2004. From Distributional to Semantic Similarity. Ph.D. thesis, University of Edinburgh. D. Dahlmeier, H. T. Ng. 2011. Correcting Semantic Collocation Errors with L1-induced Paraphrases. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 107–1 17, Edinburgh, Scotland, UK, July. Association for Computational Linguistics T. Dunning. 1993. Accurate methods for the statistics of surprise and coincidence. Computational Linguistics 19. 1(Mar. 1993), 61-74. Y. Futagi, P. Deane, M. Chodorow, and J. Tetreault. 2008. A computational approach to detecting collocation errors in the writing of non-native speakers of English. Computer Assisted Language Learning, 21, 4 (October 2008), 353-367. M. Gamon. 2010. Using mostly native data to correct errors in learners’ writing: A meta-classifier approach. In Proceedings of Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL, pages 163– 171, Los Angeles, California, June. Association for Computational Linguistics. D. Jurafsky and J. H. Martin. 2009. Speech and Language Processing: An Introduction to Natural Language Processing, Speech Recognition, and Computational Linguistics. 2nd edition. Prentice-Hall. K. Maekawa. 2008. Balanced corpus of contemporary written japanese. In Proceedings of the 6th Workshop on Asian Language Resources pages 101–102. (ALR), M. Kitamura, Y. Matsumoto. 1997. Automatic extraction of translation patterns in parallel corpora. In IPSJ, Vol. 38(4), pp. 108-1 17, April. In Japanese. S. Kullback, R.A. Leibler. 195 1. On Information and Sufficiency. Annals of Mathematical Statistics 22 (1): 79–86. L. Lee. 1999. Measures of Distributional Similarity. In Proc of the 37th annual meeting of the ACL, Stroudsburg, PA, USA, 25. A. L. Liu, D. Wible, and N. L. Tsao. 2009. Automated suggestions for miscollocations. In Proceedings of the NAACL HLT Workshop on Innovative Use of NLP for Building Educational Applications, pages 47–50, Boulder, Colorado, June. Association for Computational Linguistics. Mainichi Newspaper CD-ROM 1991. Co. 1991. Mainichi Shimbun T. Mizumoto, K. Mamoru, M. Nagata, Y. Matsumoto. 2011. Mining Revision Log of Language Learning SNS for Automated Japanese Error Correction of Second Language Learners. In Proceedings of the 5th International Joint Conference on Natural Language Processing, pp. 147-155. Chiang Mai, Thailand, November. AFNLP. R. Östling and O. Knutsson. 2009. A corpus-based tool for helping writers with Swedish collocations. In Proceedings of the Workshop on Extracting and Using Constructions in NLP, Nodalida, Odense, Denmark. 70, 77. H. Oyama and Y. Matsumoto. 2010. Automatic Error Detection Method for Japanese Case Particles in Japanese Language Learners. In Corpus, ICT, and Language Education, pages 235–245. A. Rozovskaya and D. Roth. 2010. Generating confusion sets for context-sensitive error correction. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pages 961–970, MIT, Massachusetts, USA, October. Association for Computational Linguistics. F. Smadja, K. R. Mckeown, V. Hatzivassiloglou. 1996. Translation collocations for bilingual lexicons: a statistical approach. Computational Linguistics, 22: 1-38. H. Suzuki and K. Toutanova. 2006. Learning to Predict Case Markers in Japanese. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL, pages 1049–1056 , Sydney, July. Association for Computational Linguistics.J. Tetreault, J. Foster, and M. Chodorow. 2010. Using parse features for preposition selection and error detection. In Proceedings of ACL 2010 Conference Short Papers, pages 353-358, Uppsala, Sweden, July. Association for Computational Linguistics. The National Institute for Japanese Language, editor. 1964. Bunrui-Goi-Hyo. Shuei shuppan. In Japanese. J. C. Wu, Y. C. Chang, T. Mitamura, and J. S. Chang. 2010. Automatic collocation suggestion in academic writing. In Proceedings of the ACL 2010 Conference Short Papers, pages 115-1 19, Uppsala, Sweden, July. Association for Computational Linguistics. 58