emnlp emnlp2013 emnlp2013-169 emnlp2013-169-reference knowledge-graph by maker-knowledge-mining

169 emnlp-2013-Semi-Supervised Representation Learning for Cross-Lingual Text Classification

Source: pdf

Author: Min Xiao ; Yuhong Guo

Abstract: Cross-lingual adaptation aims to learn a prediction model in a label-scarce target language by exploiting labeled data from a labelrich source language. An effective crosslingual adaptation system can substantially reduce the manual annotation effort required in many natural language processing tasks. In this paper, we propose a new cross-lingual adaptation approach for document classification based on learning cross-lingual discriminative distributed representations of words. Specifically, we propose to maximize the loglikelihood of the documents from both language domains under a cross-lingual logbilinear document model, while minimizing the prediction log-losses of labeled documents. We conduct extensive experiments on cross-lingual sentiment classification tasks of Amazon product reviews. Our experimental results demonstrate the efficacy of the pro- posed cross-lingual adaptation approach.

reference text

M. Amini, N. Usunier, and C. Goutte. Learning from multiple partially observed views - an application to multilingual text categorization. In Advances in Neural Information Processing Systems (NIPS), 2009. B. A.R., A. Joshi, and P. Bhattacharyya. Crosslingual sentiment analysis for indian languages using linked wordnets. In Proceedings of the International Conference on Computational Linguistics (COLING), 2012. N. Bel, C. Koster, and M. Villegas. Cross-lingual Table 2: Examples of source seed words together with five closest English words and five closest German words estimated using the Euclidean distance in the cross-lingual representation space on the task GB. books English German absolutely English German love English German bpwbteoa xo gotrekd s bw tbeu ¨lax oc trh te rcd taoe btsfrmationalpiuyntle tely ely ads kbie ocsf mhio nelpiu rtliev tlf oie okven eld lwf ui¨e iheb led en r ce porvx oisep cte r lpyn rs iecve dht pe¨ro ouec hei hser teng bwgieroce toelaedtrgbng re uro st¨ßs eatretnig n cno aen tvneortkn kei e cihn te s expensive English German good English German not English German text categorization. In Proceedings of European Conference on Digital Libraries (ECDL), 2003. Y. Bengio, R. Ducharme, and P. Vincent. A neu- ral probabilistic language model. In Advances in Neural Information Processing Systems (NIPS), 2000. D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research (JMLR), 3:993–1022, 2003. J. Blitzer, M. Dredze, and F. Pereira. Biographies, bollywood, boomboxes and blenders: Domain adaptation for sentiment classification. In Proceedings of the Annual Meeting of the Asso. for Computational Linguistics (ACL), 2007. K. Diamantaras and S. Kung. Principal component neural networks: theory and applications. WileyInterscience, 1996. L. Duan, D. Xu, and I. Tsang. Learning with augmented features for heterogeneous domain adaptation. In Proceedings of the International Conference on Machine Learning (ICML), 2012. R. Fan, K. Chang, C. Hsieh, X. Wang, and C. Lin. LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research (JMLR), 9: 1871–1874, 2008. 1474 C. Fellbaum, editor. WordNet: an electronic lexical database. MIT Press, 1998. L. Gillick and S. Cox. Some statistical issues in the comparison of speech recognition algorithms. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1989. A. Gliozzo. Exploiting comparable corpora and bilingual dictionaries for cross-language text categorization. In Proceedings of the International Conference on Computational Linguistics and the Annual Meeting of the Association for Computational Linguistics (ICCL-ACL), 2006. Y. Guo and M. Xiao. Transductive representation learning for cross-lingual text classification. In Proceedings of the IEEE International Conference on Data Mining (ICDM), 2012a. Y. Guo and M. Xiao. Cross language text classification via subspace co-regularized multi-view learning. In Proceedings ofthe International Conference on Machine Learning (ICML), 2012b. T. Hofmann. Probabilistic latent semantic analysis. In Proceedings of Uncertainty in Artificial Intelligence (UAI), 1999. A. Klementiev, I. Titov, and B. Bhattarai. Inducing crosslingual distributed representations of words. In Proceedings ofthe International Conference on Computational Linguistics (COLING), 2012. X. Ling, G. Xue, W. Dai, Y. Jiang, Q. Yang, and Y. Yu. Can chinese web pages be classified with english data source? In Proceedings of the International Conference on World Wide Web (WWW), 2008. M. Littman, S. Dumais, and T. Landauer. Automatic Cross-Language Information Retrieval using Latent Semantic Indexing, chapter 5, pages 5 1–62. Kluwer Academic Publishers, 1998. A. Maas, R. Daly, P. Pham, D. Huang, A. Ng, and C. Potts. Learning word vectors for sentiment analysis. In Proceedings of the Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL), 2011. D. Mimno, H. Wallach, J. Naradowsky, D. Smith, and A. McCallum. Polylingual topic models. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2, 2009. A. Mnih and G. Hinton. Three new graphical models for statistical language modelling. In Proceedings of the International Conference on Machine Learning (ICML), 2007. X. Ni, J. Sun, J. Hu, and Z. Chen. Cross lingual text classification by mining multilingual topics from wikipedia. In Proceedings of the ACM International Conference on Web Search and Data Mining (WSDM), 2011. J. Pan, G. Xue, Y. Yu, and Y. Wang. Cross-lingual sentiment classification via bi-view non-negative matrix tri-factorization. In Proceedings of the Pacific-Asia conference on Advances in knowledge discovery and data mining (PAKDD), 2011. P. Petrenz and B. Webber. Label propagation for fine-grained cross-lingual genre classification. In Proceedings of the NIPS xLiTe workshop, 2012. S. Petrov, D. Das, and R. McDonald. A universal part-of-speech tagset. In Proceedings of the International Conference on Language Resources and Evaluation (LREC), 2012. 1475 J. Platt, K. Toutanova, and W. Yih. Translingual document representations from discriminative projec- tions. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2010. P. Prettenhofer and B. Stein. Cross-language text classification using structural correspondence learning. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2010. L. Rigutini and M. Maggini. An em based training algorithm for cross-language text categorization. In Proceedings of the Web Intelligence Conference, 2005. J. Shanahan, G. Grefenstette, Y. Qu, and D. Evans. Mining multilingual opinions through classification and translation. In Proceedings of AAAI Spring Symposium on Exploring Attitude and Affect in Text, 2004. W. Smet, J. Tang, and M. Moens. Knowledge transfer across multilingual corpora via latent topics. In Proceedings of the Pacific-Asia conference on Advances in knowledge discovery and data mining (PAKDD), 2011. A. Vinokourov, J. Shawe-taylor, and N. Cristianini. Inferring a semantic representation of text via cross-language correlation analysis. In Advances in Neural Information Processing Systems (NIPS), 2002. C. Wan, R. Pan, and J. Li. Bi-weighting domain adaptation for cross-language text classification. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), 2011. X. Wan. Co-training for cross-lingual sentiment classification. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2009. B. Wei and C. Pal. Cross lingual adaptation: An experiment on sentiment classifications. In Proceedings of the Annual Meeting of the Asso. for Computational Linguistics (ACL), 2010.