acl acl2011 acl2011-150 acl2011-150-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Xipeng Qiu ; Xuanjing Huang ; Zhao Liu ; Jinlong Zhou
Abstract: Recently, hierarchical text classification has become an active research topic. The essential idea is that the descendant classes can share the information of the ancestor classes in a predefined taxonomy. In this paper, we claim that each class has several latent concepts and its subclasses share information with these different concepts respectively. Then, we propose a variant Passive-Aggressive (PA) algorithm for hierarchical text classification with latent concepts. Experimental results show that the performance of our algorithm is competitive with the recently proposed hierarchical classification algorithms.
L. Cai and T. Hofmann. 2004. Hierarchical document categorization with support vector machines. In Proceedings of CIKM. L. Cai and T. Hofmann. 2007. Exploiting known taxonomies in learning overlapping concepts. In Proceedings of International Joint Conferences on Artificial Intelligence. R. Caruana. 1997. Multi-task learning. Machine Learning, 28(1):41–75. D. Koller and M Sahami. 1997. Hierarchically classifying documents using very few words. In Proceedings of the International Conference on Machine Learning (ICML). T.Y. Liu, Y. Yang, H. Wan, H.J. Zeng, Z. Chen, and W.Y. Ma. 2005. Support vector machines classification with a very large-scale taxonomy. ACM SIGKDD Explorations Newsletter, 7(1):43. Youdong Miao and Xipeng Qiu. 2009. Hierarchical centroid-based classifier for large scale text classification. In Large Scale Hierarchical Text classification (LSHTC) Pascal Challenge. Xipeng Qiu, Wenjun Gao, and Xuanjing Huang. 2009. Hierarchical multi-class text categorization with glob- al margin maximization. In Proceedings of the ACLIJCNLP 2009 Conference, pages 165–168, Suntec, Singapore, August. Association for Computational Linguistics. Xipeng Qiu, Jinlong Zhou, and Xuanjing Huang. 2011. An effective feature selection method for text categorization. In Proceedings of the 15th Pacific-Asia Conference on Knowledge Discovery and Data Mining. Juho Rousu, Craig Saunders, Sandor Szedmak, and John Shawe-Taylor. 2006. Kernel-based learning of hierarchical multilabel classification models. In Journal of Machine Learning Research. G. Salton, A. Wong, and CS Yang. 1975. A vector space model for automatic indexing. Communications of the ACM, 18(1 1):613–620. F. Sebastiani. 2002. Machine learning in automated text categorization. ACM computing surveys, 34(1): 1–47. A. Sun and E.-P Lim. 2001 . Hierarchical text classification and evaluation. In Proceedings of the IEEE International Conference on Data Mining. A. Weigend, E. Wiener, and J Pedersen. 1999. Exploiting hierarchy in text categorization. In Information Retrieval. 602 Y. Xue, X. Liao, L. Carin, and B. Krishnapuram. 2007. Multi-task learning for classification with dirichlet process priors. The Journal of Machine Learning Re- search, 8:63. Y. Yang and X. Liu. 1999. A re-examination of text categorization methods. In Proc. ofSIGIR. ACM Press New York, NY, USA. Y. Yang and J.O. Pedersen. 1997. A comparative study on feature selection in text categorization. In Proc. of Int. Conf. on Mach. Learn. (ICML), volume 97.