jmlr jmlr2007 jmlr2007-49 jmlr2007-49-reference knowledge-graph by maker-knowledge-mining

49 jmlr-2007-Learning to Classify Ordinal Data: The Data Replication Method

Source: pdf

Author: Jaime S. Cardoso, Joaquim F. Pinto da Costa

Abstract: Classiﬁcation of ordinal data is one of the most important tasks of relation learning. This paper introduces a new machine learning paradigm speciﬁcally intended for classiﬁcation problems where the classes have a natural order. The technique reduces the problem of classifying ordered classes to the standard two-class problem. The introduced method is then mapped into support vector machines and neural networks. Generalization bounds of the proposed ordinal classiﬁer are also provided. An experimental study with artiﬁcial and real data sets, including an application to gene expression analysis, veriﬁes the usefulness of the proposed approach. Keywords: classiﬁcation, ordinal data, support vector machines, neural networks

reference text

A. Agarwal, J. T. Davis, and T. Ward. Supporting ordinal four-state classiﬁcation decisions using neural networks. In Information Technology and Management, pages 5–26, 2001. P. L. Bartlett and J. Shawe-Taylor. Generalization performance of support vector machines and other pattern classiﬁers. In B. Scholkopf, C. J. C. Burges, and A. J. Smola, editors, Advances in Kernel Methods - Support Vector Learning, pages 43–54. MIT Press, Cambridge, MA, 1999. J. S. Cardoso, J. F. Pinto da Costa, and M. J. Cardoso. Modelling ordinal relations with SVMs: an application to objective aesthetic evaluation of breast cancer conservative treatment. Neural Networks, 18:808–817, june-july 2005. 1427 C ARDOSO AND P INTO DA C OSTA W. Chu and Z. Ghahramani. Gaussian processes for ordinal regression. Journal of Machine Learning Research, 6:1019–1041, 2005. W. Chu and S. S. Keerthi. New approaches to support vector ordinal regression. In Proceedings of International Conference on Machine Learning (ICML05), pages 145–152, 2005. M. Costa. Probabilistic interpretation of feedforward network outputs, with relationships to statistical prediction of ordinal quantities. International Journal Neural Systems, 7(5):627–638, 1996. J. Dong, A. Krzyzak, and C. Y. Suen. Fast SVM training algorithm with decomposition on very large data sets. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(4):603–618, 2005. E. Frank and M. Hall. A simple approach to ordinal classiﬁcation. In Proceedings of the 12th European Conference on Machine Learning, volume 1, pages 145–156, 2001. I. Guyon, J. Weston, S. Barnhill, and V. Vapnik. Gene selection for cancer classiﬁcation using support vector machines. Machine Learning, 46:389–422, 2002. R. Herbrich, T. Graepel, and K. Obermayer. Regression models for ordinal data: a machine learning approach. Technical Report TR-99/03, TU Berlin, 1999a. R. Herbrich, T. Graepel, and K. Obermayer. Support vector learning for ordinal regression. In Ninth International Conference on Artiﬁcial Neural Networks ICANN, volume 1, pages 97–102, 1999b. T. Joachims. Making large-scale support vector machine learning practical. In A. Smola B. Sch¨ lkopf, C. Burges, editor, Advances in Kernel Methods: Support Vector Machines. MIT o Press, Cambridge, MA, 1998. Y. Lin, Y. Lee, and G. Wahba. Support vector machines for classiﬁcation in nonstandard situations. Machine Learning, 46:191–202, 2002. M. J. Mathieson. Ordinal models for neural networks. In A.-P.N Refenes, Y. Abu-Mostafa, and J. Moody, editors, Neural Networks for Financial Engineering. World Scientiﬁc, Singapore, 1995. P. McCullagh. Regression models for ordinal data. Journal of the Royal Statistical Society Series, 42:109–142, 1980. P. McCullagh and J. A. Nelder. Generalized Linear Models. Chapman and Hall, 1989. J. Moody and J. Utans. Architecture selection strategies for neural networks: application to corporate bond rating prediction. In A.-P. Refenes, editor, Neural Networks in the Capital Markets, pages 277–300, Chichester, 1995. Wiley. J. Platt. Fast training of support vector machines using sequential minimal optimization. In Advances in Kernel Methods-Support Vector Learning, pages 185–208, 1998. W. Press, B. Flannery, S. Teukolsky, and W. Vetterling. Numerical Recipes in C: the Art of Scientiﬁc Computing. Cambridge University Press, 1992. 1428 L EARNING TO C LASSIFY O RDINAL DATA : T HE DATA R EPLICATION M ETHOD B. Scholkopf, A. J. Smola, R. C. Williamson, and P. L. Bartlett. New support vector algorithms. Neural Computation, 12:1207–1245, 2000. A. Shashua and A. Levin. Ranking with large margin principle: Two approaches. In Neural Information and Processing Systems (NIPS), 2002. L. Shen and A. K. Joshi. Ranking and reranking with perceptron. Machine Learning, 60:73–96, September 2005. D. Singh, P. G. Febbo, K. Ross, D. G. Jackson, J. Manola, C. Ladd, P. Tamayo, A. A. Renshaw, A. V. D’Amico, J. P. Richie, E. S. Lander, M. Loda, P. W. Kantoff, T. R. Golub, and W. R. Sellers. Gene expression correlates of clinical prostate cancer behavior. Cancer Cell, 1:1019–1041, 2002. V. N. Vapnik. Statistical Learning Theory. John Wiley, 1998. 1429