jmlr jmlr2007 jmlr2007-49 jmlr2007-49-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Jaime S. Cardoso, Joaquim F. Pinto da Costa
Abstract: Classification of ordinal data is one of the most important tasks of relation learning. This paper introduces a new machine learning paradigm specifically intended for classification problems where the classes have a natural order. The technique reduces the problem of classifying ordered classes to the standard two-class problem. The introduced method is then mapped into support vector machines and neural networks. Generalization bounds of the proposed ordinal classifier are also provided. An experimental study with artificial and real data sets, including an application to gene expression analysis, verifies the usefulness of the proposed approach. Keywords: classification, ordinal data, support vector machines, neural networks
A. Agarwal, J. T. Davis, and T. Ward. Supporting ordinal four-state classification decisions using neural networks. In Information Technology and Management, pages 5–26, 2001. P. L. Bartlett and J. Shawe-Taylor. Generalization performance of support vector machines and other pattern classifiers. In B. Scholkopf, C. J. C. Burges, and A. J. Smola, editors, Advances in Kernel Methods - Support Vector Learning, pages 43–54. MIT Press, Cambridge, MA, 1999. J. S. Cardoso, J. F. Pinto da Costa, and M. J. Cardoso. Modelling ordinal relations with SVMs: an application to objective aesthetic evaluation of breast cancer conservative treatment. Neural Networks, 18:808–817, june-july 2005. 1427 C ARDOSO AND P INTO DA C OSTA W. Chu and Z. Ghahramani. Gaussian processes for ordinal regression. Journal of Machine Learning Research, 6:1019–1041, 2005. W. Chu and S. S. Keerthi. New approaches to support vector ordinal regression. In Proceedings of International Conference on Machine Learning (ICML05), pages 145–152, 2005. M. Costa. Probabilistic interpretation of feedforward network outputs, with relationships to statistical prediction of ordinal quantities. International Journal Neural Systems, 7(5):627–638, 1996. J. Dong, A. Krzyzak, and C. Y. Suen. Fast SVM training algorithm with decomposition on very large data sets. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(4):603–618, 2005. E. Frank and M. Hall. A simple approach to ordinal classification. In Proceedings of the 12th European Conference on Machine Learning, volume 1, pages 145–156, 2001. I. Guyon, J. Weston, S. Barnhill, and V. Vapnik. Gene selection for cancer classification using support vector machines. Machine Learning, 46:389–422, 2002. R. Herbrich, T. Graepel, and K. Obermayer. Regression models for ordinal data: a machine learning approach. Technical Report TR-99/03, TU Berlin, 1999a. R. Herbrich, T. Graepel, and K. Obermayer. Support vector learning for ordinal regression. In Ninth International Conference on Artificial Neural Networks ICANN, volume 1, pages 97–102, 1999b. T. Joachims. Making large-scale support vector machine learning practical. In A. Smola B. Sch¨ lkopf, C. Burges, editor, Advances in Kernel Methods: Support Vector Machines. MIT o Press, Cambridge, MA, 1998. Y. Lin, Y. Lee, and G. Wahba. Support vector machines for classification in nonstandard situations. Machine Learning, 46:191–202, 2002. M. J. Mathieson. Ordinal models for neural networks. In A.-P.N Refenes, Y. Abu-Mostafa, and J. Moody, editors, Neural Networks for Financial Engineering. World Scientific, Singapore, 1995. P. McCullagh. Regression models for ordinal data. Journal of the Royal Statistical Society Series, 42:109–142, 1980. P. McCullagh and J. A. Nelder. Generalized Linear Models. Chapman and Hall, 1989. J. Moody and J. Utans. Architecture selection strategies for neural networks: application to corporate bond rating prediction. In A.-P. Refenes, editor, Neural Networks in the Capital Markets, pages 277–300, Chichester, 1995. Wiley. J. Platt. Fast training of support vector machines using sequential minimal optimization. In Advances in Kernel Methods-Support Vector Learning, pages 185–208, 1998. W. Press, B. Flannery, S. Teukolsky, and W. Vetterling. Numerical Recipes in C: the Art of Scientific Computing. Cambridge University Press, 1992. 1428 L EARNING TO C LASSIFY O RDINAL DATA : T HE DATA R EPLICATION M ETHOD B. Scholkopf, A. J. Smola, R. C. Williamson, and P. L. Bartlett. New support vector algorithms. Neural Computation, 12:1207–1245, 2000. A. Shashua and A. Levin. Ranking with large margin principle: Two approaches. In Neural Information and Processing Systems (NIPS), 2002. L. Shen and A. K. Joshi. Ranking and reranking with perceptron. Machine Learning, 60:73–96, September 2005. D. Singh, P. G. Febbo, K. Ross, D. G. Jackson, J. Manola, C. Ladd, P. Tamayo, A. A. Renshaw, A. V. D’Amico, J. P. Richie, E. S. Lander, M. Loda, P. W. Kantoff, T. R. Golub, and W. R. Sellers. Gene expression correlates of clinical prostate cancer behavior. Cancer Cell, 1:1019–1041, 2002. V. N. Vapnik. Statistical Learning Theory. John Wiley, 1998. 1429