nips nips2008 nips2008-155 nips2008-155-reference knowledge-graph by maker-knowledge-mining

155 nips-2008-Nonparametric regression and classification with joint sparsity constraints


Source: pdf

Author: Han Liu, Larry Wasserman, John D. Lafferty

Abstract: We propose new families of models and algorithms for high-dimensional nonparametric learning with joint sparsity constraints. Our approach is based on a regularization method that enforces common sparsity patterns across different function components in a nonparametric additive model. The algorithms employ a coordinate descent approach that is based on a functional soft-thresholding operator. The framework yields several new models, including multi-task sparse additive models, multi-response sparse additive models, and sparse additive multi-category logistic regression. The methods are illustrated with experiments on synthetic data and gene microarray data. 1


reference text

F ORNASIER , M. and R AUHUT, H. (2008). Recovery algorithms for vector valued data with joint sparsity constraints. SIAM Journal of Numerical Analysis 46 577–613. F RIEDMAN , J. H. (1991). Multivariate adaptive regression splines. The Annals of Statistics 19 1–67. K HAN , J., W EI , J. S., R INGNER , M., S AA , L. H., L ADANYI , M., W ESTERMANN , F., B ERTHOLD , F., S CHWAB , M., A NTONESCU , C. R., P ETERSON , C. and M ELTZER , P. S. (2001). Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nature Medicine 7 673 –679. K RISHNAPURAM , B., C ARIN , L., F IGUEIREDO , M. and H ARTEMINK , A. (2005). Sparse multinomial logistic regression: Fast algorithms and generalization bounds. IEEE Transactions on Pattern Analysis and Machine Intelligence 27 957– 968. R AVIKUMAR , P., L IU , H., L AFFERTY, J. and WASSERMAN , L. (2007). SpAM: Sparse additive models. In Advances in Neural Information Processing Systems 20. MIT Press. ROCKAFELLAR , R. T. and W ETS , R. J.-B. (1998). Variational Analysis. Springer-Verlag Inc. T IBSHIRANI , R., H ASTIE , T., NARASIMHAN , B., and C HU , G. (2002). Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci U.S.A. 99 6567–6572. T ROPP, J., G ILBERT, A. C. and S TRAUSS , M. J. (2006). Algorithms for simultaneous sparse approximation. Part II: Convex relaxation. Signal Processing 86 572–588. T URLACH , B., V ENABLES , W. N. and W RIGHT, S. J. (2005). Simultaneous variable selection. Technometrics 27 349–363. Z HANG , H. H., L IU , Y., W U , Y. and Z HU , J. (2008). Variable selection for the multicategory SVM via adaptive sup-norm regularization. Electronic Journal of Statistics 2 149–1167. Z HANG , J. (2006). A probabilistic framework for multitask learning. Tech. Rep. CMU-LTI-06-006, Ph.D. thesis, Carnegie Mellon University. 8