nips nips2011 nips2011-104 nips2011-104-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Artin Armagan, Merlise Clyde, David B. Dunson
Abstract: In recent years, a rich variety of shrinkage priors have been proposed that have great promise in addressing massive regression problems. In general, these new priors can be expressed as scale mixtures of normals, but have more complex forms and better properties than traditional Cauchy and double exponential priors. We first propose a new class of normal scale mixtures through a novel generalized beta distribution that encompasses many interesting priors as special cases. This encompassing framework should prove useful in comparing competing priors, considering properties and revealing close connections. We then develop a class of variational Bayes approximations through the new hierarchy presented that will scale more efficiently to the types of truly massive data sets that are now encountered routinely. 1
[1] A. Armagan. Variational bridge regression. JMLR: W&CP;, 5:17–24, 2009. 8
[2] A. Armagan, D. B. Dunson, and J. Lee. arXiv:1104.0861v2, 2011. Generalized double Pareto shrinkage.
[3] C. Armero and M. J. Bayarri. Prior assessments for prediction in queues. The Statistician, 43(1):pp. 139–153, 1994.
[4] J. Berger. A robust generalized Bayes estimator and confidence region for a multivariate normal mean. The Annals of Statistics, 8(4):pp. 716–761, 1980.
[5] C. M. Bishop and M. E. Tipping. Variational relevance vector machines. In UAI ’00: Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence, pages 46–53, San Francisco, CA, USA, 2000. Morgan Kaufmann Publishers Inc.
[6] C. M. Carvalho, N. G. Polson, and J. G. Scott. Handling sparsity via the horseshoe. JMLR: W&CP;, 5, 2009.
[7] C. M. Carvalho, N. G. Polson, and J. G. Scott. The horseshoe estimator for sparse signals. Biometrika, 97(2):465–480, 2010.
[8] M. A. T. Figueiredo. Adaptive sparseness for supervised learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25:1150–1159, 2003.
[9] E. I. George and R. E. McCulloch. Variable selection via Gibbs sampling. Journal of the American Statistical Association, 88, 1993.
[10] M. Gordy. A generalization of generalized beta distributions. Finance and Economics Discussion Series 1998-18, Board of Governors of the Federal Reserve System (U.S.), 1998.
[11] J. E. Griffin and P. J. Brown. Bayesian adaptive lassos with non-convex penalization. Technical Report, 2007.
[12] J. E. Griffin and P. J. Brown. Inference with normal-gamma prior distributions in regression problems. Bayesian Analysis, 5(1):171–188, 2010.
[13] C. Hans. Bayesian lasso regression. Biometrika, 96:835–845, 2009.
[14] C. J. Hoggart, J. C. Whittaker, and David J. Balding M. De Iorio. Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies. PLoS Genetics, 4(7), 2008.
[15] H. Ishwaran and J. S. Rao. Spike and slab variable selection: Frequentist and Bayesian strategies. The Annals of Statistics, 33(2):pp. 730–773, 2005.
[16] I. M. Johnstone and B. W. Silverman. Needles and straw in haystacks: Empirical Bayes estimates of possibly sparse sequences. Annals of Statistics, 32(4):pp. 1594–1649, 2004.
[17] M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul. An introduction to variational methods for graphical models. MIT Press, Cambridge, MA, USA, 1999.
[18] T. J. Mitchell and J. J. Beauchamp. Bayesian variable selection in linear regression. Journal of the American Statistical Association, 83(404):pp. 1023–1032, 1988.
[19] T. Park and G. Casella. The Bayesian lasso. Journal of the American Statistical Association, 103:681–686(6), 2008.
[20] N. G. Polson and J. G. Scott. Alternative global-local shrinkage rules using hypergeometricbeta mixtures. Discussion Paper 2009-14, Department of Statistical Science, Duke University, 2009.
[21] W. E. Strawderman. Proper Bayes minimax estimators of the multivariate normal mean. The Annals of Mathematical Statistics, 42(1):pp. 385–388, 1971.
[22] R. Tibshirani. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological), 58(1):267–288, 1996.
[23] L. Tierney and J. B. Kadane. Accurate approximations for posterior moments and marginal densities. Journal of the American Statistical Association, 81(393):82–86, 1986.
[24] M. E. Tipping. Sparse Bayesian learning and the relevance vector machine. Journal of Machine Learning Research, 1, 2001. 9