andrew_gelman_stats andrew_gelman_stats-2013 andrew_gelman_stats-2013-1856 knowledge-graph by maker-knowledge-mining

1856 andrew gelman stats-2013-05-14-GPstuff: Bayesian Modeling with Gaussian Processes

meta infos for this blog

Source: html

Introduction: I think it’s part of my duty as a blogger to intersperse, along with the steady flow of jokes, rants, and literary criticism, some material that will actually be useful to you. So here goes. Jarno Vanhatalo, Jaakko Riihimäki, Jouni Hartikainen, Pasi Jylänki, Ville Tolvanen, and Aki Vehtari write : The GPstuff toolbox is a versatile collection of Gaussian process models and computational tools required for Bayesian inference. The tools include, among others, various inference methods, sparse approximations and model assessment methods. We can actually now fit Gaussian processes in Stan . But for big problems (or even moderately-sized problems), full Bayes can be slow. GPstuff uses EP, which is faster. At some point we’d like to implement EP in Stan. (Right now we’re working with Dave Blei to implement VB.) GPstuff really works. I saw Aki use it to fit a nonparametric version of the Bangladesh well-switching example in ARM. He was sitting in his office and just whip

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 I think it’s part of my duty as a blogger to intersperse, along with the steady flow of jokes, rants, and literary criticism, some material that will actually be useful to you. [sent-1, score-0.816]

2 Jarno Vanhatalo, Jaakko Riihimäki, Jouni Hartikainen, Pasi Jylänki, Ville Tolvanen, and Aki Vehtari write : The GPstuff toolbox is a versatile collection of Gaussian process models and computational tools required for Bayesian inference. [sent-3, score-0.638]

3 The tools include, among others, various inference methods, sparse approximations and model assessment methods. [sent-4, score-0.672]

4 We can actually now fit Gaussian processes in Stan . [sent-5, score-0.326]

5 But for big problems (or even moderately-sized problems), full Bayes can be slow. [sent-6, score-0.151]

6 At some point we’d like to implement EP in Stan. [sent-8, score-0.206]

7 (Right now we’re working with Dave Blei to implement VB. [sent-9, score-0.206]

8 I saw Aki use it to fit a nonparametric version of the Bangladesh well-switching example in ARM. [sent-11, score-0.415]

9 He was sitting in his office and just whipped up the model and fit it. [sent-12, score-0.575]

similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('gpstuff', 0.469), ('ep', 0.289), ('aki', 0.231), ('implement', 0.206), ('gaussian', 0.202), ('whipped', 0.166), ('bangladesh', 0.166), ('intersperse', 0.166), ('jaakko', 0.166), ('tools', 0.164), ('fit', 0.161), ('jouni', 0.156), ('toolbox', 0.156), ('vehtari', 0.144), ('blei', 0.144), ('rants', 0.131), ('steady', 0.122), ('jokes', 0.122), ('approximations', 0.117), ('flow', 0.116), ('duty', 0.116), ('sparse', 0.114), ('nonparametric', 0.106), ('literary', 0.106), ('dave', 0.106), ('blogger', 0.102), ('assessment', 0.102), ('sitting', 0.096), ('processes', 0.093), ('collection', 0.091), ('problems', 0.091), ('office', 0.087), ('computational', 0.084), ('required', 0.08), ('uses', 0.079), ('material', 0.075), ('saw', 0.075), ('criticism', 0.074), ('bayes', 0.073), ('version', 0.073), ('actually', 0.072), ('stan', 0.072), ('model', 0.065), ('process', 0.063), ('full', 0.06), ('include', 0.059), ('among', 0.057), ('along', 0.054), ('useful', 0.053), ('inference', 0.053)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 1856 andrew gelman stats-2013-05-14-GPstuff: Bayesian Modeling with Gaussian Processes

2 0.27090484 2067 andrew gelman stats-2013-10-18-EP and ABC

Introduction: Expectation propagation and approximate Bayesian computation. Here are X’s comments on a paper, “Expectation-Propagation for Likelihood-Free Inference,” by Simon Barthelme and Nicolas Chopin. The paper is not new but the topic is still hot. Also there’s this paper by Maurizio Filippone and Mark Girolami on computation for Gaussian process models. I wonder how this connects to GPstuff , which I think is what Aki did to fit the birthdays model: This stuff is where it’s at.

3 0.18247622 1384 andrew gelman stats-2012-06-19-Slick time series decomposition of the birthdays data

Introduction: Aki updates : Here is my plot using the full time series data to make the model. Data analysis could be made in many different ways, but my hammer is Gaussian process, and so I modeled the data with a Gaussian process with six components 1) slowly changing trend 2) 7 day periodical component capturing day of week effect 3) 365.25 day periodical component capturing day of year effect 4) component to take into account the special days and interaction with weekends 5) small time scale correlating noise 6) independent Gaussian noise - Day of the week effect has been increasing in 80′s - Day of year effect has changed only a little during years - 22nd to 31st December is strange time I [Aki] will make the code available this week, but we have to first make new release of our GPstuff toolbox, as I used our development code to do this. I have no idea what’s going on with 29 Feb; I wouldn’t see why births would be less likely on that day. Also, the above graphs are g

4 0.13984649 2139 andrew gelman stats-2013-12-19-Happy birthday

Introduction: (Click for bigger image.) The above is Akiâ€™s decomposition of the birthdays data (the number of babies born each day in the United States, from 1968 through 1988) using a Gaussian process model, as described in more detail in our book .

5 0.12046277 1379 andrew gelman stats-2012-06-14-Cool-ass signal processing using Gaussian processes (birthdays again)

Introduction: Aki writes: Here’s my version of the birthday frequency graph . I used Gaussian process with two slowly varying components and periodic component with decay, so that periodic form can change in time. I used Student’s t-distribution as observation model to allow exceptional dates to be outliers. I guess that periodic component due to week effect is still in the data because there is data only from twenty years. Naturally it would be better to model the whole timeseries, but it was easier to just use the cvs by Mulligan. ALl I can say is . . . wow. Bayes wins again. Maybe Aki can supply the R or Matlab code? P.S. And let’s not forget how great the simple and clear time series plots are, compared to various fancy visualizations that people might try. P.P.S. More here .

6 0.11762951 1648 andrew gelman stats-2013-01-02-A important new survey of Bayesian predictive methods for model assessment, selection and comparison

7 0.092357688 2291 andrew gelman stats-2014-04-14-Transitioning to Stan

8 0.088833787 1454 andrew gelman stats-2012-08-11-Weakly informative priors for Bayesian nonparametric models?

9 0.08774174 1858 andrew gelman stats-2013-05-15-Reputations changeable, situations tolerable

10 0.086140133 1950 andrew gelman stats-2013-07-22-My talks that were scheduled for Tues at the Data Skeptics meetup and Wed at the Open Statistical Programming meetup

11 0.085428007 288 andrew gelman stats-2010-09-21-Discussion of the paper by Girolami and Calderhead on Bayesian computation

12 0.084844172 1469 andrew gelman stats-2012-08-25-Ways of knowing

13 0.082046136 244 andrew gelman stats-2010-08-30-Useful models, model checking, and external validation: a mini-discussion

14 0.081853598 2011 andrew gelman stats-2013-09-07-Here’s what happened when I finished my PhD thesis

15 0.081661731 1961 andrew gelman stats-2013-07-29-Postdocs in probabilistic modeling! With David Blei! And Stan!

16 0.080917194 2016 andrew gelman stats-2013-09-11-Zipfian Academy, A School for Data Science

17 0.07980141 2351 andrew gelman stats-2014-05-28-Bayesian nonparametric weighted sampling inference

18 0.079012118 1205 andrew gelman stats-2012-03-09-Coming to agreement on philosophy of statistics

19 0.075996846 2161 andrew gelman stats-2014-01-07-My recent debugging experience

20 0.074406855 754 andrew gelman stats-2011-06-09-Difficulties with Bayesian model averaging

similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.104), (1, 0.084), (2, -0.043), (3, 0.055), (4, 0.005), (5, 0.037), (6, -0.021), (7, -0.067), (8, 0.026), (9, -0.023), (10, -0.005), (11, 0.017), (12, -0.067), (13, -0.012), (14, 0.003), (15, -0.014), (16, 0.03), (17, 0.006), (18, -0.018), (19, 0.0), (20, -0.005), (21, -0.025), (22, -0.036), (23, -0.02), (24, 0.001), (25, -0.001), (26, -0.019), (27, -0.005), (28, 0.004), (29, 0.01), (30, 0.004), (31, -0.004), (32, -0.015), (33, -0.023), (34, 0.009), (35, -0.002), (36, -0.033), (37, -0.005), (38, 0.017), (39, 0.005), (40, -0.017), (41, 0.034), (42, -0.002), (43, 0.005), (44, 0.04), (45, 0.018), (46, -0.012), (47, -0.004), (48, -0.013), (49, -0.014)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.96138197 1856 andrew gelman stats-2013-05-14-GPstuff: Bayesian Modeling with Gaussian Processes

2 0.8138237 964 andrew gelman stats-2011-10-19-An interweaving-transformation strategy for boosting MCMC efficiency

Introduction: Yaming Yu and Xiao-Li Meng write in with a cool new idea for improving the efficiency of Gibbs and Metropolis in multilevel models: For a broad class of multilevel models, there exist two well-known competing parameterizations, the centered parameterization (CP) and the non-centered parameterization (NCP), for effective MCMC implementation. Much literature has been devoted to the questions of when to use which and how to compromise between them via partial CP/NCP. This article introduces an alternative strategy for boosting MCMC efficiency via simply interweaving—but not alternating—the two parameterizations. This strategy has the surprising property that failure of both the CP and NCP chains to converge geometrically does not prevent the interweaving algorithm from doing so. It achieves this seemingly magical property by taking advantage of the discordance of the two parameterizations, namely, the sufficiency of CP and the ancillarity of NCP, to substantially reduce the Markovian

3 0.79056317 2003 andrew gelman stats-2013-08-30-Stan Project: Continuous Relaxations for Discrete MRFs

Introduction: Hamiltonian Monte Carlo (HMC), as used by Stan , is only defined for continuous parameters. We’d love to be able to do discrete sampling. So I was excited when I saw this: Yichuan Zhang, Charles Sutton, Amos J Storkey, and Zoubin Ghahramani. 2012. Continuous Relaxations for Discrete Hamiltonian Monte Carlo . NIPS 25. Abstract: Continuous relaxations play an important role in discrete optimization, but have not seen much use in approximate probabilistic inference. Here we show that a general form of the Gaussian Integral Trick makes it possible to transform a wide class of discrete variable undirected models into fully continuous systems. The continuous representation allows the use of gradient-based Hamiltonian Monte Carlo for inference, results in new ways of estimating normalization constants (partition functions), and in general opens up a number of new avenues for inference in difficult discrete systems. We demonstrate some of these continuous relaxation inference a

4 0.78394997 2020 andrew gelman stats-2013-09-12-Samplers for Big Science: emcee and BAT

Introduction: Over the past few months, we’ve talked about modeling with particle physicists ( Allen Caldwell ), astrophysicists ( David Hogg , who regularly comments here), and climate and energy usage modelers ( Phil Price , who regularly posts here). Big Science Black Boxes We’ve gotten pretty much the same story from all of them: their models involve “big science” components that are hugely complex and provided by outside implementations from labs like CERN or LBL. Some concrete examples for energy modeling are the TOUGH2 thermal simulator, the EnergyPlus building energy usage simulator, and global climate model (GCM) implementations. These models have the character of not only being black boxes, but taking several seconds or more to generate the equivalent of a likelihood function evaluation. So we can’t use something like Stan, because nobody has the person years required to implement something like TOUGH2 in Stan (and Stan doesn’t have the debugging or modularity tools to suppor

5 0.77472091 1739 andrew gelman stats-2013-02-26-An AI can build and try out statistical models using an open-ended generative grammar

Introduction: David Duvenaud writes: I’ve been following your recent discussions about how an AI could do statistics [see also here ]. I was especially excited about your suggestion for new statistical methods using “a language-like approach to recursively creating new models from a specified list of distributions and transformations, and an automatic approach to checking model fit.” Your discussion of these ideas was exciting to me and my colleagues because we recently did some work taking a step in this direction, automatically searching through a grammar over Gaussian process regression models. Roger Grosse previously did the same thing , but over matrix decomposition models using held-out predictive likelihood to check model fit. These are both examples of automatic Bayesian model-building by a search over more and more complex models, as you suggested. One nice thing is that both grammars include lots of standard models for free, and they seem to work pretty well, although the

6 0.75500762 1041 andrew gelman stats-2011-12-04-David MacKay and Occam’s Razor

7 0.750736 2299 andrew gelman stats-2014-04-21-Stan Model of the Week: Hierarchical Modeling of Supernovas

8 0.74851978 2242 andrew gelman stats-2014-03-10-Stan Model of the Week: PK Calculation of IV and Oral Dosing

9 0.74791431 1406 andrew gelman stats-2012-07-05-Xiao-Li Meng and Xianchao Xie rethink asymptotics

10 0.74319595 1459 andrew gelman stats-2012-08-15-How I think about mixture models

11 0.74016207 773 andrew gelman stats-2011-06-18-Should we always be using the t and robit instead of the normal and logit?

12 0.73763019 1528 andrew gelman stats-2012-10-10-My talk at MIT on Thurs 11 Oct

13 0.72978854 244 andrew gelman stats-2010-08-30-Useful models, model checking, and external validation: a mini-discussion

14 0.72833025 1886 andrew gelman stats-2013-06-07-Robust logistic regression

15 0.72549087 72 andrew gelman stats-2010-06-07-Valencia: Summer of 1991

16 0.7211042 1950 andrew gelman stats-2013-07-22-My talks that were scheduled for Tues at the Data Skeptics meetup and Wed at the Open Statistical Programming meetup

17 0.72099596 2035 andrew gelman stats-2013-09-23-Scalable Stan

18 0.71779239 1392 andrew gelman stats-2012-06-26-Occam

19 0.71673328 2291 andrew gelman stats-2014-04-14-Transitioning to Stan

20 0.70252526 1036 andrew gelman stats-2011-11-30-Stan uses Nuts!

similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(13, 0.02), (15, 0.013), (16, 0.037), (24, 0.126), (30, 0.017), (53, 0.363), (55, 0.015), (62, 0.015), (63, 0.014), (86, 0.053), (99, 0.21)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.96452141 413 andrew gelman stats-2010-11-14-Statistics of food consumption

Introduction: Visual Economics shows statistics on average food consumption in America: My brief feedback is that water is confounded with these results. They should have subtracted water content from the weight of all dietary items, as it inflates the proportion of milk, vegetable and fruit items that contain more water. They did that for soda (which is represented as sugar/corn syrup), amplifying the inconsistency. Time Magazine had a beautiful gallery that visualizes diets around the world in a more appealing way.

2 0.92289221 1589 andrew gelman stats-2012-11-25-Life as a blogger: the emails just get weirder and weirder

Introduction: In the email the other day, subject line “Casting blogger, writer, journalist to host cable series”: Hi there Andrew, I’m casting a male journalist, writer, blogger, documentary filmmaker or comedian with a certain type personality for a television pilot along with production company, Pipeline39. See below: A certain type of character – no cockiness, no ego, a person who is smart, savvy, dry humor, but someone who isn’t imposing, who can infiltrate these organizations. This person will be hosting his own show and covering alternative lifestyles and secret societies around the world. If you’re interested in hearing more or would like to be considered for this project, please email me a photo and a bio of yourself, along with contact information. I’ll respond to you ASAP. I’m looking forward to hearing from you. *** Casting Producer (646) ***.**** ***@gmail.com I was with them until I got to the “no ego” part. . . . Also, I don’t think I could infiltrate any org

3 0.89879119 298 andrew gelman stats-2010-09-27-Who is that masked person: The use of face masks on Mexico City public transportation during the Influenza A (H1N1) outbreak

Introduction: Tapen Sinha writes: Living in Mexico, I have been witness to many strange (and beautiful) things. Perhaps the strangest happened during the first outbreak of A(H1N1) in Mexico City. We had our university closed, football (soccer) was played in empty stadiums (or should it be stadia) because the government feared a spread of the virus. The Metro was operating and so were the private/public buses and taxis. Since the university was closed, we took the opportunity to collect data on facemask use in the public transport systems. It was a simple (but potentially deadly!) exercise in first hand statistical data collection that we teach our students (Although I must admit that I did not dare sending my research assistant to collect data â€“ what if she contracted the virus?). I believe it was a unique experiment never to be repeated. The paper appeared in the journal Health Policy. From the abstract: At the height of the influenza epidemic in Mexico City in the spring of 2009, the f

same-blog 4 0.89142966 1856 andrew gelman stats-2013-05-14-GPstuff: Bayesian Modeling with Gaussian Processes

5 0.85755157 1677 andrew gelman stats-2013-01-16-Greenland is one tough town

Introduction: Americans (including me) don’t know much about other countries. Jeff Lax sent me to this blog post by Myrddin pointing out that Belgium has a higher murder rate than the rest of Western Europe. I have no particular take on this, but it’s a good reminder that other countries differ from each other. Here in the U.S., we tend to think all western European countries are the same, all eastern European countries are the same, etc. In reality, Sweden is not Finland . P.S. According to the Wiki , Greenland is one tough town. I guess there’s nothing much to do out there but watch satellite TV, chew the blubber, and kill people.

6 0.82934314 1468 andrew gelman stats-2012-08-24-Multilevel modeling and instrumental variables

7 0.8292858 1802 andrew gelman stats-2013-04-14-Detecting predictability in complex ecosystems

8 0.82597286 46 andrew gelman stats-2010-05-21-Careers, one-hit wonders, and an offer of a free book

9 0.82159114 991 andrew gelman stats-2011-11-04-Insecure researchers aren’t sharing their data

10 0.8096019 1905 andrew gelman stats-2013-06-18-There are no fat sprinters

11 0.80521512 1902 andrew gelman stats-2013-06-17-Job opening at new “big data” consulting firm!

12 0.78192103 733 andrew gelman stats-2011-05-27-Another silly graph

13 0.77873099 1555 andrew gelman stats-2012-10-31-Social scientists who use medical analogies to explain causal inference are, I think, implicitly trying to borrow some of the scientific and cultural authority of that field for our own purposes

14 0.77859324 1047 andrew gelman stats-2011-12-08-I Am Too Absolutely Heteroskedastic for This Probit Model

15 0.77480769 2022 andrew gelman stats-2013-09-13-You heard it here first: Intense exercise can suppress appetite

16 0.75845551 795 andrew gelman stats-2011-07-10-Aleks says this is the future of visualization

17 0.75577301 2067 andrew gelman stats-2013-10-18-EP and ABC

18 0.75378418 495 andrew gelman stats-2010-12-31-“Threshold earners” and economic inequality

19 0.7465862 880 andrew gelman stats-2011-08-30-Annals of spam

20 0.7458328 547 andrew gelman stats-2011-01-31-Using sample size in the prior distribution