andrew_gelman_stats andrew_gelman_stats-2013 andrew_gelman_stats-2013-1948 knowledge-graph by maker-knowledge-mining

1948 andrew gelman stats-2013-07-21-Bayes related


meta infos for this blog

Source: html

Introduction: Dave Decker writes: I’ve seen some Bayes related things recently that might make for interesting fodder on your blog. There are two books, teaching Bayesian analysis from a programming perspective. And also a “web application for data analysis using powerful Bayesian statistical methods.” I took a look. The first book is Think Bayes: Bayesian Statistics Made Simple, by Allen B. Downey . It’s super readable and, amazingly, has approximately zero overlap with Bayesian Data Analysis. Downey discusses lots of little problems in a conversational way. In some ways it’s like an old-style math stat textbook (although with a programming rather than mathematical flavor) in that the examples are designed for simplicity rather than realism. I like it! Our book already exists; it’s good to have something else for people to read, coming from an entirely different perspective. The second book is Probabilistic Programming and Bayesian Methods for Hackers , by Cameron Davidson-P


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 There are two books, teaching Bayesian analysis from a programming perspective. [sent-2, score-0.293]

2 And also a “web application for data analysis using powerful Bayesian statistical methods. [sent-3, score-0.587]

3 The first book is Think Bayes: Bayesian Statistics Made Simple, by Allen B. [sent-5, score-0.224]

4 It’s super readable and, amazingly, has approximately zero overlap with Bayesian Data Analysis. [sent-7, score-0.171]

5 Downey discusses lots of little problems in a conversational way. [sent-8, score-0.187]

6 In some ways it’s like an old-style math stat textbook (although with a programming rather than mathematical flavor) in that the examples are designed for simplicity rather than realism. [sent-9, score-0.369]

7 Our book already exists; it’s good to have something else for people to read, coming from an entirely different perspective. [sent-11, score-0.224]

8 The second book is Probabilistic Programming and Bayesian Methods for Hackers , by Cameron Davidson-Pilon with contributions from may others. [sent-12, score-0.224]

9 This book is a bit more polished and less conversational (which is good in some ways and bad in others; overall it’s a plus for book #2 to differ in some ways from book #1, so you can get something out of reading each of them), with a more of a focus on business-analytics sorts of problems. [sent-13, score-1.215]

10 I didn’t read this book in detail either, but from what I saw, I like it too! [sent-14, score-0.335]

11 My main criticism with both books is that they talk a lot about inference but not so much about model building or model checking (recall the three steps of Bayesian data analysis). [sent-15, score-0.634]

12 I think it’s ok for an introductory book to focus on inference, which of course is central to the data-analytic process—but I’d like them to at least mention that Bayesian ideas arise in model building and model checking as well. [sent-16, score-0.726]

13 Finally I looked at BayesHive , which describes itself as “a web application for data analysis using powerful Bayesian statistical methods. [sent-17, score-0.69]

14 We help you build statistical models for your entire dataset such that final decisions and measurements are based on all the available information. [sent-18, score-0.082]

15 ” This sounds great, but unfortunately the website has zero detail on what they are doing. [sent-19, score-0.196]

16 Their documentation has things like, “BayesHive is what you get when you ask functional programmers to re-invent statistics and data analysis software. [sent-23, score-0.362]

17 UIRouter module, we can define a hierarchical set of user interface “states”, each of which has associated Hamlet files defining the user interface appearance and Julius files defining its behaviour. [sent-25, score-1.222]

18 The layout and behaviour definitions for all the states are collected together using Template Haskell code that traverses the directory tree of state definitions and constructs the state dispatch structure needed to initialise the Angular ui-router system. [sent-29, score-1.067]

19 Once Stan becomes more of a standard, this should be a good thing, as developers of this sort of software will be able to focus on specific applications rather than having to write their own code for statistical modeling and inference. [sent-34, score-0.382]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('bayeshive', 0.235), ('book', 0.224), ('bayesian', 0.223), ('downey', 0.202), ('conversational', 0.187), ('programming', 0.178), ('definitions', 0.163), ('interface', 0.163), ('files', 0.148), ('stan', 0.14), ('defining', 0.138), ('focus', 0.128), ('powerful', 0.124), ('user', 0.122), ('analysis', 0.115), ('ways', 0.114), ('detail', 0.111), ('julius', 0.107), ('dispatch', 0.107), ('hamlet', 0.107), ('building', 0.105), ('application', 0.104), ('web', 0.103), ('inference', 0.102), ('checking', 0.099), ('cameron', 0.097), ('layout', 0.097), ('bayes', 0.094), ('code', 0.094), ('directory', 0.093), ('programmers', 0.091), ('using', 0.089), ('behaviour', 0.088), ('constructs', 0.088), ('module', 0.086), ('super', 0.086), ('zero', 0.085), ('books', 0.085), ('states', 0.085), ('model', 0.085), ('documentation', 0.083), ('statistical', 0.082), ('appearance', 0.08), ('allen', 0.079), ('flavor', 0.079), ('template', 0.078), ('developers', 0.078), ('simplicity', 0.077), ('data', 0.073), ('amazingly', 0.072)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 1948 andrew gelman stats-2013-07-21-Bayes related

Introduction: Dave Decker writes: I’ve seen some Bayes related things recently that might make for interesting fodder on your blog. There are two books, teaching Bayesian analysis from a programming perspective. And also a “web application for data analysis using powerful Bayesian statistical methods.” I took a look. The first book is Think Bayes: Bayesian Statistics Made Simple, by Allen B. Downey . It’s super readable and, amazingly, has approximately zero overlap with Bayesian Data Analysis. Downey discusses lots of little problems in a conversational way. In some ways it’s like an old-style math stat textbook (although with a programming rather than mathematical flavor) in that the examples are designed for simplicity rather than realism. I like it! Our book already exists; it’s good to have something else for people to read, coming from an entirely different perspective. The second book is Probabilistic Programming and Bayesian Methods for Hackers , by Cameron Davidson-P

2 0.28975996 1972 andrew gelman stats-2013-08-07-When you’re planning on fitting a model, build up to it by fitting simpler models first. Then, once you have a model you like, check the hell out of it

Introduction: In response to my remarks on his online book, Think Bayes, Allen Downey wrote: I [Downey] have a question about one of your comments: My [Gelman's] main criticism with both books is that they talk a lot about inference but not so much about model building or model checking (recall the three steps of Bayesian data analysis). I think it’s ok for an introductory book to focus on inference, which of course is central to the data-analytic process—but I’d like them to at least mention that Bayesian ideas arise in model building and model checking as well. This sounds like something I agree with, and one of the things I tried to do in the book is to put modeling decisions front and center. But the word “modeling” is used in lots of ways, so I want to see if we are talking about the same thing. For example, in many chapters, I start with a simple model of the scenario, do some analysis, then check whether the model is good enough, and iterate. Here’s the discussion of modeling

3 0.20783204 1112 andrew gelman stats-2012-01-11-A blog full of examples for your statistics class

Introduction: From Allen Downey .

4 0.17837821 1950 andrew gelman stats-2013-07-22-My talks that were scheduled for Tues at the Data Skeptics meetup and Wed at the Open Statistical Programming meetup

Introduction: Statistical Methods and Data Skepticism Data analysis today is dominated by three paradigms: null hypothesis significance testing, Bayesian inference, and exploratory data analysis. There is concern that all these methods lead to overconfidence on the part of researchers and the general public, and this concern has led to the new “data skepticism” movement. But the history of statistics is already in some sense a history of data skepticism. Concepts of bias, variance, sampling and measurement error, least-squares regression, and statistical significance can all be viewed as formalizations of data skepticism. All these methods address the concern that patterns in observed data might not generalize to the population of interest. We discuss the challenge of attaining data skepticism while avoiding data nihilism, and consider some proposed future directions. Stan Stan (mc-stan.org) is an open-source package for obtaining Bayesian inference using the No-U-Turn sampler, a

5 0.17329971 1554 andrew gelman stats-2012-10-31-It not necessary that Bayesian methods conform to the likelihood principle

Introduction: Bayesian inference, conditional on the model and data, conforms to the likelihood principle. But there is more to Bayesian methods than Bayesian inference. See chapters 6 and 7 of Bayesian Data Analysis for much discussion of this point. It saddens me to see that people are still confused on this issue.

6 0.17204531 2291 andrew gelman stats-2014-04-14-Transitioning to Stan

7 0.16839036 1205 andrew gelman stats-2012-03-09-Coming to agreement on philosophy of statistics

8 0.16687486 244 andrew gelman stats-2010-08-30-Useful models, model checking, and external validation: a mini-discussion

9 0.16647455 2273 andrew gelman stats-2014-03-29-References (with code) for Bayesian hierarchical (multilevel) modeling and structural equation modeling

10 0.16506009 781 andrew gelman stats-2011-06-28-The holes in my philosophy of Bayesian data analysis

11 0.15819535 1469 andrew gelman stats-2012-08-25-Ways of knowing

12 0.15539606 2106 andrew gelman stats-2013-11-19-More on “data science” and “statistics”

13 0.15262485 1475 andrew gelman stats-2012-08-30-A Stan is Born

14 0.152068 1779 andrew gelman stats-2013-03-27-“Two Dogmas of Strong Objective Bayesianism”

15 0.1508857 1695 andrew gelman stats-2013-01-28-Economists argue about Bayes

16 0.14939208 1719 andrew gelman stats-2013-02-11-Why waste time philosophizing?

17 0.14359641 1712 andrew gelman stats-2013-02-07-Philosophy and the practice of Bayesian statistics (with all the discussions!)

18 0.14241447 1611 andrew gelman stats-2012-12-07-Feedback on my Bayesian Data Analysis class at Columbia

19 0.13942139 2368 andrew gelman stats-2014-06-11-Bayes in the research conversation

20 0.13735859 690 andrew gelman stats-2011-05-01-Peter Huber’s reflections on data analysis


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.271), (1, 0.125), (2, -0.145), (3, 0.148), (4, -0.001), (5, 0.115), (6, -0.074), (7, -0.032), (8, 0.072), (9, -0.056), (10, -0.016), (11, -0.055), (12, -0.028), (13, -0.017), (14, 0.174), (15, 0.018), (16, -0.006), (17, 0.031), (18, 0.048), (19, -0.042), (20, 0.005), (21, 0.03), (22, -0.01), (23, 0.022), (24, -0.02), (25, -0.007), (26, 0.025), (27, -0.033), (28, 0.008), (29, 0.012), (30, -0.035), (31, -0.075), (32, 0.009), (33, 0.018), (34, 0.022), (35, 0.121), (36, 0.01), (37, -0.03), (38, -0.008), (39, -0.057), (40, -0.034), (41, -0.0), (42, -0.055), (43, 0.042), (44, 0.01), (45, 0.0), (46, 0.037), (47, 0.012), (48, 0.008), (49, 0.026)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.97977769 1948 andrew gelman stats-2013-07-21-Bayes related

Introduction: Dave Decker writes: I’ve seen some Bayes related things recently that might make for interesting fodder on your blog. There are two books, teaching Bayesian analysis from a programming perspective. And also a “web application for data analysis using powerful Bayesian statistical methods.” I took a look. The first book is Think Bayes: Bayesian Statistics Made Simple, by Allen B. Downey . It’s super readable and, amazingly, has approximately zero overlap with Bayesian Data Analysis. Downey discusses lots of little problems in a conversational way. In some ways it’s like an old-style math stat textbook (although with a programming rather than mathematical flavor) in that the examples are designed for simplicity rather than realism. I like it! Our book already exists; it’s good to have something else for people to read, coming from an entirely different perspective. The second book is Probabilistic Programming and Bayesian Methods for Hackers , by Cameron Davidson-P

2 0.80484164 2273 andrew gelman stats-2014-03-29-References (with code) for Bayesian hierarchical (multilevel) modeling and structural equation modeling

Introduction: A student writes: I am new to Bayesian methods. While I am reading your book, I have some questions for you. I am interested in doing Bayesian hierarchical (multi-level) linear regression (e.g., random-intercept model) and Bayesian structural equation modeling (SEM)—for causality. Do you happen to know if I could find some articles, where authors could provide data w/ R and/or BUGS codes that I could replicate them? My reply: For Bayesian hierarchical (multi-level) linear regression and causal inference, see my book with Jennifer Hill. For Bayesian structural equation modeling, try google and you’ll find some good stuff. Also, I recommend Stan (http://mc-stan.org/) rather than Bugs.

3 0.80094671 427 andrew gelman stats-2010-11-23-Bayesian adaptive methods for clinical trials

Introduction: Scott Berry, Brad Carlin, Jack Lee, and Peter Muller recently came out with a book with the above title. The book packs a lot into its 280 pages and is fun to read as well (even if they do use the word “modalities” in their first paragraph, and later on they use the phrase “DIC criterion,” which upsets my tidy, logical mind). The book starts off fast on page 1 and never lets go. Clinical trials are a big part of statistics and it’s cool to see the topic taken seriously and being treated rigorously. (Here I’m not talking about empty mathematical rigor (or, should I say, “rigor”), so-called optimal designs and all that, but rather the rigor of applied statistics, mapping models to reality.) Also I have a few technical suggestions. 1. The authors fit a lot of models in Bugs, which is fine, but they go overboard on the WinBUGS thing. There’s WinBUGS, OpenBUGS, JAGS: they’re all Bugs recommend running Bugs from R using the clunky BRugs interface rather than the smoother bugs(

4 0.79660153 1783 andrew gelman stats-2013-03-31-He’s getting ready to write a book

Introduction: Eric Novik does some open-source planning : My co-author, Jacki Buros, and I [Novik] have just signed a contract with Apress to write a book tentatively entitled “Predictive Analytics with R”, which will cover programming best practices, data munging, data exploration, and single and multi-level models with case studies in social media, healthcare, politics, marketing, and the stock market. Why does the world need another R book? We think there is a shortage of books that deal with the complete and programmer centric analysis of real, dirty, and sometimes unstructured data. Our target audience are people who have some familiarity with statistics, but do not have much experience with programming. . . . The book is projected to be about 300 pages across 8 chapters. This is my first experience with writing a book and everything I heard about the process tells me that this is going to be a long and arduous endeavor lasting anywhere from 6 to 8 months. Novik emailed me and wrot

5 0.78708017 1199 andrew gelman stats-2012-03-05-Any available cookbooks on Bayesian designs?

Introduction: Robert Bell writes: I [Bell] just finished “The Emperor of All Maladies” from 2010. One of the sections is captioned with the “In God we trust, all others bring data” quote which is supposedly from Deming. If you haven’t read it, I think you might like it even though the topic is pretty morbid (i.e. cancer). I was rereading chapter 7 in Bayesian Data Analysis. While the principles there would allow me to “roll my own”, I was wondering whether or not there is now a comprehensive “cookbook” out there that gathers together such analyses for a variety of experimental designs – sort of a Bayesian equivalent to Montgomery’s “Design and Analysis of Experiments”. My reply: I don’t know, but now I’m thinking we should put some of this in our forthcoming book, Statistics for Psychology Research. In the meantime, if anyone has any suggestions, feel free to post them in comments.

6 0.77651334 1188 andrew gelman stats-2012-02-28-Reference on longitudinal models?

7 0.76529944 1586 andrew gelman stats-2012-11-21-Readings for a two-week segment on Bayesian modeling?

8 0.75684845 2009 andrew gelman stats-2013-09-05-A locally organized online BDA course on G+ hangout?

9 0.75093257 690 andrew gelman stats-2011-05-01-Peter Huber’s reflections on data analysis

10 0.74420202 1950 andrew gelman stats-2013-07-22-My talks that were scheduled for Tues at the Data Skeptics meetup and Wed at the Open Statistical Programming meetup

11 0.72977817 1912 andrew gelman stats-2013-06-24-Bayesian quality control?

12 0.71820813 117 andrew gelman stats-2010-06-29-Ya don’t know Bayes, Jack

13 0.71424115 1469 andrew gelman stats-2012-08-25-Ways of knowing

14 0.71155262 183 andrew gelman stats-2010-08-04-Bayesian models for simultaneous equation systems?

15 0.70916766 1719 andrew gelman stats-2013-02-11-Why waste time philosophizing?

16 0.70349222 1634 andrew gelman stats-2012-12-21-Two reviews of Nate Silver’s new book, from Kaiser Fung and Cathy O’Neil

17 0.69995469 1542 andrew gelman stats-2012-10-20-A statistical model for underdispersion

18 0.69078004 1528 andrew gelman stats-2012-10-10-My talk at MIT on Thurs 11 Oct

19 0.69057333 244 andrew gelman stats-2010-08-30-Useful models, model checking, and external validation: a mini-discussion

20 0.6899578 590 andrew gelman stats-2011-02-25-Good introductory book for statistical computation?


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(0, 0.014), (15, 0.02), (16, 0.073), (21, 0.029), (24, 0.114), (50, 0.029), (59, 0.026), (77, 0.152), (86, 0.071), (99, 0.352)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.98280072 1784 andrew gelman stats-2013-04-01-Wolfram on Mandelbrot

Introduction: The most perfect pairing of author and subject since Nicholson Baker and John Updike. Here’s Wolfram on the great researcher of fractals : In his way, Mandelbrot paid me some great compliments. When I was in my 20s, and he in his 60s, he would ask about my scientific work: “How can so many people take someone so young so seriously?” In 2002, my book “A New Kind of Science”—in which I argued that many phenomena across science are the complex results of relatively simple, program-like rules—appeared. Mandelbrot seemed to see it as a direct threat, once declaring that “Wolfram’s ‘science’ is not new except when it is clearly wrong; it deserves to be completely disregarded.” In private, though, several mutual friends told me, he fretted that in the long view of history it would overwhelm his work. In retrospect, I don’t think Mandelbrot had much to worry about on this account. The link from the above review came from Peter Woit, who also points to a review by Brian Hayes wit

2 0.98036724 1124 andrew gelman stats-2012-01-17-How to map geographically-detailed survey responses?

Introduction: David Sparks writes: I am experimenting with the mapping/visualization of survey response data, with a particular focus on using transparency to convey uncertainty. See some examples here . Do you think the examples are successful at communicating both local values of the variable of interest, as well as the lack of information in certain places? Also, do you have any general advice for choosing an approach to spatially smoothing the data in a way that preserves local features, but prevents individual respondents from standing out? I have experimented a lot with smoothing in these maps, and the cost of preventing the Midwest and West from looking “spotty” is the oversmoothing of the Northeast. My quick impression is that the graphs are more pretty than they are informative. But “pretty” is not such a bad thing! The conveying-information part is more difficult: to me, the graphs seem to be displaying a somewhat confusing mix of opinion level and population density. Consider

3 0.97946405 1684 andrew gelman stats-2013-01-20-Ugly ugly ugly

Introduction: Denis Cote sends the following , under the heading, “Some bad graphs for your enjoyment”: To start with, they don’t know how to spell “color.” Seriously, though, the graph is a mess. The circular display implies a circular or periodic structure that isn’t actually in the data, the cramped display requires the use of an otherwise-unnecessary color code that makes it difficult to find or make sense of the information, the alphabetical ordering (without even supplying state names, only abbreviations) makes it further difficult to find any patterns. It would be so much better, and even easier, to just display a set of small maps shading states on whether they have different laws. But that’s part of the problem—the clearer graph would also be easier to make! To get a distinctive graph, there needs to be some degree of difficulty. The designers continue with these monstrosities: Here they decide to display only 5 states at a time so that it’s really hard to see any big pi

4 0.97621191 978 andrew gelman stats-2011-10-28-Cool job opening with brilliant researchers at Yahoo

Introduction: Duncan Watts writes: The Human Social Dynamics Group in Yahoo Research is seeking highly qualified candidates for a post-doctoral research scientist position. The Human and Social Dynamics group is devoted to understanding the interplay between individual-level behavior (e.g. how people make decisions about what music they like, which dates to go on, or which groups to join) and the social environment in which individual behavior necessarily plays itself out. In particular, we are interested in: * Structure and evolution of social groups and networks * Decision making, social influence, diffusion, and collective decisions * Networking and collaborative problem solving. The intrinsically multi-disciplinary and cross-cutting nature of the subject demands an eclectic range of researchers, both in terms of domain-expertise (e.g. decision sciences, social psychology, sociology) and technical skills (e.g. statistical analysis, mathematical modeling, computer simulations, design o

5 0.97459513 1481 andrew gelman stats-2012-09-04-Cool one-day miniconference at Columbia Fri 12 Oct on computational and online social science

Introduction: One thing we do here at the Applied Statistics Center is hold mini-conferences. The next one looks really cool. It’s organized by Sharad Goel and Jake Hofman (Microsoft Research, formerly at Yahoo Research), David Park (Columbia University), and Sergei Vassilvitskii (Google). As with our other conferences, one of our goals is to mix the academic and nonacademic research communities. Here’s the website for the workshop, and here’s the announcement from the organizers: With an explosion of data on every aspect of our everyday existence — from what we buy, to where we travel, to who we know — we are able to observe human behavior with granularity largely thought impossible just a decade ago. The growth of such online activity has further facilitated the design of web-based experiments, enhancing both the scale and efficiency of traditional methods. Together these advances have created an unprecedented opportunity to address longstanding questions in the social sciences, rang

6 0.97410983 57 andrew gelman stats-2010-05-29-Roth and Amsterdam

7 0.9738903 1373 andrew gelman stats-2012-06-09-Cognitive psychology research helps us understand confusion of Jonathan Haidt and others about working-class voters

same-blog 8 0.96421105 1948 andrew gelman stats-2013-07-21-Bayes related

9 0.96363932 1604 andrew gelman stats-2012-12-04-An epithet I can live with

10 0.96159512 1561 andrew gelman stats-2012-11-04-Someone is wrong on the internet

11 0.96013308 2054 andrew gelman stats-2013-10-07-Bing is preferred to Google by people who aren’t like me

12 0.95920336 380 andrew gelman stats-2010-10-29-“Bluntly put . . .”

13 0.95616055 93 andrew gelman stats-2010-06-17-My proposal for making college admissions fairer

14 0.95551372 128 andrew gelman stats-2010-07-05-The greatest works of statistics never published

15 0.95533502 216 andrew gelman stats-2010-08-18-More forecasting competitions

16 0.9532696 614 andrew gelman stats-2011-03-15-Induction within a model, deductive inference for model evaluation

17 0.95212811 562 andrew gelman stats-2011-02-06-Statistician cracks Toronto lottery

18 0.95056319 1071 andrew gelman stats-2011-12-19-“NYU Professor Claims He Was Fired for Giving James Franco a D”

19 0.94793999 2122 andrew gelman stats-2013-12-03-Objects of the class “Lawrence Summers”: Arne Duncan edition

20 0.94300818 530 andrew gelman stats-2011-01-22-MS-Bayes?