andrew_gelman_stats andrew_gelman_stats-2012 andrew_gelman_stats-2012-1389 knowledge-graph by maker-knowledge-mining

1389 andrew gelman stats-2012-06-23-Larry Wasserman’s statistics blog

meta infos for this blog

Source: html

Introduction: Larry Wasserman, a leading theoretical statistician and generally thoughtful guy, has started a blog on . . . theoretical statistics! Good stuff, and readers of this blog should enjoy the different perspective that Larry offers. Here are some earlier references to Larry on this blog, and here’s a discussion that gives a sense of our different (but not extremely different) attitudes about statistical methods.

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Larry Wasserman, a leading theoretical statistician and generally thoughtful guy, has started a blog on . [sent-1, score-1.311]

2 Good stuff, and readers of this blog should enjoy the different perspective that Larry offers. [sent-5, score-0.885]

3 Here are some earlier references to Larry on this blog, and here’s a discussion that gives a sense of our different (but not extremely different) attitudes about statistical methods. [sent-6, score-1.242]

similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('larry', 0.592), ('theoretical', 0.297), ('wasserman', 0.251), ('blog', 0.233), ('different', 0.208), ('thoughtful', 0.207), ('references', 0.19), ('enjoy', 0.185), ('extremely', 0.17), ('attitudes', 0.168), ('leading', 0.162), ('started', 0.152), ('guy', 0.142), ('statistician', 0.138), ('gives', 0.137), ('stuff', 0.135), ('perspective', 0.134), ('earlier', 0.133), ('readers', 0.125), ('generally', 0.122), ('methods', 0.102), ('discussion', 0.086), ('sense', 0.083), ('statistics', 0.073), ('statistical', 0.067), ('good', 0.058)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 1389 andrew gelman stats-2012-06-23-Larry Wasserman’s statistics blog

2 0.31744811 1165 andrew gelman stats-2012-02-13-Philosophy of Bayesian statistics: my reactions to Wasserman

Introduction: Continuing with my discussion of the articles in the special issue of the journal Rationality, Markets and Morals on the philosophy of Bayesian statistics: Larry Wasserman, “Low Assumptions, High Dimensions”: This article was refreshing to me because it was so different from anything I’ve seen before. Larry works in a statistics department and I work in a statistics department but there’s so little overlap in what we do. Larry and I both work in high dimesions (maybe his dimensions are higher than mine, but a few thousand dimensions seems like a lot to me!), but there the similarity ends. His article is all about using few to no assumptions, while I use assumptions all the time. Here’s an example. Larry writes: P. Laurie Davies (and his co-workers) have written several interesting papers where probability models, at least in the sense that we usually use them, are eliminated. Data are treated as deterministic. One then looks for adequate models rather than true mode

3 0.27382606 1898 andrew gelman stats-2013-06-14-Progress! (on the understanding of the role of randomization in Bayesian inference)

Introduction: Leading theoretical statistician Larry Wassserman in 2008 : Some of the greatest contributions of statistics to science involve adding additional randomness and leveraging that randomness. Examples are randomized experiments, permutation tests, cross-validation and data-splitting. These are unabashedly frequentist ideas and, while one can strain to fit them into a Bayesian framework, they don’t really have a place in Bayesian inference. The fact that Bayesian methods do not naturally accommodate such a powerful set of statistical ideas seems like a serious deficiency. To which I responded on the second-to-last paragraph of page 8 here . Larry Wasserman in 2013 : Some people say that there is no role for randomization in Bayesian inference. In other words, the randomization mechanism plays no role in Bayes’ theorem. But this is not really true. Without randomization, we can indeed derive a posterior for theta but it is highly sensitive to the prior. This is just a restat

4 0.23093338 1560 andrew gelman stats-2012-11-03-Statistical methods that work in some settings but not others

Introduction: David Hogg pointed me to this post by Larry Wasserman: 1. The Horwitz-Thompson estimator satisfies the following condition: for every , where — the parameter space — is the set of all functions . (There are practical improvements to the Horwitz-Thompson estimator that we discussed in our earlier posts but we won’t revisit those here.) 2. A Bayes estimator requires a prior for . In general, if is not a function of then (1) will not hold. . . . 3. If you let be a function if , (1) still, in general, does not hold. 4. If you make a function if in just the right way, then (1) will hold. . . . There is nothing wrong with doing this, but in our opinion this is not in the spirit of Bayesian inference. . . . 7. This example is only meant to show that Bayesian estimators do not necessarily have good frequentist properties. This should not be surprising. There is no reason why we should in general expect a Bayesian method to have a frequentist property

5 0.21762681 1273 andrew gelman stats-2012-04-20-Proposals for alternative review systems for scientific work

Introduction: I recently became aware of two new entries in the ever-popular genre of, Our Peer-Review System is in Trouble; How Can We Fix It? Political scientist Brendan Nyhan, commenting on experimental and empirical sciences more generally, focuses on the selection problem that positive rather then negative findings tend to get published, leading via the statistical significance filter to an overestimation of effect sizes. Nyhan recommends that data-collection protocols be published ahead of time, with the commitment to publish the eventual results: In the case of experimental data, a better practice would be for journals to accept articles before the study was conducted. The article should be written up to the point of the results section, which would then be populated using a pre-specified analysis plan submitted by the author. The journal would then allow for post-hoc analysis and interpretation by the author that would be labeled as such and distinguished from the previously submit

6 0.21531615 586 andrew gelman stats-2011-02-23-A statistical version of Arrow’s paradox

7 0.19689071 1610 andrew gelman stats-2012-12-06-Yes, checking calibration of probability forecasts is part of Bayesian statistics

8 0.15164611 1205 andrew gelman stats-2012-03-09-Coming to agreement on philosophy of statistics

9 0.1469159 6 andrew gelman stats-2010-04-27-Jelte Wicherts lays down the stats on IQ

10 0.14515591 498 andrew gelman stats-2011-01-02-Theoretical vs applied statistics

11 0.13459048 1832 andrew gelman stats-2013-04-29-The blogroll

12 0.12282474 244 andrew gelman stats-2010-08-30-Useful models, model checking, and external validation: a mini-discussion

13 0.11983757 1459 andrew gelman stats-2012-08-15-How I think about mixture models

14 0.097995974 1705 andrew gelman stats-2013-02-04-Recently in the sister blog

15 0.097883031 659 andrew gelman stats-2011-04-13-Jim Campbell argues that Larry Bartels’s “Unequal Democracy” findings are not robust

16 0.097329356 1701 andrew gelman stats-2013-01-31-The name that fell off a cliff

17 0.09571024 431 andrew gelman stats-2010-11-26-One fun thing about physicists . . .

18 0.095696166 1629 andrew gelman stats-2012-12-18-It happened in Connecticut

19 0.095422 1469 andrew gelman stats-2012-08-25-Ways of knowing

20 0.093740322 932 andrew gelman stats-2011-09-30-Articles on the philosophy of Bayesian statistics by Cox, Mayo, Senn, and others!

similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.114), (1, 0.004), (2, -0.07), (3, 0.034), (4, -0.067), (5, -0.001), (6, -0.048), (7, 0.016), (8, 0.046), (9, -0.036), (10, 0.021), (11, -0.006), (12, 0.011), (13, 0.036), (14, -0.013), (15, 0.005), (16, -0.09), (17, 0.029), (18, -0.052), (19, 0.019), (20, 0.073), (21, -0.025), (22, -0.046), (23, 0.096), (24, 0.103), (25, 0.005), (26, -0.074), (27, 0.071), (28, -0.007), (29, 0.024), (30, 0.029), (31, 0.051), (32, 0.044), (33, -0.029), (34, 0.009), (35, -0.025), (36, -0.003), (37, 0.014), (38, -0.025), (39, 0.02), (40, 0.037), (41, -0.037), (42, 0.012), (43, -0.014), (44, 0.028), (45, -0.003), (46, 0.034), (47, -0.016), (48, -0.079), (49, -0.013)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.97201246 1389 andrew gelman stats-2012-06-23-Larry Wasserman’s statistics blog

2 0.67521232 890 andrew gelman stats-2011-09-05-Error statistics

Introduction: New blog from the philosopher Deborah Mayo who I think agrees with me about many statistical issues although from a non-Bayesian perspective. But I disagree with her when she writes that certain criticisms of frequentist statistical methods “keep popping up (verbatim) in every Bayesian textbook and article on philosophical foundations.” I’ve written a couple of Bayesian textbooks and some articles on philosophical foundations, and I don’t think I do this! That said, I think Mayo has a lot to say, so I wouldn’t judge her whole blog (let alone her published work) based on that one intemperate statement.

3 0.62333637 91 andrew gelman stats-2010-06-16-RSS mess

Introduction: Apparently some of our new blog entries are appearing as old entries on the RSS feed, meaning that those of you who read the blog using RSS may be missing a lot of good stuff. We’re working on this. But, in the meantime, I recommend you click on the blog itself to see what’s been posted in the last few weeks. Enjoy.

4 0.60874534 738 andrew gelman stats-2011-05-30-Works well versus well understood

Introduction: John Cook discusses the John Tukey quote, “The test of a good procedure is how well it works, not how well it is understood.” Cook writes: At some level, it’s hard to argue against this. Statistical procedures operate on empirical data, so it makes sense that the procedures themselves be evaluated empirically. But I [Cook] question whether we really know that a statistical procedure works well if it isn’t well understood. Specifically, I’m skeptical of complex statistical methods whose only credentials are a handful of simulations. “We don’t have any theoretical results, buy hey, it works well in practice. Just look at the simulations.” Every method works well on the scenarios its author publishes, almost by definition. If the method didn’t handle a scenario well, the author would publish a different scenario. I agree with Cook but would give a slightly different emphasis. I’d say that a lot of methods can work when they are done well. See the second meta-principle liste

5 0.60700202 1859 andrew gelman stats-2013-05-16-How do we choose our default methods?

Introduction: I was asked to write an article for the Committee of Presidents of Statistical Societies (COPSS) 50th anniversary volume. Here it is (it’s labeled as “Chapter 1,” which isn’t right; that’s just what came out when I used the template that was supplied). The article begins as follows: The field of statistics continues to be divided into competing schools of thought. In theory one might imagine choosing the uniquely best method for each problem as it arises, but in practice we choose for ourselves (and recom- mend to others) default principles, models, and methods to be used in a wide variety of settings. This article briefly considers the informal criteria we use to decide what methods to use and what principles to apply in statistics problems. And then I follow up with these sections: Statistics: the science of defaults Ways of knowing The pluralist’s dilemma And here’s the concluding paragraph: Statistics is a young science in which progress is being made in many

6 0.60075802 1645 andrew gelman stats-2012-12-31-Statistical modeling, causal inference, and social science

7 0.59994239 1165 andrew gelman stats-2012-02-13-Philosophy of Bayesian statistics: my reactions to Wasserman

8 0.59926748 871 andrew gelman stats-2011-08-26-Be careful what you control for . . . you just might get it!

9 0.57972848 1508 andrew gelman stats-2012-09-23-Speaking frankly

10 0.56978518 120 andrew gelman stats-2010-06-30-You can’t put Pandora back in the box

11 0.56568277 1560 andrew gelman stats-2012-11-03-Statistical methods that work in some settings but not others

12 0.56418741 2002 andrew gelman stats-2013-08-30-Blogging

13 0.56061196 602 andrew gelman stats-2011-03-06-Assumptions vs. conditions

14 0.5526005 498 andrew gelman stats-2011-01-02-Theoretical vs applied statistics

15 0.54428393 856 andrew gelman stats-2011-08-16-Our new improved blog! Thanks to Cord Blomquist

16 0.54406482 2344 andrew gelman stats-2014-05-23-The gremlins did it? Iffy statistics drive strong policy recommendations

17 0.54160708 1832 andrew gelman stats-2013-04-29-The blogroll

18 0.53857762 586 andrew gelman stats-2011-02-23-A statistical version of Arrow’s paradox

19 0.53694457 539 andrew gelman stats-2011-01-26-Lies, Damn Lies…that’s pretty much it.

20 0.53141892 263 andrew gelman stats-2010-09-08-The China Study: fact or fallacy?

similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(15, 0.115), (16, 0.107), (21, 0.028), (24, 0.067), (78, 0.037), (84, 0.049), (99, 0.424)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.98859119 1389 andrew gelman stats-2012-06-23-Larry Wasserman’s statistics blog

2 0.98395377 1385 andrew gelman stats-2012-06-20-Reconciling different claims about working-class voters

Introduction: After our discussions of psychologist Jonathan Haidt’s opinions about working-class voters (see here and here ), a question arose on how to reconcile the analyses of Alan Abramowitz and Tom Edsall (showing an increase in Republican voting among low-education working white southerners), with Larry Bartels’s finding that “there has been no discernible trend in presidential voting behavior among the ‘working white working class.’” Here is my resolution: All the statistics that have been posted seem reasonable to me. Also relevant to the discussion, I believe, are Figures 3.1, 4.2b, 10.1, and 10.2 of Red State Blue State. In short: Republicans continue to do about 20 percentage points better among upper-income voters compared to lower-income, but the compositions of these coalitions have changed over time. As has been noted, low-education white workers have moved toward the Republican party over the past few decades, and at the same time there have been compositional changes

3 0.98060971 1678 andrew gelman stats-2013-01-17-Wanted: 365 stories of statistics

Introduction: The American Statistical Association has a blog called the Statistics Forum that I edit but haven’t been doing much with. Originally I thought we’d get a bunch of bloggers and have a topic each week or each month and get discussions from lots of perspectives. But it was hard to get people to keep contributing, and the blog+comments approach didn’t seem to be working as a way to get wide-ranging discussion. I did organize a good roundtable discussion at one point, but it took a lot of work on my part. Recently I had another idea for the blog, based on something that Kaiser Fung wrote on three hours in the life of a statistician , along with a similar (if a bit more impressionistic) piece I wrote awhile back describing my experiences on a typical workday. So here’s the plan. 365 of you write vignettes about your statistical lives. Get into the nitty gritty—tell me what you do, and why you’re doing it. I’ll collect these and then post them at the Statistics Forum, one a day

4 0.9799124 371 andrew gelman stats-2010-10-26-Musical chairs in econ journals

Introduction: Tyler Cowen links to a paper by Bruno Frey on the lack of space for articles in economics journals. Frey writes: To further their careers, [academic economists] are required to publish in A-journals, but for the vast majority this is impossible because there are few slots open in such journals. Such academic competition maybe useful to generate hard work, however, there may be serious negative consequences: the wrong output may be produced in an inefficient way, the wrong people may be selected, and losers may react in a harmful way. According to Frey, the consensus is that there are only five top economics journals–and one of those five is Econometrica, which is so specialized that I’d say that, for most academic economists, there are only four top places they can publish. The difficulty is that demand for these slots outpaces supply: for example, in 2007 there were only 275 articles in all these journals combined (or 224 if you exclude Econometrica), while “a rough estim

5 0.97628778 2217 andrew gelman stats-2014-02-19-The replication and criticism movement is not about suppressing speculative research; rather, it’s all about enabling science’s fabled self-correcting nature

Introduction: Jeff Leek points to a post by Alex Holcombe, who disputes the idea that science is self-correcting. Holcombe writes [scroll down to get to his part]: The pace of scientific production has quickened, and self-correction has suffered. Findings that might correct old results are considered less interesting than results from more original research questions. Potential corrections are also more contested. As the competition for space in prestigious journals has become increasingly frenzied, doing and publishing studies that would confirm the rapidly accumulating new discoveries, or would correct them, became a losing proposition. Holcombe picks up on some points that we’ve discussed a lot here in the past year. Here’s Holcombe: In certain subfields, almost all new work appears in only a very few journals, all associated with a single professional society. There is then no way around the senior gatekeepers, who may then suppress corrections with impunity. . . . The bias agai

6 0.9757576 1441 andrew gelman stats-2012-08-02-“Based on my experiences, I think you could make general progress by constructing a solution to your specific problem.”

7 0.97567946 838 andrew gelman stats-2011-08-04-Retraction Watch

8 0.97232211 2268 andrew gelman stats-2014-03-26-New research journal on observational studies

9 0.97214341 274 andrew gelman stats-2010-09-14-Battle of the Americans: Writer at the American Enterprise Institute disparages the American Political Science Association

10 0.97212929 1865 andrew gelman stats-2013-05-20-What happened that the journal Psychological Science published a paper with no identifiable strengths?

11 0.97095001 1998 andrew gelman stats-2013-08-25-A new Bem theory

12 0.96995014 1833 andrew gelman stats-2013-04-30-“Tragedy of the science-communication commons”

13 0.96896392 2191 andrew gelman stats-2014-01-29-“Questioning The Lancet, PLOS, And Other Surveys On Iraqi Deaths, An Interview With Univ. of London Professor Michael Spagat”

14 0.96881872 1458 andrew gelman stats-2012-08-14-1.5 million people were told that extreme conservatives are happier than political moderates. Approximately .0001 million Americans learned that the opposite is true.

15 0.96875 1652 andrew gelman stats-2013-01-03-“The Case for Inductive Theory Building”

16 0.96845353 2014 andrew gelman stats-2013-09-09-False memories and statistical analysis

17 0.96779513 1779 andrew gelman stats-2013-03-27-“Two Dogmas of Strong Objective Bayesianism”

18 0.96776748 2269 andrew gelman stats-2014-03-27-Beyond the Valley of the Trolls

19 0.96756017 1336 andrew gelman stats-2012-05-22-Battle of the Repo Man quotes: Reid Hastie’s turn

20 0.96741104 1139 andrew gelman stats-2012-01-26-Suggested resolution of the Bem paradox