andrew_gelman_stats andrew_gelman_stats-2011 andrew_gelman_stats-2011-854 knowledge-graph by maker-knowledge-mining

854 andrew gelman stats-2011-08-15-A silly paper that tries to make fun of multilevel models

meta infos for this blog

Source: html

Introduction: Torkild Hovde Lyngstad writes: I wondered what your reaction would be to this paper from a recent issue of European Political Science. It came out already in March this year, so you might have seen it or even commented on it before. Is is a joke at the expense of the whole polisci discipline, a joke the Editors did not catch, or the sequel to the Sokal affair, just with quanto social science as the target? My reply: Yes, several people pointed me to this article. I don’t think it’s a hoax, it’s more of a joke: the author is making the point that with fancy statistics you can discover all sorts of patterns that don’t make sense. The implication, I believe, is that many patterns that social scientists do find through statistical analysis are not actually meaningful. I agree with this point, which could be even more pithily stated as “correlation does not imply causation.” I am irritated, however, by the singling out of multilevel models here, as the point could be mad

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Torkild Hovde Lyngstad writes: I wondered what your reaction would be to this paper from a recent issue of European Political Science. [sent-1, score-0.309]

2 It came out already in March this year, so you might have seen it or even commented on it before. [sent-2, score-0.199]

3 Is is a joke at the expense of the whole polisci discipline, a joke the Editors did not catch, or the sequel to the Sokal affair, just with quanto social science as the target? [sent-3, score-1.242]

4 My reply: Yes, several people pointed me to this article. [sent-4, score-0.081]

5 I don’t think it’s a hoax, it’s more of a joke: the author is making the point that with fancy statistics you can discover all sorts of patterns that don’t make sense. [sent-5, score-0.741]

6 The implication, I believe, is that many patterns that social scientists do find through statistical analysis are not actually meaningful. [sent-6, score-0.532]

7 I agree with this point, which could be even more pithily stated as “correlation does not imply causation. [sent-7, score-0.43]

8 ” I am irritated, however, by the singling out of multilevel models here, as the point could be made just as well using a simple correlation. [sent-8, score-0.575]

9 As my French friend said to me many years ago when I asked her what she thought of the Pink Panther movies, it’s not that I mind that this article makes fun of multilevel models, I just don’t find it funny. [sent-9, score-0.679]

10 Then again, I didn’t try to publish that one in a real journal, hence I was able to get to the point right away. [sent-11, score-0.391]

similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('joke', 0.356), ('sequel', 0.196), ('pithily', 0.196), ('sokal', 0.185), ('affair', 0.185), ('hoax', 0.185), ('patterns', 0.178), ('singling', 0.171), ('multilevel', 0.163), ('expense', 0.149), ('irritated', 0.145), ('point', 0.143), ('movies', 0.141), ('french', 0.139), ('discipline', 0.139), ('pink', 0.139), ('fancy', 0.137), ('zombies', 0.137), ('wondered', 0.13), ('european', 0.13), ('commented', 0.13), ('march', 0.129), ('implication', 0.125), ('discover', 0.122), ('stated', 0.122), ('target', 0.116), ('social', 0.113), ('imply', 0.112), ('catch', 0.111), ('editors', 0.108), ('friend', 0.102), ('models', 0.098), ('reaction', 0.097), ('find', 0.095), ('hence', 0.095), ('correlation', 0.093), ('mind', 0.089), ('fun', 0.085), ('publish', 0.084), ('author', 0.084), ('paper', 0.082), ('pointed', 0.081), ('sorts', 0.077), ('scientists', 0.074), ('asked', 0.073), ('away', 0.073), ('many', 0.072), ('whole', 0.072), ('seen', 0.069), ('able', 0.069)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 854 andrew gelman stats-2011-08-15-A silly paper that tries to make fun of multilevel models

2 0.13515405 2187 andrew gelman stats-2014-01-26-Twitter sucks, and people are gullible as f…

Introduction: Hey, and I did it in less than 140 characters! The above was my response to this item which David Hogg forwarded to me. The next thing you know, people are going to claim that women are three times as likely to wear red pink when . . . Naaah, forget about it, that would never happen. Hmmm, I think the above is not so savvy of me, to just go around insulting a whole bunch of people. So let me just say that becoming numerate is not as easy as it might seem. All of us can be gullible in areas outside of our expertise. Indeed, I’ve fallen for the occasional April Fool’s gag myself. And, maybe it’s not really right for me to say that “Twitter sucks.” Sure, the downside of Twitter is that people can just pass along a silly joke, not realizing it’s a joke at all. But the upside is, I hope, that once people have committed themselves and then realize they were mistaken, they’ll think harder the next time they see something like that. I hope the same thing goes with the “women

3 0.13357255 1457 andrew gelman stats-2012-08-13-Retro ethnic slurs

Introduction: From Watership Down: There is a rabbit saying, ‘In the warren, more stories than passages’; and a rabbit can no more refuse to tell a story than an Irishman can refuse to fight. Wow. OK, if someone made a joke about New Yorkers being argumentative or people from Iowa being boring (sorry, Tom!), I wouldn’t see it as being in poor taste. But somehow, to this non-U.K. reader, Adams’s remark about “Irishmen” seems a bit over the top. I’m not criticizing it as offensive, exactly; it just is a bit jarring, and it’s kind of hard for me to believe someone would just write that as a throwaway line anymore. Things have changed a lot since 1971, I guess, or maybe in England an Irish joke is no more offensive/awkward than a joke about corrupt Chicagoans, loopy Californians, or crazy Floridians would be here.

4 0.10981606 1737 andrew gelman stats-2013-02-25-Correlation of 1 . . . too good to be true?

Introduction: Alex Hoffman points me to this interview by Dylan Matthews of education researcher Thomas Kane, who at one point says, Once you corrected for measurement error, a teacher’s score on their chosen videos and on their unchosen videos were correlated at 1. They were perfectly correlated. Hoffman asks, “What do you think? Do you think that just maybe, perhaps, it’s possible we aught to consider, I’m just throwing out the possibility that it might be that the procedure for correcting measurement error might, you now, be a little too strong?” I don’t know exactly what’s happening here, but it might be something that I’ve seen on occasion when fitting multilevel models using a point estimate for the group-level variance. It goes like this: measurement-error models are multilevel models, they involve the estimation of a distribution of a latent variable. When fitting multilevel models, it is possible to estimate the group-level variance to be zero, even though the group-level varia

5 0.10937482 785 andrew gelman stats-2011-07-02-Experimental reasoning in social science

Introduction: As a statistician, I was trained to think of randomized experimentation as representing the gold standard of knowledge in the social sciences, and, despite having seen occasional arguments to the contrary, I still hold that view, expressed pithily by Box, Hunter, and Hunter (1978) that “To find out what happens when you change something, it is necessary to change it.” At the same time, in my capacity as a social scientist, I’ve published many applied research papers, almost none of which have used experimental data. In the present article, I’ll address the following questions: 1. Why do I agree with the consensus characterization of randomized experimentation as a gold standard? 2. Given point 1 above, why does almost all my research use observational data? In confronting these issues, we must consider some general issues in the strategy of social science research. We also take from the psychology methods literature a more nuanced perspective that considers several differen

6 0.10253008 400 andrew gelman stats-2010-11-08-Poli sci plagiarism update, and a note about the benefits of not caring

7 0.10251775 1435 andrew gelman stats-2012-07-30-Retracted articles and unethical behavior in economics journals?

8 0.098546013 2269 andrew gelman stats-2014-03-27-Beyond the Valley of the Trolls

9 0.096245825 295 andrew gelman stats-2010-09-25-Clusters with very small numbers of observations

10 0.092729077 2033 andrew gelman stats-2013-09-23-More on Bayesian methods and multilevel modeling

11 0.092484578 1248 andrew gelman stats-2012-04-06-17 groups, 6 group-level predictors: What to do?

12 0.092263587 2245 andrew gelman stats-2014-03-12-More on publishing in journals

13 0.091362894 1954 andrew gelman stats-2013-07-24-Too Good To Be True: The Scientific Mass Production of Spurious Statistical Significance

14 0.091214955 1690 andrew gelman stats-2013-01-23-When are complicated models helpful in psychology research and when are they overkill?

15 0.089876406 1934 andrew gelman stats-2013-07-11-Yes, worry about generalizing from data to population. But multilevel modeling is the solution, not the problem

16 0.08828155 1139 andrew gelman stats-2012-01-26-Suggested resolution of the Bem paradox

17 0.086570375 1695 andrew gelman stats-2013-01-28-Economists argue about Bayes

18 0.086142235 2255 andrew gelman stats-2014-03-19-How Americans vote

19 0.085585698 725 andrew gelman stats-2011-05-21-People kept emailing me this one so I think I have to blog something

20 0.08484596 202 andrew gelman stats-2010-08-12-Job openings in multilevel modeling in Bristol, England

similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.188), (1, -0.023), (2, -0.023), (3, -0.029), (4, -0.024), (5, -0.011), (6, -0.013), (7, -0.061), (8, 0.025), (9, 0.066), (10, 0.054), (11, -0.009), (12, -0.017), (13, 0.015), (14, 0.037), (15, -0.004), (16, -0.023), (17, 0.016), (18, -0.007), (19, -0.005), (20, -0.017), (21, -0.019), (22, -0.018), (23, 0.027), (24, -0.021), (25, -0.054), (26, -0.038), (27, -0.016), (28, -0.019), (29, -0.01), (30, -0.028), (31, 0.012), (32, 0.0), (33, -0.039), (34, -0.029), (35, -0.025), (36, 0.026), (37, 0.028), (38, 0.01), (39, 0.009), (40, -0.035), (41, 0.04), (42, 0.034), (43, -0.004), (44, -0.07), (45, -0.026), (46, 0.003), (47, 0.03), (48, -0.017), (49, -0.023)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.9758119 854 andrew gelman stats-2011-08-15-A silly paper that tries to make fun of multilevel models

2 0.81048006 952 andrew gelman stats-2011-10-11-More reason to like Sims besides just his name

Introduction: John Horton points to Sims ‘s comment on Angrist and Pischke : Top of page 8—he criticizes economists for using clustered standard errors—suggests using multilevel models instead. Awesome! So now there are at least two Nobel prize winners in economics who’ve expressed skepticism about controlled experiments. (I wonder if Sims is such a danger in a parking lot.) P.S. I’m still miffed that this journal didn’t invite me to comment on that article!

3 0.75651282 1585 andrew gelman stats-2012-11-20-“I know you aren’t the plagiarism police, but . . .”

Introduction: Someone I don’t know writes in: I have followed your thoughts on plagiarism rather closely, and I ran across something in the Economics literature that I felt might interest you (and if you were to share this, I’d rather remain anonymous as a junior faculty not looking to step on toes anywhere). I know you aren’t the plagiarism police, but figured you would have some input. I’ve been reading up on some literature regarding all-pay auctions for some research I have been working on and came across an interesting paper in J. Political Economy (1998) with the following intro: “Many economic allocations are decided by competition for a prize on the basis of costly activities. For example, monopoly licenses may be awarded to the person (or group) that lobbies the hardest (Tullock, 1967), or tickets may be given to those who wait in line the longest (Holt and Sherman 1982). In such contests, losers’ efforts are costly and are generally not compensated. These situations, which are esp

4 0.73451716 675 andrew gelman stats-2011-04-22-Arrow’s other theorem

Introduction: I received the following email from someone who’d like to remain anonymous: Lately I [the anonymous correspondent] witnessed that Bruno Frey has published two articles in two well known referreed journals on the Titanic disaster that try to explain survival rates of passenger on board. The articles were published in the Journal of Economic Perspectives and Rationality & Society . While looking up the name of the second journal where I stumbled across the article I even saw that they put the message in a third journal, the Proceedings of the National Academy of Sciences United States of America . To say it in Sopranos like style – with all due respect, I know Bruno Frey from conferences, I really appreciate his take on economics as a social science and he has really published more interesting stuff that most economists ever will. But putting the same message into three journals gives me headaches for at least two reasons: 1) When building a track record and scientific rep

5 0.72648954 2245 andrew gelman stats-2014-03-12-More on publishing in journals

Introduction: I’m postponing today’s scheduled post (“Empirical implications of Empirical Implications of Theoretical Models”) to continue the lively discussion from yesterday, What if I were to stop publishing in journals? . An example: my papers with Basbøll Thomas Basbøll and I got into a long discussion on our blogs about business school professor Karl Weick and other cases of plagiarism copying text without attribution. We felt it useful to take our ideas to the next level and write them up as a manuscript, which ended up being logical to split into two papers. At that point I put some effort into getting these papers published, which I eventually did: To throw away data: Plagiarism as a statistical crime went into American Scientist and When do stories work? Evidence and illustration in the social sciences will appear in Sociological Methods and Research. The second paper, in particular, took some effort to place; I got some advice from colleagues in sociology as to where

6 0.72609478 1035 andrew gelman stats-2011-11-29-“Tobin’s analysis here is methodologically old-fashioned in the sense that no attempt is made to provide microfoundations for the postulated adjustment processes”

7 0.72270608 1139 andrew gelman stats-2012-01-26-Suggested resolution of the Bem paradox

8 0.71452469 1747 andrew gelman stats-2013-03-03-More research on the role of puzzles in processing data graphics

9 0.70701617 1578 andrew gelman stats-2012-11-15-Outta control political incorrectness

10 0.70568967 2004 andrew gelman stats-2013-09-01-Post-publication peer review: How it (sometimes) really works

11 0.70424503 1987 andrew gelman stats-2013-08-18-A lot of statistical methods have this flavor, that they are a solution to a mathematical problem that has been posed without a careful enough sense of whether the problem is worth solving in the first place

12 0.70306569 848 andrew gelman stats-2011-08-11-That xkcd cartoon on multiple comparisons that all of you were sending me a couple months ago

13 0.70276934 1435 andrew gelman stats-2012-07-30-Retracted articles and unethical behavior in economics journals?

14 0.69932431 2218 andrew gelman stats-2014-02-20-Do differences between biology and statistics explain some of our diverging attitudes regarding criticism and replication of scientific claims?

15 0.69907546 601 andrew gelman stats-2011-03-05-Against double-blind reviewing: Political science and statistics are not like biology and physics

16 0.69832319 2137 andrew gelman stats-2013-12-17-Replication backlash

17 0.69697636 2269 andrew gelman stats-2014-03-27-Beyond the Valley of the Trolls

18 0.69548523 1889 andrew gelman stats-2013-06-08-Using trends in R-squared to measure progress in criminology??

19 0.69080281 1860 andrew gelman stats-2013-05-17-How can statisticians help psychologists do their research better?

20 0.68694234 128 andrew gelman stats-2010-07-05-The greatest works of statistics never published

similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(7, 0.015), (15, 0.014), (16, 0.041), (21, 0.412), (24, 0.075), (86, 0.017), (92, 0.016), (99, 0.311)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.96580505 1232 andrew gelman stats-2012-03-27-Banned in NYC school tests

Introduction: The list includes “hunting” but not “fishing,” so that’s cool. I wonder how they’d feel about a question involving different cuts of meat. In any case, I’m happy to see that “Bayes” is not on the banned list. P.S. Russell explains .

2 0.94561231 151 andrew gelman stats-2010-07-16-Wanted: Probability distributions for rank orderings

Introduction: Dietrich Stoyan writes: I asked the IMS people for an expert in statistics of voting/elections and they wrote me your name. I am a statistician, but never worked in the field voting/elections. It was my son-in-law who asked me for statistical theories in that field. He posed in particular the following problem: The aim of the voting is to come to a ranking of c candidates. Every vote is a permutation of these c candidates. The problem is to have probability distributions in the set of all permutations of c elements. Are there theories for such distributions? I should be very grateful for a fast answer with hints to literature. (I confess that I do not know your books.) My reply: Rather than trying to model the ranks directly, Iâ€™d recommend modeling a latent continuous outcome which then implies a distribution on ranks, if the ranks are of interest. There are lots of distributions of c-dimensional continuous outcomes. In political science, the usual way to start is

3 0.9261182 672 andrew gelman stats-2011-04-20-The R code for those time-use graphs

Introduction: By popular demand, hereâ€™s my R script for the time-use graphs : # The data a1 <- c(4.2,3.2,11.1,1.3,2.2,2.0) a2 <- c(3.9,3.2,10.0,0.8,3.1,3.1) a3 <- c(6.3,2.5,9.8,0.9,2.2,2.4) a4 <- c(4.4,3.1,9.8,0.8,3.3,2.7) a5 <- c(4.8,3.0,9.9,0.7,3.3,2.4) a6 <- c(4.0,3.4,10.5,0.7,3.3,2.1) a <- rbind(a1,a2,a3,a4,a5,a6) avg <- colMeans (a) avg.array <- t (array (avg, rev(dim(a)))) diff <- a - avg.array country.name <- c("France", "Germany", "Japan", "Britain", "USA", "Turkey") # The line plots par (mfrow=c(2,3), mar=c(4,4,2,.5), mgp=c(2,.7,0), tck=-.02, oma=c(3,0,4,0), bg="gray96", fg="gray30") for (i in 1:6){ plot (c(1,6), c(-1,1.7), xlab="", ylab="", xaxt="n", yaxt="n", bty="l", type="n") lines (1:6, diff[i,], col="blue") points (1:6, diff[i,], pch=19, col="black") if (i>3){ axis (1, c(1,3,5), c ("Work,\nstudy", "Eat,\nsleep", "Leisure"), mgp=c(2,1.5,0), tck=0, cex.axis=1.2) axis (1, c(2,4,6), c ("Unpaid\nwork", "Personal\nCare", "Other"), mgp=c(2,1.5,0),

4 0.91043866 1857 andrew gelman stats-2013-05-15-Does quantum uncertainty have a place in everyday applied statistics?

Introduction: Several months ago, Mike Betancourt and I wrote a discussion for the article, Can quantum probability provide a new direction for cognitive modeling?, by Emmanuel Pothos and Jerome Busemeyer, in Behavioral and Brain Sciences. We didn’t say much, but it was a milestone for me because, with this article, BBS became the 100th journal I’d published in. Anyway, the full article with its 34 discussions just appeared in the journal . Here it is. What surprised me, in reading the full discussion, was how supportive the commentary was. Given the topic of Pothos and Busemeyer’s article, I was expecting the discussions to range from gentle mockery to outright abuse. The discussion that Mike and I wrote was moderately encouraging, and I was expecting this to fall on the extreme positive end of the spectrum. Actually, though, most of the discussions were positive, and only a couple were purely negative (those would be “Quantum models of cognition as Orwellian newspeak” by Michael Lee a

5 0.90385234 432 andrew gelman stats-2010-11-27-Neumann update

Introduction: Steve Hsu, who started off this discussion, had some comments on my speculations on the personality of John von Neumann and others. Steve writes: I [Hsu] actually knew Feynman a bit when I was an undergrad, and found him to be very nice to students. Since then I have heard quite a few stories from people in theoretical physics which emphasize his nastier side, and I think in the end he was quite a complicated person like everyone else. There are a couple of pseudo-biographies of vN, but none as high quality as, e.g., Gleick’s book on Feynman or Hodges book about Turing. (Gleick studied physics as an undergrad at Harvard, and Hodges is a PhD in mathematical physics — pretty rare backgrounds for biographers!) For example, as mentioned on the comment thread to your post, Steve Heims wrote a book about both vN and Wiener (!), and Norman Macrae wrote a biography of vN. Both books are worth reading, but I think neither really do him justice. The breadth of vN’s work is just too m

same-blog 6 0.90275645 854 andrew gelman stats-2011-08-15-A silly paper that tries to make fun of multilevel models

7 0.90260869 2298 andrew gelman stats-2014-04-21-On deck this week

8 0.89254063 1275 andrew gelman stats-2012-04-22-Please stop me before I barf again

9 0.88423562 894 andrew gelman stats-2011-09-07-Hipmunk FAIL: Graphics without content is not enough

10 0.88169277 62 andrew gelman stats-2010-06-01-Two Postdoc Positions Available on Bayesian Hierarchical Modeling

11 0.87536591 2272 andrew gelman stats-2014-03-29-I agree with this comment

12 0.86871207 1826 andrew gelman stats-2013-04-26-“A Vast Graveyard of Undead Theories: Publication Bias and Psychological Science’s Aversion to the Null”

13 0.86073703 1675 andrew gelman stats-2013-01-15-“10 Things You Need to Know About Causal Effects”

14 0.85872519 1615 andrew gelman stats-2012-12-10-A defense of Tom Wolfe based on the impossibility of the law of small numbers in network structure

15 0.85384393 1401 andrew gelman stats-2012-06-30-David Hogg on statistics

16 0.84583116 1728 andrew gelman stats-2013-02-19-The grasshopper wins, and Greg Mankiw’s grandmother would be “shocked and appalled” all over again

17 0.83517146 514 andrew gelman stats-2011-01-13-News coverage of statistical issues…how did I do?

18 0.82492316 2306 andrew gelman stats-2014-04-26-Sleazy sock puppet can’t stop spamming our discussion of compressed sensing and promoting the work of Xiteng Liu

19 0.817945 900 andrew gelman stats-2011-09-11-Symptomatic innumeracy

20 0.81525159 1932 andrew gelman stats-2013-07-10-Don’t trust the Turk