andrew_gelman_stats andrew_gelman_stats-2012 andrew_gelman_stats-2012-1215 knowledge-graph by maker-knowledge-mining

1215 andrew gelman stats-2012-03-16-The “hot hand” and problems with hypothesis testing


meta infos for this blog

Source: html

Introduction: Gur Yaari writes : Anyone who has ever watched a sports competition is familiar with expressions like “on fire”, “in the zone”, “on a roll”, “momentum” and so on. But what do these expressions really mean? In 1985 when Thomas Gilovich, Robert Vallone and Amos Tversky studied this phenomenon for the first time, they defined it as: “. . . these phrases express a belief that the performance of a player during a particular period is significantly better than expected on the basis of the player’s overall record”. Their conclusion was that what people tend to perceive as a “hot hand” is essentially a cognitive illusion caused by a misperception of random sequences. Until recently there was little, if any, evidence to rule out their conclusion. Increased computing power and new data availability from various sports now provide surprising evidence of this phenomenon, thus reigniting the debate. Yaari goes on to some studies that have found time dependence in basketball, baseball, voll


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Gur Yaari writes : Anyone who has ever watched a sports competition is familiar with expressions like “on fire”, “in the zone”, “on a roll”, “momentum” and so on. [sent-1, score-0.575]

2 In 1985 when Thomas Gilovich, Robert Vallone and Amos Tversky studied this phenomenon for the first time, they defined it as: “. [sent-3, score-0.202]

3 these phrases express a belief that the performance of a player during a particular period is significantly better than expected on the basis of the player’s overall record”. [sent-6, score-0.288]

4 Their conclusion was that what people tend to perceive as a “hot hand” is essentially a cognitive illusion caused by a misperception of random sequences. [sent-7, score-0.367]

5 Until recently there was little, if any, evidence to rule out their conclusion. [sent-8, score-0.072]

6 Increased computing power and new data availability from various sports now provide surprising evidence of this phenomenon, thus reigniting the debate. [sent-9, score-0.407]

7 Yaari goes on to some studies that have found time dependence in basketball, baseball, volleyball, and bowling. [sent-10, score-0.078]

8 We are not machines, and anything that can affect our expectations (for example, our success in previous tries) should affect our performance. [sent-13, score-0.408]

9 The effects I’ve seen are small, on the order of 2 percentage points (for example, the probability of a success in some sports task might be 45% if you’re “hot” and 43% otherwise). [sent-15, score-0.496]

10 Yaari presents his results in terms of p-values but I think it’s best to look at effect sizes directly in terms of probabilities. [sent-16, score-0.242]

11 Sometimes if you succeed you will stay relaxed and focused, other times you can succeed and get overconfidence. [sent-19, score-0.485]

12 In summary, this is yet another problem where much is lost by going down the standard route of null hypothesis testing. [sent-22, score-0.08]

13 Better, in my view, to start with the admission of variation in the effect and go from there. [sent-23, score-0.245]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('yaari', 0.369), ('sports', 0.254), ('hot', 0.232), ('gilovich', 0.224), ('vallone', 0.224), ('phenomenon', 0.202), ('perceive', 0.173), ('expressions', 0.17), ('tversky', 0.163), ('succeed', 0.163), ('player', 0.143), ('affect', 0.116), ('misperception', 0.112), ('spectators', 0.112), ('success', 0.106), ('gur', 0.106), ('overturning', 0.106), ('variation', 0.101), ('momentum', 0.098), ('amos', 0.098), ('relaxed', 0.092), ('terms', 0.088), ('zone', 0.087), ('anyone', 0.086), ('watched', 0.085), ('illusion', 0.082), ('availability', 0.081), ('route', 0.08), ('roll', 0.08), ('consistency', 0.08), ('alike', 0.08), ('machines', 0.08), ('hand', 0.079), ('admission', 0.078), ('dependence', 0.078), ('fire', 0.076), ('overestimate', 0.076), ('basketball', 0.076), ('phrases', 0.075), ('dramatically', 0.073), ('evidence', 0.072), ('tries', 0.071), ('significantly', 0.07), ('expectations', 0.07), ('effects', 0.07), ('baseball', 0.068), ('stay', 0.067), ('effect', 0.066), ('task', 0.066), ('competition', 0.066)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 1215 andrew gelman stats-2012-03-16-The “hot hand” and problems with hypothesis testing

Introduction: Gur Yaari writes : Anyone who has ever watched a sports competition is familiar with expressions like “on fire”, “in the zone”, “on a roll”, “momentum” and so on. But what do these expressions really mean? In 1985 when Thomas Gilovich, Robert Vallone and Amos Tversky studied this phenomenon for the first time, they defined it as: “. . . these phrases express a belief that the performance of a player during a particular period is significantly better than expected on the basis of the player’s overall record”. Their conclusion was that what people tend to perceive as a “hot hand” is essentially a cognitive illusion caused by a misperception of random sequences. Until recently there was little, if any, evidence to rule out their conclusion. Increased computing power and new data availability from various sports now provide surprising evidence of this phenomenon, thus reigniting the debate. Yaari goes on to some studies that have found time dependence in basketball, baseball, voll

2 0.68660319 2243 andrew gelman stats-2014-03-11-The myth of the myth of the myth of the hot hand

Introduction: Phil pointed me to this paper so I thought I probably better repeat what I wrote a couple years ago: 1. The effects are certainly not zero. We are not machines, and anything that can affect our expectations (for example, our success in previous tries) should affect our performance. 2. The effects I’ve seen are small, on the order of 2 percentage points (for example, the probability of a success in some sports task might be 45% if you’re “hot” and 43% otherwise). 3. There’s a huge amount of variation, not just between but also among players. Sometimes if you succeed you will stay relaxed and focused, other times you can succeed and get overconfidence. 4. Whatever the latest results on particular sports, I can’t see anyone overturning the basic finding of Gilovich, Vallone, and Tversky that players and spectators alike will perceive the hot hand even when it does not exist and dramatically overestimate the magnitude and consistency of any hot-hand phenomenon that does exist.

3 0.2128723 1173 andrew gelman stats-2012-02-17-Sports examples in class

Introduction: Karl Broman writes : I [Karl] personally would avoid sports entirely, as I view the subject to be insufficiently serious. . . . Certainly lots of statisticians are interested in sports. . . . And I’m not completely uninterested in sports: I like to watch football, particularly Nebraska, Green Bay, and Baltimore, and to see Notre Dame or any team from Florida or Texas lose. But statistics about sports? Yawn. As a person who loves sports, statistics, and sports statistics, I have a few thoughts: 1. Not everyone likes sports, and even fewer are interested in any particular sport. It’s ok to use sports examples, but don’t delude yourself into thinking that everyone in the class cares about it. 2. Don’t forget foreign students. A lot of them don’t even know the rules of kickball, fer chrissake! 3. Of the students who care about a sport, there will be a minority who really care. We had some serious basketball fans in our class last year. 4. I think the best solution

4 0.13891438 2336 andrew gelman stats-2014-05-16-How much can we learn about individual-level causal claims from state-level correlations?

Introduction: Hey, we all know the answer: “correlation does not imply causation”—but of course life is more complicated than that. As philosophers, economists, statisticians, and others have repeatedly noted, most of our information about the world is observational not experimental, yet we manage to draw causal conclusions all the time. Sure, some of these conclusions are wrong (more often than 5% of the time, I’m sure) but that’s an accepted part of life. Challenges in this regard arise in the design of a study, in the statistical analysis, in how you write it up for a peer-reviewed journal, and finally in how you present it to the world. School sports and life outcomes An interesting case of all this came up recently in a post on Freakonomics that pointed to a post on Deadspin that pointed to a research article . The claim was that “sports participation [in high school] causes women to be less likely to be religious . . . more likely to have children . . . more likely to be singl

5 0.12712954 1903 andrew gelman stats-2013-06-17-Weak identification provides partial information

Introduction: Matt Selove writes: My question is about Bayesian analysis of the linear regression model. It seems to me that in some cases this approach throws out useful information. As an example, imagine you have two basketball players randomly drawn from the pool of NBA players (which provides the prior). You’d like to estimate how many free throws each can make out of 100. You have two pieces of information: - Session 1: Each player shoots 100 shots, and you learn player A’s total minus player B’s total - Session 2: Player A does another session where he shoots 100 shots alone, and you learn his total If we take the regression approach: y_i = number of shots made beta_A = player A’s expected number out of 100 beta_B = player B’s expected number out of 100 x_i = vector of zeros and ones showing which player took shots In the above example, our datapoints are: y_1 (first number reported) = beta_A * 1 + beta_B * (-1) + epsilon_1 y_2 (second number reported) = beta_A * 1 +

6 0.087945364 263 andrew gelman stats-2010-09-08-The China Study: fact or fallacy?

7 0.086271323 813 andrew gelman stats-2011-07-21-Scrabble!

8 0.084455349 1400 andrew gelman stats-2012-06-29-Decline Effect in Linguistics?

9 0.081714258 466 andrew gelman stats-2010-12-13-“The truth wears off: Is there something wrong with the scientific method?”

10 0.079656139 1424 andrew gelman stats-2012-07-22-Extreme events as evidence for differences in distributions

11 0.079385132 2086 andrew gelman stats-2013-11-03-How best to compare effects measured in two different time periods?

12 0.077628225 364 andrew gelman stats-2010-10-22-Politics is not a random walk: Momentum and mean reversion in polling

13 0.075560458 506 andrew gelman stats-2011-01-06-That silly ESP paper and some silliness in a rebuttal as well

14 0.074997939 803 andrew gelman stats-2011-07-14-Subtleties with measurement-error models for the evaluation of wacky claims

15 0.07405629 697 andrew gelman stats-2011-05-05-A statistician rereads Bill James

16 0.073121957 1826 andrew gelman stats-2013-04-26-“A Vast Graveyard of Undead Theories: Publication Bias and Psychological Science’s Aversion to the Null”

17 0.072969727 1605 andrew gelman stats-2012-12-04-Write This Book

18 0.070911683 2072 andrew gelman stats-2013-10-21-The future (and past) of statistical sciences

19 0.070750333 2149 andrew gelman stats-2013-12-26-Statistical evidence for revised standards

20 0.069483168 1804 andrew gelman stats-2013-04-15-How effective are football coaches?


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.139), (1, -0.01), (2, 0.039), (3, -0.071), (4, 0.007), (5, -0.031), (6, 0.024), (7, 0.037), (8, 0.019), (9, -0.039), (10, -0.072), (11, 0.012), (12, 0.028), (13, -0.087), (14, -0.01), (15, 0.021), (16, -0.014), (17, -0.003), (18, -0.006), (19, 0.016), (20, -0.027), (21, 0.031), (22, -0.024), (23, 0.021), (24, -0.041), (25, 0.015), (26, -0.003), (27, 0.057), (28, -0.003), (29, -0.087), (30, 0.01), (31, -0.047), (32, -0.016), (33, -0.032), (34, -0.02), (35, 0.015), (36, 0.048), (37, -0.065), (38, 0.016), (39, -0.0), (40, 0.043), (41, 0.035), (42, -0.038), (43, -0.021), (44, -0.014), (45, 0.023), (46, 0.009), (47, -0.051), (48, 0.029), (49, -0.021)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.96009737 1215 andrew gelman stats-2012-03-16-The “hot hand” and problems with hypothesis testing

Introduction: Gur Yaari writes : Anyone who has ever watched a sports competition is familiar with expressions like “on fire”, “in the zone”, “on a roll”, “momentum” and so on. But what do these expressions really mean? In 1985 when Thomas Gilovich, Robert Vallone and Amos Tversky studied this phenomenon for the first time, they defined it as: “. . . these phrases express a belief that the performance of a player during a particular period is significantly better than expected on the basis of the player’s overall record”. Their conclusion was that what people tend to perceive as a “hot hand” is essentially a cognitive illusion caused by a misperception of random sequences. Until recently there was little, if any, evidence to rule out their conclusion. Increased computing power and new data availability from various sports now provide surprising evidence of this phenomenon, thus reigniting the debate. Yaari goes on to some studies that have found time dependence in basketball, baseball, voll

2 0.87631416 2243 andrew gelman stats-2014-03-11-The myth of the myth of the myth of the hot hand

Introduction: Phil pointed me to this paper so I thought I probably better repeat what I wrote a couple years ago: 1. The effects are certainly not zero. We are not machines, and anything that can affect our expectations (for example, our success in previous tries) should affect our performance. 2. The effects I’ve seen are small, on the order of 2 percentage points (for example, the probability of a success in some sports task might be 45% if you’re “hot” and 43% otherwise). 3. There’s a huge amount of variation, not just between but also among players. Sometimes if you succeed you will stay relaxed and focused, other times you can succeed and get overconfidence. 4. Whatever the latest results on particular sports, I can’t see anyone overturning the basic finding of Gilovich, Vallone, and Tversky that players and spectators alike will perceive the hot hand even when it does not exist and dramatically overestimate the magnitude and consistency of any hot-hand phenomenon that does exist.

3 0.73037165 1607 andrew gelman stats-2012-12-05-The p-value is not . . .

Introduction: From a recent email exchange: I agree that you should never compare p-values directly. The p-value is a strange nonlinear transformation of data that is only interpretable under the null hypothesis. Once you abandon the null (as we do when we observe something with a very low p-value), the p-value itself becomes irrelevant. To put it another way, the p-value is a measure of evidence, it is not an estimate of effect size (as it is often treated, with the idea that a p=.001 effect is larger than a p=.01 effect, etc). Even conditional on sample size, the p-value is not a measure of effect size.

4 0.71330428 813 andrew gelman stats-2011-07-21-Scrabble!

Introduction: AT writes : Sitting on my [AT's] to-do list for a while now has been an exploration of Scrabble from an experimental design point of view; how to better design a tournament to make the variance as small as possible while still preserving the appearance of the home game to its players. . . . I’m proud (relieved?) to say that I’ve finally finished the first draft of this work for two-player head-to-head games, with a duplication method that ensures that if the game were repeated, each player would receive tiles from the reserve in the same sequence: think of the tiles being laid out in order (but unseen to the players), so that one player draws from the front and the other draws from the back. . . . One goal of this was to figure out how much of the variance in score comes from the tile order and how much comes from the board, given that a tile order would be expected. It turns out to be about half-bag, half-board . . . Some other findings from the simulations: The blank

5 0.70923597 1400 andrew gelman stats-2012-06-29-Decline Effect in Linguistics?

Introduction: Josef Fruehwald writes : In the past few years, the empirical foundations of the social sciences, especially Psychology, have been coming under increased scrutiny and criticism. For example, there was the New Yorker piece from 2010 called “The Truth Wears Off” about the “decline effect,” or how the effect size of a phenomenon appears to decrease over time. . . . I [Fruehwald] am a linguist. Do the problems facing psychology face me? To really answer that, I first have to decide which explanation for the decline effect I think is most likely, and I think Andrew Gelman’s proposal is a good candidate: The short story is that if you screen for statistical significance when estimating small effects, you will necessarily overestimate the magnitudes of effects, sometimes by a huge amount. I’ve put together some R code to demonstrate this point. Let’s say I’m looking at two populations, and unknown to me as a researcher, there is a small difference between the two, even though they

6 0.70532203 629 andrew gelman stats-2011-03-26-Is it plausible that 1% of people pick a career based on their first name?

7 0.69698465 2090 andrew gelman stats-2013-11-05-How much do we trust a new claim that early childhood stimulation raised earnings by 42%?

8 0.69462061 1744 andrew gelman stats-2013-03-01-Why big effects are more important than small effects

9 0.69389039 797 andrew gelman stats-2011-07-11-How do we evaluate a new and wacky claim?

10 0.67277837 1746 andrew gelman stats-2013-03-02-Fishing for cherries

11 0.66851825 898 andrew gelman stats-2011-09-10-Fourteen magic words: an update

12 0.6592207 1760 andrew gelman stats-2013-03-12-Misunderstanding the p-value

13 0.65776771 1776 andrew gelman stats-2013-03-25-The harm done by tests of significance

14 0.65592217 2165 andrew gelman stats-2014-01-09-San Fernando Valley cityscapes: An example of the benefits of fractal devastation?

15 0.65585864 963 andrew gelman stats-2011-10-18-Question on Type M errors

16 0.65212339 511 andrew gelman stats-2011-01-11-One more time on that ESP study: The problem of overestimates and the shrinkage solution

17 0.64991593 256 andrew gelman stats-2010-09-04-Noooooooooooooooooooooooooooooooooooooooooooooooo!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

18 0.647044 1195 andrew gelman stats-2012-03-04-Multiple comparisons dispute in the tabloids

19 0.64372379 803 andrew gelman stats-2011-07-14-Subtleties with measurement-error models for the evaluation of wacky claims

20 0.64032525 1883 andrew gelman stats-2013-06-04-Interrogating p-values


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(5, 0.01), (15, 0.02), (16, 0.025), (17, 0.017), (21, 0.028), (24, 0.151), (27, 0.015), (89, 0.426), (95, 0.014), (99, 0.174)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.94761717 1685 andrew gelman stats-2013-01-21-Class on computational social science this semester, Fridays, 1:00-3:40pm

Introduction: Sharad Goel, Jake Hofman, and Sergei Vassilvitskii are teaching this awesome class on computational social science this semester in the applied math department at Columbia. Here’s the course info . You should take this course. These guys are amazing.

2 0.93393397 2243 andrew gelman stats-2014-03-11-The myth of the myth of the myth of the hot hand

Introduction: Phil pointed me to this paper so I thought I probably better repeat what I wrote a couple years ago: 1. The effects are certainly not zero. We are not machines, and anything that can affect our expectations (for example, our success in previous tries) should affect our performance. 2. The effects I’ve seen are small, on the order of 2 percentage points (for example, the probability of a success in some sports task might be 45% if you’re “hot” and 43% otherwise). 3. There’s a huge amount of variation, not just between but also among players. Sometimes if you succeed you will stay relaxed and focused, other times you can succeed and get overconfidence. 4. Whatever the latest results on particular sports, I can’t see anyone overturning the basic finding of Gilovich, Vallone, and Tversky that players and spectators alike will perceive the hot hand even when it does not exist and dramatically overestimate the magnitude and consistency of any hot-hand phenomenon that does exist.

same-blog 3 0.92295146 1215 andrew gelman stats-2012-03-16-The “hot hand” and problems with hypothesis testing

Introduction: Gur Yaari writes : Anyone who has ever watched a sports competition is familiar with expressions like “on fire”, “in the zone”, “on a roll”, “momentum” and so on. But what do these expressions really mean? In 1985 when Thomas Gilovich, Robert Vallone and Amos Tversky studied this phenomenon for the first time, they defined it as: “. . . these phrases express a belief that the performance of a player during a particular period is significantly better than expected on the basis of the player’s overall record”. Their conclusion was that what people tend to perceive as a “hot hand” is essentially a cognitive illusion caused by a misperception of random sequences. Until recently there was little, if any, evidence to rule out their conclusion. Increased computing power and new data availability from various sports now provide surprising evidence of this phenomenon, thus reigniting the debate. Yaari goes on to some studies that have found time dependence in basketball, baseball, voll

4 0.92137027 1477 andrew gelman stats-2012-08-30-Visualizing Distributions of Covariance Matrices

Introduction: Since we’ve been discussing prior distributions on covariance matrices, I will recommend this recent article (coauthored with Tomoki Tokuda, Ben Goodrich, Iven Van Mechelen, and Francis Tuerlinckx) on their visualization: We present some methods for graphing distributions of covariance matrices and demonstrate them on several models, including the Wishart, inverse-Wishart, and scaled inverse-Wishart families in different dimensions. Our visualizations follow the principle of decomposing a covariance matrix into scale parameters and correlations, pulling out marginal summaries where possible and using two and three-dimensional plots to reveal multivariate structure. Visualizing a distribution of covariance matrices is a step beyond visualizing a single covariance matrix or a single multivariate dataset. Our visualization methods are available through the R package VisCov.

5 0.90117538 1953 andrew gelman stats-2013-07-24-Recently in the sister blog

Introduction: Would You Accept DNA From A Murderer?

6 0.84679329 1160 andrew gelman stats-2012-02-09-Familial Linkage between Neuropsychiatric Disorders and Intellectual Interests

7 0.84293509 1756 andrew gelman stats-2013-03-10-He said he was sorry

8 0.81789291 1708 andrew gelman stats-2013-02-05-Wouldn’t it be cool if Glenn Hubbard were consulting for Herbalife and I were on the other side?

9 0.81616807 459 andrew gelman stats-2010-12-09-Solve mazes by starting at the exit

10 0.78892207 833 andrew gelman stats-2011-07-31-Untunable Metropolis

11 0.73642457 1032 andrew gelman stats-2011-11-28-Does Avastin work on breast cancer? Should Medicare be paying for it?

12 0.73441672 407 andrew gelman stats-2010-11-11-Data Visualization vs. Statistical Graphics

13 0.72288781 1855 andrew gelman stats-2013-05-13-Stan!

14 0.68609893 1991 andrew gelman stats-2013-08-21-BDA3 table of contents (also a new paper on visualization)

15 0.67595387 566 andrew gelman stats-2011-02-09-The boxer, the wrestler, and the coin flip, again

16 0.66603643 1320 andrew gelman stats-2012-05-14-Question 4 of my final exam for Design and Analysis of Sample Surveys

17 0.65517664 1572 andrew gelman stats-2012-11-10-I don’t like this cartoon

18 0.65031326 623 andrew gelman stats-2011-03-21-Baseball’s greatest fielders

19 0.64581996 231 andrew gelman stats-2010-08-24-Yet another Bayesian job opportunity

20 0.64229375 1628 andrew gelman stats-2012-12-17-Statistics in a world where nothing is random