803 andrew gelman stats-2011-07-14-Subtleties with measurement-error models for the evaluation of wacky claims

Introduction: A few days ago I discussed the evaluation of somewhat-plausible claims that are somewhat supported by theory and somewhat supported by statistical evidence. One point I raised was that an implausibly large estimate of effect size can be cause for concern: Uri Simonsohn (the author of the recent rebuttal of the name-choice article by Pelham et al.) argued that the implied effects were too large to be believed (just as I was arguing above regarding the July 4th study), which makes more plausible his claims that the results arise from methodological artifacts. That calculation is straight Bayes: the distribution of systematic errors has much longer tails than the distribution of random errors, so the larger the estimated effect, the more likely it is to be a mistake. This little theoretical result is a bit annoying, because it is the larger effects that are the most interesting!” Larry Bartels notes that my reasoning above is a bit incoherent: I [Bartels] strongly agree with

1 A few days ago I discussed the evaluation of somewhat-plausible claims that are somewhat supported by theory and somewhat supported by statistical evidence. [sent-1, score-0.494]

2 One point I raised was that an implausibly large estimate of effect size can be cause for concern: Uri Simonsohn (the author of the recent rebuttal of the name-choice article by Pelham et al. [sent-2, score-0.926]

3 ) argued that the implied effects were too large to be believed (just as I was arguing above regarding the July 4th study), which makes more plausible his claims that the results arise from methodological artifacts. [sent-3, score-0.485]

4 That calculation is straight Bayes: the distribution of systematic errors has much longer tails than the distribution of random errors, so the larger the estimated effect, the more likely it is to be a mistake. [sent-4, score-1.017]

5 This little theoretical result is a bit annoying, because it is the larger effects that are the most interesting! [sent-5, score-0.244]

6 ” Larry Bartels notes that my reasoning above is a bit incoherent: I [Bartels] strongly agree with your bottom line that our main aim should be “understanding effect sizes on a real scale. [sent-6, score-0.456]

7 ” However, your paradoxical conclusion (“the larger the estimated effect, the more likely it is to be a mistake”) seems to distract attention from the effect size of primary interest-the magnitude of the “true” (causal) effect. [sent-7, score-1.403]

8 But the more important fact would seem to be that your posterior belief regarding the magnitude of the “true” (causal) effect, E(c|b), is also increasing in b (at least for plausible-seeming distributional assumptions). [sent-9, score-0.815]

9 Focusing on whether a surprising empirical result is “a mistake” (whatever that means) seems to concede too much to the simple-minded is-there-an-effect-or-isn’t-there perspective, while obscuring your more fundamental interest in “understanding [true] effect sizes on a real scale. [sent-12, score-0.775]

10 Maybe a more correct statement would be that, given reasonable models for x, d, and e, if the estimate gets implausibly large, the estimate for x does not increase proportionally. [sent-15, score-0.546]

11 I actually think there will be some (non-Gaussian) models for which, as y gets larger, E(x|y) can actually go back toward zero. [sent-16, score-0.073]

12 But this will depend on the distributional form. [sent-17, score-0.148]

13 I agree that “how likely is it to be a mistake” is the wrong way to look at things. [sent-18, score-0.09]

14 No analysis is perfect, so the “mistake” framing is generally not so helpful. [sent-20, score-0.065]

803 andrew gelman stats-2011-07-14-Subtleties with measurement-error models for the evaluation of wacky claims

