andrew_gelman_stats andrew_gelman_stats-2013 andrew_gelman_stats-2013-1672 knowledge-graph by maker-knowledge-mining

1672 andrew gelman stats-2013-01-14-How do you think about the values in a confidence interval?


meta infos for this blog

Source: html

Introduction: Philip Jones writes: As an interested reader of your blog, I wondered if you might consider a blog entry sometime on the following question I posed on CrossValidated (StackExchange). I originally posed the question based on my uncertainty about 95% CIs: “Are all values within the 95% CI equally likely (probable), or are the values at the “tails” of the 95% CI less likely than those in the middle of the CI closer to the point estimate?” I posed this question based on discordant information I found at a couple of different web sources (I posted these sources in the body of the question). I received some interesting replies, and the replies were not unanimous, in fact there is some serious disagreement there! After seeing this disagreement, I naturally thought of you, and whether you might be able to clear this up. Please note I am not referring to credible intervals, but rather to the common medical journal reporting standard of confidence intervals. My response: First


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Philip Jones writes: As an interested reader of your blog, I wondered if you might consider a blog entry sometime on the following question I posed on CrossValidated (StackExchange). [sent-1, score-0.581]

2 I originally posed the question based on my uncertainty about 95% CIs: “Are all values within the 95% CI equally likely (probable), or are the values at the “tails” of the 95% CI less likely than those in the middle of the CI closer to the point estimate? [sent-2, score-1.394]

3 ” I posed this question based on discordant information I found at a couple of different web sources (I posted these sources in the body of the question). [sent-3, score-0.822]

4 I received some interesting replies, and the replies were not unanimous, in fact there is some serious disagreement there! [sent-4, score-0.289]

5 After seeing this disagreement, I naturally thought of you, and whether you might be able to clear this up. [sent-5, score-0.065]

6 Please note I am not referring to credible intervals, but rather to the common medical journal reporting standard of confidence intervals. [sent-6, score-0.417]

7 My response: First off, I’m going to forget about the official statistics-textbook interpretation, in which a 95% confidence interval is defined as a procedure that has a 95% chance of covering the true value. [sent-7, score-0.916]

8 For most of the examples I’ve ever seen, this interpretation is pretty useless because the goal is to learn about the situation we have right now in front of us, not merely to make a statement with certain average properties. [sent-8, score-0.287]

9 I would say that the usual interpretation of a confidence interval is as a set of parameter values that are consistent with the data. [sent-9, score-1.326]

10 Typically the values near the center of the interval are more consistent, and sometimes this idea is formalized by thinking about hypothetical nested 1%, 2%, 3%, …, 99% intervals, where the more central parameter values are in more of these intervals. [sent-10, score-1.285]

11 The real problem is that the interval will exclude the true value at least 5% of the time. [sent-11, score-0.534]

12 Also, when the sample size is small, the confidence interval can include lots of implausible values too. [sent-13, score-0.993]

13 Consider the notorious claim that beautiful parents were more likely to have girls. [sent-14, score-0.452]

14 Here, the confidence interval included all sorts of big numbers (for example, the data were consistent with beautiful parents being 10 percentage points more likely to have a girl, compared to ugly parents) that a quick literature review revealed were highly implausible. [sent-15, score-1.385]

15 This was a setting where the prior information was much stronger than the data. [sent-16, score-0.12]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('interval', 0.379), ('values', 0.276), ('confidence', 0.265), ('posed', 0.257), ('ci', 0.252), ('parents', 0.178), ('interpretation', 0.158), ('consistent', 0.152), ('replies', 0.151), ('likely', 0.142), ('disagreement', 0.138), ('beautiful', 0.132), ('intervals', 0.118), ('crossvalidated', 0.111), ('discordant', 0.111), ('unanimous', 0.111), ('sources', 0.111), ('question', 0.105), ('formalized', 0.104), ('cis', 0.1), ('parameter', 0.096), ('nested', 0.089), ('credible', 0.087), ('exclude', 0.087), ('jones', 0.087), ('tails', 0.086), ('probable', 0.086), ('covering', 0.079), ('sometime', 0.078), ('girl', 0.076), ('philip', 0.075), ('wondered', 0.073), ('implausible', 0.073), ('revealed', 0.071), ('body', 0.069), ('equally', 0.069), ('true', 0.068), ('consider', 0.068), ('useless', 0.067), ('ugly', 0.066), ('originally', 0.066), ('referring', 0.065), ('hypothetical', 0.065), ('naturally', 0.065), ('official', 0.064), ('stronger', 0.062), ('front', 0.062), ('forget', 0.061), ('closer', 0.061), ('information', 0.058)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 1672 andrew gelman stats-2013-01-14-How do you think about the values in a confidence interval?

Introduction: Philip Jones writes: As an interested reader of your blog, I wondered if you might consider a blog entry sometime on the following question I posed on CrossValidated (StackExchange). I originally posed the question based on my uncertainty about 95% CIs: “Are all values within the 95% CI equally likely (probable), or are the values at the “tails” of the 95% CI less likely than those in the middle of the CI closer to the point estimate?” I posed this question based on discordant information I found at a couple of different web sources (I posted these sources in the body of the question). I received some interesting replies, and the replies were not unanimous, in fact there is some serious disagreement there! After seeing this disagreement, I naturally thought of you, and whether you might be able to clear this up. Please note I am not referring to credible intervals, but rather to the common medical journal reporting standard of confidence intervals. My response: First

2 0.41102442 480 andrew gelman stats-2010-12-21-Instead of “confidence interval,” let’s say “uncertainty interval”

Introduction: I’ve become increasingly uncomfortable with the term “confidence interval,” for several reasons: - The well-known difficulties in interpretation (officially the confidence statement can be interpreted only on average, but people typically implicitly give the Bayesian interpretation to each case), - The ambiguity between confidence intervals and predictive intervals. (See the footnote in BDA where we discuss the difference between “inference” and “prediction” in the classical framework.) - The awkwardness of explaining that confidence intervals are big in noisy situations where you have less confidence, and confidence intervals are small when you have more confidence. So here’s my proposal. Let’s use the term “uncertainty interval” instead. The uncertainty interval tells you how much uncertainty you have. That works pretty well, I think. P.S. As of this writing, “confidence interval” outGoogles “uncertainty interval” by the huge margin of 9.5 million to 54000. So we

3 0.3456158 1913 andrew gelman stats-2013-06-24-Why it doesn’t make sense in general to form confidence intervals by inverting hypothesis tests

Introduction: I’m reposing this classic from 2011 . . . Peter Bergman pointed me to this discussion from Cyrus of a presentation by Guido Imbens on design of randomized experiments. Cyrus writes: The standard analysis that Imbens proposes includes (1) a Fisher-type permutation test of the sharp null hypothesis–what Imbens referred to as “testing”–along with a (2) Neyman-type point estimate of the sample average treatment effect and confidence interval–what Imbens referred to as “estimation.” . . . Imbens claimed that testing and estimation are separate enterprises with separate goals and that the two should not be confused. I [Cyrus] took it as a warning against proposals that use “inverted” tests in order to produce point estimates and confidence intervals. There is no reason that such confidence intervals will have accurate coverage except under rather dire assumptions, meaning that they are not “confidence intervals” in the way that we usually think of them. I agree completely. T

4 0.32951355 870 andrew gelman stats-2011-08-25-Why it doesn’t make sense in general to form confidence intervals by inverting hypothesis tests

Introduction: Peter Bergman points me to this discussion from Cyrus of a presentation by Guido Imbens on design of randomized experiments. Cyrus writes: The standard analysis that Imbens proposes includes (1) a Fisher-type permutation test of the sharp null hypothesis–what Imbens referred to as “testing”–along with a (2) Neyman-type point estimate of the sample average treatment effect and confidence interval–what Imbens referred to as “estimation.” . . . Imbens claimed that testing and estimation are separate enterprises with separate goals and that the two should not be confused. I [Cyrus] took it as a warning against proposals that use “inverted” tests in order to produce point estimates and confidence intervals. There is no reason that such confidence intervals will have accurate coverage except under rather dire assumptions, meaning that they are not “confidence intervals” in the way that we usually think of them. I agree completely. This is something I’ve been saying for a long

5 0.23605365 1334 andrew gelman stats-2012-05-21-Question 11 of my final exam for Design and Analysis of Sample Surveys

Introduction: 11. Here is the result of fitting a logistic regression to Republican vote in the 1972 NES. Income is on a 1–5 scale. Approximately how much more likely is a person in income category 4 to vote Republican, compared to a person income category 2? Give an approximate estimate, standard error, and 95% interval. Solution to question 10 From yesterday : 10. Out of a random sample of 100 Americans, zero report having ever held political office. From this information, give a 95% confidence interval for the proportion of Americans who have ever held political office. Solution: Use the Agresti-Coull interval based on (y+2)/(n+4). Estimate is p.hat=2/104=0.02, se is sqrt(p.hat*(1-p.hat)/104)=0.013, 95% interval is [0.02 +/- 2*0.013] = [0,0.05].

6 0.21496633 2248 andrew gelman stats-2014-03-15-Problematic interpretations of confidence intervals

7 0.21284543 1333 andrew gelman stats-2012-05-20-Question 10 of my final exam for Design and Analysis of Sample Surveys

8 0.15904813 2201 andrew gelman stats-2014-02-06-Bootstrap averaging: Examples where it works and where it doesn’t work

9 0.15069553 1470 andrew gelman stats-2012-08-26-Graphs showing regression uncertainty: the code!

10 0.15043119 1941 andrew gelman stats-2013-07-16-Priors

11 0.13846748 2042 andrew gelman stats-2013-09-28-Difficulties of using statistical significance (or lack thereof) to sift through and compare research hypotheses

12 0.13800712 923 andrew gelman stats-2011-09-24-What is the normal range of values in a medical test?

13 0.13294704 1149 andrew gelman stats-2012-02-01-Philosophy of Bayesian statistics: my reactions to Cox and Mayo

14 0.13229771 1527 andrew gelman stats-2012-10-10-Another reason why you can get good inferences from a bad model

15 0.12877607 2142 andrew gelman stats-2013-12-21-Chasing the noise

16 0.12796226 1968 andrew gelman stats-2013-08-05-Evidence on the impact of sustained use of polynomial regression on causal inference (a claim that coal heating is reducing lifespan by 5 years for half a billion people)

17 0.12377623 1478 andrew gelman stats-2012-08-31-Watercolor regression

18 0.12077969 188 andrew gelman stats-2010-08-06-Fake newspaper headlines

19 0.11976662 1713 andrew gelman stats-2013-02-08-P-values and statistical practice

20 0.1195023 370 andrew gelman stats-2010-10-25-Who gets wedding announcements in the Times?


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.182), (1, 0.044), (2, 0.086), (3, -0.08), (4, 0.01), (5, -0.058), (6, 0.036), (7, 0.054), (8, -0.036), (9, -0.123), (10, -0.057), (11, 0.006), (12, 0.025), (13, -0.027), (14, -0.033), (15, -0.011), (16, -0.048), (17, -0.049), (18, 0.069), (19, -0.102), (20, 0.141), (21, 0.027), (22, 0.121), (23, -0.061), (24, 0.2), (25, -0.089), (26, -0.037), (27, -0.127), (28, -0.033), (29, 0.046), (30, -0.024), (31, -0.097), (32, -0.051), (33, -0.057), (34, 0.045), (35, 0.09), (36, 0.013), (37, 0.148), (38, 0.049), (39, -0.01), (40, 0.004), (41, -0.008), (42, 0.117), (43, -0.024), (44, 0.015), (45, 0.002), (46, -0.011), (47, -0.022), (48, 0.02), (49, 0.074)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.97643232 1672 andrew gelman stats-2013-01-14-How do you think about the values in a confidence interval?

Introduction: Philip Jones writes: As an interested reader of your blog, I wondered if you might consider a blog entry sometime on the following question I posed on CrossValidated (StackExchange). I originally posed the question based on my uncertainty about 95% CIs: “Are all values within the 95% CI equally likely (probable), or are the values at the “tails” of the 95% CI less likely than those in the middle of the CI closer to the point estimate?” I posed this question based on discordant information I found at a couple of different web sources (I posted these sources in the body of the question). I received some interesting replies, and the replies were not unanimous, in fact there is some serious disagreement there! After seeing this disagreement, I naturally thought of you, and whether you might be able to clear this up. Please note I am not referring to credible intervals, but rather to the common medical journal reporting standard of confidence intervals. My response: First

2 0.9226318 480 andrew gelman stats-2010-12-21-Instead of “confidence interval,” let’s say “uncertainty interval”

Introduction: I’ve become increasingly uncomfortable with the term “confidence interval,” for several reasons: - The well-known difficulties in interpretation (officially the confidence statement can be interpreted only on average, but people typically implicitly give the Bayesian interpretation to each case), - The ambiguity between confidence intervals and predictive intervals. (See the footnote in BDA where we discuss the difference between “inference” and “prediction” in the classical framework.) - The awkwardness of explaining that confidence intervals are big in noisy situations where you have less confidence, and confidence intervals are small when you have more confidence. So here’s my proposal. Let’s use the term “uncertainty interval” instead. The uncertainty interval tells you how much uncertainty you have. That works pretty well, I think. P.S. As of this writing, “confidence interval” outGoogles “uncertainty interval” by the huge margin of 9.5 million to 54000. So we

3 0.81891978 1913 andrew gelman stats-2013-06-24-Why it doesn’t make sense in general to form confidence intervals by inverting hypothesis tests

Introduction: I’m reposing this classic from 2011 . . . Peter Bergman pointed me to this discussion from Cyrus of a presentation by Guido Imbens on design of randomized experiments. Cyrus writes: The standard analysis that Imbens proposes includes (1) a Fisher-type permutation test of the sharp null hypothesis–what Imbens referred to as “testing”–along with a (2) Neyman-type point estimate of the sample average treatment effect and confidence interval–what Imbens referred to as “estimation.” . . . Imbens claimed that testing and estimation are separate enterprises with separate goals and that the two should not be confused. I [Cyrus] took it as a warning against proposals that use “inverted” tests in order to produce point estimates and confidence intervals. There is no reason that such confidence intervals will have accurate coverage except under rather dire assumptions, meaning that they are not “confidence intervals” in the way that we usually think of them. I agree completely. T

4 0.80086374 870 andrew gelman stats-2011-08-25-Why it doesn’t make sense in general to form confidence intervals by inverting hypothesis tests

Introduction: Peter Bergman points me to this discussion from Cyrus of a presentation by Guido Imbens on design of randomized experiments. Cyrus writes: The standard analysis that Imbens proposes includes (1) a Fisher-type permutation test of the sharp null hypothesis–what Imbens referred to as “testing”–along with a (2) Neyman-type point estimate of the sample average treatment effect and confidence interval–what Imbens referred to as “estimation.” . . . Imbens claimed that testing and estimation are separate enterprises with separate goals and that the two should not be confused. I [Cyrus] took it as a warning against proposals that use “inverted” tests in order to produce point estimates and confidence intervals. There is no reason that such confidence intervals will have accurate coverage except under rather dire assumptions, meaning that they are not “confidence intervals” in the way that we usually think of them. I agree completely. This is something I’ve been saying for a long

5 0.72205919 1333 andrew gelman stats-2012-05-20-Question 10 of my final exam for Design and Analysis of Sample Surveys

Introduction: 10. Out of a random sample of 100 Americans, zero report having ever held political office. From this information, give a 95% confidence interval for the proportion of Americans who have ever held political office. Solution to question 9 From yesterday : 9. Out of a population of 100 medical records, 40 are randomly sampled and then audited. 10 out of the 40 audits reveal fraud. From this information, give an estimate, standard error, and 95% confidence interval for the proportion of audits in the population with fraud. Solution: estimate is p.hat=10/40=0.25. Se is sqrt(1-f)*sqrt(p.hat*(1-.hat)/n)=sqrt(1-0.4)*sqrt(0.25*0.75/40)=0.053. 95% interval is [0.25 +/- 2*0.053] = [0.14,0.36].

6 0.70544052 2248 andrew gelman stats-2014-03-15-Problematic interpretations of confidence intervals

7 0.6614247 1334 andrew gelman stats-2012-05-21-Question 11 of my final exam for Design and Analysis of Sample Surveys

8 0.6608904 2201 andrew gelman stats-2014-02-06-Bootstrap averaging: Examples where it works and where it doesn’t work

9 0.60806322 1662 andrew gelman stats-2013-01-09-The difference between “significant” and “non-significant” is not itself statistically significant

10 0.60502541 1881 andrew gelman stats-2013-06-03-Boot

11 0.60386437 1206 andrew gelman stats-2012-03-10-95% intervals that I don’t believe, because they’re from a flat prior I don’t believe

12 0.58303994 1016 andrew gelman stats-2011-11-17-I got 99 comparisons but multiplicity ain’t one

13 0.5695172 1470 andrew gelman stats-2012-08-26-Graphs showing regression uncertainty: the code!

14 0.56826466 1331 andrew gelman stats-2012-05-19-Question 9 of my final exam for Design and Analysis of Sample Surveys

15 0.55971515 2142 andrew gelman stats-2013-12-21-Chasing the noise

16 0.53207517 410 andrew gelman stats-2010-11-12-The Wald method has been the subject of extensive criticism by statisticians for exaggerating results”

17 0.50819349 306 andrew gelman stats-2010-09-29-Statistics and the end of time

18 0.50740904 1021 andrew gelman stats-2011-11-21-Don’t judge a book by its title

19 0.5068965 775 andrew gelman stats-2011-06-21-Fundamental difficulty of inference for a ratio when the denominator could be positive or negative

20 0.50675797 1478 andrew gelman stats-2012-08-31-Watercolor regression


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(13, 0.14), (16, 0.063), (21, 0.06), (24, 0.104), (36, 0.041), (43, 0.019), (48, 0.013), (63, 0.02), (86, 0.029), (89, 0.028), (99, 0.356)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.98047239 971 andrew gelman stats-2011-10-25-Apply now for Earth Institute postdoctoral fellowships at Columbia University

Introduction: The economy isn’t going so well, but there are some interesting possibilities here at Columbia University. One such option that you should be thinking about is the Earth Institute Fellowship , which pays well, includes a research stipend, and puts you in an exciting interdisciplinary community of faculty and postdoctoral researchers. The Earth Institute at Columbia brings in several postdocs each year–it’s a two-year gig–and some of them have been statisticians (recently, Kenny Shirley and Leontine Alkema). We’re particularly interested in statisticians who have research interests in development and public health. It’s fine–not just fine, but ideal–if you are interested in statistical methods also. The EI postdoc can be a place to do interesting work and begin a research career. If you’re a statistician who’s interested in this fellowship, feel free to contact me directly–you have to apply to the Earth Institute directly (see link above), but I’m happy to give you advice about

2 0.97086304 597 andrew gelman stats-2011-03-02-RStudio – new cross-platform IDE for R

Introduction: The new R environment RStudio looks really great, especially for users new to R. In teaching, these are often people new to programming anything, much less statistical models. The R GUIs were different on each platform, with (sometimes modal) windows appearing and disappearing and no unified design. RStudio fixes that and has already found a happy home on my desktop. Initial impressions I’ve been using it for the past couple of days. For me, it replaces the niche that R.app held: looking at help, quickly doing something I don’t want to pollute a project workspace with; sometimes data munging, merging, and transforming; and prototyping plots. RStudio is better than R.app at all of these things. For actual development and papers, though, I remain wedded to emacs+ess (good old C-x M-c M-Butterfly ). Favorite features in no particular order plots seamlessly made in new graphics devices. This is huge— instead of one active plot window named something like quartz(1) t

3 0.96875942 172 andrew gelman stats-2010-07-30-Why don’t we have peer reviewing for oral presentations?

Introduction: Panos Ipeirotis writes in his blog post : Everyone who has attended a conference knows that the quality of the talks is very uneven. There are talks that are highly engaging, entertaining, and describe nicely the research challenges and solutions. And there are talks that are a waste of time. Either the presenter cannot present clearly, or the presented content is impossible to digest within the time frame of the presentation. We already have reviewing for the written part. The program committee examines the quality of the written paper and vouch for its technical content. However, by looking at a paper it is impossible to know how nicely it can be presented. Perhaps the seemingly solid but boring paper can be a very entertaining presentation. Or an excellent paper may be written by a horrible presenter. Why not having a second round of reviewing, where the authors of accepted papers submit their presentations (slides and a YouTube video) for presentation to the conference.

same-blog 4 0.966838 1672 andrew gelman stats-2013-01-14-How do you think about the values in a confidence interval?

Introduction: Philip Jones writes: As an interested reader of your blog, I wondered if you might consider a blog entry sometime on the following question I posed on CrossValidated (StackExchange). I originally posed the question based on my uncertainty about 95% CIs: “Are all values within the 95% CI equally likely (probable), or are the values at the “tails” of the 95% CI less likely than those in the middle of the CI closer to the point estimate?” I posed this question based on discordant information I found at a couple of different web sources (I posted these sources in the body of the question). I received some interesting replies, and the replies were not unanimous, in fact there is some serious disagreement there! After seeing this disagreement, I naturally thought of you, and whether you might be able to clear this up. Please note I am not referring to credible intervals, but rather to the common medical journal reporting standard of confidence intervals. My response: First

5 0.96645987 1852 andrew gelman stats-2013-05-12-Crime novels for economists

Introduction: Following up on this post by Noah Smith on economics in science fiction, Mark Palko writes on economics in crime fiction. Just as almost all science fiction is ultimately about politics, one could say that just about all crime fiction is about economics. But if I had to pick one crime novelist with an economics focus, I’d pick George V. Higgins. In one of his novels, his character Jerry Kennedy had a riff on the difference between guys who get a salary and guys who have to work for every dollar. But, really, almost all his novels are full of economics.

6 0.96606845 437 andrew gelman stats-2010-11-29-The mystery of the U-shaped relationship between happiness and age

7 0.96566749 1942 andrew gelman stats-2013-07-17-“Stop and frisk” statistics

8 0.96467972 1789 andrew gelman stats-2013-04-05-Elites have alcohol problems too!

9 0.96444535 234 andrew gelman stats-2010-08-25-Modeling constrained parameters

10 0.96279025 1916 andrew gelman stats-2013-06-27-The weirdest thing about the AJPH story

11 0.96194112 980 andrew gelman stats-2011-10-29-When people meet this guy, can they resist the temptation to ask him what he’s doing for breakfast??

12 0.961882 1648 andrew gelman stats-2013-01-02-A important new survey of Bayesian predictive methods for model assessment, selection and comparison

13 0.96065116 1137 andrew gelman stats-2012-01-24-Difficulties in publishing non-replications of implausible findings

14 0.95610285 1509 andrew gelman stats-2012-09-24-Analyzing photon counts

15 0.95226538 2309 andrew gelman stats-2014-04-28-Crowdstorming a dataset

16 0.95114374 1519 andrew gelman stats-2012-10-02-Job!

17 0.95083904 424 andrew gelman stats-2010-11-21-Data cleaning tool!

18 0.94819385 2069 andrew gelman stats-2013-10-19-R package for effect size calculations for psychology researchers

19 0.94431895 2159 andrew gelman stats-2014-01-04-“Dogs are sensitive to small variations of the Earth’s magnetic field”

20 0.94295812 1933 andrew gelman stats-2013-07-10-Please send all comments to -dev-ripley