andrew_gelman_stats andrew_gelman_stats-2011 andrew_gelman_stats-2011-583 knowledge-graph by maker-knowledge-mining

583 andrew gelman stats-2011-02-21-An interesting assignment for statistical graphics


meta infos for this blog

Source: html

Introduction: Antony Unwin writes: I [Unwin] find it an interesting exercise for students to ask them to write headlines (and subheadlines) for graphics, both for ones they have drawn themselves and for published ones. The results are sometimes depressing, often thought-provoking and occasionally highly entertaining. This seems like a great idea, both for teaching students how to read a graph and also for teaching how to make a graph. I’ve long said that when making a graph (or, for that matter, a table), you want to think about what message the reader will get out of it. “Displaying a bunch of numbers” doesn’t cut it.


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Antony Unwin writes: I [Unwin] find it an interesting exercise for students to ask them to write headlines (and subheadlines) for graphics, both for ones they have drawn themselves and for published ones. [sent-1, score-1.426]

2 The results are sometimes depressing, often thought-provoking and occasionally highly entertaining. [sent-2, score-0.59]

3 This seems like a great idea, both for teaching students how to read a graph and also for teaching how to make a graph. [sent-3, score-1.333]

4 I’ve long said that when making a graph (or, for that matter, a table), you want to think about what message the reader will get out of it. [sent-4, score-0.951]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('unwin', 0.414), ('teaching', 0.271), ('depressing', 0.26), ('headlines', 0.24), ('graph', 0.218), ('antony', 0.213), ('students', 0.209), ('displaying', 0.207), ('exercise', 0.189), ('drawn', 0.181), ('occasionally', 0.179), ('cut', 0.175), ('reader', 0.163), ('table', 0.158), ('ones', 0.153), ('message', 0.147), ('graphics', 0.145), ('highly', 0.137), ('bunch', 0.125), ('ask', 0.12), ('matter', 0.119), ('numbers', 0.109), ('sometimes', 0.103), ('making', 0.095), ('long', 0.095), ('published', 0.091), ('great', 0.09), ('said', 0.09), ('results', 0.087), ('write', 0.086), ('often', 0.084), ('read', 0.082), ('interesting', 0.081), ('doesn', 0.08), ('find', 0.076), ('idea', 0.071), ('want', 0.065), ('seems', 0.065), ('ve', 0.053), ('make', 0.051), ('writes', 0.048), ('get', 0.044), ('also', 0.042), ('like', 0.034), ('think', 0.034)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 583 andrew gelman stats-2011-02-21-An interesting assignment for statistical graphics

Introduction: Antony Unwin writes: I [Unwin] find it an interesting exercise for students to ask them to write headlines (and subheadlines) for graphics, both for ones they have drawn themselves and for published ones. The results are sometimes depressing, often thought-provoking and occasionally highly entertaining. This seems like a great idea, both for teaching students how to read a graph and also for teaching how to make a graph. I’ve long said that when making a graph (or, for that matter, a table), you want to think about what message the reader will get out of it. “Displaying a bunch of numbers” doesn’t cut it.

2 0.3848519 225 andrew gelman stats-2010-08-23-Getting into hot water over hot graphics

Introduction: I like what Antony Unwin has to say here (start on page 5).

3 0.19727066 546 andrew gelman stats-2011-01-31-Infovis vs. statistical graphics: My talk tomorrow (Tues) 1pm at Columbia

Introduction: Infovis vs. statistical graphics . Tues 1 Feb 2011 1pm, Avery Hall room 114. It’s for the Lectures in Planning Series at the School of Architecture, Planning, and Preservation. Background on the talk (joint with Antony Unwin) is here . And here are more of my thoughts on statistical graphics.

4 0.19253506 878 andrew gelman stats-2011-08-29-Infovis, infographics, and data visualization: Where I’m coming from, and where I’d like to go

Introduction: I continue to struggle to convey my thoughts on statistical graphics so I’ll try another approach, this time giving my own story. For newcomers to this discussion: the background is that Antony Unwin and I wrote an article on the different goals embodied in information visualization and statistical graphics, but I have difficulty communicating on this point with the infovis people. Maybe if I tell my own story, and then they tell their stories, this will point a way forward to a more constructive discussion. So here goes. I majored in physics in college and I worked in a couple of research labs during the summer. Physicists graph everything. I did most of my plotting on graph paper–this continued through my second year of grad school–and became expert at putting points at 1/5, 2/5, 3/5, and 4/5 between the x and y grid lines. In grad school in statistics, I continued my physics habits and graphed everything I could. I did notice, though, that the faculty and the other

5 0.15734014 816 andrew gelman stats-2011-07-22-“Information visualization” vs. “Statistical graphics”

Introduction: By now you all must be tired of my one-sided presentations of the differences between infovis and statgraphics (for example, this article with Antony Unwin). Today is something different. Courtesy of Martin Theus, editor of the Statistical Computing and Graphics Newsletter, we have two short articles offering competing perspectives: Robert Kosara writes from an Infovis view: Information visualization is a field that has had trouble defining its boundaries, and that consequently is often misunderstood. It doesn’t help that InfoVis, as it is also known, produces pretty pictures that people like to look at and link to or send around. But InfoVis is more than pretty pictures, and it is more than statistical graphics. The key to understanding InfoVis is to ignore the images for a moment and focus on the part that is often lost: interaction. When we use visualization tools, we don’t just create one image or one kind of visualization. In fact, most people would argue that there is

6 0.15335588 2266 andrew gelman stats-2014-03-25-A statistical graphics course and statistical graphics advice

7 0.14988592 855 andrew gelman stats-2011-08-16-Infovis and statgraphics update update

8 0.14813492 1604 andrew gelman stats-2012-12-04-An epithet I can live with

9 0.14655383 1517 andrew gelman stats-2012-10-01-“On Inspiring Students and Being Human”

10 0.14423098 2022 andrew gelman stats-2013-09-13-You heard it here first: Intense exercise can suppress appetite

11 0.13884437 1584 andrew gelman stats-2012-11-19-Tradeoffs in information graphics

12 0.13794984 847 andrew gelman stats-2011-08-10-Using a “pure infographic” to explore differences between information visualization and statistical graphics

13 0.13649943 1582 andrew gelman stats-2012-11-18-How to teach methods we don’t like?

14 0.1234505 2154 andrew gelman stats-2013-12-30-Bill Gates’s favorite graph of the year

15 0.12188067 1813 andrew gelman stats-2013-04-19-Grad students: Participate in an online survey on statistics education

16 0.11942372 2279 andrew gelman stats-2014-04-02-Am I too negative?

17 0.11885627 1745 andrew gelman stats-2013-03-02-Classification error

18 0.11477366 1268 andrew gelman stats-2012-04-18-Experimenting on your intro stat course, as a way of teaching experimentation in your intro stat course (and also to improve the course itself)

19 0.11331543 1848 andrew gelman stats-2013-05-09-A tale of two discussion papers

20 0.10957933 61 andrew gelman stats-2010-05-31-A data visualization manifesto


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.145), (1, -0.082), (2, -0.065), (3, 0.044), (4, 0.163), (5, -0.08), (6, -0.038), (7, 0.164), (8, -0.057), (9, -0.042), (10, 0.075), (11, 0.043), (12, -0.06), (13, -0.013), (14, 0.044), (15, -0.078), (16, -0.002), (17, 0.015), (18, -0.052), (19, 0.059), (20, -0.025), (21, -0.021), (22, 0.006), (23, 0.02), (24, 0.015), (25, -0.009), (26, 0.047), (27, 0.002), (28, -0.008), (29, -0.002), (30, -0.041), (31, 0.046), (32, -0.022), (33, 0.003), (34, -0.013), (35, -0.029), (36, -0.014), (37, 0.001), (38, 0.031), (39, -0.03), (40, -0.04), (41, -0.035), (42, 0.032), (43, -0.01), (44, 0.052), (45, -0.013), (46, 0.058), (47, 0.015), (48, -0.074), (49, 0.019)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.96796852 583 andrew gelman stats-2011-02-21-An interesting assignment for statistical graphics

Introduction: Antony Unwin writes: I [Unwin] find it an interesting exercise for students to ask them to write headlines (and subheadlines) for graphics, both for ones they have drawn themselves and for published ones. The results are sometimes depressing, often thought-provoking and occasionally highly entertaining. This seems like a great idea, both for teaching students how to read a graph and also for teaching how to make a graph. I’ve long said that when making a graph (or, for that matter, a table), you want to think about what message the reader will get out of it. “Displaying a bunch of numbers” doesn’t cut it.

2 0.71494204 1056 andrew gelman stats-2011-12-13-Drawing to Learn in Science

Introduction: Joshua Vogelstein points us to an article by Shaaron Ainsworth, Vaughan Prain, and Russell Tytler: Should science learners be challenged to draw more? Certainly making visualizations is integral to scientific thinking. Scientists do not use words only but rely on diagrams, graphs, videos, photographs, and other images to make discoveries, explain findings, and excite public interest. . . . However, in the science classroom, learners mainly focus on interpreting others’ visualizations; when drawing does occur, it is rare that learners are systematically encouraged to create their own visual forms to develop and show understanding (6). Drawing includes constructing a line graph from a table of values, sketching cells observed through a microscope, or inventing a way to show a scientific phenomenon (e.g., evaporation). Although interpretation of visualizations and other information is clearly critical to learning, becoming proficient in science also requires learners to develop many

3 0.71276355 2266 andrew gelman stats-2014-03-25-A statistical graphics course and statistical graphics advice

Introduction: Dean Eckles writes: Some of my coworkers at Facebook and I have worked with Udacity to create an online course on exploratory data analysis, including using data visualizations in R as part of EDA. The course has now launched at  https://www.udacity.com/course/ud651  so anyone can take it for free. And Kaiser Fung has  reviewed it . So definitely feel free to promote it! Criticism is also welcome (we are still fine-tuning things and adding more notes throughout). I wrote some more comments about the course  here , including highlighting the interviews with my great coworkers. I didn’t have a chance to look at the course so instead I responded with some generic comments about eda and visualization (in no particular order): - Think of a graph as a comparison. All graphs are comparison (indeed, all statistical analyses are comparisons). If you already have the graph in mind, think of what comparisons it’s enabling. Or if you haven’t settled on the graph yet, think of what

4 0.68716919 878 andrew gelman stats-2011-08-29-Infovis, infographics, and data visualization: Where I’m coming from, and where I’d like to go

Introduction: I continue to struggle to convey my thoughts on statistical graphics so I’ll try another approach, this time giving my own story. For newcomers to this discussion: the background is that Antony Unwin and I wrote an article on the different goals embodied in information visualization and statistical graphics, but I have difficulty communicating on this point with the infovis people. Maybe if I tell my own story, and then they tell their stories, this will point a way forward to a more constructive discussion. So here goes. I majored in physics in college and I worked in a couple of research labs during the summer. Physicists graph everything. I did most of my plotting on graph paper–this continued through my second year of grad school–and became expert at putting points at 1/5, 2/5, 3/5, and 4/5 between the x and y grid lines. In grad school in statistics, I continued my physics habits and graphed everything I could. I did notice, though, that the faculty and the other

5 0.66385323 1584 andrew gelman stats-2012-11-19-Tradeoffs in information graphics

Introduction: The visual display of quantitative information (to use Edward Tufte’s wonderful term) is a diverse field or set of fields, and its practitioners have different goals. The goals of software designers, applied statisticians, biologists, graphic designers, and journalists (to list just a few of the important creators of data graphics) often overlap—but not completely. One of our aims in writing our article [on Infovis and Statistical Graphics] was to emphasize the diversity of graphical goals, as it seems to us that even experts tend to consider one aspect of a graph and not others. Our main practical suggestion was that, in the internet age, we should not have to choose between attractive graphs and informational graphs: it should be possible to display both, via interactive displays. But to follow this suggestion, one must first accept that not every beautiful graph is informative, and not every informative graph is beautiful. . . . Yes, it can sometimes be possible for a graph to

6 0.66244274 1308 andrew gelman stats-2012-05-08-chartsnthings !

7 0.66072363 319 andrew gelman stats-2010-10-04-“Who owns Congress”

8 0.65193313 1604 andrew gelman stats-2012-12-04-An epithet I can live with

9 0.64641833 847 andrew gelman stats-2011-08-10-Using a “pure infographic” to explore differences between information visualization and statistical graphics

10 0.64612204 61 andrew gelman stats-2010-05-31-A data visualization manifesto

11 0.64368302 798 andrew gelman stats-2011-07-12-Sometimes a graph really is just ugly

12 0.64321989 1764 andrew gelman stats-2013-03-15-How do I make my graphs?

13 0.64174342 855 andrew gelman stats-2011-08-16-Infovis and statgraphics update update

14 0.64091611 1669 andrew gelman stats-2013-01-12-The power of the puzzlegraph

15 0.63835162 2154 andrew gelman stats-2013-12-30-Bill Gates’s favorite graph of the year

16 0.63644367 1811 andrew gelman stats-2013-04-18-Psychology experiments to understand what’s going on with data graphics?

17 0.63591599 1078 andrew gelman stats-2011-12-22-Tables as graphs: The Ramanujan principle

18 0.63369846 1894 andrew gelman stats-2013-06-12-How to best graph the Beveridge curve, relating the vacancy rate in jobs to the unemployment rate?

19 0.63367122 2146 andrew gelman stats-2013-12-24-NYT version of birthday graph

20 0.63055497 2038 andrew gelman stats-2013-09-25-Great graphs of names


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(16, 0.063), (21, 0.032), (24, 0.2), (53, 0.029), (82, 0.064), (87, 0.094), (99, 0.381)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.98035312 1868 andrew gelman stats-2013-05-23-Validation of Software for Bayesian Models Using Posterior Quantiles

Introduction: Every once in awhile I get a question that I can directly answer from my published research. When that happens it makes me so happy. Here’s an example. Patrick Lam wrote, Suppose one develops a Bayesian model to estimate a parameter theta. Now suppose one wants to evaluate the model via simulation by generating fake data where you know the value of theta and see how well you recover theta with your model, assuming that you use the posterior mean as the estimate. The traditional frequentist way of evaluating it might be to generate many datasets and see how well your estimator performs each time in terms of unbiasedness or mean squared error or something. But given that unbiasedness means nothing to a Bayesian and there is no repeated sampling interpretation in a Bayesian model, how would you suggest one would evaluate a Bayesian model? My reply: I actually have a paper on this ! It is by Cook, Gelman, and Rubin. The idea is to draw theta from the prior distribution.

2 0.97962642 294 andrew gelman stats-2010-09-23-Thinking outside the (graphical) box: Instead of arguing about how best to fix a bar chart, graph it as a time series lineplot instead

Introduction: John Kastellec points me to this blog by Ezra Klein criticizing the following graph from a recent Republican Party report: Klein (following Alexander Hart ) slams the graph for not going all the way to zero on the y-axis, thus making the projected change seem bigger than it really is. I agree with Klein and Hart that, if you’re gonna do a bar chart, you want the bars to go down to 0. On the other hand, a projected change from 19% to 23% is actually pretty big, and I don’t see the point of using a graphical display that hides it. The solution: Ditch the bar graph entirely and replace it by a lineplot , in particular, a time series with year-by-year data. The time series would have several advantages: 1. Data are placed in context. You’d see every year, instead of discrete averages, and you’d get to see the changes in the context of year-to-year variation. 2. With the time series, you can use whatever y-axis works with the data. No need to go to zero. P.S. I l

same-blog 3 0.97942871 583 andrew gelman stats-2011-02-21-An interesting assignment for statistical graphics

Introduction: Antony Unwin writes: I [Unwin] find it an interesting exercise for students to ask them to write headlines (and subheadlines) for graphics, both for ones they have drawn themselves and for published ones. The results are sometimes depressing, often thought-provoking and occasionally highly entertaining. This seems like a great idea, both for teaching students how to read a graph and also for teaching how to make a graph. I’ve long said that when making a graph (or, for that matter, a table), you want to think about what message the reader will get out of it. “Displaying a bunch of numbers” doesn’t cut it.

4 0.97262764 355 andrew gelman stats-2010-10-20-Andy vs. the Ideal Point Model of Voting

Introduction: Last week, as I walked into Andrew’s office for a meeting, he was formulating some misgivings about applying an ideal-point model to budgetary bills in the U.S. Senate. Andrew didn’t like that the model of a senator’s position was an indifference point rather than at their optimal point, and that the effect of moving away from a position was automatically modeled as increasing in one direction and decreasing in the other. Executive Summary The monotonicity of inverse logit entails that the expected vote for a bill among any fixed collection of senators’ ideal points is monotonically increasing (or decreasing) with the bill’s position, with direction determined by the outcome coding. The Ideal-Point Model The ideal-point model’s easy to write down, but hard to reason about because of all the polarity shifting going on. To recapitulate from Gelman and Hill’s Regression book (p. 317), using the U.S. Senate instead of the Supreme Court, and ignoring the dis

5 0.97031105 918 andrew gelman stats-2011-09-21-Avoiding boundary estimates in linear mixed models

Introduction: Pablo Verde sends in this letter he and Daniel Curcio just published in the Journal of Antimicrobial Chemotherapy. They had published a meta-analysis with a boundary estimate which, he said, gave nonsense results. Here’s Curcio and Verde’s key paragraph: The authors [of the study they are criticizing] performed a test of heterogeneity between studies. Given that the test result was not significant at 5%, they decided to pool all the RRs by using a fixed-effect meta-analysis model. Unfortunately, this is a common practice in meta-analysis, which usually leads to very misleading results. First of all, the pooled RR as well as its standard error are sensitive to 2 the estimation of the between-studies standard deviation (SD). SD is difficult to estimate with a small number of studies. On the other hand, it is very well known that the significant test of hetero- geneity lacks statistical power to detect values of SD greater than zero. In addition, the statistically non-significant re

6 0.96976542 1963 andrew gelman stats-2013-07-31-Response by Jessica Tracy and Alec Beall to my critique of the methods in their paper, “Women Are More Likely to Wear Red or Pink at Peak Fertility”

7 0.96944845 1725 andrew gelman stats-2013-02-17-“1.7%” ha ha ha

8 0.96830082 783 andrew gelman stats-2011-06-30-Don’t stop being a statistician once the analysis is done

9 0.96747375 342 andrew gelman stats-2010-10-14-Trying to be precise about vagueness

10 0.96738267 1263 andrew gelman stats-2012-04-13-Question of the week: Will the authors of a controversial new study apologize to busy statistician Don Berry for wasting his time reading and responding to their flawed article?

11 0.96653634 1788 andrew gelman stats-2013-04-04-When is there “hidden structure in data” to be discovered?

12 0.96449721 1661 andrew gelman stats-2013-01-08-Software is as software does

13 0.96320951 2355 andrew gelman stats-2014-05-31-Jessica Tracy and Alec Beall (authors of the fertile-women-wear-pink study) comment on our Garden of Forking Paths paper, and I comment on their comments

14 0.96292269 247 andrew gelman stats-2010-09-01-How does Bayes do it?

15 0.96261489 326 andrew gelman stats-2010-10-07-Peer pressure, selection, and educational reform

16 0.96229053 1553 andrew gelman stats-2012-10-30-Real rothko, fake rothko

17 0.96176529 2351 andrew gelman stats-2014-05-28-Bayesian nonparametric weighted sampling inference

18 0.96152323 855 andrew gelman stats-2011-08-16-Infovis and statgraphics update update

19 0.96137357 2042 andrew gelman stats-2013-09-28-Difficulties of using statistical significance (or lack thereof) to sift through and compare research hypotheses

20 0.96137178 2266 andrew gelman stats-2014-03-25-A statistical graphics course and statistical graphics advice