andrew_gelman_stats andrew_gelman_stats-2012 andrew_gelman_stats-2012-1450 knowledge-graph by maker-knowledge-mining

1450 andrew gelman stats-2012-08-08-My upcoming talk for the data visualization meetup


meta infos for this blog

Source: html

Introduction: Somebody asked me to speak sometime at a data visualization meetup. I think I spoke there a year or two ago but I could do it again. Last time I spoke on Infovis vs Statistical Graphics , this time I could just go thru the choices involved in a few zillion graphs I’ve published over the years, to give a sense of the options and choices involved in graphical communication. For this talk there would be no single theme (except, perhaps, my usual “Graphs as comparisons,” “All of statistics as comparisons,” and “Exploratory data analysis as hypothesis testing”), just a bunch of open discussion about what I tried, why I tried it, what worked and what didn’t work, etc. I’ve discussed these sorts of decisions on occasion (and am now writing a paper with Yair about some of this for our voting models), but I’ve never tried to make a talk out of it before. Could be fun.


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Somebody asked me to speak sometime at a data visualization meetup. [sent-1, score-0.579]

2 I think I spoke there a year or two ago but I could do it again. [sent-2, score-0.562]

3 Last time I spoke on Infovis vs Statistical Graphics , this time I could just go thru the choices involved in a few zillion graphs I’ve published over the years, to give a sense of the options and choices involved in graphical communication. [sent-3, score-2.627]

4 For this talk there would be no single theme (except, perhaps, my usual “Graphs as comparisons,” “All of statistics as comparisons,” and “Exploratory data analysis as hypothesis testing”), just a bunch of open discussion about what I tried, why I tried it, what worked and what didn’t work, etc. [sent-4, score-1.268]

5 I’ve discussed these sorts of decisions on occasion (and am now writing a paper with Yair about some of this for our voting models), but I’ve never tried to make a talk out of it before. [sent-5, score-1.188]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('spoke', 0.312), ('tried', 0.312), ('choices', 0.25), ('involved', 0.211), ('comparisons', 0.205), ('thru', 0.2), ('graphs', 0.179), ('zillion', 0.173), ('sometime', 0.171), ('vs', 0.171), ('yair', 0.162), ('talk', 0.161), ('occasion', 0.16), ('infovis', 0.16), ('theme', 0.155), ('options', 0.144), ('exploratory', 0.139), ('graphical', 0.128), ('visualization', 0.127), ('speak', 0.125), ('ve', 0.123), ('somebody', 0.119), ('voting', 0.116), ('decisions', 0.115), ('graphics', 0.111), ('testing', 0.11), ('except', 0.108), ('could', 0.106), ('fun', 0.105), ('worked', 0.099), ('hypothesis', 0.098), ('usual', 0.097), ('bunch', 0.096), ('open', 0.095), ('sorts', 0.095), ('asked', 0.09), ('discussed', 0.09), ('single', 0.089), ('time', 0.079), ('year', 0.075), ('writing', 0.073), ('published', 0.07), ('last', 0.069), ('ago', 0.069), ('never', 0.066), ('data', 0.066), ('didn', 0.065), ('give', 0.064), ('perhaps', 0.061), ('models', 0.061)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 1450 andrew gelman stats-2012-08-08-My upcoming talk for the data visualization meetup

Introduction: Somebody asked me to speak sometime at a data visualization meetup. I think I spoke there a year or two ago but I could do it again. Last time I spoke on Infovis vs Statistical Graphics , this time I could just go thru the choices involved in a few zillion graphs I’ve published over the years, to give a sense of the options and choices involved in graphical communication. For this talk there would be no single theme (except, perhaps, my usual “Graphs as comparisons,” “All of statistics as comparisons,” and “Exploratory data analysis as hypothesis testing”), just a bunch of open discussion about what I tried, why I tried it, what worked and what didn’t work, etc. I’ve discussed these sorts of decisions on occasion (and am now writing a paper with Yair about some of this for our voting models), but I’ve never tried to make a talk out of it before. Could be fun.

2 0.84297049 1668 andrew gelman stats-2013-01-11-My talk at the NY data visualization meetup this Monday!

Introduction: It’s in midtown at 7pm (on Mon 14 Jan 2013). Last time I talked for this group, I spoke on Infovis vs. Statistical Graphics . This time I plan to just go thru the choices involved in a few zillion graphs I’ve published over the years, to give a sense of the options and choices involved in graphical communication. For this talk there will be no single theme (except, perhaps, my usual “Graphs as comparisons,” “All of statistics as comparisons,” and “Exploratory data analysis as hypothesis testing”), just a bunch of open discussion about what I tried, why I tried it, what worked and what didn’t work, etc. I’ve discussed these sorts of decisions on occasion (and am now writing a paper with Yair about some of this for our voting models), but I’ve never tried to make a talk out of it before. Could be fun.

3 0.20883358 855 andrew gelman stats-2011-08-16-Infovis and statgraphics update update

Introduction: To continue our discussion from last week , consider three positions regarding the display of information: (a) The traditional tabular approach. This is how most statisticians, econometricians, political scientists, sociologists, etc., seem to operate. They understand the appeal of a pretty graph, and they’re willing to plot some data as part of an exploratory data analysis, but they see their serious research as leading to numerical estimates, p-values, tables of numbers. These people might use a graph to illustrate their points but they don’t see them as necessary in their research. (b) Statistical graphics as performed by Howard Wainer, Bill Cleveland, Dianne Cook, etc. They–we–see graphics as central to the process of statistical modeling and data analysis and are interested in graphs (static and dynamic) that display every data point as transparently as possible. (c) Information visualization or infographics, as performed by graphics designers and statisticians who are

4 0.19951101 878 andrew gelman stats-2011-08-29-Infovis, infographics, and data visualization: Where I’m coming from, and where I’d like to go

Introduction: I continue to struggle to convey my thoughts on statistical graphics so I’ll try another approach, this time giving my own story. For newcomers to this discussion: the background is that Antony Unwin and I wrote an article on the different goals embodied in information visualization and statistical graphics, but I have difficulty communicating on this point with the infovis people. Maybe if I tell my own story, and then they tell their stories, this will point a way forward to a more constructive discussion. So here goes. I majored in physics in college and I worked in a couple of research labs during the summer. Physicists graph everything. I did most of my plotting on graph paper–this continued through my second year of grad school–and became expert at putting points at 1/5, 2/5, 3/5, and 4/5 between the x and y grid lines. In grad school in statistics, I continued my physics habits and graphed everything I could. I did notice, though, that the faculty and the other

5 0.18540956 2266 andrew gelman stats-2014-03-25-A statistical graphics course and statistical graphics advice

Introduction: Dean Eckles writes: Some of my coworkers at Facebook and I have worked with Udacity to create an online course on exploratory data analysis, including using data visualizations in R as part of EDA. The course has now launched at  https://www.udacity.com/course/ud651  so anyone can take it for free. And Kaiser Fung has  reviewed it . So definitely feel free to promote it! Criticism is also welcome (we are still fine-tuning things and adding more notes throughout). I wrote some more comments about the course  here , including highlighting the interviews with my great coworkers. I didn’t have a chance to look at the course so instead I responded with some generic comments about eda and visualization (in no particular order): - Think of a graph as a comparison. All graphs are comparison (indeed, all statistical analyses are comparisons). If you already have the graph in mind, think of what comparisons it’s enabling. Or if you haven’t settled on the graph yet, think of what

6 0.1754251 548 andrew gelman stats-2011-02-01-What goes around . . .

7 0.16653585 2279 andrew gelman stats-2014-04-02-Am I too negative?

8 0.15750523 816 andrew gelman stats-2011-07-22-“Information visualization” vs. “Statistical graphics”

9 0.14962384 847 andrew gelman stats-2011-08-10-Using a “pure infographic” to explore differences between information visualization and statistical graphics

10 0.14625357 1594 andrew gelman stats-2012-11-28-My talk on statistical graphics at Mit this Thurs aft

11 0.14566752 2275 andrew gelman stats-2014-03-31-Just gave a talk

12 0.14564888 1764 andrew gelman stats-2013-03-15-How do I make my graphs?

13 0.14063305 1811 andrew gelman stats-2013-04-18-Psychology experiments to understand what’s going on with data graphics?

14 0.13925925 1848 andrew gelman stats-2013-05-09-A tale of two discussion papers

15 0.13775182 319 andrew gelman stats-2010-10-04-“Who owns Congress”

16 0.13334325 1584 andrew gelman stats-2012-11-19-Tradeoffs in information graphics

17 0.12677915 1143 andrew gelman stats-2012-01-29-G+ > Skype

18 0.12632374 1039 andrew gelman stats-2011-12-02-I just flew in from the econ seminar, and boy are my arms tired

19 0.12177591 290 andrew gelman stats-2010-09-22-Data Thief

20 0.11997341 492 andrew gelman stats-2010-12-30-That puzzle-solving feeling


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.182), (1, -0.035), (2, -0.089), (3, 0.064), (4, 0.115), (5, -0.141), (6, -0.133), (7, 0.058), (8, -0.029), (9, -0.049), (10, 0.036), (11, 0.059), (12, 0.012), (13, -0.026), (14, 0.034), (15, -0.117), (16, -0.063), (17, -0.119), (18, 0.059), (19, 0.084), (20, -0.052), (21, -0.069), (22, 0.087), (23, 0.018), (24, -0.142), (25, -0.046), (26, -0.11), (27, -0.117), (28, 0.06), (29, -0.055), (30, -0.049), (31, 0.019), (32, 0.075), (33, 0.02), (34, -0.012), (35, -0.043), (36, 0.122), (37, 0.092), (38, 0.034), (39, 0.057), (40, -0.06), (41, 0.069), (42, -0.046), (43, -0.061), (44, -0.036), (45, -0.165), (46, -0.015), (47, -0.006), (48, 0.012), (49, -0.065)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.98056918 1668 andrew gelman stats-2013-01-11-My talk at the NY data visualization meetup this Monday!

Introduction: It’s in midtown at 7pm (on Mon 14 Jan 2013). Last time I talked for this group, I spoke on Infovis vs. Statistical Graphics . This time I plan to just go thru the choices involved in a few zillion graphs I’ve published over the years, to give a sense of the options and choices involved in graphical communication. For this talk there will be no single theme (except, perhaps, my usual “Graphs as comparisons,” “All of statistics as comparisons,” and “Exploratory data analysis as hypothesis testing”), just a bunch of open discussion about what I tried, why I tried it, what worked and what didn’t work, etc. I’ve discussed these sorts of decisions on occasion (and am now writing a paper with Yair about some of this for our voting models), but I’ve never tried to make a talk out of it before. Could be fun.

same-blog 2 0.97606939 1450 andrew gelman stats-2012-08-08-My upcoming talk for the data visualization meetup

Introduction: Somebody asked me to speak sometime at a data visualization meetup. I think I spoke there a year or two ago but I could do it again. Last time I spoke on Infovis vs Statistical Graphics , this time I could just go thru the choices involved in a few zillion graphs I’ve published over the years, to give a sense of the options and choices involved in graphical communication. For this talk there would be no single theme (except, perhaps, my usual “Graphs as comparisons,” “All of statistics as comparisons,” and “Exploratory data analysis as hypothesis testing”), just a bunch of open discussion about what I tried, why I tried it, what worked and what didn’t work, etc. I’ve discussed these sorts of decisions on occasion (and am now writing a paper with Yair about some of this for our voting models), but I’ve never tried to make a talk out of it before. Could be fun.

3 0.70003086 546 andrew gelman stats-2011-01-31-Infovis vs. statistical graphics: My talk tomorrow (Tues) 1pm at Columbia

Introduction: Infovis vs. statistical graphics . Tues 1 Feb 2011 1pm, Avery Hall room 114. It’s for the Lectures in Planning Series at the School of Architecture, Planning, and Preservation. Background on the talk (joint with Antony Unwin) is here . And here are more of my thoughts on statistical graphics.

4 0.68152142 407 andrew gelman stats-2010-11-11-Data Visualization vs. Statistical Graphics

Introduction: I have this great talk on the above topic but nowhere to give it. Here’s the story. Several months ago, I was invited to speak at IEEE VisWeek. It sounded like a great opportunity. The organizer told me that there were typically about 700 people in the audience, and these are people in the visualization community whom I’d like to reach but normally wouldn’t have the opportunity to encounter. It sounded great, but I didn’t want to fly most of the way across the country by myself, so I offered to give the talk by videolink. I was surprised to get a No response: I’d think that a visualization conference, of all things, would welcome a video talk. In the meantime, though, I’d thought a lot about what I’d talk about and had started preparing something. Once I found out I wouldn’t be giving the talk, I channeled the efforts into an article which, with the collaboration of Antony Unwin, was completed about a month ago. It would take very little effort to adapt this graph-laden a

5 0.67094624 2275 andrew gelman stats-2014-03-31-Just gave a talk

Introduction: I just gave a talk in Milan. Actually I was sitting at my desk, it was a g+ hangout which was a bit more convenient for me. The audience was a bunch of astronomers so I figured they could handle a satellite link. . . . Anyway, the talk didn’t go so well. Two reasons: first, it’s just hard to get the connection with the audience without being able to see their faces. Next time I think I’ll try to get several people in the audience to open up their laptops and connect to the hangout, so that I can see a mosaic of faces instead of just a single image from the front of the room. The second problem with the talk was the topic. I asked the people who invited me to choose a topic, and they picked Can we use Bayesian methods to resolve the current crisis of statistically-significant research findings that don’t hold up? But I don’t think this was right for this audience. I think that it would’ve been better to give them the Stan talk or the little data talk or the statistic

6 0.67050147 548 andrew gelman stats-2011-02-01-What goes around . . .

7 0.66189539 1598 andrew gelman stats-2012-11-30-A graphics talk with no visuals!

8 0.65188718 438 andrew gelman stats-2010-11-30-I just skyped in from Kentucky, and boy are my arms tired

9 0.64912999 794 andrew gelman stats-2011-07-09-The quest for the holy graph

10 0.64024568 1824 andrew gelman stats-2013-04-25-Fascinating graphs from facebook data

11 0.63641745 855 andrew gelman stats-2011-08-16-Infovis and statgraphics update update

12 0.62630904 492 andrew gelman stats-2010-12-30-That puzzle-solving feeling

13 0.61926359 847 andrew gelman stats-2011-08-10-Using a “pure infographic” to explore differences between information visualization and statistical graphics

14 0.61641216 319 andrew gelman stats-2010-10-04-“Who owns Congress”

15 0.60428399 1584 andrew gelman stats-2012-11-19-Tradeoffs in information graphics

16 0.60165727 1594 andrew gelman stats-2012-11-28-My talk on statistical graphics at Mit this Thurs aft

17 0.59531069 1066 andrew gelman stats-2011-12-17-Ripley on model selection, and some links on exploratory model analysis

18 0.5895564 1806 andrew gelman stats-2013-04-16-My talk in Chicago this Thurs 6:30pm

19 0.58812028 2319 andrew gelman stats-2014-05-05-Can we make better graphs of global temperature history?

20 0.58722484 1275 andrew gelman stats-2012-04-22-Please stop me before I barf again


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(0, 0.039), (16, 0.102), (24, 0.171), (47, 0.141), (48, 0.023), (51, 0.038), (79, 0.023), (85, 0.021), (99, 0.327)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.9663434 1668 andrew gelman stats-2013-01-11-My talk at the NY data visualization meetup this Monday!

Introduction: It’s in midtown at 7pm (on Mon 14 Jan 2013). Last time I talked for this group, I spoke on Infovis vs. Statistical Graphics . This time I plan to just go thru the choices involved in a few zillion graphs I’ve published over the years, to give a sense of the options and choices involved in graphical communication. For this talk there will be no single theme (except, perhaps, my usual “Graphs as comparisons,” “All of statistics as comparisons,” and “Exploratory data analysis as hypothesis testing”), just a bunch of open discussion about what I tried, why I tried it, what worked and what didn’t work, etc. I’ve discussed these sorts of decisions on occasion (and am now writing a paper with Yair about some of this for our voting models), but I’ve never tried to make a talk out of it before. Could be fun.

2 0.96509337 1285 andrew gelman stats-2012-04-27-“How to Lie with Statistics” guy worked for the tobacco industry to mock studies of the risks of smoking statistics

Introduction: Remember How to Lie With Statistics? It turns out that the author worked for the cigarette companies. John Mashey points to this, from Robert Proctor’s book, “Golden Holocaust: Origins of the Cigarette Catastrophe and the Case for Abolition”: Darrell Huff, author of the wildly popular (and aptly named) How to Lie With Statistics, was paid to testify before Congress in the 1950s and then again in the 1960s, with the assigned task of ridiculing any notion of a cigarette-disease link. On March 22, 1965, Huff testified at hearings on cigarette labeling and advertising, accusing the recent Surgeon General’s report of myriad failures and “fallacies.” Huff peppered his attack with with amusing asides and anecdotes, lampooning spurious correlations like that between the size of Dutch families and the number of storks nesting on rooftops–which proves not that storks bring babies but rather that people with large families tend to have larger houses (which therefore attract more storks).

3 0.96271336 95 andrew gelman stats-2010-06-17-“Rewarding Strivers: Helping Low-Income Students Succeed in College”

Introduction: Several years ago, I heard about a project at the Educational Testing Service to identify “strivers”: students from disadvantaged backgrounds who did unexpectedly well on the SAT (the college admissions exam formerly known as the “Scholastic Aptitude Test” but apparently now just “the SAT,” in the same way that Exxon is just “Exxon” and that Harry Truman’s middle name is just “S”), at least 200 points above a predicted score based on demographic and neighborhood information. My ETS colleague and I agreed that this was a silly idea: From a statistical point of view, if student A is expected ahead of time to do better than student B, and then they get identical test scores, then you’d expect student A (the non-”striver”) to do better than student B (the “striver”) later on. Just basic statistics: if a student does much better than expected, then probably some of that improvement is noise. The idea of identifying these “strivers” seemed misguided and not the best use of the SAT.

4 0.95980883 1261 andrew gelman stats-2012-04-12-The Naval Research Lab

Introduction: I worked at the U.S. Naval Research Laboratory for four summers during high school and college. I spent much of my time writing a computer program to do thermal analysis for an experiment that we put on the space shuttle. The facility I developed with the finite-element method came in handy in my job at Bell Labs the following summers. I was working for C. H. Tsao and Jim Adams in the Laboratory for Cosmic Ray Physics. We were estimating the distribution of isotopes in cosmic rays using a pile of track detectors. To get accurate measurements, you want these plastic disks to be as close as possible to a constant temperature, so we designed an elaborate wrapping of thermal blankets. My program computed the temperature of the detectors during the year that the Long Duration Exposure Facility (including our experiment and a bunch of others) was scheduled to be in orbit. The input is the heat from solar radiation (easy enough to compute given the trajectory). On the computer I tr

5 0.9595412 1897 andrew gelman stats-2013-06-13-When’s that next gamma-ray blast gonna come, already?

Introduction: Phil Plait writes : Earth May Have Been Hit by a Cosmic Blast 1200 Years Ago . . . this is nothing to panic about. If it happened at all, it was a long time ago, and unlikely to happen again for hundreds of thousands of years. This left me confused. If it really did happen 1200 years ago, basic statistics would suggest it would occur approximately once every 1200 years or so (within half an order of magnitude). So where does “hundreds of thousands of years” come from? I emailed astronomer David Hogg to see if I was missing something here, and he replied: Yeah, if we think this hit us 1200 years ago, we should imagine that this happens every few thousand years at least. Now that said, if there are *other* reasons for thinking it is exceedingly rare, then that would be a strong a priori argument against believing in the result. So you should either believe that it didn’t happen 1200 years ago, or else you should believe it will happen again in the next few thousan

6 0.95758843 1055 andrew gelman stats-2011-12-13-Data sharing update

same-blog 7 0.9560498 1450 andrew gelman stats-2012-08-08-My upcoming talk for the data visualization meetup

8 0.95497912 275 andrew gelman stats-2010-09-14-Data visualization at the American Evaluation Association

9 0.95435339 716 andrew gelman stats-2011-05-17-Is the internet causing half the rapes in Norway? I wanna see the scatterplot.

10 0.95099729 1218 andrew gelman stats-2012-03-18-Check your missing-data imputations using cross-validation

11 0.94970101 2275 andrew gelman stats-2014-03-31-Just gave a talk

12 0.94787538 548 andrew gelman stats-2011-02-01-What goes around . . .

13 0.94768465 1730 andrew gelman stats-2013-02-20-Unz on Unz

14 0.94709337 1050 andrew gelman stats-2011-12-10-Presenting at the econ seminar

15 0.94694352 1143 andrew gelman stats-2012-01-29-G+ > Skype

16 0.94396245 2183 andrew gelman stats-2014-01-23-Discussion on preregistration of research studies

17 0.94086081 2270 andrew gelman stats-2014-03-28-Creating a Lenin-style democracy

18 0.93921489 1486 andrew gelman stats-2012-09-07-Prior distributions for regression coefficients

19 0.93574333 79 andrew gelman stats-2010-06-10-What happens when the Democrats are “fighting Wall Street with one hand, unions with the other,” while the Republicans are fighting unions with two hands?

20 0.9337998 438 andrew gelman stats-2010-11-30-I just skyped in from Kentucky, and boy are my arms tired