andrew_gelman_stats andrew_gelman_stats-2011 andrew_gelman_stats-2011-800 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: These particular lineplots are called parallel coordinate plots.
sentIndex sentText sentNum sentScore
1 These particular lineplots are called parallel coordinate plots. [sent-1, score-1.928]
wordName wordTfidf (topN-words)
[('coordinate', 0.552), ('lineplots', 0.552), ('parallel', 0.42), ('plots', 0.362), ('called', 0.233), ('particular', 0.171)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 800 andrew gelman stats-2011-07-13-I like lineplots
Introduction: These particular lineplots are called parallel coordinate plots.
2 0.23682709 319 andrew gelman stats-2010-10-04-“Who owns Congress”
Introduction: Curt Yeske pointed me to this . Wow–these graphs are really hard to read! The old me would’ve said that each of these graphs would be better replaced by a dotplot (or, better still, a series of lineplots showing time trends). The new me would still like the dotplots and lineplots, but I’d say it’s fine to have the eye-grabbing but hard-to-read graphs as is, and then to have the more informative statistical graphics underneath, as it were. The idea is, you’d click on the pretty but hard-to-read “infovis” graphs, and this would then reveal informative “full Cleveland” graphs. And then if you click again you’d get a spreadsheet with the raw numbers. That I’d like to see, as a new model for graphical presentation.
3 0.16066885 252 andrew gelman stats-2010-09-02-R needs a good function to make line plots
Introduction: More and more I’m thinking that line plots are great. More specifically, two-way grids of line plots on common scales, with one, two, or three lines per plot (enough to show comparisons but not so many that you can’t tell the lines apart). Also dot plots, of the sort that have been masterfully used by Lax and Phillips to show comparisons and trends in support for gay rights. There’s a big step missing, though, and that is to be able to make these graphs as a default. We have to figure out the right way to structure the data so these graphs come naturally. Then when it’s all working, we can talk the Excel people into implementing our ideas. I’m not asking to be paid here; all our ideas are in the public domain and I’m happy for Microsoft or Google or whoever to copy us. P.S. Drew Conway writes: This could be accomplished with ggplot2 using various combinations of the grammar. If I am understanding what you mean by line plots, here are some examples with code . In fact,
4 0.12140629 61 andrew gelman stats-2010-05-31-A data visualization manifesto
Introduction: Details matter (at least, they do for me), but we don’t yet have a systematic way of going back and forth between the structure of a graph, its details, and the underlying questions that motivate our visualizations. (Cleveland, Wilkinson, and others have written a bit on how to formalize these connections, and I’ve thought about it too, but we have a ways to go.) I was thinking about this difficulty after reading an article on graphics by some computer scientists that was well-written but to me lacked a feeling for the linkages between substantive/statistical goals and graphical details. I have problems with these issues too, and my point here is not to criticize but to move the discussion forward. When thinking about visualization, how important are the details? Aleks pointed me to this article by Jeffrey Heer, Michael Bostock, and Vadim Ogievetsky, “A Tour through the Visualization Zoo: A survey of powerful visualization techniques, from the obvious to the obscure.” Th
5 0.10681077 1531 andrew gelman stats-2012-10-12-Elderpedia
Introduction: It’s good to remember that wikis aren’t just for looking up Dylan lyrics and the plots of old Three’s Company episodes.
6 0.10544588 816 andrew gelman stats-2011-07-22-“Information visualization” vs. “Statistical graphics”
7 0.10253579 2285 andrew gelman stats-2014-04-07-On deck this week
8 0.088275298 743 andrew gelman stats-2011-06-03-An argument that can’t possibly make sense
9 0.079846591 778 andrew gelman stats-2011-06-24-New ideas on DIC from Martyn Plummer and Sumio Watanabe
10 0.075799137 1896 andrew gelman stats-2013-06-13-Against the myth of the heroic visualization
11 0.073421799 1478 andrew gelman stats-2012-08-31-Watercolor regression
12 0.067736216 324 andrew gelman stats-2010-10-07-Contest for developing an R package recommendation system
13 0.067334332 1800 andrew gelman stats-2013-04-12-Too tired to mock
14 0.065020867 1509 andrew gelman stats-2012-09-24-Analyzing photon counts
15 0.06056134 965 andrew gelman stats-2011-10-19-Web-friendly visualizations in R
16 0.06033209 1497 andrew gelman stats-2012-09-15-Our blog makes connections!
17 0.059475973 363 andrew gelman stats-2010-10-22-Graphing Likert scale responses
18 0.058813605 1011 andrew gelman stats-2011-11-15-World record running times vs. distance
19 0.058804303 878 andrew gelman stats-2011-08-29-Infovis, infographics, and data visualization: Where I’m coming from, and where I’d like to go
20 0.05232086 2245 andrew gelman stats-2014-03-12-More on publishing in journals
topicId topicWeight
[(0, 0.021), (1, 0.001), (2, -0.013), (3, 0.019), (4, 0.03), (5, -0.038), (6, -0.031), (7, 0.011), (8, -0.013), (9, -0.0), (10, -0.004), (11, 0.006), (12, -0.009), (13, 0.012), (14, -0.008), (15, -0.011), (16, 0.002), (17, -0.018), (18, -0.003), (19, -0.002), (20, 0.006), (21, 0.004), (22, 0.003), (23, 0.002), (24, 0.007), (25, -0.009), (26, 0.011), (27, 0.02), (28, 0.011), (29, 0.02), (30, -0.006), (31, -0.006), (32, 0.018), (33, 0.013), (34, 0.017), (35, -0.01), (36, 0.01), (37, 0.028), (38, 0.012), (39, 0.001), (40, 0.006), (41, -0.003), (42, -0.025), (43, 0.027), (44, -0.008), (45, 0.003), (46, -0.028), (47, 0.0), (48, 0.024), (49, -0.008)]
simIndex simValue blogId blogTitle
same-blog 1 0.9753837 800 andrew gelman stats-2011-07-13-I like lineplots
Introduction: These particular lineplots are called parallel coordinate plots.
2 0.65951133 252 andrew gelman stats-2010-09-02-R needs a good function to make line plots
Introduction: More and more I’m thinking that line plots are great. More specifically, two-way grids of line plots on common scales, with one, two, or three lines per plot (enough to show comparisons but not so many that you can’t tell the lines apart). Also dot plots, of the sort that have been masterfully used by Lax and Phillips to show comparisons and trends in support for gay rights. There’s a big step missing, though, and that is to be able to make these graphs as a default. We have to figure out the right way to structure the data so these graphs come naturally. Then when it’s all working, we can talk the Excel people into implementing our ideas. I’m not asking to be paid here; all our ideas are in the public domain and I’m happy for Microsoft or Google or whoever to copy us. P.S. Drew Conway writes: This could be accomplished with ggplot2 using various combinations of the grammar. If I am understanding what you mean by line plots, here are some examples with code . In fact,
3 0.63197017 319 andrew gelman stats-2010-10-04-“Who owns Congress”
Introduction: Curt Yeske pointed me to this . Wow–these graphs are really hard to read! The old me would’ve said that each of these graphs would be better replaced by a dotplot (or, better still, a series of lineplots showing time trends). The new me would still like the dotplots and lineplots, but I’d say it’s fine to have the eye-grabbing but hard-to-read graphs as is, and then to have the more informative statistical graphics underneath, as it were. The idea is, you’d click on the pretty but hard-to-read “infovis” graphs, and this would then reveal informative “full Cleveland” graphs. And then if you click again you’d get a spreadsheet with the raw numbers. That I’d like to see, as a new model for graphical presentation.
4 0.62332648 372 andrew gelman stats-2010-10-27-A use for tables (really)
Introduction: After our recent discussion of semigraphic displays, Jay Ulfelder sent along a semigraphic table from his recent book. He notes, “When countries are the units of analysis, it’s nice that you can use three-letter codes, so all the proper names have the same visual weight.” Ultimately I think that graphs win over tables for display. However in our work we spend a lot of time looking at raw data, often simply to understand what data we have. This use of tables has, I think, been forgotten in the statistical graphics literature. So I’d like to refocus the eternal tables vs. graphs discussion. If the goal is to present information, comparisons, relationships, models, data, etc etc, graphs win. Forget about tables. But . . . when you’re looking at your data, it can often help to see the raw numbers. Once you’re looking at numbers, it makes sense to organize them. Even a displayed matrix in R is a form of table, after all. And once you’re making a table, it can be sensible to
5 0.62028176 1584 andrew gelman stats-2012-11-19-Tradeoffs in information graphics
Introduction: The visual display of quantitative information (to use Edward Tufte’s wonderful term) is a diverse field or set of fields, and its practitioners have different goals. The goals of software designers, applied statisticians, biologists, graphic designers, and journalists (to list just a few of the important creators of data graphics) often overlap—but not completely. One of our aims in writing our article [on Infovis and Statistical Graphics] was to emphasize the diversity of graphical goals, as it seems to us that even experts tend to consider one aspect of a graph and not others. Our main practical suggestion was that, in the internet age, we should not have to choose between attractive graphs and informational graphs: it should be possible to display both, via interactive displays. But to follow this suggestion, one must first accept that not every beautiful graph is informative, and not every informative graph is beautiful. . . . Yes, it can sometimes be possible for a graph to
6 0.6161902 1775 andrew gelman stats-2013-03-23-In which I disagree with John Maynard Keynes
7 0.58282053 1604 andrew gelman stats-2012-12-04-An epithet I can live with
8 0.57983458 855 andrew gelman stats-2011-08-16-Infovis and statgraphics update update
9 0.55502993 324 andrew gelman stats-2010-10-07-Contest for developing an R package recommendation system
10 0.55271667 736 andrew gelman stats-2011-05-29-Response to “Why Tables Are Really Much Better Than Graphs”
12 0.54939765 1275 andrew gelman stats-2012-04-22-Please stop me before I barf again
14 0.52793527 1896 andrew gelman stats-2013-06-13-Against the myth of the heroic visualization
15 0.5269258 672 andrew gelman stats-2011-04-20-The R code for those time-use graphs
16 0.52512807 37 andrew gelman stats-2010-05-17-Is chartjunk really “more useful” than plain graphs? I don’t think so.
17 0.52085048 1594 andrew gelman stats-2012-11-28-My talk on statistical graphics at Mit this Thurs aft
18 0.52068388 1606 andrew gelman stats-2012-12-05-The Grinch Comes Back
19 0.51799577 126 andrew gelman stats-2010-07-03-Graphical presentation of risk ratios
20 0.51730675 2038 andrew gelman stats-2013-09-25-Great graphs of names
topicId topicWeight
[(13, 0.339), (24, 0.256), (99, 0.11)]
simIndex simValue blogId blogTitle
same-blog 1 0.93555367 800 andrew gelman stats-2011-07-13-I like lineplots
Introduction: These particular lineplots are called parallel coordinate plots.
2 0.90061778 1514 andrew gelman stats-2012-09-28-AdviseStat 47% Campaign Ad
Introduction: Lee Wilkinson sends me this amusing ad for his new software, AdviseStat: The ad is a parody, but the software is real !
3 0.86109906 19 andrew gelman stats-2010-05-06-OK, so this is how I ended up working with three different guys named Matt
Introduction: Really we need the data on babies born 30 years ago, but this is still pretty stunning: Argentina: Matías, #3; Mateo, #13 Australia/New South Wales: Matthew, #21 Australia/Victoria: Matthew, #21 Austria: Matthias, #19 Belgium: Mathis, #9; Matteo, #22; Mathias, #23; Mathéo, #35; Mats, #89; Mathieu, #90; Matthias, #97 Brazil: Matheus, #4 Canada/Alberta: Matthew, #8 Canada/British Columbia: Matthew, #6 Canada/Ontario: Matthew, #2 Canada/Quebec: Mathis, #11; Mathieu, #35; Mathias, #47; Matthew, #76; Mathys, #78; Matis, #84 Canada/Saskatchewan: Matthew, #10 Chile: Matias, #4 Czech Republic: Matej, #7; Matyas, #17; Matous, #25 Denmark: Mathias, #11, Mads, #12 England: Matthew, #24 Finland: Matias, #4 France: Mathis, #3 Georgia: Mate, #8 Germany: Matthis, #87 Hungary: Máté, #2; Matyas, #53 Iceland: Matthias, #32 Ireland: Matthew, #17 Italy: Matteo, #4; Mattia, #7 Lithuania: Matas, #1 Netherlands: Thijs, #13 New Zealand: Matthew, #21 Northern Ireland: Matthew,
4 0.83762532 373 andrew gelman stats-2010-10-27-It’s better than being forwarded the latest works of you-know-who
Introduction: In the inbox today: From Jimmy. From Kieran. The relevant references are here and, of course, here .
5 0.73958766 345 andrew gelman stats-2010-10-15-Things we do on sabbatical instead of actually working
Introduction: Frank Fischer, a political scientist at Rutgers U., says his alleged plagiarism was mere sloppiness and not all that uncommon in scholarship. I’ve heard about plagiarism but I had no idea it occurred in political science.
6 0.72544748 234 andrew gelman stats-2010-08-25-Modeling constrained parameters
7 0.70093715 1559 andrew gelman stats-2012-11-02-The blog is back
8 0.6761992 172 andrew gelman stats-2010-07-30-Why don’t we have peer reviewing for oral presentations?
9 0.67416656 437 andrew gelman stats-2010-11-29-The mystery of the U-shaped relationship between happiness and age
10 0.67252576 1789 andrew gelman stats-2013-04-05-Elites have alcohol problems too!
11 0.64557296 1509 andrew gelman stats-2012-09-24-Analyzing photon counts
12 0.63506275 240 andrew gelman stats-2010-08-29-ARM solutions
13 0.63470256 643 andrew gelman stats-2011-04-02-So-called Bayesian hypothesis testing is just as bad as regular hypothesis testing
14 0.63399792 545 andrew gelman stats-2011-01-30-New innovations in spam
15 0.63294446 1851 andrew gelman stats-2013-05-11-Actually, I have no problem with this graph
16 0.63223082 38 andrew gelman stats-2010-05-18-Breastfeeding, infant hyperbilirubinemia, statistical graphics, and modern medicine
17 0.63130295 1907 andrew gelman stats-2013-06-20-Amazing retro gnu graphics!
18 0.62959123 1046 andrew gelman stats-2011-12-07-Neutral noninformative and informative conjugate beta and gamma prior distributions
19 0.62833619 241 andrew gelman stats-2010-08-29-Ethics and statistics in development research
20 0.62827784 1437 andrew gelman stats-2012-07-31-Paying survey respondents