andrew_gelman_stats andrew_gelman_stats-2013 andrew_gelman_stats-2013-2001 knowledge-graph by maker-knowledge-mining

2001 andrew gelman stats-2013-08-29-Edgar Allan Poe was a statistician

meta infos for this blog

Source: html

Introduction: Antony Unwin writes: Rereading Edgar Allan Poe’s “Murder in the Rue Morgue” reminded me of his astute remarks on analysis. For instance But it is in matters beyond the limits of mere rule that the skill of the analyst is evinced. He makes, in silence, a host of observations and inferences. and and the difference in the extent of the information obtained, lies not so much in the validity of the inference as in the quality of the observation. The necessary knowledge is that of what to observe. and He impaired his vision by holding the object too close. He might see, perhaps, one or two points with unusual clearness, but in so doing he, necessarily, lost sight of the matter as a whole. However, I had forgotten his following comment, which rang all sorts of bells in connection with some scientific articles I have seen recently: what is only complex is mistaken (a not unusual error) for what is profound. How about asking referees to rate articles on their complex

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Antony Unwin writes: Rereading Edgar Allan Poe’s “Murder in the Rue Morgue” reminded me of his astute remarks on analysis. [sent-1, score-0.418]

2 For instance But it is in matters beyond the limits of mere rule that the skill of the analyst is evinced. [sent-2, score-0.968]

3 He makes, in silence, a host of observations and inferences. [sent-3, score-0.263]

4 and and the difference in the extent of the information obtained, lies not so much in the validity of the inference as in the quality of the observation. [sent-4, score-0.45]

5 The necessary knowledge is that of what to observe. [sent-5, score-0.184]

6 and He impaired his vision by holding the object too close. [sent-6, score-0.612]

7 He might see, perhaps, one or two points with unusual clearness, but in so doing he, necessarily, lost sight of the matter as a whole. [sent-7, score-0.616]

8 However, I had forgotten his following comment, which rang all sorts of bells in connection with some scientific articles I have seen recently: what is only complex is mistaken (a not unusual error) for what is profound. [sent-8, score-1.427]

9 How about asking referees to rate articles on their complexity and their profundity? [sent-9, score-0.599]

similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('unusual', 0.248), ('edgar', 0.207), ('rang', 0.195), ('allan', 0.195), ('astute', 0.195), ('bells', 0.187), ('rereading', 0.187), ('impaired', 0.18), ('sight', 0.18), ('silence', 0.166), ('vision', 0.16), ('articles', 0.154), ('lies', 0.15), ('host', 0.15), ('skill', 0.148), ('mistaken', 0.148), ('murder', 0.146), ('analyst', 0.146), ('holding', 0.141), ('forgotten', 0.14), ('antony', 0.14), ('mere', 0.138), ('referees', 0.137), ('unwin', 0.135), ('limits', 0.133), ('object', 0.131), ('obtained', 0.129), ('complexity', 0.128), ('remarks', 0.119), ('validity', 0.117), ('instance', 0.116), ('matters', 0.115), ('observations', 0.113), ('lost', 0.11), ('connection', 0.105), ('reminded', 0.104), ('asking', 0.098), ('necessary', 0.097), ('necessarily', 0.097), ('rule', 0.096), ('complex', 0.096), ('quality', 0.093), ('extent', 0.09), ('knowledge', 0.087), ('rate', 0.082), ('sorts', 0.081), ('matter', 0.078), ('beyond', 0.076), ('seen', 0.073), ('error', 0.073)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0000001 2001 andrew gelman stats-2013-08-29-Edgar Allan Poe was a statistician

2 0.16824041 225 andrew gelman stats-2010-08-23-Getting into hot water over hot graphics

Introduction: I like what Antony Unwin has to say here (start on page 5).

3 0.096466668 583 andrew gelman stats-2011-02-21-An interesting assignment for statistical graphics

Introduction: Antony Unwin writes: I [Unwin] find it an interesting exercise for students to ask them to write headlines (and subheadlines) for graphics, both for ones they have drawn themselves and for published ones. The results are sometimes depressing, often thought-provoking and occasionally highly entertaining. This seems like a great idea, both for teaching students how to read a graph and also for teaching how to make a graph. I’ve long said that when making a graph (or, for that matter, a table), you want to think about what message the reader will get out of it. “Displaying a bunch of numbers” doesn’t cut it.

4 0.086790189 816 andrew gelman stats-2011-07-22-“Information visualization” vs. “Statistical graphics”

Introduction: By now you all must be tired of my one-sided presentations of the differences between infovis and statgraphics (for example, this article with Antony Unwin). Today is something different. Courtesy of Martin Theus, editor of the Statistical Computing and Graphics Newsletter, we have two short articles offering competing perspectives: Robert Kosara writes from an Infovis view: Information visualization is a field that has had trouble defining its boundaries, and that consequently is often misunderstood. It doesn’t help that InfoVis, as it is also known, produces pretty pictures that people like to look at and link to or send around. But InfoVis is more than pretty pictures, and it is more than statistical graphics. The key to understanding InfoVis is to ignore the images for a moment and focus on the part that is often lost: interaction. When we use visualization tools, we don’t just create one image or one kind of visualization. In fact, most people would argue that there is

5 0.082605153 822 andrew gelman stats-2011-07-26-Any good articles on the use of error bars?

Introduction: Hadley Wickham asks: I was wondering if you knew of any good articles on the use of error bars. I’m particularly looking for articles that discuss the difference between error of means and error of difference in the context of models (e.g. mixed models) where they are very different. I suspect every applied field has a couple of good articles, but it’s really hard to search for them. Can anyone help on this? My only advice is to get rid of those horrible crossbars at the ends of the error bars. The crossbars draw attention to the error bars’ endpoints, which are generally not important at all. See, for example, my Anova paper , for some examples of how I like error bars to look.

6 0.077860452 1745 andrew gelman stats-2013-03-02-Classification error

7 0.077423967 1892 andrew gelman stats-2013-06-10-I don’t think we get much out of framing politics as the Tragic Vision vs. the Utopian Vision

8 0.066841573 2279 andrew gelman stats-2014-04-02-Am I too negative?

9 0.065872073 546 andrew gelman stats-2011-01-31-Infovis vs. statistical graphics: My talk tomorrow (Tues) 1pm at Columbia

10 0.065358415 438 andrew gelman stats-2010-11-30-I just skyped in from Kentucky, and boy are my arms tired

11 0.062881038 1522 andrew gelman stats-2012-10-05-High temperatures cause violent crime and implications for climate change, also some suggestions about how to better summarize these claims

12 0.059780262 2006 andrew gelman stats-2013-09-03-Evaluating evidence from published research

13 0.058087282 855 andrew gelman stats-2011-08-16-Infovis and statgraphics update update

14 0.054659151 1139 andrew gelman stats-2012-01-26-Suggested resolution of the Bem paradox

15 0.054088421 1848 andrew gelman stats-2013-05-09-A tale of two discussion papers

16 0.053779475 120 andrew gelman stats-2010-06-30-You can’t put Pandora back in the box

17 0.05314678 1835 andrew gelman stats-2013-05-02-7 ways to separate errors from statistics

18 0.052944154 787 andrew gelman stats-2011-07-05-Different goals, different looks: Infovis and the Chris Rock effect

19 0.051477119 1246 andrew gelman stats-2012-04-04-Data visualization panel at the New York Public Library this evening!

20 0.049840145 1604 andrew gelman stats-2012-12-04-An epithet I can live with

similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.087), (1, -0.005), (2, -0.013), (3, -0.006), (4, -0.011), (5, -0.032), (6, -0.01), (7, 0.013), (8, -0.003), (9, 0.005), (10, -0.003), (11, -0.011), (12, -0.011), (13, 0.008), (14, -0.004), (15, -0.02), (16, 0.005), (17, -0.004), (18, -0.008), (19, 0.028), (20, -0.023), (21, -0.01), (22, 0.017), (23, 0.008), (24, 0.032), (25, 0.016), (26, 0.013), (27, 0.03), (28, -0.003), (29, -0.01), (30, -0.005), (31, 0.04), (32, -0.001), (33, 0.008), (34, 0.026), (35, 0.004), (36, 0.006), (37, 0.011), (38, 0.049), (39, -0.012), (40, -0.028), (41, -0.018), (42, -0.014), (43, 0.009), (44, 0.009), (45, 0.011), (46, 0.013), (47, 0.002), (48, -0.028), (49, 0.013)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.94673228 2001 andrew gelman stats-2013-08-29-Edgar Allan Poe was a statistician

2 0.66919905 1096 andrew gelman stats-2012-01-02-Graphical communication for legal scholarship

Introduction: Following my talk on infovis and statistical graphics at the Empirical Legal Studies conference , Dan Kahan writes: The legal academy, which is making strides toward sensible integration of a variety of empirical methods into its scholarship, is horribly ignorant of the utility of graphic reporting of data, a likely influence of the formative influence that econometric methods has exerted on expectations and habits of mind among legal scholars. Lee Epstein has written a pair of wonderful articles on graphic reporting – 1. Epstein, L., Martin, A. & Boyd, C. On the Effective Communication of the Results of Empirical Studies, Part II. Vand. L. Rev. 60, 798-846 (2007). 2. Epstein, L., Martin, A. & Schneider, M. On the Effective Communication of the Results of Empirical Studies, Part I. Vand. L. Rev. 59, 1811-1871 (2007). – but her efforts haven’t gotten the attention they deserve, and reinforcement, particularly at a venue like CELS is very important. But the main issue there

3 0.66674411 2284 andrew gelman stats-2014-04-07-How literature is like statistical reasoning: Kosara on stories. Gelman and Basbøll on stories.

Introduction: In “Story: A Definition,” visual analysis researcher Robert Kosara writes : A story ties facts together. There is a reason why this particular collection of facts is in this story, and the story gives you that reason. provides a narrative path through those facts. In other words, it guides the viewer/reader through the world, rather than just throwing them in there. presents a particular interpretation of those facts. A story is always a particular path through a world, so it favors one way of seeing things over all others. The relevance of these ideas to statistical graphics is apparent. From a completely different direction, in “When do stories work? Evidence and illustration in the social sciences,” Thomas Basbøll and I write : Storytelling has long been recognized as central to human cognition and communication. Here we explore a more active role of stories in social science research, not merely to illustrate concepts but also to develop new ideas and evalu

4 0.66273421 38 andrew gelman stats-2010-05-18-Breastfeeding, infant hyperbilirubinemia, statistical graphics, and modern medicine

Introduction: Dan Lakeland asks : When are statistical graphics potentially life threatening? When they’re poorly designed, and used to make decisions on potentially life threatening topics, like medical decision making, engineering design, and the like. The American Academy of Pediatrics has dropped the ball on communicating to physicians about infant jaundice. Another message in this post is that bad decisions can compound each other. It’s an interesting story (follow the link above for the details), would be great for a class in decision analysis or statistical communication. I have no idea how to get from A to B here, in the sense of persuading hospitals to do this sort of thing better. I’d guess the first step is to carefully lay out costs and benefits. When doctors and nurses make extra precautions for safety, it could be useful to lay out the ultimate goals and estimate the potential costs and benefits of different approaches.

5 0.65548658 1848 andrew gelman stats-2013-05-09-A tale of two discussion papers

Introduction: Over the years I’ve written a dozen or so journal articles that have appeared with discussions, and I’ve participated in many published discussions of others’ articles as well. I get a lot out of these article-discussion-rejoinder packages, in all three of my roles as reader, writer, and discussant. Part 1: The story of an unsuccessful discussion The first time I had a discussion article was the result of an unfortunate circumstance. I had a research idea that resulted in an article with Don Rubin on monitoring the mixing of Markov chain simulations. I new the idea was great, but back then we worked pretty slowly so it was awhile before we had a final version to submit to a journal. (In retrospect I wish I’d just submitted the draft version as it was.) In the meantime I presented the paper at a conference. Our idea was very well received (I had a sheet of paper so people could write their names and addresses to get preprints, and we got either 50 or 150 (I can’t remembe

6 0.63028067 1775 andrew gelman stats-2013-03-23-In which I disagree with John Maynard Keynes

7 0.62544632 1336 andrew gelman stats-2012-05-22-Battle of the Repo Man quotes: Reid Hastie’s turn

8 0.62344384 116 andrew gelman stats-2010-06-29-How to grab power in a democracy – in 5 easy non-violent steps

9 0.61743808 1975 andrew gelman stats-2013-08-09-Understanding predictive information criteria for Bayesian models

10 0.61263019 721 andrew gelman stats-2011-05-20-Non-statistical thinking in the US foreign policy establishment

11 0.6122027 1420 andrew gelman stats-2012-07-18-The treatment, the intermediate outcome, and the ultimate outcome: Leverage and the financial crisis

12 0.61062342 1278 andrew gelman stats-2012-04-23-“Any old map will do” meets “God is in every leaf of every tree”

13 0.61017507 816 andrew gelman stats-2011-07-22-“Information visualization” vs. “Statistical graphics”

14 0.60728514 1404 andrew gelman stats-2012-07-03-Counting gays

15 0.60610467 847 andrew gelman stats-2011-08-10-Using a “pure infographic” to explore differences between information visualization and statistical graphics

16 0.60496259 1463 andrew gelman stats-2012-08-19-It is difficult to convey intonation in typed speech

17 0.59701073 1319 andrew gelman stats-2012-05-14-I hate to get all Gerd Gigerenzer on you here, but . . .

18 0.59667999 2199 andrew gelman stats-2014-02-04-Widening the goalposts in medical trials

19 0.58872533 778 andrew gelman stats-2011-06-24-New ideas on DIC from Martyn Plummer and Sumio Watanabe

20 0.58716685 399 andrew gelman stats-2010-11-07-Challenges of experimental design; also another rant on the practice of mentioning the publication of an article but not naming its author

similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(0, 0.026), (13, 0.039), (15, 0.017), (16, 0.073), (21, 0.063), (22, 0.104), (24, 0.093), (51, 0.098), (52, 0.024), (55, 0.047), (66, 0.024), (76, 0.08), (84, 0.043), (87, 0.034), (99, 0.135)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.93352866 2001 andrew gelman stats-2013-08-29-Edgar Allan Poe was a statistician

2 0.76943505 448 andrew gelman stats-2010-12-03-This is a footnote in one of my papers

Introduction: In the annals of hack literature, it is sometimes said that if you aim to write best-selling crap, all you’ll end up with is crap. To truly produce best-selling crap, you have to have a conviction, perhaps misplaced, that your writing has integrity. Whether or not this is a good generalization about writing, I have seen an analogous phenomenon in statistics: If you try to do nothing but model the data, you can be in for a wild and unpleasant ride: real data always seem to have one more twist beyond our ability to model (von Neumann’s elephant’s trunk notwithstanding). But if you model the underlying process, sometimes your model can fit surprisingly well as well as inviting openings for future research progress.

3 0.75513923 1037 andrew gelman stats-2011-12-01-Lamentably common misunderstanding of meritocracy

Introduction: Tyler Cowen pointed to an article by business-school professor Luigi Zingales about meritocracy. I’d expect a b-school prof to support the idea of meritocracy, and Zingales does not disappoint. But he says a bunch of other things that to me represent a confused conflation of ideas. Here’s Zingales: America became known as a land of opportunity—a place whose capitalist system benefited the hardworking and the virtuous [emphasis added]. In a word, it was a meritocracy. That’s interesting—and revealing. Here’s what I get when I look up “meritocracy” in the dictionary : 1 : a system in which the talented are chosen and moved ahead on the basis of their achievement 2 : leadership selected on the basis of intellectual criteria Nothing here about “hardworking” or “virtuous.” In a meritocracy, you can be as hardworking as John Kruk or as virtuous as Kobe Bryant and you’ll still get ahead—if you have the talent and achievement. Throwing in “hardworking” and “virtuous”

4 0.74076295 1543 andrew gelman stats-2012-10-21-Model complexity as a function of sample size

Introduction: As we get more data, we can fit more model. But at some point we become so overwhelmed by data that, for computational reasons, we can barely do anything at all. Thus, the curve above could be thought of as the product of two curves: a steadily increasing curve showing the statistical ability to fit more complex models with more data, and a steadily decreasing curve showing the computational feasibility of doing so.

5 0.72591662 145 andrew gelman stats-2010-07-13-Statistical controversy regarding human rights violations in Colomnbia

Introduction: Megan Price wrote in that she and Daniel Guzmán of the Benetech Human Rights Program released a paper today entitled “Comments to the article ‘Is Violence Against Union Members in Colombia Systematic and Targeted?’” (o aqui en español), which examines an article written by Colombian academics Daniel Mejía and María José Uribe. Price writes [in the third person]: The paper reviewed by Price and Guzmán concluded that “. . . on average, violence against unionists in Colombia is neither systematic nor targeted.” However, in their response, Price and Guzmán present – in technical and methodological detail – the reasons they find the conclusions in Mejía and Uribe’s study to be overstated. Price and Guzmán believe that weaknesses in the data, in the choice of the statistical model, and the interpretation of the model used in Mejía and Uribe’s study, all raise serious questions about the authors’ strong causal conclusions. Price and Guzmán point out that unchecked, those conclusio

6 0.72185063 1755 andrew gelman stats-2013-03-09-Plaig

7 0.72085869 385 andrew gelman stats-2010-10-31-Wacky surveys where they don’t tell you the questions they asked

8 0.7203418 1594 andrew gelman stats-2012-11-28-My talk on statistical graphics at Mit this Thurs aft

9 0.71588629 1161 andrew gelman stats-2012-02-10-If an entire article in Computational Statistics and Data Analysis were put together from other, unacknowledged, sources, would that be a work of art?

10 0.7145046 879 andrew gelman stats-2011-08-29-New journal on causal inference

11 0.71170038 1216 andrew gelman stats-2012-03-17-Modeling group-level predictors in a multilevel regression

12 0.70970666 2246 andrew gelman stats-2014-03-13-An Economist’s Guide to Visualizing Data

13 0.70922059 988 andrew gelman stats-2011-11-02-Roads, traffic, and the importance in decision analysis of carefully examining your goals

14 0.70877928 2317 andrew gelman stats-2014-05-04-Honored oldsters write about statistics

15 0.70810318 1609 andrew gelman stats-2012-12-06-Stephen Kosslyn’s principles of graphics and one more: There’s no need to cram everything into a single plot

16 0.7078402 1147 andrew gelman stats-2012-01-30-Statistical Murder

17 0.70589793 477 andrew gelman stats-2010-12-20-Costless false beliefs

18 0.70582247 586 andrew gelman stats-2011-02-23-A statistical version of Arrow’s paradox

19 0.70168841 1398 andrew gelman stats-2012-06-28-Every time you take a sample, you’ll have to pay this guy a quarter

20 0.69992471 337 andrew gelman stats-2010-10-12-Election symposium at Columbia Journalism School