andrew_gelman_stats andrew_gelman_stats-2011 andrew_gelman_stats-2011-733 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Somebody named Justin writes: Check this out for some probably bad statistics and bad graphs. It looks like they tallied the most frequent names of CEOs, professions, geography, etc. What inference they are trying to make from this, I have no clue. I agree this is pretty horrible isn’t the sort of graph that I would make. But, readers, please! You don’t have to just send me the bad stuff!
sentIndex sentText sentNum sentScore
1 Somebody named Justin writes: Check this out for some probably bad statistics and bad graphs. [sent-1, score-1.014]
2 It looks like they tallied the most frequent names of CEOs, professions, geography, etc. [sent-2, score-0.963]
3 What inference they are trying to make from this, I have no clue. [sent-3, score-0.276]
4 I agree this is pretty horrible isn’t the sort of graph that I would make. [sent-4, score-0.626]
wordName wordTfidf (topN-words)
[('ceos', 0.357), ('tallied', 0.357), ('bad', 0.316), ('professions', 0.302), ('geography', 0.271), ('frequent', 0.252), ('justin', 0.226), ('named', 0.194), ('horrible', 0.19), ('names', 0.185), ('somebody', 0.176), ('send', 0.162), ('please', 0.146), ('check', 0.142), ('stuff', 0.138), ('looks', 0.13), ('readers', 0.128), ('graph', 0.123), ('isn', 0.12), ('inference', 0.114), ('probably', 0.113), ('agree', 0.105), ('trying', 0.104), ('pretty', 0.087), ('sort', 0.082), ('statistics', 0.075), ('make', 0.058), ('writes', 0.054), ('would', 0.039), ('like', 0.039)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 733 andrew gelman stats-2011-05-27-Another silly graph
Introduction: Somebody named Justin writes: Check this out for some probably bad statistics and bad graphs. It looks like they tallied the most frequent names of CEOs, professions, geography, etc. What inference they are trying to make from this, I have no clue. I agree this is pretty horrible isn’t the sort of graph that I would make. But, readers, please! You don’t have to just send me the bad stuff!
2 0.14141864 1733 andrew gelman stats-2013-02-22-Krugman sets the bar too high
Introduction: If being cantankerous and potty-mouthed is a bad thing, I’m in big trouble !
3 0.12656088 6 andrew gelman stats-2010-04-27-Jelte Wicherts lays down the stats on IQ
Introduction: Good stuff.
4 0.11998457 2211 andrew gelman stats-2014-02-14-The popularity of certain baby names is falling off the clifffffffffffff
Introduction: Ubs writes: I was looking at baby name data last night and I stumbled upon something curious. I follow the baby names blog occasionally but not regularly, so I’m not sure if it’s been noticed before. Let me present it like this: Take the statement… Of the top 100 boys and top 100 girls names, only ___% contain the letter __. I’m using the SSA baby names page, so that’s U.S. births, and I’m looking at the decade of 2000-2009 (so kids currently aged 4 to 13). Which letters would you expect to have the lowest rate of occurrence? As expected, the lowest score is for Q, which appears zero times. (Jacqueline ranks #104 for girls.) It’s the second lowest that surprised me. (… You can pause and try to guess now. Spoilers to follow.) Of the other big-point Scrabble letters, Z appears in four names (Elizabeth, Zachary, Mackenzie, Zoe) and X in six, of which five are closely related (Alexis, Alexander, Alexandra, Alexa, Alex, Xavier). J is heavily overrepresented, especial
5 0.11703444 153 andrew gelman stats-2010-07-17-Tenure-track position at U. North Carolina in survey methods and social statistics
Introduction: See here . Cool–it looks like they’re doing interesting stuff, and it’s great to see this sort of support for applied research.
6 0.11152029 2013 andrew gelman stats-2013-09-08-What we need here is some peer review for statistical graphics
8 0.10270544 2212 andrew gelman stats-2014-02-15-Mary, Mary, why ya buggin
9 0.095798358 658 andrew gelman stats-2011-04-11-Statistics in high schools: Towards more accessible conceptions of statistical inference
10 0.095124625 1919 andrew gelman stats-2013-06-29-R sucks
11 0.091918722 1701 andrew gelman stats-2013-01-31-The name that fell off a cliff
12 0.091049194 332 andrew gelman stats-2010-10-10-Proposed new section of the American Statistical Association on Imaging Sciences
13 0.090482891 1296 andrew gelman stats-2012-05-03-Google Translate for code, and an R help-list bot
14 0.088077128 798 andrew gelman stats-2011-07-12-Sometimes a graph really is just ugly
15 0.083713107 2006 andrew gelman stats-2013-09-03-Evaluating evidence from published research
16 0.083554246 1862 andrew gelman stats-2013-05-18-uuuuuuuuuuuuugly
17 0.080862515 1347 andrew gelman stats-2012-05-27-Macromuddle
18 0.079242386 1006 andrew gelman stats-2011-11-12-Val’s Number Scroll: Helping kids visualize math
19 0.073258676 199 andrew gelman stats-2010-08-11-Note to semi-spammers
20 0.073055111 2279 andrew gelman stats-2014-04-02-Am I too negative?
topicId topicWeight
[(0, 0.097), (1, -0.041), (2, -0.054), (3, 0.051), (4, 0.034), (5, -0.044), (6, -0.024), (7, 0.005), (8, 0.016), (9, -0.012), (10, 0.016), (11, -0.035), (12, 0.003), (13, 0.022), (14, -0.011), (15, 0.038), (16, -0.002), (17, 0.009), (18, 0.003), (19, -0.009), (20, 0.039), (21, -0.002), (22, 0.023), (23, -0.024), (24, 0.045), (25, -0.019), (26, -0.028), (27, 0.032), (28, -0.055), (29, 0.025), (30, 0.051), (31, -0.009), (32, -0.021), (33, -0.083), (34, -0.065), (35, 0.02), (36, 0.006), (37, -0.042), (38, -0.047), (39, 0.04), (40, 0.031), (41, -0.011), (42, 0.025), (43, -0.038), (44, 0.069), (45, 0.054), (46, 0.01), (47, -0.002), (48, 0.063), (49, 0.009)]
simIndex simValue blogId blogTitle
same-blog 1 0.97688556 733 andrew gelman stats-2011-05-27-Another silly graph
Introduction: Somebody named Justin writes: Check this out for some probably bad statistics and bad graphs. It looks like they tallied the most frequent names of CEOs, professions, geography, etc. What inference they are trying to make from this, I have no clue. I agree this is pretty horrible isn’t the sort of graph that I would make. But, readers, please! You don’t have to just send me the bad stuff!
2 0.71568674 1193 andrew gelman stats-2012-03-03-“Do you guys pay your bills?”
Introduction: I’ve had Love the Liberry on the blogroll forever. I hadn’t checked the site for awhile and was impressed to see that they’re still at it. Great stuff—don’t ever quit! P.S. It seems that there are other librarian blogs. Pretty scary, actually! One’s enough for me.
3 0.69848776 153 andrew gelman stats-2010-07-17-Tenure-track position at U. North Carolina in survey methods and social statistics
Introduction: See here . Cool–it looks like they’re doing interesting stuff, and it’s great to see this sort of support for applied research.
4 0.65624726 1810 andrew gelman stats-2013-04-17-Subway series
Introduction: Abby points us to a spare but cool visualization . I don’t like the curvy connect-the-dots line, but my main suggested improvement would be a closer link to the map . Showing median income on census tracts along subway lines is cool, but ultimately it’s a clever gimmick that pulls me in and makes me curious about what the map looks like. (And, thanks to google, the map was easy to find.)
5 0.64892727 1862 andrew gelman stats-2013-05-18-uuuuuuuuuuuuugly
Introduction: Hamdan Azhar writes: I came across this graphic of vaccine-attributed decreases in mortality and was curious if you found it as unattractive and unintuitive as I did. Hope all is well with you! My reply: All’s well with me. And yes, that’s one horrible graph. It has all the problems with a bad infographic with none of the virtues. Compared to this monstrosity, the typical USA Today graph is a stunning, beautiful masterpiece. I don’t think I want to soil this webpage with the image. In fact, I don’t even want to link to it.
6 0.64591265 1701 andrew gelman stats-2013-01-31-The name that fell off a cliff
7 0.63838971 2333 andrew gelman stats-2014-05-13-Personally, I’d rather go with Teragram
8 0.61832434 2079 andrew gelman stats-2013-10-27-Uncompressing the concept of compressed sensing
9 0.61719525 1283 andrew gelman stats-2012-04-26-Let’s play “Guess the smoother”!
10 0.61461169 353 andrew gelman stats-2010-10-19-The violent crime rate was about 75% higher in Detroit than in Minneapolis in 2009
11 0.61411673 2038 andrew gelman stats-2013-09-25-Great graphs of names
12 0.61243802 1785 andrew gelman stats-2013-04-02-So much artistic talent
13 0.61090881 664 andrew gelman stats-2011-04-16-Dilbert update: cartooning can give you the strength to open jars with your bare hands
14 0.60794342 208 andrew gelman stats-2010-08-15-When Does a Name Become Androgynous?
15 0.60688227 2211 andrew gelman stats-2014-02-14-The popularity of certain baby names is falling off the clifffffffffffff
17 0.59629649 204 andrew gelman stats-2010-08-12-Sloppily-written slam on moderately celebrated writers is amusing nonetheless
18 0.59079623 1733 andrew gelman stats-2013-02-22-Krugman sets the bar too high
19 0.59078336 875 andrew gelman stats-2011-08-28-Better than Dennis the dentist or Laura the lawyer
topicId topicWeight
[(16, 0.093), (24, 0.106), (53, 0.361), (99, 0.274)]
simIndex simValue blogId blogTitle
1 0.96568149 413 andrew gelman stats-2010-11-14-Statistics of food consumption
Introduction: Visual Economics shows statistics on average food consumption in America: My brief feedback is that water is confounded with these results. They should have subtracted water content from the weight of all dietary items, as it inflates the proportion of milk, vegetable and fruit items that contain more water. They did that for soda (which is represented as sugar/corn syrup), amplifying the inconsistency. Time Magazine had a beautiful gallery that visualizes diets around the world in a more appealing way.
2 0.95359182 1589 andrew gelman stats-2012-11-25-Life as a blogger: the emails just get weirder and weirder
Introduction: In the email the other day, subject line “Casting blogger, writer, journalist to host cable series”: Hi there Andrew, I’m casting a male journalist, writer, blogger, documentary filmmaker or comedian with a certain type personality for a television pilot along with production company, Pipeline39. See below: A certain type of character – no cockiness, no ego, a person who is smart, savvy, dry humor, but someone who isn’t imposing, who can infiltrate these organizations. This person will be hosting his own show and covering alternative lifestyles and secret societies around the world. If you’re interested in hearing more or would like to be considered for this project, please email me a photo and a bio of yourself, along with contact information. I’ll respond to you ASAP. I’m looking forward to hearing from you. *** Casting Producer (646) ***.**** ***@gmail.com I was with them until I got to the “no ego” part. . . . Also, I don’t think I could infiltrate any org
Introduction: Tapen Sinha writes: Living in Mexico, I have been witness to many strange (and beautiful) things. Perhaps the strangest happened during the first outbreak of A(H1N1) in Mexico City. We had our university closed, football (soccer) was played in empty stadiums (or should it be stadia) because the government feared a spread of the virus. The Metro was operating and so were the private/public buses and taxis. Since the university was closed, we took the opportunity to collect data on facemask use in the public transport systems. It was a simple (but potentially deadly!) exercise in first hand statistical data collection that we teach our students (Although I must admit that I did not dare sending my research assistant to collect data – what if she contracted the virus?). I believe it was a unique experiment never to be repeated. The paper appeared in the journal Health Policy. From the abstract: At the height of the influenza epidemic in Mexico City in the spring of 2009, the f
4 0.91926682 1856 andrew gelman stats-2013-05-14-GPstuff: Bayesian Modeling with Gaussian Processes
Introduction: I think it’s part of my duty as a blogger to intersperse, along with the steady flow of jokes, rants, and literary criticism, some material that will actually be useful to you. So here goes. Jarno Vanhatalo, Jaakko Riihimäki, Jouni Hartikainen, Pasi Jylänki, Ville Tolvanen, and Aki Vehtari write : The GPstuff toolbox is a versatile collection of Gaussian process models and computational tools required for Bayesian inference. The tools include, among others, various inference methods, sparse approximations and model assessment methods. We can actually now fit Gaussian processes in Stan . But for big problems (or even moderately-sized problems), full Bayes can be slow. GPstuff uses EP, which is faster. At some point we’d like to implement EP in Stan. (Right now we’re working with Dave Blei to implement VB.) GPstuff really works. I saw Aki use it to fit a nonparametric version of the Bangladesh well-switching example in ARM. He was sitting in his office and just whip
5 0.91514468 1677 andrew gelman stats-2013-01-16-Greenland is one tough town
Introduction: Americans (including me) don’t know much about other countries. Jeff Lax sent me to this blog post by Myrddin pointing out that Belgium has a higher murder rate than the rest of Western Europe. I have no particular take on this, but it’s a good reminder that other countries differ from each other. Here in the U.S., we tend to think all western European countries are the same, all eastern European countries are the same, etc. In reality, Sweden is not Finland . P.S. According to the Wiki , Greenland is one tough town. I guess there’s nothing much to do out there but watch satellite TV, chew the blubber, and kill people.
6 0.89428449 1468 andrew gelman stats-2012-08-24-Multilevel modeling and instrumental variables
7 0.89261997 46 andrew gelman stats-2010-05-21-Careers, one-hit wonders, and an offer of a free book
8 0.8904444 1802 andrew gelman stats-2013-04-14-Detecting predictability in complex ecosystems
9 0.87461227 1905 andrew gelman stats-2013-06-18-There are no fat sprinters
10 0.86303186 991 andrew gelman stats-2011-11-04-Insecure researchers aren’t sharing their data
same-blog 11 0.85971081 733 andrew gelman stats-2011-05-27-Another silly graph
12 0.85588849 1902 andrew gelman stats-2013-06-17-Job opening at new “big data” consulting firm!
13 0.84601843 2022 andrew gelman stats-2013-09-13-You heard it here first: Intense exercise can suppress appetite
15 0.83351535 1047 andrew gelman stats-2011-12-08-I Am Too Absolutely Heteroskedastic for This Probit Model
16 0.82254875 495 andrew gelman stats-2010-12-31-“Threshold earners” and economic inequality
17 0.81684113 880 andrew gelman stats-2011-08-30-Annals of spam
18 0.81525713 2067 andrew gelman stats-2013-10-18-EP and ABC
19 0.79527622 354 andrew gelman stats-2010-10-19-There’s only one Amtrak
20 0.79501867 237 andrew gelman stats-2010-08-27-Bafumi-Erikson-Wlezien predict a 50-seat loss for Democrats in November