andrew_gelman_stats andrew_gelman_stats-2010 andrew_gelman_stats-2010-321 knowledge-graph by maker-knowledge-mining

321 andrew gelman stats-2010-10-05-Racism!


meta infos for this blog

Source: html

Introduction: Last night I spoke at the Columbia Club of New York, along with some of my political science colleagues, in a panel about politics, the economy, and the forthcoming election. The discussion was fine . . . until one guy in the audience accused us of bias based on what he imputed as our ethnicity. One of the panelists replied by asking the questioner what of all the things we had said was biased, and the questioner couldn’t actually supply any examples. It makes sense that the questioner couldn’t come up with a single example of bias on our part, considering that we were actually presenting facts . At some level, the questioner’s imputation of our ethnicity and accusation of bias isn’t so horrible. When talking with my friends, I engage in casual ethnic stereotyping all the time–hey, it’s a free country!–and one can certainly make the statistical argument that you can guess people’s ethnicities from their names, appearance, and speech patterns, and in turn you can infer a lot


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Last night I spoke at the Columbia Club of New York, along with some of my political science colleagues, in a panel about politics, the economy, and the forthcoming election. [sent-1, score-0.397]

2 until one guy in the audience accused us of bias based on what he imputed as our ethnicity. [sent-5, score-0.577]

3 One of the panelists replied by asking the questioner what of all the things we had said was biased, and the questioner couldn’t actually supply any examples. [sent-6, score-1.262]

4 It makes sense that the questioner couldn’t come up with a single example of bias on our part, considering that we were actually presenting facts . [sent-7, score-0.962]

5 At some level, the questioner’s imputation of our ethnicity and accusation of bias isn’t so horrible. [sent-8, score-0.448]

6 When talking with my friends, I engage in casual ethnic stereotyping all the time–hey, it’s a free country! [sent-9, score-0.346]

7 –and one can certainly make the statistical argument that you can guess people’s ethnicities from their names, appearance, and speech patterns, and in turn you can infer a lot about people’s political attitudes from their occupations, ethnicities, and so on. [sent-10, score-0.493]

8 Still, I think it was a pretty rude comment and pretty pointless. [sent-11, score-0.089]

9 Maybe he thought we’d break down under the pressure and admit that we were all being programmed by our KGB handlers? [sent-13, score-0.239]

10 Then, later on, someone asked a truly racist question–a rant, really–that clearly had a close relation to his personal experiences even while having essentially zero connection to the real world as we understand it statistically. [sent-15, score-0.307]

11 I’ve seen the polls and I know that there are a lot of racists out there, of all stripes. [sent-16, score-0.183]

12 Still, I don’t encounter this sort of thing much in my everyday life, and it was a bit upsetting to see it in the flesh. [sent-17, score-0.274]

13 Yes, I realize that women and minorities have to deal with this all the time. [sent-22, score-0.089]

14 This was the first time in my professional life that I’ve been accused of bias based on my (imputed) ethnicity, but I’m sure that if you’re a member of a traditionally-disparaged group, it happens all over. [sent-23, score-0.61]

15 So I’m not complaining, exactly, but it still upsets me a bit. [sent-24, score-0.206]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('questioner', 0.537), ('ethnicities', 0.244), ('bias', 0.219), ('imputed', 0.183), ('accused', 0.175), ('ethnicity', 0.152), ('life', 0.145), ('panelists', 0.122), ('upsets', 0.122), ('stereotyping', 0.115), ('racists', 0.115), ('couldn', 0.111), ('racist', 0.103), ('upsetting', 0.103), ('occupations', 0.098), ('programmed', 0.096), ('infer', 0.094), ('complaining', 0.093), ('appearance', 0.091), ('encounter', 0.089), ('club', 0.089), ('forthcoming', 0.089), ('speech', 0.089), ('rude', 0.089), ('minorities', 0.089), ('night', 0.086), ('rant', 0.084), ('still', 0.084), ('everyday', 0.082), ('ethnic', 0.079), ('expecting', 0.079), ('spoke', 0.079), ('panel', 0.077), ('casual', 0.077), ('imputation', 0.077), ('engage', 0.075), ('pressure', 0.075), ('biased', 0.072), ('member', 0.071), ('come', 0.071), ('truly', 0.069), ('break', 0.068), ('polls', 0.068), ('relation', 0.068), ('facts', 0.068), ('experiences', 0.067), ('presenting', 0.067), ('supply', 0.066), ('political', 0.066), ('names', 0.063)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.99999994 321 andrew gelman stats-2010-10-05-Racism!

Introduction: Last night I spoke at the Columbia Club of New York, along with some of my political science colleagues, in a panel about politics, the economy, and the forthcoming election. The discussion was fine . . . until one guy in the audience accused us of bias based on what he imputed as our ethnicity. One of the panelists replied by asking the questioner what of all the things we had said was biased, and the questioner couldn’t actually supply any examples. It makes sense that the questioner couldn’t come up with a single example of bias on our part, considering that we were actually presenting facts . At some level, the questioner’s imputation of our ethnicity and accusation of bias isn’t so horrible. When talking with my friends, I engage in casual ethnic stereotyping all the time–hey, it’s a free country!–and one can certainly make the statistical argument that you can guess people’s ethnicities from their names, appearance, and speech patterns, and in turn you can infer a lot

2 0.15658425 935 andrew gelman stats-2011-10-01-When should you worry about imputed data?

Introduction: Majid Ezzati writes: My research group is increasingly focusing on a series of problems that involve data that either have missingness or measurements that may have bias/error. We have at times developed our own approaches to imputation (as simple as interpolating a missing unit and as sophisticated as a problem-specific Bayesian hierarchical model) and at other times, other groups impute the data. The outputs are being used to investigate the basic associations between pairs of variables, Xs and Ys, in regressions; we may or may not interpret these as causal. I am contacting colleagues with relevant expertise to suggest good references on whether having imputed X and/or Y in a subsequent regression is correct or if it could somehow lead to biased/spurious associations. Thinking about this, we can have at least the following situations (these could all be Bayesian or not): 1) X and Y both measured (perhaps with error) 2) Y imputed using some data and a model and X measur

3 0.14147528 799 andrew gelman stats-2011-07-13-Hypothesis testing with multiple imputations

Introduction: Vincent Yip writes: I have read your paper [with Kobi Abayomi and Marc Levy] regarding multiple imputation application. In order to diagnostic my imputed data, I used Kolmogorov-Smirnov (K-S) tests to compare the distribution differences between the imputed and observed values of a single attribute as mentioned in your paper. My question is: For example I have this attribute X with the following data: (NA = missing) Original dataset: 1, NA, 3, 4, 1, 5, NA Imputed dataset: 1, 2 , 3, 4, 1, 5, 6 a) in order to run the KS test, will I treat the observed data as 1, 3, 4,1, 5? b) and for the observed data, will I treat 1, 2 , 3, 4, 1, 5, 6 as the imputed dataset for the K-S test? or just 2 ,6? c) if I used m=5, I will have 5 set of imputed data sets. How would I apply K-S test to 5 of them and compare to the single observed distribution? Do I combine the 5 imputed data set into one by averaging each imputed values so I get one single imputed data and compare with the ob

4 0.10229289 1291 andrew gelman stats-2012-04-30-Systematic review of publication bias in studies on publication bias

Introduction: Via Yalda Afshar , a 2005 paper by Hans-Hermann Dubben and Hans-Peter Beck-Bornholdt: Publication bias is a well known phenomenon in clinical literature, in which positive results have a better chance of being published, are published earlier, and are published in journals with higher impact factors. Conclusions exclusively based on published studies, therefore, can be misleading. Selective under-reporting of research might be more widespread and more likely to have adverse consequences for patients than publication of deliberately falsified data. We investigated whether there is preferential publication of positive papers on publication bias. They conclude, “We found no evidence of publication bias in reports on publication bias.” But of course that’s the sort of finding regarding publication bias of findings on publication bias that you’d expect would get published. What we really need is a careful meta-analysis to estimate the level of publication bias in studies of publi

5 0.1022555 960 andrew gelman stats-2011-10-15-The bias-variance tradeoff

Introduction: Joshua Vogelstein asks for my thoughts as a Bayesian on the above topic. So here they are (briefly): The concept of the bias-variance tradeoff can be useful if you don’t take it too seriously. The basic idea is as follows: if you’re estimating something, you can slice your data finer and finer, or perform more and more adjustments, each time getting a purer—and less biased—estimate. But each subdivision or each adjustment reduces your sample size or increases potential estimation error, hence the variance of your estimate goes up. That story is real. In lots and lots of examples, there’s a continuum between a completely unadjusted general estimate (high bias, low variance) and a specific, focused, adjusted estimate (low bias, high variance). Suppose, for example, you’re using data from a large experiment to estimate the effect of a treatment on a fairly narrow group, say, white men between the ages of 45 and 50. At one extreme, you could just take the estimated treatment e

6 0.098213822 828 andrew gelman stats-2011-07-28-Thoughts on Groseclose book on media bias

7 0.096939407 683 andrew gelman stats-2011-04-28-Asymmetry in Political Bias

8 0.093492441 2255 andrew gelman stats-2014-03-19-How Americans vote

9 0.082720086 1418 andrew gelman stats-2012-07-16-Long discussion about causal inference and the use of hierarchical models to bridge between different inferential settings

10 0.078351893 1330 andrew gelman stats-2012-05-19-Cross-validation to check missing-data imputation

11 0.077404745 812 andrew gelman stats-2011-07-21-Confusion about “rigging the numbers,” the support of ideological opposites, who’s a 501(c)(3), and the asymmetry of media bias

12 0.076315016 1039 andrew gelman stats-2011-12-02-I just flew in from the econ seminar, and boy are my arms tired

13 0.073772721 868 andrew gelman stats-2011-08-24-Blogs vs. real journalism

14 0.072516382 604 andrew gelman stats-2011-03-08-More on the missing conservative psychology researchers

15 0.072308965 210 andrew gelman stats-2010-08-16-What I learned from those tough 538 commenters

16 0.072147474 125 andrew gelman stats-2010-07-02-The moral of the story is, Don’t look yourself up on Google

17 0.071553029 719 andrew gelman stats-2011-05-19-Everything is Obvious (once you know the answer)

18 0.071069546 1968 andrew gelman stats-2013-08-05-Evidence on the impact of sustained use of polynomial regression on causal inference (a claim that coal heating is reducing lifespan by 5 years for half a billion people)

19 0.070719413 601 andrew gelman stats-2011-03-05-Against double-blind reviewing: Political science and statistics are not like biology and physics

20 0.069493949 124 andrew gelman stats-2010-07-02-Note to the quals


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.154), (1, -0.081), (2, 0.026), (3, 0.021), (4, -0.017), (5, 0.006), (6, 0.024), (7, -0.006), (8, 0.031), (9, 0.011), (10, -0.023), (11, 0.004), (12, 0.019), (13, 0.024), (14, -0.012), (15, 0.02), (16, -0.019), (17, -0.023), (18, 0.01), (19, 0.014), (20, -0.008), (21, -0.021), (22, -0.03), (23, 0.008), (24, 0.01), (25, 0.004), (26, -0.021), (27, -0.045), (28, 0.01), (29, 0.006), (30, 0.018), (31, 0.029), (32, 0.025), (33, 0.024), (34, -0.001), (35, -0.017), (36, 0.01), (37, 0.056), (38, -0.007), (39, 0.022), (40, -0.057), (41, 0.012), (42, 0.008), (43, -0.003), (44, -0.026), (45, 0.0), (46, 0.06), (47, 0.029), (48, 0.008), (49, 0.016)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.9481166 321 andrew gelman stats-2010-10-05-Racism!

Introduction: Last night I spoke at the Columbia Club of New York, along with some of my political science colleagues, in a panel about politics, the economy, and the forthcoming election. The discussion was fine . . . until one guy in the audience accused us of bias based on what he imputed as our ethnicity. One of the panelists replied by asking the questioner what of all the things we had said was biased, and the questioner couldn’t actually supply any examples. It makes sense that the questioner couldn’t come up with a single example of bias on our part, considering that we were actually presenting facts . At some level, the questioner’s imputation of our ethnicity and accusation of bias isn’t so horrible. When talking with my friends, I engage in casual ethnic stereotyping all the time–hey, it’s a free country!–and one can certainly make the statistical argument that you can guess people’s ethnicities from their names, appearance, and speech patterns, and in turn you can infer a lot

2 0.80346078 1417 andrew gelman stats-2012-07-15-Some decision analysis problems are pretty easy, no?

Introduction: Cassie Murdoch reports : A 47-year-old woman in Uxbridge, Massachusetts, got behind the wheel of her car after having a bit too much to drink, but instead of wreaking havoc on the road, she ended up lodged in a sand trap at a local golf course. Why? Because her GPS made her do it—obviously! She said the GPS told her to turn left, and she did, right into a cornfield. That didn’t faze her, and she just kept on going until she ended up on the golf course and got stuck in the sand. There were people on the course at the time, but thankfully nobody was injured. Police found a cup full of alcohol in her car and arrested her for driving drunk. Here’s the punchline: This is the fourth time she’s been arrested for a DUI. Assuming this story is accurate, I guess they don’t have one of those “three strikes” laws in Massachusetts? Personally, I’m a lot more afraid of a dangerous driver than of some drug dealer. I’d think a simple cost-benefit calculation would recommend taking away

3 0.79448992 195 andrew gelman stats-2010-08-09-President Carter

Introduction: This assessment by Tyler Cowen reminded me that, in 1980, I and just about all my friends hated Jimmy Carter. Most of us much preferred him to Reagan but still hated Carter. I wouldn’t associate this with any particular ideological feeling—it’s not that we thought he was too liberal, or too conservative. He just seemed completely ineffectual. I remember feeling at the time that he had no principles, that he’d do anything to get elected. In retrospect, I think of this as an instance of uniform partisan swing: the president was unpopular nationally, and attitudes about him were negative, relatively speaking, among just about every group. My other Carter story comes from a conversation I had a couple years ago with an economist who’s about my age, a man who said that one reason he and his family moved from town A to town B in his metropolitan area was that, in town B, they didn’t feel like they were the only Republicans on their block. Anyway, this guy described himself as a “

4 0.78149599 1707 andrew gelman stats-2013-02-05-Glenn Hubbard and I were on opposite sides of a court case and I didn’t even know it!

Introduction: Matt Taibbi writes : Glenn Hubbard, Leading Academic and Mitt Romney Advisor, Took $1200 an Hour to Be Countrywide’s Expert Witness . . . Hidden among the reams of material recently filed in connection with the lawsuit of monoline insurer MBIA against Bank of America and Countrywide is a deposition of none other than Columbia University’s Glenn Hubbard. . . . Hubbard testified on behalf of Countrywide in the MBIA suit. He conducted an “analysis” that essentially concluded that Countrywide’s loans weren’t any worse than the loans produced by other mortgage originators, and that therefore the monstrous losses that investors in those loans suffered were due to other factors related to the economic crisis – and not caused by the serial misrepresentations and fraud in Countrywide’s underwriting. That’s interesting, because I worked on the other side of this case! I was hired by MBIA’s lawyers. It wouldn’t be polite of me to reveal my consulting rate, and I never actually got depose

5 0.77987474 1789 andrew gelman stats-2013-04-05-Elites have alcohol problems too!

Introduction: Speaking of Tyler Cowen, I’m puzzled by this paragraph of his: Guns, like alcohol, have many legitimate uses, and they are enjoyed by many people in a responsible manner. In both cases, there is an elite which has absolutely no problems handling the institution in question, but still there is the question of whether the nation really can have such bifurcated social norms, namely one set of standards for the elite and another set for everybody else. I don’t know anything about guns so I’ll set that part aside. My bafflement is with the claim that “there is an elite which has absolutely no problem handling [alcohol].” Is he kidding? Unless Cowen is circularly defining “an elite” as the subset of elites who don’t have an alcohol problem, I don’t buy this claim. And I actually think it’s a serious problem, that various “elites” are so sure that they have “absolutely no problem” that they do dangerous, dangerous things. Consider the notorious incident when Dick Cheney shot a

6 0.77964914 1621 andrew gelman stats-2012-12-13-Puzzles of criminal justice

7 0.76571214 707 andrew gelman stats-2011-05-12-Human nature can’t be changed (except when it can)

8 0.75470549 812 andrew gelman stats-2011-07-21-Confusion about “rigging the numbers,” the support of ideological opposites, who’s a 501(c)(3), and the asymmetry of media bias

9 0.75035387 1370 andrew gelman stats-2012-06-07-Duncan Watts and the Titanic

10 0.74680042 594 andrew gelman stats-2011-02-28-Behavioral economics doesn’t seem to have much to say about marriage

11 0.74662358 688 andrew gelman stats-2011-04-30-Why it’s so relaxing to think about social issues

12 0.74288273 1039 andrew gelman stats-2011-12-02-I just flew in from the econ seminar, and boy are my arms tired

13 0.74286747 708 andrew gelman stats-2011-05-12-Improvement of 5 MPG: how many more auto deaths?

14 0.7346943 2341 andrew gelman stats-2014-05-20-plus ça change, plus c’est la même chose

15 0.73385227 335 andrew gelman stats-2010-10-11-How to think about Lou Dobbs

16 0.73228991 1646 andrew gelman stats-2013-01-01-Back when fifty years was a long time ago

17 0.73062533 416 andrew gelman stats-2010-11-16-Is parenting a form of addiction?

18 0.72918236 683 andrew gelman stats-2011-04-28-Asymmetry in Political Bias

19 0.72913069 2053 andrew gelman stats-2013-10-06-Ideas that spread fast and slow

20 0.7279678 284 andrew gelman stats-2010-09-18-Continuing efforts to justify false “death panels” claim


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(0, 0.025), (16, 0.454), (21, 0.015), (24, 0.075), (31, 0.013), (50, 0.013), (55, 0.02), (77, 0.021), (99, 0.247)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.98876488 1115 andrew gelman stats-2012-01-12-Where are the larger-than-life athletes?

Introduction: Jonathan Cantor points to this poll estimating rifle-armed QB Tim Tebow as America’s favorite pro athlete: In an ESPN survey of 1,502 Americans age 12 or older, three percent identified Tebow as their favorite professional athlete. Tebow finished in front of Kobe Bryant (2 percent), Aaron Rodgers (1.9 percent), Peyton Manning (1.8 percent), and Tom Brady (1.5 percent). Amusing. What this survey says to me is that there are no super-popular athletes who are active in America today. Which actually sounds about right. No Tiger Woods, no Magic Johnson, Muhammed Ali, John Elway, Pete Rose, Billie Jean King, etc etc. Tebow is an amusing choice, people might as well pick him now while he’s still on top. As a sports celeb, he’s like Bill Lee or the Refrigerator: colorful and a solid pro athlete, but no superstar. When you think about all the colorful superstar athletes of times gone by, it’s perhaps surprising that there’s nobody out there right now to play the role. I supp

2 0.9877528 572 andrew gelman stats-2011-02-14-Desecration of valuable real estate

Introduction: Malecki asks: Is this the worst infographic ever to appear in NYT? USA Today is not something to aspire to. To connect to some of our recent themes , I agree this is a pretty horrible data display. But it’s not bad as a series of images. Considering the competition to be a cartoon or series of photos, these images aren’t so bad. One issue, I think, is that designers get credit for creativity and originality (unusual color combinations! Histogram bars shaped like mosques!) , which is often the opposite of what we want in a clear graph. It’s Martin Amis vs. George Orwell all over again.

3 0.98482943 1366 andrew gelman stats-2012-06-05-How do segregation measures change when you change the level of aggregation?

Introduction: In a discussion of workplace segregation, Philip Cohen posts some graphs that led me to a statistical question. I’ll pose my question below, but first the graphs: In a world of zero segregation of jobs by sex, the top graph above would have a spike at 50% (or, whatever the actual percentage is of women in the labor force) and, in the bottom graph, the pink and blue lines would be in the same place and would look like very steep S curves. The difference between the pink and blue lines represents segregation by job. One thing I wonder is how these graphs would change if we redefine occupation. (For example, is my occupation “mathematical scientist,” “statistician,” “teacher,” “university professor,” “statistics professor,” or “tenured statistics professor”?) Finer or coarser classification would give different results, and I wonder how this would work. This is not at all meant as a criticism of Cohen’s claims, it’s just a statistical question. I’m guessing that

4 0.98345649 1014 andrew gelman stats-2011-11-16-Visualizations of NYPD stop-and-frisk data

Introduction: Cathy O’Neil organized this visualization project with NYPD stop-and-frisk data. It’s part of the Data Without Borders project. Unfortunately, because of legal restrictions I couldn’t send them the data Jeff, Alex, and I used in our project several years ago.

5 0.98318899 528 andrew gelman stats-2011-01-21-Elevator shame is a two-way street

Introduction: Tyler Cowen links a blog by Samuel Arbesman mocking people who are so lazy that they take the elevator from 1 to 2. This reminds me of my own annoyance about a guy who worked in my building and did not take the elevator. (For the full story, go here and search on “elevator.”)

6 0.98004377 1279 andrew gelman stats-2012-04-24-ESPN is looking to hire a research analyst

7 0.97921312 1659 andrew gelman stats-2013-01-07-Some silly things you (didn’t) miss by not reading the sister blog

8 0.97889256 1304 andrew gelman stats-2012-05-06-Picking on Stephen Wolfram

9 0.97724271 1180 andrew gelman stats-2012-02-22-I’m officially no longer a “rogue”

10 0.97161317 1487 andrew gelman stats-2012-09-08-Animated drought maps

11 0.96399283 1330 andrew gelman stats-2012-05-19-Cross-validation to check missing-data imputation

12 0.95663744 1025 andrew gelman stats-2011-11-24-Always check your evidence

13 0.95658088 1598 andrew gelman stats-2012-11-30-A graphics talk with no visuals!

14 0.95170671 445 andrew gelman stats-2010-12-03-Getting a job in pro sports… as a statistician

15 0.95015293 398 andrew gelman stats-2010-11-06-Quote of the day

16 0.94432378 700 andrew gelman stats-2011-05-06-Suspicious pattern of too-strong replications of medical research

17 0.9351173 1156 andrew gelman stats-2012-02-06-Bayesian model-building by pure thought: Some principles and examples

18 0.92343843 1026 andrew gelman stats-2011-11-25-Bayes wikipedia update

19 0.92015231 609 andrew gelman stats-2011-03-13-Coauthorship norms

20 0.91733021 1168 andrew gelman stats-2012-02-14-The tabloids strike again