andrew_gelman_stats andrew_gelman_stats-2013 andrew_gelman_stats-2013-1931 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Mike Jordan sends along this National Academies report on “big data.” This is not a research report but it could be interesting in that it conveys what are believed to be important technical challenges.
sentIndex sentText sentNum sentScore
1 Mike Jordan sends along this National Academies report on “big data. [sent-1, score-0.716]
2 ” This is not a research report but it could be interesting in that it conveys what are believed to be important technical challenges. [sent-2, score-1.566]
wordName wordTfidf (topN-words)
[('academies', 0.462), ('conveys', 0.357), ('jordan', 0.345), ('report', 0.324), ('believed', 0.273), ('mike', 0.272), ('challenges', 0.259), ('sends', 0.24), ('technical', 0.219), ('national', 0.186), ('along', 0.152), ('big', 0.123), ('interesting', 0.118), ('important', 0.117), ('research', 0.091), ('could', 0.067)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 1931 andrew gelman stats-2013-07-09-“Frontiers in Massive Data Analysis”
Introduction: Mike Jordan sends along this National Academies report on “big data.” This is not a research report but it could be interesting in that it conveys what are believed to be important technical challenges.
2 0.26153511 686 andrew gelman stats-2011-04-29-What are the open problems in Bayesian statistics??
Introduction: Follow the discussion (originated by Mike Jordan) at the Statistics Forum.
3 0.1602208 357 andrew gelman stats-2010-10-20-Sas and R
Introduction: Xian sends along this link that might be of interest to some of you.
4 0.154351 1660 andrew gelman stats-2013-01-08-Bayesian, Permutable Symmetries
Introduction: Mike Betancourt sends along this paper . Could be interesting, no? Note the heavy tail on the CDF in Figure 3, exhibiting weakened median time since 1999. And, as you can see from the bibliography, the work draws on a variety of sources:
5 0.13008526 1725 andrew gelman stats-2013-02-17-“1.7%” ha ha ha
Introduction: Jordan Ellenberg writes: Lots of people sharing this today. Isn’t this exactly the kind of situation where they should have done some kind of shrinkage towards the national mean, as in that thing you wrote about kidney cancer rates by county? i.e. you see, just as you might expect, the extreme values of “proportion of people who said they were gay” are disproportionately taken by small states. My reply: If I don’t have the individual-level survey data that would allow me to do full-scale Mister P , yes, I’d fit a multilevel model to the state-level averages. I wouldn’t quite just partially pool toward the national mean; I think it would make sense to include some state-level predictors. In any case, I think it’s tacky to report poll numbers to fractional percentage points. That kind of precision simply isn’t there. P.S. More discussion of variances of large and small states in the comments .
6 0.12964478 1405 andrew gelman stats-2012-07-04-“Titanic Thompson: The Man Who Would Bet on Everything”
7 0.12859634 1106 andrew gelman stats-2012-01-08-Intro to splines—with cool graphs
8 0.11824794 1487 andrew gelman stats-2012-09-08-Animated drought maps
9 0.10924346 2101 andrew gelman stats-2013-11-15-BDA class 4 G+ hangout on air is on air
10 0.10361481 290 andrew gelman stats-2010-09-22-Data Thief
11 0.090966202 1158 andrew gelman stats-2012-02-07-The more likely it is to be X, the more likely it is to be Not X?
12 0.090018295 869 andrew gelman stats-2011-08-24-Mister P in Stata
13 0.083036676 522 andrew gelman stats-2011-01-18-Problems with Haiti elections?
14 0.082193717 1798 andrew gelman stats-2013-04-11-Continuing conflict over conflict statistics
15 0.081909545 1003 andrew gelman stats-2011-11-11-$
16 0.081896529 749 andrew gelman stats-2011-06-06-“Sampling: Design and Analysis”: a course for political science graduate students
17 0.075072609 1152 andrew gelman stats-2012-02-03-Web equation
18 0.073168129 307 andrew gelman stats-2010-09-29-“Texting bans don’t reduce crashes; effects are slight crash increases”
19 0.069165595 531 andrew gelman stats-2011-01-22-Third-party Dream Ticket
20 0.068710625 563 andrew gelman stats-2011-02-07-Evaluating predictions of political events
topicId topicWeight
[(0, 0.059), (1, -0.033), (2, 0.0), (3, -0.013), (4, 0.01), (5, 0.003), (6, -0.03), (7, -0.01), (8, -0.028), (9, 0.004), (10, 0.013), (11, -0.033), (12, 0.022), (13, 0.011), (14, -0.041), (15, 0.005), (16, 0.008), (17, 0.038), (18, 0.01), (19, -0.055), (20, -0.031), (21, 0.027), (22, 0.009), (23, -0.016), (24, -0.025), (25, -0.015), (26, 0.002), (27, -0.07), (28, 0.029), (29, 0.01), (30, -0.017), (31, 0.068), (32, 0.01), (33, -0.059), (34, 0.001), (35, -0.022), (36, 0.034), (37, -0.039), (38, 0.066), (39, 0.066), (40, -0.0), (41, 0.017), (42, 0.051), (43, 0.003), (44, 0.036), (45, -0.017), (46, -0.07), (47, 0.042), (48, 0.028), (49, 0.015)]
simIndex simValue blogId blogTitle
same-blog 1 0.96824306 1931 andrew gelman stats-2013-07-09-“Frontiers in Massive Data Analysis”
Introduction: Mike Jordan sends along this National Academies report on “big data.” This is not a research report but it could be interesting in that it conveys what are believed to be important technical challenges.
2 0.7071659 1798 andrew gelman stats-2013-04-11-Continuing conflict over conflict statistics
Introduction: Mike Spagat sends along a serious presentation with an ironic title: 18.7 MILLION ANNIHILATED SAYS LEADING EXPERT IN PEER–REVIEWED JOURNAL: AN APPROVED, AUTHORITATIVE, SCIENTIFIC PRESENTATION MADE BY AN EXPERT He’ll be speaking on it at tomorrow’s meeting of the Catastrophes and Conflict Forum of the Royal Society of Medicine in London. All I can say is, it’s a long time since I’ve seen a slide presentation in portrait form. It brings me back to the days of transparency sheets.
3 0.69620621 1660 andrew gelman stats-2013-01-08-Bayesian, Permutable Symmetries
Introduction: Mike Betancourt sends along this paper . Could be interesting, no? Note the heavy tail on the CDF in Figure 3, exhibiting weakened median time since 1999. And, as you can see from the bibliography, the work draws on a variety of sources:
4 0.66857862 357 andrew gelman stats-2010-10-20-Sas and R
Introduction: Xian sends along this link that might be of interest to some of you.
5 0.63615561 1106 andrew gelman stats-2012-01-08-Intro to splines—with cool graphs
Introduction: Ido Rosen pointed me to this page by Mike Kamermans.
6 0.61294186 869 andrew gelman stats-2011-08-24-Mister P in Stata
7 0.61138016 290 andrew gelman stats-2010-09-22-Data Thief
8 0.58980006 531 andrew gelman stats-2011-01-22-Third-party Dream Ticket
9 0.57278991 141 andrew gelman stats-2010-07-12-Dispute over counts of child deaths in Iraq due to sanctions
10 0.56542605 403 andrew gelman stats-2010-11-09-Society for Industrial and Applied Mathematics startup-math meetup
11 0.56203014 1440 andrew gelman stats-2012-08-02-“A Christmas Carol” as applied to plagiarism
13 0.50965542 522 andrew gelman stats-2011-01-18-Problems with Haiti elections?
14 0.50698125 1160 andrew gelman stats-2012-02-09-Familial Linkage between Neuropsychiatric Disorders and Intellectual Interests
15 0.49634945 406 andrew gelman stats-2010-11-10-Translating into Votes: The Electoral Impact of Spanish-Language Ballots
16 0.48810729 450 andrew gelman stats-2010-12-04-The Joy of Stats
17 0.47597858 1689 andrew gelman stats-2013-01-23-MLB Hall of Fame Voting Trajectories
18 0.47183892 1973 andrew gelman stats-2013-08-08-For chrissake, just make up an analysis already! We have a lab here to run, y’know?
19 0.46884254 1288 andrew gelman stats-2012-04-29-Clueless Americans think they’ll never get sick
20 0.4662891 1487 andrew gelman stats-2012-09-08-Animated drought maps
topicId topicWeight
[(10, 0.054), (16, 0.121), (24, 0.05), (29, 0.104), (66, 0.133), (82, 0.078), (99, 0.257)]
simIndex simValue blogId blogTitle
same-blog 1 0.90802568 1931 andrew gelman stats-2013-07-09-“Frontiers in Massive Data Analysis”
Introduction: Mike Jordan sends along this National Academies report on “big data.” This is not a research report but it could be interesting in that it conveys what are believed to be important technical challenges.
2 0.87929773 536 andrew gelman stats-2011-01-24-Trends in partisanship by state
Introduction: Matthew Yglesias discusses how West Virginia used to be a Democratic state but is now solidly Republican. I thought it would be helpful to expand this to look at trends since 1948 (rather than just 1988) and all 50 states (rather than just one). This would represent a bit of work, except that I already did it a couple years ago, so here it is (right-click on the image to see the whole thing): I cheated a bit to get reasonable-looking groupings, for example putting Indiana in the Border South rather than Midwest, and putting Alaska in Mountain West and Hawaii in West Coast. Also, it would help to distinguish states by color (to be able to disentangle New Jersey and Delaware, for example) but we didn’t do this because the book is mostly black and white. In any case, the picture makes it clear that there have been strong regional trends all over during the past sixty years. P.S. My graph comes from Red State Blue State so no 2008 data, but 2008 was pretty much a shift
3 0.87416142 651 andrew gelman stats-2011-04-06-My talk at Northwestern University tomorrow (Thursday)
Introduction: Of Beauty, Sex, and Power: Statistical Challenges in Estimating Small Effects. At the Institute of Policy Research, Thurs 7 Apr 2011, 3.30pm . Regular blog readers know all about this topic. ( Here are the slides.) But, rest assured, I don’t just mock. I also offer constructive suggestions. My last talk at Northwestern was fifteen years ago. Actually, I gave two lectures then, in the process of being turned down for a job enjoying their chilly Midwestern hospitality. P.S. I searched on the web and also found this announcement which gives the wrong title.
4 0.8734237 1010 andrew gelman stats-2011-11-14-“Free energy” and economic resources
Introduction: By “free energy” I don’t mean perpetual motion machines, cars that run on water and get 200 mpg, or the latest cold-fusion hype. No, I’m referring to the term from physics. The free energy of a system is, roughly, the amount of energy that can be directly extracted from it. For example, a rock at room temperature is just full of energy—not just the energy locked in its nuclei, but basic thermal energy—but at room temperature you can’t extract any of it. To the physicists in the audience: Yes, I realize that free energy has a technical meaning in statistical mechanics and that my above definition is sloppy. Please bear with me. And, to the non-physicists: feel free to head to Wikipedia or a physics textbook for a more careful treatment. I was thinking about free energy the other day when hearing someone on the radio say something about China bailing out the E.U. I did a double-take. Huh? The E.U. is rich, China’s not so rich. How can a middle-income country bail out a
5 0.87084305 680 andrew gelman stats-2011-04-26-My talk at Berkeley on Wednesday
Introduction: Something on Applied Bayesian Statistics April 27, 4:10-5 p.m., 1011 Evans Hall I will deliver one of the following three talks: 1. Of beauty, sex, and power: Statistical challenges in estimating small effects 2. Why we (usually) don’t worry about multiple comparisons 3. Parameterization and Bayesian modeling Whoever shows up on time to the seminar gets to vote, and I’ll give the talk that gets the most votes.
6 0.86933219 1192 andrew gelman stats-2012-03-02-These people totally don’t know what Chance magazine is all about
7 0.85980451 1271 andrew gelman stats-2012-04-20-Education could use some systematic evaluation
8 0.85111445 193 andrew gelman stats-2010-08-09-Besag
9 0.8503058 1200 andrew gelman stats-2012-03-06-Some economists are skeptical about microfoundations
10 0.8465786 204 andrew gelman stats-2010-08-12-Sloppily-written slam on moderately celebrated writers is amusing nonetheless
11 0.84345788 1344 andrew gelman stats-2012-05-25-Question 15 of my final exam for Design and Analysis of Sample Surveys
12 0.84293175 474 andrew gelman stats-2010-12-18-The kind of frustration we could all use more of
13 0.84273368 1958 andrew gelman stats-2013-07-27-Teaching is hard
14 0.84205496 1034 andrew gelman stats-2011-11-29-World Class Speakers and Entertainers
15 0.84193915 814 andrew gelman stats-2011-07-21-The powerful consumer?
16 0.84127045 1322 andrew gelman stats-2012-05-15-Question 5 of my final exam for Design and Analysis of Sample Surveys
17 0.84086359 1533 andrew gelman stats-2012-10-14-If x is correlated with y, then y is correlated with x
18 0.83975995 1440 andrew gelman stats-2012-08-02-“A Christmas Carol” as applied to plagiarism
19 0.83441007 935 andrew gelman stats-2011-10-01-When should you worry about imputed data?
20 0.8342104 606 andrew gelman stats-2011-03-10-It’s no fun being graded on a curve