andrew_gelman_stats andrew_gelman_stats-2011 andrew_gelman_stats-2011-987 knowledge-graph by maker-knowledge-mining

987 andrew gelman stats-2011-11-02-How Khan Academy is using Machine Learning to Assess Student Mastery

meta infos for this blog

Source: html

Introduction: This is sooooo cool. The actual statistical methods they are using are pretty crude, but that’s fine. What’s important is their focus on the important goal. It’s sort of like Bill James or Nate Silver: if you’re using good information, and you’re focused on good questions, then the fancy statistics can come later (or from others). In most educational efforts I know of (including my own), very little is done to target assessments to improvements for individual students. I really like what they’re doing here and it reminds me how I want to figure out how to do something similar in my own teaching and course materials.

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 The actual statistical methods they are using are pretty crude, but that’s fine. [sent-2, score-0.52]

2 What’s important is their focus on the important goal. [sent-3, score-0.476]

3 It’s sort of like Bill James or Nate Silver: if you’re using good information, and you’re focused on good questions, then the fancy statistics can come later (or from others). [sent-4, score-1.22]

4 In most educational efforts I know of (including my own), very little is done to target assessments to improvements for individual students. [sent-5, score-1.426]

5 I really like what they’re doing here and it reminds me how I want to figure out how to do something similar in my own teaching and course materials. [sent-6, score-0.913]

similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('sooooo', 0.293), ('assessments', 0.251), ('fancy', 0.235), ('silver', 0.218), ('crude', 0.217), ('improvements', 0.215), ('nate', 0.213), ('materials', 0.213), ('educational', 0.201), ('target', 0.199), ('efforts', 0.181), ('re', 0.181), ('focused', 0.176), ('important', 0.171), ('reminds', 0.166), ('james', 0.159), ('bill', 0.155), ('teaching', 0.145), ('using', 0.14), ('actual', 0.135), ('focus', 0.134), ('figure', 0.129), ('later', 0.129), ('individual', 0.124), ('similar', 0.113), ('questions', 0.113), ('good', 0.111), ('including', 0.107), ('done', 0.102), ('little', 0.099), ('methods', 0.098), ('come', 0.098), ('course', 0.096), ('others', 0.096), ('information', 0.089), ('pretty', 0.082), ('sort', 0.077), ('like', 0.073), ('statistics', 0.07), ('want', 0.069), ('something', 0.065), ('statistical', 0.065), ('really', 0.057), ('know', 0.054)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 987 andrew gelman stats-2011-11-02-How Khan Academy is using Machine Learning to Assess Student Mastery

2 0.20698065 1634 andrew gelman stats-2012-12-21-Two reviews of Nate Silver’s new book, from Kaiser Fung and Cathy O’Neil

Introduction: People keep asking me what I think of Nate’s book, and I keep replying that, as a blogger, I’m spoiled. I’m so used to getting books for free that I wouldn’t go out and buy a book just for the purpose of reviewing it. (That reminds me that I should post reviews of some of those books I’ve received in the mail over the past few months.) I have, however, encountered a couple of reviews of The Signal and the Noise so I thought I’d pass them on to you. Both these reviews are by statisticians / data scientists who work here in NYC in the non-academic “real world” so in that sense they are perhaps better situated than me to review the book (also, they have not collaborated with Nate so they have no conflict of interest). Kaiser Fung gives a positive review : It is in the subtitle—“why so many predictions fail – but some don’t”—that one learns the core philosophy of Silver: he is most concerned with the honest evaluation of the performance of predictive models. The failure to look

3 0.18137936 131 andrew gelman stats-2010-07-07-A note to John

Introduction: Jeff the Productivity Sapper points me to this insulting open letter to Nate Silver written by pollster John Zogby. I’ll go through bits of Zogby’s note line by line. (Conflict of interest warning: I have collaborated with Nate and I blog on his site). Zogby writes: Here is some advice from someone [Zogby] who has been where you [Silver] are today. Sorry, John. (I can call you that, right? Since you’re calling Nate “Nate”?). Yes, you were once the hot pollster. But, no, you were never where Nate is today. Don’t kid yourself. Zogby writes: You [Nate] are hot right now – using an aggregate of other people’s work, you got 49 of 50 states right in 2008. Yes, Nate used other people’s work. That’s what’s called “making use of available data.” Or, to use a more technical term employed in statistics, it’s called “not being an idiot.” Only in the wacky world of polling are you supposed to draw inferences about the U.S.A. using only a single survey organization. I do

4 0.13758206 697 andrew gelman stats-2011-05-05-A statistician rereads Bill James

Introduction: Ben Lindbergh invited me to write an article for Baseball Prospectus. I first sent him this item on the differences between baseball and politics but he said it was too political for them. I then sent him this review of a book on baseball’s greatest fielders but he said they already had someone slotted to review that book. Then I sent him some reflections on the great Bill James and he published it ! If anybody out there knows Bill James, please send this on to him: I have some questions at the end that I’m curious about. Here’s how it begins: I read my first Bill James book in 1984, took my first statistics class in 1985, and began graduate study in statistics the next year. Besides giving me the opportunity to study with the best applied statistician of the late 20th century (Don Rubin) and the best theoretical statistician of the early 21st (Xiao-Li Meng), going to graduate school at Harvard in 1986 gave me the opportunity to sit in a basement room one evening that

5 0.13687511 364 andrew gelman stats-2010-10-22-Politics is not a random walk: Momentum and mean reversion in polling

Introduction: Nate Silver and Justin Wolfers are having a friendly blog-dispute about momentum in political polling. Nate and Justin each make good points but are also missing parts of the picture. These questions relate to my own research so I thought I’d discuss them here. There ain’t no mo’ Nate led off the discussion by writing that pundits are always talking about “momentum” in the polls: Turn on the news or read through much of the analysis put out by some of our friends, and you’re likely to hear a lot of talk about “momentum”: the term is used about 60 times per day by major media outlets in conjunction with articles about polling. When people say a particular candidate has momentum, what they are implying is that present trends are likely to perpetuate themselves into the future. Say, for instance, that a candidate trailed by 10 points in a poll three weeks ago — and now a new poll comes out showing the candidate down by just 5 points. It will frequently be said that this

6 0.13527994 391 andrew gelman stats-2010-11-03-Some thoughts on election forecasting

7 0.12489952 541 andrew gelman stats-2011-01-27-Why can’t I be more like Bill James, or, The use of default and default-like models

8 0.12262681 1419 andrew gelman stats-2012-07-17-“Faith means belief in something concerning which doubt is theoretically possible.” — William James

9 0.11634918 300 andrew gelman stats-2010-09-28-A calibrated Cook gives Dems the edge in Nov, sez Sandy

10 0.11284347 270 andrew gelman stats-2010-09-12-Comparison of forecasts for the 2010 congressional elections

11 0.11148772 440 andrew gelman stats-2010-12-01-In defense of jargon

12 0.1091035 611 andrew gelman stats-2011-03-14-As the saying goes, when they argue that you’re taking over, that’s when you know you’ve won

13 0.10410376 1544 andrew gelman stats-2012-10-22-Is it meaningful to talk about a probability of “65.7%” that Obama will win the election?

14 0.10066773 61 andrew gelman stats-2010-05-31-A data visualization manifesto

15 0.100575 1610 andrew gelman stats-2012-12-06-Yes, checking calibration of probability forecasts is part of Bayesian statistics

16 0.098193042 2033 andrew gelman stats-2013-09-23-More on Bayesian methods and multilevel modeling

17 0.093454078 423 andrew gelman stats-2010-11-20-How to schedule projects in an introductory statistics course?

18 0.090823501 642 andrew gelman stats-2011-04-02-Bill James and the base-rate fallacy

19 0.090445146 33 andrew gelman stats-2010-05-14-Felix Salmon wins the American Statistical Association’s Excellence in Statistical Reporting Award

20 0.090117954 2172 andrew gelman stats-2014-01-14-Advice on writing research articles

similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.154), (1, -0.019), (2, -0.029), (3, 0.039), (4, 0.039), (5, 0.06), (6, -0.025), (7, 0.049), (8, 0.0), (9, 0.002), (10, 0.047), (11, -0.001), (12, 0.037), (13, -0.064), (14, -0.019), (15, -0.016), (16, -0.015), (17, 0.032), (18, 0.031), (19, -0.061), (20, -0.019), (21, 0.035), (22, -0.015), (23, 0.083), (24, 0.007), (25, 0.026), (26, -0.004), (27, -0.037), (28, -0.01), (29, -0.002), (30, 0.039), (31, 0.043), (32, 0.059), (33, -0.051), (34, -0.024), (35, 0.086), (36, 0.033), (37, 0.017), (38, -0.006), (39, -0.049), (40, 0.084), (41, 0.082), (42, -0.06), (43, 0.001), (44, -0.05), (45, 0.042), (46, -0.011), (47, -0.037), (48, -0.005), (49, 0.046)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.94917887 987 andrew gelman stats-2011-11-02-How Khan Academy is using Machine Learning to Assess Student Mastery

2 0.75258285 697 andrew gelman stats-2011-05-05-A statistician rereads Bill James

3 0.72048807 623 andrew gelman stats-2011-03-21-Baseball’s greatest fielders

Introduction: Someone just stopped by and dropped off a copy of the book Wizardry: Baseball’s All-time Greatest Fielders Revealed, by Michael Humphreys. I don’t have much to say about the topic–I did see Brooks Robinson play, but I don’t remember any fancy plays. I must have seen Mark Belanger but I don’t really recall. Ozzie Smith was cool but I saw only him on TV. The most impressive thing I ever saw live was Rickey Henderson stealing a base. The best thing about that was that everyone was expecting him to steal the base, and he still was able to do it. But that wasn’t fielding either. Anyway, Humphreys was nice enough to give me a copy of his book, and since I can’t say much (I didn’t have it in me to study the formulas in detail, nor do I know enough to be able to evaluate them), I might as well say what I can say right away. (Note: Humphreys replies to some of these questions in a comment .) 1. Near the beginning, Humphreys says that 10 runs are worth about 1 win. I’ve always b

4 0.69932199 131 andrew gelman stats-2010-07-07-A note to John

5 0.69831282 642 andrew gelman stats-2011-04-02-Bill James and the base-rate fallacy

Introduction: I was recently rereading and enjoying Bill James’s Historical Baseball Abstract (the second edition, from 2001). But even the Master is not perfect. Here he is, in the context of the all-time 20th-greatest shortstop (in his reckoning): Are athletes special people? In general, no, but occasionally, yes. Johnny Pesky at 75 was trim, youthful, optimistic, and practically exploding with energy. You rarely meet anybody like that who isn’t an ex-athlete–and that makes athletes seem special. [italics in the original] Hey, I’ve met 75-year-olds like that–and none of them are ex-athletes! That’s probably because I don’t know a lot of ex-athletes. But Bill James . . . he knows a lot of athletes. He went to the bathroom with Tim Raines once! The most I can say is that I saw Rickey Henderson steal a couple bases when he was playing against the Orioles once. Cognitive psychologists talk about the base-rate fallacy , which is the mistake of estimating probabilities without accou

6 0.68055302 1419 andrew gelman stats-2012-07-17-“Faith means belief in something concerning which doubt is theoretically possible.” — William James

7 0.67728907 541 andrew gelman stats-2011-01-27-Why can’t I be more like Bill James, or, The use of default and default-like models

8 0.66562104 1637 andrew gelman stats-2012-12-24-Textbook for data visualization?

9 0.65974021 364 andrew gelman stats-2010-10-22-Politics is not a random walk: Momentum and mean reversion in polling

10 0.63987583 481 andrew gelman stats-2010-12-22-The Jumpstart financial literacy survey and the different purposes of tests

11 0.62898898 1625 andrew gelman stats-2012-12-15-“I coach the jumpers here at Boise State . . .”

12 0.62596744 499 andrew gelman stats-2011-01-03-5 books

13 0.62095404 949 andrew gelman stats-2011-10-10-Grrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr

14 0.61827815 652 andrew gelman stats-2011-04-07-Minor-league Stats Predict Major-league Performance, Sarah Palin, and Some Differences Between Baseball and Politics

15 0.60705113 509 andrew gelman stats-2011-01-09-Chartjunk, but in a good cause!

16 0.60381365 45 andrew gelman stats-2010-05-20-Domain specificity: Does being really really smart or really really rich qualify you to make economic policy?

17 0.59868348 300 andrew gelman stats-2010-09-28-A calibrated Cook gives Dems the edge in Nov, sez Sandy

18 0.59846896 76 andrew gelman stats-2010-06-09-Both R and Stata

19 0.59787208 440 andrew gelman stats-2010-12-01-In defense of jargon

20 0.59704471 173 andrew gelman stats-2010-07-31-Editing and clutch hitting

similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(0, 0.031), (5, 0.062), (16, 0.054), (21, 0.059), (24, 0.133), (61, 0.043), (65, 0.082), (99, 0.407)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.98297572 987 andrew gelman stats-2011-11-02-How Khan Academy is using Machine Learning to Assess Student Mastery

2 0.97316319 416 andrew gelman stats-2010-11-16-Is parenting a form of addiction?

Introduction: The last time we encountered Slate columnist Shankar Vedantam was when he puzzled over why slightly more than half of voters planned to vote for Republican candidates, given that polls show that Americans dislike the Republican Party even more than they dislike the Democrats. Vedantam attributed the new Republican majority to irrationality and “unconscious bias.” But, actually, this voting behavior is perfectly consistent with there being some moderate voters who prefer divided government. The simple, direct explanation (which Vedantam mistakenly dismisses) actually works fine. I was flipping through Slate today and noticed a new article by Vedantam headlined, “If parenthood sucks, why do we love it? Because we’re addicted.” I don’t like this one either. Vedantam starts by reviewing the evidence that people with kids are less happy than people without kids and that parents report that they are unhappy when they are around their young children. Given this, Vedantam asks

3 0.97269428 1819 andrew gelman stats-2013-04-23-Charles Murray’s “Coming Apart” and the measurement of social and political divisions

Introduction: Following up on our blog discussions a year ago, I published a review of Charles Murray’s recent book, “Coming Apart,” for the journal Statistics, Politics, and Policy. I invited Murray to publish a response, and he did so. Here’s the abstract to my review : This article examines some claims made in a recent popular book of political sociology, with the intent not being to debunk any claims but rather to connect some important social and policy positions to statistical data on income, social class, and political attitudes. The thesis of Charles Murray’s book is that America’s upper and lower classes have become increasingly separate, with elites living more disciplined, orderly lives (characterized by marriage, work, and stable families) while being largely unaware of the lifestyles of the majority of Americans. I argue that some of Murray’s conclusions are sensitive to particular choices of whom to label as elite or upper-class. From my analysis of survey data, I see the big

4 0.97074395 1811 andrew gelman stats-2013-04-18-Psychology experiments to understand what’s going on with data graphics?

Introduction: Ricardo Pietrobon writes, regarding my post from last year on attitudes toward data graphics, Wouldn’t it be the case to start formally studying the usability of graphics from a cognitive perspective? with platforms such as the mechanical turk it should be fairly straightforward to test alternative methods and come to some conclusions about what might be more informative and what might better assist in supporting decisions. btw, my guess is that these two constructs might not necessarily agree with each other. And Jessica Hullman provides some background: Measuring success for the different goals that you hint at in your article is indeed challenging, and I don’t think that most visualization researchers would claim to have met this challenge (myself included). Visualization researchers may know the user psychology well when it comes to certain dimensions of a graph’s effectiveness (such as quick and accurate responses), but I wouldn’t agree with this statement as a gene

5 0.97064728 2061 andrew gelman stats-2013-10-14-More on Mister P and how it does what it does

Introduction: Following up on our discussion the other day, Matt Buttice and Ben Highton write: It was nice to see our article mentioned and discussed by Andrew, Jeff Lax, Justin Phillips, and Yair Ghitza on Andrew’s blog in this post on Wednesday. As noted in the post, we recently published an article in Political Analysis on how well multilevel regression and poststratification (MRP) performs at producing estimates of state opinion with conventional national surveys where N≈1,500. Our central claims are that (i) the performance of MRP is highly variable, (ii) in the absence of knowing the true values, it is difficult to determine the quality of the MRP estimates produced on the basis of a single national sample, and, (iii) therefore, our views about the usefulness of MRP in instances where a researcher has a single sample of N≈1,500 are less optimistic than the ones expressed in previous research on the topic. Obviously we were interested in the blog posts. We found them stimulating

6 0.96996272 2056 andrew gelman stats-2013-10-09-Mister P: What’s its secret sauce?

7 0.96942031 1527 andrew gelman stats-2012-10-10-Another reason why you can get good inferences from a bad model

8 0.96811879 2251 andrew gelman stats-2014-03-17-In the best alternative histories, the real world is what’s ultimately real

9 0.96710557 10 andrew gelman stats-2010-04-29-Alternatives to regression for social science predictions

10 0.96572798 2292 andrew gelman stats-2014-04-15-When you believe in things that you don’t understand

11 0.96530956 100 andrew gelman stats-2010-06-19-Unsurprisingly, people are more worried about the economy and jobs than about deficits

12 0.96512121 2142 andrew gelman stats-2013-12-21-Chasing the noise

13 0.9649502 1634 andrew gelman stats-2012-12-21-Two reviews of Nate Silver’s new book, from Kaiser Fung and Cathy O’Neil

14 0.96461838 1452 andrew gelman stats-2012-08-09-Visually weighting regression displays

15 0.9639833 2091 andrew gelman stats-2013-11-06-“Marginally significant”

16 0.96332765 2286 andrew gelman stats-2014-04-08-Understanding Simpson’s paradox using a graph

17 0.96218336 1735 andrew gelman stats-2013-02-24-F-f-f-fake data

18 0.96199965 654 andrew gelman stats-2011-04-09-There’s no evidence that voters choose presidential candidates based on their looks

19 0.96192622 131 andrew gelman stats-2010-07-07-A note to John

20 0.96189451 1021 andrew gelman stats-2011-11-21-Don’t judge a book by its title