andrew_gelman_stats andrew_gelman_stats-2010 andrew_gelman_stats-2010-230 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting.
sentIndex sentText sentNum sentScore
1 Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). [sent-1, score-0.631]
2 Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. [sent-2, score-0.721]
3 And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting. [sent-3, score-1.121]
wordName wordTfidf (topN-words)
[('elo', 0.503), ('rating', 0.332), ('forecasting', 0.294), ('intuition', 0.29), ('tourism', 0.251), ('goldbloom', 0.237), ('team', 0.23), ('benchmark', 0.194), ('system', 0.19), ('submission', 0.188), ('anthony', 0.175), ('microsoft', 0.163), ('threshold', 0.154), ('competition', 0.149), ('submitted', 0.147), ('international', 0.136), ('publication', 0.111), ('position', 0.109), ('journal', 0.081), ('best', 0.068), ('better', 0.055), ('using', 0.052), ('writes', 0.038)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 230 andrew gelman stats-2010-08-24-Kaggle forcasting update
Introduction: Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting.
2 0.48820877 216 andrew gelman stats-2010-08-18-More forecasting competitions
Introduction: Anthony Goldbloom from Kaggle writes : We’ve recently put up some interesting new competitions. Last week, Jeff Sonas, the creator of the Chessmetrics rating system, launched a competition to find a chess rating algorithm that performs better than the official Elo system. Already nine teams have created systems that make more accurate predictions than Elo. It’s not a surprise that Elo has been outdone – the system was invented half a century ago before we could easily crunch large amounts of historical data. However, it is a big surprise that Elo has been outperformed so quickly given that it is the product of many years’ work (at least it was a surprise to me). Rob Hyndman from Monash University has put up the first part of a tourism forecasting competition . This part requires participants to forecast the results of 518 different time series. Rob is the editor of the International Journal of Forecasting and has promised to invite the winner to contribute a discussion paper
3 0.093916334 1395 andrew gelman stats-2012-06-27-Cross-validation (What is it good for?)
Introduction: I think cross-validation is a good way to estimate a model’s forecasting error but I don’t think it’s always such a great tool for comparing models. I mean, sure, if the differences are dramatic, ok. But you can easily have a few candidate models, and one model makes a lot more sense than the others (even from a purely predictive sense, I’m not talking about causality here). The difference between the model doesn’t show up in a xval measure of total error but in the patterns of the predictions. For a simple example, imagine using a linear model with positive slope to model a function that is constrained to be increasing. If the constraint isn’t in the model, the predicted/imputed series will sometimes be nonmonotonic. The effect on the prediction error can be so tiny as to be undetectable (or it might even increase avg prediction error to include the constraint); nonetheless, the predictions will be clearly nonsensical. That’s an extreme example but I think the general point h
4 0.0898396 1995 andrew gelman stats-2013-08-23-“I mean, what exact buttons do I have to hit?”
Introduction: This American Life reporter Gabriel Rhodes says : This is one of the big differences between Jon and Anthony, between scientist and non-scientist. For Jon, having a year’s worth of work suddenly thrown into question is a normal day at the office. But for Anthony, that’s not normal. And it’s not OK. The time in Jon’s lab was a year of his life, where he felt like Jon kept moving the goal posts. . . . But now, Anthony wants to know, before he starts turning his life upside down again, what will count as proof enough for Jon? How many experiments? Anthony Holland: So let’s say I do three weeks of experiment, and I only concentrate on these leukemia cells. And if I can kill at least 20% every single time, every week, will that do it? Would that be enough? Or do you want to see pancreatic die, or do you want to see—I mean, what exact buttons do I have to hit? This captures a big problem with the research enterprise, as I see it. There’s this attitude that if you can reach som
5 0.088772558 1174 andrew gelman stats-2012-02-18-Not as ugly as you look
Introduction: Kaiser asks the interesting question: How do you measure what restaurants are “overrated”? You can’t just ask people, right? There’s some sort of social element here, that “overrated” implies that someone’s out there doing the rating.
6 0.084691003 1911 andrew gelman stats-2013-06-23-AI Stats conference on Stan etc.
7 0.084259331 2054 andrew gelman stats-2013-10-07-Bing is preferred to Google by people who aren’t like me
8 0.077616587 1291 andrew gelman stats-2012-04-30-Systematic review of publication bias in studies on publication bias
9 0.069878735 218 andrew gelman stats-2010-08-20-I think you knew this already
10 0.069327012 559 andrew gelman stats-2011-02-06-Bidding for the kickoff
11 0.069201633 2168 andrew gelman stats-2014-01-12-Things that I like that almost nobody else is interested in
12 0.069179691 32 andrew gelman stats-2010-05-14-Causal inference in economics
13 0.069154844 402 andrew gelman stats-2010-11-09-Kaggle: forecasting competitions in the classroom
14 0.069071911 1886 andrew gelman stats-2013-06-07-Robust logistic regression
15 0.068893299 244 andrew gelman stats-2010-08-30-Useful models, model checking, and external validation: a mini-discussion
16 0.067803256 2244 andrew gelman stats-2014-03-11-What if I were to stop publishing in journals?
17 0.063783526 1134 andrew gelman stats-2012-01-21-Lessons learned from a recent R package submission
19 0.062920451 1219 andrew gelman stats-2012-03-18-Tips on “great design” from . . . Microsoft!
20 0.061702527 1008 andrew gelman stats-2011-11-13-Student project competition
topicId topicWeight
[(0, 0.044), (1, -0.014), (2, -0.016), (3, -0.017), (4, -0.005), (5, 0.018), (6, -0.01), (7, -0.047), (8, -0.029), (9, 0.02), (10, 0.049), (11, -0.0), (12, -0.027), (13, -0.028), (14, -0.048), (15, -0.002), (16, 0.024), (17, -0.007), (18, 0.005), (19, -0.011), (20, -0.009), (21, 0.064), (22, 0.011), (23, 0.034), (24, 0.014), (25, 0.024), (26, 0.03), (27, 0.004), (28, 0.01), (29, -0.028), (30, -0.011), (31, -0.062), (32, 0.013), (33, 0.002), (34, 0.004), (35, -0.027), (36, 0.007), (37, 0.032), (38, -0.002), (39, 0.032), (40, -0.038), (41, 0.036), (42, -0.012), (43, 0.071), (44, -0.005), (45, -0.016), (46, 0.023), (47, -0.01), (48, 0.001), (49, 0.0)]
simIndex simValue blogId blogTitle
same-blog 1 0.9777633 230 andrew gelman stats-2010-08-24-Kaggle forcasting update
Introduction: Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting.
2 0.68470132 216 andrew gelman stats-2010-08-18-More forecasting competitions
Introduction: Anthony Goldbloom from Kaggle writes : We’ve recently put up some interesting new competitions. Last week, Jeff Sonas, the creator of the Chessmetrics rating system, launched a competition to find a chess rating algorithm that performs better than the official Elo system. Already nine teams have created systems that make more accurate predictions than Elo. It’s not a surprise that Elo has been outdone – the system was invented half a century ago before we could easily crunch large amounts of historical data. However, it is a big surprise that Elo has been outperformed so quickly given that it is the product of many years’ work (at least it was a surprise to me). Rob Hyndman from Monash University has put up the first part of a tourism forecasting competition . This part requires participants to forecast the results of 518 different time series. Rob is the editor of the International Journal of Forecasting and has promised to invite the winner to contribute a discussion paper
3 0.57259458 1911 andrew gelman stats-2013-06-23-AI Stats conference on Stan etc.
Introduction: Jaakko Peltonen writes: The Seventeenth International Conference on Artificial Intelligence and Statistics (http://www.aistats.org) will be next April in Reykjavik, Iceland. AISTATS is an interdisciplinary conference at the intersection of computer science, artificial intelligence, machine learning, statistics, and related areas. ============================================================================== AISTATS 2014 Call for Papers Seventeenth International Conference on Artificial Intelligence and Statistics April 22 – 25, 2014, Reykjavik, Iceland http://www.aistats.org Colocated with a MLSS Machine Learning Summer School ============================================================================== AISTATS is an interdisciplinary gathering of researchers at the intersection of computer science, artificial intelligence, machine learning, statistics, and related areas. Since its inception in 1985, the primary goal of AISTATS has been to broaden research in the
4 0.54682088 1638 andrew gelman stats-2012-12-25-Diving chess
Introduction: Knowing of my interest in Turing run-around-the-house chess , David Lockhart points me to this : Diving Chess is a chess variant, which is played in a swimming pool. Instead of using chess clocks, each player must submerge themselves underwater during their turn, only to resurface when they are ready to make a move. Players must make a move within 5 seconds of resurfacing (they will receive a warning if not, and three warnings will result in a forfeit). Diving Chess was invented by American Chess Master Etan Ilfeld; the very first exhibition game took place between Ilfeld and former British Chess Champion William Hartston at the Thirdspace gym in Soho on August 2nd, 2011. Hartston won the match which lasted almost two hours such that each player was underwater for an entire hour.
5 0.52347058 1818 andrew gelman stats-2013-04-22-Goal: Rules for Turing chess
Introduction: Daniel Murell has more thoughts on Turing chess (last discussed here ): When I played with my brother, we had it that if you managed to lap someone while running around the house, then you got an additional move. This means that if you had the option to take the king on your additional move, you could, and doing so won you the game. He was fitter at the time so he slipped in two additional moves over the course of the game. I still won :) I am much better at him at chess though, so I’m sure he would have beaten me had we been more even. W.r.t. dsquared’s comment and your response, I’m not overly concerned about the first move, because you can enforce that white must reach a halfway point or that some time interval elapse before black makes his first move. This version though does have one significant weakness that is evident to me. If you wait a little for your opponent to return to make his second move in a row against you, you get your breath back. He couldn’t plan for th
6 0.51558095 1118 andrew gelman stats-2012-01-14-A model rejection letter
7 0.50947559 1473 andrew gelman stats-2012-08-28-Turing chess run update
8 0.4961893 1923 andrew gelman stats-2013-07-03-Bayes pays!
9 0.49376857 559 andrew gelman stats-2011-02-06-Bidding for the kickoff
10 0.48981702 1137 andrew gelman stats-2012-01-24-Difficulties in publishing non-replications of implausible findings
11 0.48625863 1828 andrew gelman stats-2013-04-27-Time-Sharing Experiments for the Social Sciences
12 0.47713035 615 andrew gelman stats-2011-03-16-Chess vs. checkers
13 0.47526202 2268 andrew gelman stats-2014-03-26-New research journal on observational studies
14 0.47328797 1774 andrew gelman stats-2013-03-22-Likelihood Ratio ≠ 1 Journal
15 0.46685869 1915 andrew gelman stats-2013-06-27-Huh?
16 0.46472219 634 andrew gelman stats-2011-03-29-A.I. is Whatever We Can’t Yet Automate
17 0.45064464 2239 andrew gelman stats-2014-03-09-Reviewing the peer review process?
18 0.43633458 1272 andrew gelman stats-2012-04-20-More proposals to reform the peer-review system
19 0.43552104 813 andrew gelman stats-2011-07-21-Scrabble!
20 0.43395045 1393 andrew gelman stats-2012-06-26-The reverse-journal-submission system
topicId topicWeight
[(15, 0.09), (16, 0.031), (24, 0.023), (53, 0.053), (77, 0.484), (99, 0.137)]
simIndex simValue blogId blogTitle
same-blog 1 0.96923143 230 andrew gelman stats-2010-08-24-Kaggle forcasting update
Introduction: Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting.
2 0.93962008 74 andrew gelman stats-2010-06-08-“Extreme views weakly held”
Introduction: Alan and Felix .
3 0.76216888 911 andrew gelman stats-2011-09-15-More data tools worth using from Google
Introduction: Speaking of open data and google tools, see this post from Revolution R: How to use a Google Spreadsheet as data in R .
4 0.69613534 1006 andrew gelman stats-2011-11-12-Val’s Number Scroll: Helping kids visualize math
Introduction: This looks cool.
5 0.64005351 1071 andrew gelman stats-2011-12-19-“NYU Professor Claims He Was Fired for Giving James Franco a D”
Introduction: One advantage of teaching statistics is that you don’t have to worry about any celebrities taking your class.
6 0.63738739 1784 andrew gelman stats-2013-04-01-Wolfram on Mandelbrot
8 0.60616875 1684 andrew gelman stats-2013-01-20-Ugly ugly ugly
9 0.60121775 380 andrew gelman stats-2010-10-29-“Bluntly put . . .”
10 0.58995271 978 andrew gelman stats-2011-10-28-Cool job opening with brilliant researchers at Yahoo
12 0.55331278 57 andrew gelman stats-2010-05-29-Roth and Amsterdam
13 0.54622018 1124 andrew gelman stats-2012-01-17-How to map geographically-detailed survey responses?
14 0.53831297 1604 andrew gelman stats-2012-12-04-An epithet I can live with
15 0.49725932 1561 andrew gelman stats-2012-11-04-Someone is wrong on the internet
16 0.47481591 2059 andrew gelman stats-2013-10-12-Visualization, “big data”, and EDA
17 0.46149623 93 andrew gelman stats-2010-06-17-My proposal for making college admissions fairer
18 0.45574027 562 andrew gelman stats-2011-02-06-Statistician cracks Toronto lottery
19 0.44375277 1297 andrew gelman stats-2012-05-03-New New York data research organizations
20 0.43140125 2054 andrew gelman stats-2013-10-07-Bing is preferred to Google by people who aren’t like me