andrew_gelman_stats andrew_gelman_stats-2010 andrew_gelman_stats-2010-230 knowledge-graph by maker-knowledge-mining

230 andrew gelman stats-2010-08-24-Kaggle forcasting update


meta infos for this blog

Source: html

Introduction: Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting.


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). [sent-1, score-0.631]

2 Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. [sent-2, score-0.721]

3 And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting. [sent-3, score-1.121]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('elo', 0.503), ('rating', 0.332), ('forecasting', 0.294), ('intuition', 0.29), ('tourism', 0.251), ('goldbloom', 0.237), ('team', 0.23), ('benchmark', 0.194), ('system', 0.19), ('submission', 0.188), ('anthony', 0.175), ('microsoft', 0.163), ('threshold', 0.154), ('competition', 0.149), ('submitted', 0.147), ('international', 0.136), ('publication', 0.111), ('position', 0.109), ('journal', 0.081), ('best', 0.068), ('better', 0.055), ('using', 0.052), ('writes', 0.038)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 230 andrew gelman stats-2010-08-24-Kaggle forcasting update

Introduction: Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting.

2 0.48820877 216 andrew gelman stats-2010-08-18-More forecasting competitions

Introduction: Anthony Goldbloom from Kaggle writes : We’ve recently put up some interesting new competitions. Last week, Jeff Sonas, the creator of the Chessmetrics rating system, launched a competition to find a chess rating algorithm that performs better than the official Elo system. Already nine teams have created systems that make more accurate predictions than Elo. It’s not a surprise that Elo has been outdone – the system was invented half a century ago before we could easily crunch large amounts of historical data. However, it is a big surprise that Elo has been outperformed so quickly given that it is the product of many years’ work (at least it was a surprise to me). Rob Hyndman from Monash University has put up the first part of a tourism forecasting competition . This part requires participants to forecast the results of 518 different time series. Rob is the editor of the International Journal of Forecasting and has promised to invite the winner to contribute a discussion paper

3 0.093916334 1395 andrew gelman stats-2012-06-27-Cross-validation (What is it good for?)

Introduction: I think cross-validation is a good way to estimate a model’s forecasting error but I don’t think it’s always such a great tool for comparing models. I mean, sure, if the differences are dramatic, ok. But you can easily have a few candidate models, and one model makes a lot more sense than the others (even from a purely predictive sense, I’m not talking about causality here). The difference between the model doesn’t show up in a xval measure of total error but in the patterns of the predictions. For a simple example, imagine using a linear model with positive slope to model a function that is constrained to be increasing. If the constraint isn’t in the model, the predicted/imputed series will sometimes be nonmonotonic. The effect on the prediction error can be so tiny as to be undetectable (or it might even increase avg prediction error to include the constraint); nonetheless, the predictions will be clearly nonsensical. That’s an extreme example but I think the general point h

4 0.0898396 1995 andrew gelman stats-2013-08-23-“I mean, what exact buttons do I have to hit?”

Introduction: This American Life reporter Gabriel Rhodes says : This is one of the big differences between Jon and Anthony, between scientist and non-scientist. For Jon, having a year’s worth of work suddenly thrown into question is a normal day at the office. But for Anthony, that’s not normal. And it’s not OK. The time in Jon’s lab was a year of his life, where he felt like Jon kept moving the goal posts. . . . But now, Anthony wants to know, before he starts turning his life upside down again, what will count as proof enough for Jon? How many experiments? Anthony Holland: So let’s say I do three weeks of experiment, and I only concentrate on these leukemia cells. And if I can kill at least 20% every single time, every week, will that do it? Would that be enough? Or do you want to see pancreatic die, or do you want to see—I mean, what exact buttons do I have to hit? This captures a big problem with the research enterprise, as I see it. There’s this attitude that if you can reach som

5 0.088772558 1174 andrew gelman stats-2012-02-18-Not as ugly as you look

Introduction: Kaiser asks the interesting question: How do you measure what restaurants are “overrated”? You can’t just ask people, right? There’s some sort of social element here, that “overrated” implies that someone’s out there doing the rating.

6 0.084691003 1911 andrew gelman stats-2013-06-23-AI Stats conference on Stan etc.

7 0.084259331 2054 andrew gelman stats-2013-10-07-Bing is preferred to Google by people who aren’t like me

8 0.077616587 1291 andrew gelman stats-2012-04-30-Systematic review of publication bias in studies on publication bias

9 0.069878735 218 andrew gelman stats-2010-08-20-I think you knew this already

10 0.069327012 559 andrew gelman stats-2011-02-06-Bidding for the kickoff

11 0.069201633 2168 andrew gelman stats-2014-01-12-Things that I like that almost nobody else is interested in

12 0.069179691 32 andrew gelman stats-2010-05-14-Causal inference in economics

13 0.069154844 402 andrew gelman stats-2010-11-09-Kaggle: forecasting competitions in the classroom

14 0.069071911 1886 andrew gelman stats-2013-06-07-Robust logistic regression

15 0.068893299 244 andrew gelman stats-2010-08-30-Useful models, model checking, and external validation: a mini-discussion

16 0.067803256 2244 andrew gelman stats-2014-03-11-What if I were to stop publishing in journals?

17 0.063783526 1134 andrew gelman stats-2012-01-21-Lessons learned from a recent R package submission

18 0.0634901 1875 andrew gelman stats-2013-05-28-Simplify until your fake-data check works, then add complications until you can figure out where the problem is coming from

19 0.062920451 1219 andrew gelman stats-2012-03-18-Tips on “great design” from . . . Microsoft!

20 0.061702527 1008 andrew gelman stats-2011-11-13-Student project competition


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.044), (1, -0.014), (2, -0.016), (3, -0.017), (4, -0.005), (5, 0.018), (6, -0.01), (7, -0.047), (8, -0.029), (9, 0.02), (10, 0.049), (11, -0.0), (12, -0.027), (13, -0.028), (14, -0.048), (15, -0.002), (16, 0.024), (17, -0.007), (18, 0.005), (19, -0.011), (20, -0.009), (21, 0.064), (22, 0.011), (23, 0.034), (24, 0.014), (25, 0.024), (26, 0.03), (27, 0.004), (28, 0.01), (29, -0.028), (30, -0.011), (31, -0.062), (32, 0.013), (33, 0.002), (34, 0.004), (35, -0.027), (36, 0.007), (37, 0.032), (38, -0.002), (39, 0.032), (40, -0.038), (41, 0.036), (42, -0.012), (43, 0.071), (44, -0.005), (45, -0.016), (46, 0.023), (47, -0.01), (48, 0.001), (49, 0.0)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.9777633 230 andrew gelman stats-2010-08-24-Kaggle forcasting update

Introduction: Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting.

2 0.68470132 216 andrew gelman stats-2010-08-18-More forecasting competitions

Introduction: Anthony Goldbloom from Kaggle writes : We’ve recently put up some interesting new competitions. Last week, Jeff Sonas, the creator of the Chessmetrics rating system, launched a competition to find a chess rating algorithm that performs better than the official Elo system. Already nine teams have created systems that make more accurate predictions than Elo. It’s not a surprise that Elo has been outdone – the system was invented half a century ago before we could easily crunch large amounts of historical data. However, it is a big surprise that Elo has been outperformed so quickly given that it is the product of many years’ work (at least it was a surprise to me). Rob Hyndman from Monash University has put up the first part of a tourism forecasting competition . This part requires participants to forecast the results of 518 different time series. Rob is the editor of the International Journal of Forecasting and has promised to invite the winner to contribute a discussion paper

3 0.57259458 1911 andrew gelman stats-2013-06-23-AI Stats conference on Stan etc.

Introduction: Jaakko Peltonen writes: The Seventeenth International Conference on Artificial Intelligence and Statistics (http://www.aistats.org) will be next April in Reykjavik, Iceland. AISTATS is an interdisciplinary conference at the intersection of computer science, artificial intelligence, machine learning, statistics, and related areas. ============================================================================== AISTATS 2014 Call for Papers Seventeenth International Conference on Artificial Intelligence and Statistics April 22 – 25, 2014, Reykjavik, Iceland http://www.aistats.org Colocated with a MLSS Machine Learning Summer School ============================================================================== AISTATS is an interdisciplinary gathering of researchers at the intersection of computer science, artificial intelligence, machine learning, statistics, and related areas. Since its inception in 1985, the primary goal of AISTATS has been to broaden research in the

4 0.54682088 1638 andrew gelman stats-2012-12-25-Diving chess

Introduction: Knowing of my interest in Turing run-around-the-house chess , David Lockhart points me to this : Diving Chess is a chess variant, which is played in a swimming pool. Instead of using chess clocks, each player must submerge themselves underwater during their turn, only to resurface when they are ready to make a move. Players must make a move within 5 seconds of resurfacing (they will receive a warning if not, and three warnings will result in a forfeit). Diving Chess was invented by American Chess Master Etan Ilfeld; the very first exhibition game took place between Ilfeld and former British Chess Champion William Hartston at the Thirdspace gym in Soho on August 2nd, 2011. Hartston won the match which lasted almost two hours such that each player was underwater for an entire hour.

5 0.52347058 1818 andrew gelman stats-2013-04-22-Goal: Rules for Turing chess

Introduction: Daniel Murell has more thoughts on Turing chess (last discussed here ): When I played with my brother, we had it that if you managed to lap someone while running around the house, then you got an additional move. This means that if you had the option to take the king on your additional move, you could, and doing so won you the game. He was fitter at the time so he slipped in two additional moves over the course of the game. I still won :) I am much better at him at chess though, so I’m sure he would have beaten me had we been more even. W.r.t. dsquared’s comment and your response, I’m not overly concerned about the first move, because you can enforce that white must reach a halfway point or that some time interval elapse before black makes his first move. This version though does have one significant weakness that is evident to me. If you wait a little for your opponent to return to make his second move in a row against you, you get your breath back. He couldn’t plan for th

6 0.51558095 1118 andrew gelman stats-2012-01-14-A model rejection letter

7 0.50947559 1473 andrew gelman stats-2012-08-28-Turing chess run update

8 0.4961893 1923 andrew gelman stats-2013-07-03-Bayes pays!

9 0.49376857 559 andrew gelman stats-2011-02-06-Bidding for the kickoff

10 0.48981702 1137 andrew gelman stats-2012-01-24-Difficulties in publishing non-replications of implausible findings

11 0.48625863 1828 andrew gelman stats-2013-04-27-Time-Sharing Experiments for the Social Sciences

12 0.47713035 615 andrew gelman stats-2011-03-16-Chess vs. checkers

13 0.47526202 2268 andrew gelman stats-2014-03-26-New research journal on observational studies

14 0.47328797 1774 andrew gelman stats-2013-03-22-Likelihood Ratio ≠ 1 Journal

15 0.46685869 1915 andrew gelman stats-2013-06-27-Huh?

16 0.46472219 634 andrew gelman stats-2011-03-29-A.I. is Whatever We Can’t Yet Automate

17 0.45064464 2239 andrew gelman stats-2014-03-09-Reviewing the peer review process?

18 0.43633458 1272 andrew gelman stats-2012-04-20-More proposals to reform the peer-review system

19 0.43552104 813 andrew gelman stats-2011-07-21-Scrabble!

20 0.43395045 1393 andrew gelman stats-2012-06-26-The reverse-journal-submission system


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(15, 0.09), (16, 0.031), (24, 0.023), (53, 0.053), (77, 0.484), (99, 0.137)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.96923143 230 andrew gelman stats-2010-08-24-Kaggle forcasting update

Introduction: Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting.

2 0.93962008 74 andrew gelman stats-2010-06-08-“Extreme views weakly held”

Introduction: Alan and Felix .

3 0.76216888 911 andrew gelman stats-2011-09-15-More data tools worth using from Google

Introduction: Speaking of open data and google tools, see this post from Revolution R: How to use a Google Spreadsheet as data in R .

4 0.69613534 1006 andrew gelman stats-2011-11-12-Val’s Number Scroll: Helping kids visualize math

Introduction: This looks cool.

5 0.64005351 1071 andrew gelman stats-2011-12-19-“NYU Professor Claims He Was Fired for Giving James Franco a D”

Introduction: One advantage of teaching statistics is that you don’t have to worry about any celebrities taking your class.

6 0.63738739 1784 andrew gelman stats-2013-04-01-Wolfram on Mandelbrot

7 0.61783254 1373 andrew gelman stats-2012-06-09-Cognitive psychology research helps us understand confusion of Jonathan Haidt and others about working-class voters

8 0.60616875 1684 andrew gelman stats-2013-01-20-Ugly ugly ugly

9 0.60121775 380 andrew gelman stats-2010-10-29-“Bluntly put . . .”

10 0.58995271 978 andrew gelman stats-2011-10-28-Cool job opening with brilliant researchers at Yahoo

11 0.58066481 1481 andrew gelman stats-2012-09-04-Cool one-day miniconference at Columbia Fri 12 Oct on computational and online social science

12 0.55331278 57 andrew gelman stats-2010-05-29-Roth and Amsterdam

13 0.54622018 1124 andrew gelman stats-2012-01-17-How to map geographically-detailed survey responses?

14 0.53831297 1604 andrew gelman stats-2012-12-04-An epithet I can live with

15 0.49725932 1561 andrew gelman stats-2012-11-04-Someone is wrong on the internet

16 0.47481591 2059 andrew gelman stats-2013-10-12-Visualization, “big data”, and EDA

17 0.46149623 93 andrew gelman stats-2010-06-17-My proposal for making college admissions fairer

18 0.45574027 562 andrew gelman stats-2011-02-06-Statistician cracks Toronto lottery

19 0.44375277 1297 andrew gelman stats-2012-05-03-New New York data research organizations

20 0.43140125 2054 andrew gelman stats-2013-10-07-Bing is preferred to Google by people who aren’t like me