andrew_gelman_stats andrew_gelman_stats-2010 andrew_gelman_stats-2010-216 knowledge-graph by maker-knowledge-mining

216 andrew gelman stats-2010-08-18-More forecasting competitions


meta infos for this blog

Source: html

Introduction: Anthony Goldbloom from Kaggle writes : We’ve recently put up some interesting new competitions. Last week, Jeff Sonas, the creator of the Chessmetrics rating system, launched a competition to find a chess rating algorithm that performs better than the official Elo system. Already nine teams have created systems that make more accurate predictions than Elo. It’s not a surprise that Elo has been outdone – the system was invented half a century ago before we could easily crunch large amounts of historical data. However, it is a big surprise that Elo has been outperformed so quickly given that it is the product of many years’ work (at least it was a surprise to me). Rob Hyndman from Monash University has put up the first part of a tourism forecasting competition . This part requires participants to forecast the results of 518 different time series. Rob is the editor of the International Journal of Forecasting and has promised to invite the winner to contribute a discussion paper


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Anthony Goldbloom from Kaggle writes : We’ve recently put up some interesting new competitions. [sent-1, score-0.077]

2 Last week, Jeff Sonas, the creator of the Chessmetrics rating system, launched a competition to find a chess rating algorithm that performs better than the official Elo system. [sent-2, score-1.283]

3 Already nine teams have created systems that make more accurate predictions than Elo. [sent-3, score-0.294]

4 It’s not a surprise that Elo has been outdone – the system was invented half a century ago before we could easily crunch large amounts of historical data. [sent-4, score-0.713]

5 However, it is a big surprise that Elo has been outperformed so quickly given that it is the product of many years’ work (at least it was a surprise to me). [sent-5, score-0.636]

6 Rob Hyndman from Monash University has put up the first part of a tourism forecasting competition . [sent-6, score-0.616]

7 This part requires participants to forecast the results of 518 different time series. [sent-7, score-0.205]

8 Rob is the editor of the International Journal of Forecasting and has promised to invite the winner to contribute a discussion paper to the journal describing their methodology and giving their results (provided the winner achieves a certain level of predictive accuracy). [sent-8, score-1.043]

9 Chris Raimondi describes his winning method on the Kaggle blog. [sent-10, score-0.069]

10 On the chess example, I’m not at all surprised that Elo has been outperformed. [sent-12, score-0.26]

11 Published alternatives have been out there for years . [sent-13, score-0.078]

12 We even have a chess example in Bayesian Data Analysis (in the first edition, from 1995), and that in turn is based on earlier work by Glickman. [sent-14, score-0.26]

13 I like this competition idea and would like to propose some ideas of our own, through the Applied Statistics Center. [sent-15, score-0.369]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('elo', 0.485), ('competition', 0.287), ('chess', 0.26), ('kaggle', 0.242), ('surprise', 0.189), ('rob', 0.178), ('winner', 0.162), ('rating', 0.16), ('forecasting', 0.142), ('outperformed', 0.121), ('tourism', 0.121), ('achieves', 0.114), ('crunch', 0.114), ('goldbloom', 0.114), ('creator', 0.109), ('hiv', 0.109), ('promised', 0.1), ('invented', 0.094), ('system', 0.092), ('launched', 0.089), ('nine', 0.087), ('amounts', 0.087), ('anthony', 0.085), ('invite', 0.084), ('performs', 0.084), ('propose', 0.082), ('alternatives', 0.078), ('journal', 0.078), ('recently', 0.077), ('contribute', 0.076), ('teams', 0.075), ('century', 0.073), ('thesis', 0.073), ('forecast', 0.072), ('product', 0.072), ('edition', 0.072), ('official', 0.07), ('methodology', 0.069), ('winning', 0.069), ('systems', 0.067), ('results', 0.067), ('describing', 0.067), ('international', 0.066), ('part', 0.066), ('quickly', 0.065), ('created', 0.065), ('accuracy', 0.065), ('historical', 0.064), ('editor', 0.064), ('algorithm', 0.064)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0000001 216 andrew gelman stats-2010-08-18-More forecasting competitions

Introduction: Anthony Goldbloom from Kaggle writes : We’ve recently put up some interesting new competitions. Last week, Jeff Sonas, the creator of the Chessmetrics rating system, launched a competition to find a chess rating algorithm that performs better than the official Elo system. Already nine teams have created systems that make more accurate predictions than Elo. It’s not a surprise that Elo has been outdone – the system was invented half a century ago before we could easily crunch large amounts of historical data. However, it is a big surprise that Elo has been outperformed so quickly given that it is the product of many years’ work (at least it was a surprise to me). Rob Hyndman from Monash University has put up the first part of a tourism forecasting competition . This part requires participants to forecast the results of 518 different time series. Rob is the editor of the International Journal of Forecasting and has promised to invite the winner to contribute a discussion paper

2 0.48820877 230 andrew gelman stats-2010-08-24-Kaggle forcasting update

Introduction: Anthony Goldbloom writes: The Elo rating system is now in 47th position (team Elo Benchmark on the leaderboard). Team Intuition submitted using Microsoft’s Trueskill rating system – Intuition is in 38th position. And for the tourism forecasting competition, the best submission is doing better than the threshold for publication in the International Journal of Forecasting.

3 0.2621215 402 andrew gelman stats-2010-11-09-Kaggle: forecasting competitions in the classroom

Introduction: Anthony Goldbloom writes: For those who haven’t come across Kaggle, we are a new platform for data prediction competitions. Companies and researchers put up a dataset and a problem and data scientists compete to produce the best solutions. We’ve just launched a new initiative called Kaggle in Class, allowing instructors to host competitions for their students. Competitions are a neat way to engage students, giving them the opportunity to put into practice what they learn. The platform offers live leaderboards, so students get instant feedback on the accuracy of their work. And since competitions are judged on objective criteria (predictions are compared with outcomes), the platform offers unique assessment opportunities. The first Kaggle in Class competition is being hosted by Stanford University’s Stats 202 class and requires students to predict the price of different wines based on vintage, country, ratings and other information. Those interested in hosting a competition f

4 0.17902273 1638 andrew gelman stats-2012-12-25-Diving chess

Introduction: Knowing of my interest in Turing run-around-the-house chess , David Lockhart points me to this : Diving Chess is a chess variant, which is played in a swimming pool. Instead of using chess clocks, each player must submerge themselves underwater during their turn, only to resurface when they are ready to make a move. Players must make a move within 5 seconds of resurfacing (they will receive a warning if not, and three warnings will result in a forfeit). Diving Chess was invented by American Chess Master Etan Ilfeld; the very first exhibition game took place between Ilfeld and former British Chess Champion William Hartston at the Thirdspace gym in Soho on August 2nd, 2011. Hartston won the match which lasted almost two hours such that each player was underwater for an entire hour.

5 0.16617803 615 andrew gelman stats-2011-03-16-Chess vs. checkers

Introduction: Mark Palko writes : Chess derives most of its complexity through differentiated pieces; with checkers the complexity comes from the interaction between pieces. The result is a series of elegant graph problems where the viable paths change with each move of your opponent. To draw an analogy with chess, imagine if moving your knight could allow your opponent’s bishop to move like a rook. Add to that the potential for traps and manipulation that come with forced capture and you have one of the most remarkable games of all time. . . . It’s not unusual to hear masters of both chess and checkers (draughts) to admit that they prefer the latter. So why does chess get all the respect? Why do you never see a criminal mastermind or a Bond villain playing in a checkers tournament? Part of the problem is that we learn the game as children so we tend to think of it as a children’s game. We focus on how simple the rules are and miss how much complexity and subtlety you can get out of those ru

6 0.14749141 1473 andrew gelman stats-2012-08-28-Turing chess run update

7 0.14227733 1008 andrew gelman stats-2011-11-13-Student project competition

8 0.1269805 1818 andrew gelman stats-2013-04-22-Goal: Rules for Turing chess

9 0.1213647 1847 andrew gelman stats-2013-05-08-Of parsing and chess

10 0.11299425 1290 andrew gelman stats-2012-04-30-I suppose it’s too late to add Turing’s run-around-the-house-chess to the 2012 London Olympics?

11 0.093343914 218 andrew gelman stats-2010-08-20-I think you knew this already

12 0.091809958 557 andrew gelman stats-2011-02-05-Call for book proposals

13 0.091622204 1585 andrew gelman stats-2012-11-20-“I know you aren’t the plagiarism police, but . . .”

14 0.085949183 1495 andrew gelman stats-2012-09-13-Win $5000 in the Economist’s data visualization competition

15 0.083946303 1875 andrew gelman stats-2013-05-28-Simplify until your fake-data check works, then add complications until you can figure out where the problem is coming from

16 0.0790537 244 andrew gelman stats-2010-08-30-Useful models, model checking, and external validation: a mini-discussion

17 0.078438215 32 andrew gelman stats-2010-05-14-Causal inference in economics

18 0.076313883 982 andrew gelman stats-2011-10-30-“There’s at least as much as an 80 percent chance . . .”

19 0.074545696 1292 andrew gelman stats-2012-05-01-Colorless green facts asserted resolutely

20 0.074469477 1634 andrew gelman stats-2012-12-21-Two reviews of Nate Silver’s new book, from Kaiser Fung and Cathy O’Neil


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.141), (1, -0.021), (2, -0.041), (3, -0.006), (4, -0.004), (5, 0.031), (6, -0.011), (7, -0.015), (8, -0.013), (9, 0.019), (10, 0.068), (11, -0.002), (12, -0.031), (13, -0.036), (14, -0.05), (15, 0.001), (16, 0.024), (17, -0.021), (18, -0.007), (19, -0.008), (20, -0.032), (21, 0.084), (22, -0.006), (23, 0.062), (24, 0.038), (25, 0.053), (26, 0.023), (27, 0.03), (28, 0.031), (29, -0.097), (30, 0.02), (31, -0.098), (32, 0.011), (33, -0.023), (34, 0.023), (35, -0.047), (36, -0.024), (37, 0.043), (38, 0.015), (39, 0.066), (40, -0.065), (41, 0.09), (42, 0.033), (43, 0.007), (44, 0.016), (45, -0.1), (46, 0.053), (47, -0.033), (48, 0.027), (49, 0.024)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.92541194 216 andrew gelman stats-2010-08-18-More forecasting competitions

Introduction: Anthony Goldbloom from Kaggle writes : We’ve recently put up some interesting new competitions. Last week, Jeff Sonas, the creator of the Chessmetrics rating system, launched a competition to find a chess rating algorithm that performs better than the official Elo system. Already nine teams have created systems that make more accurate predictions than Elo. It’s not a surprise that Elo has been outdone – the system was invented half a century ago before we could easily crunch large amounts of historical data. However, it is a big surprise that Elo has been outperformed so quickly given that it is the product of many years’ work (at least it was a surprise to me). Rob Hyndman from Monash University has put up the first part of a tourism forecasting competition . This part requires participants to forecast the results of 518 different time series. Rob is the editor of the International Journal of Forecasting and has promised to invite the winner to contribute a discussion paper

2 0.80725831 1638 andrew gelman stats-2012-12-25-Diving chess

Introduction: Knowing of my interest in Turing run-around-the-house chess , David Lockhart points me to this : Diving Chess is a chess variant, which is played in a swimming pool. Instead of using chess clocks, each player must submerge themselves underwater during their turn, only to resurface when they are ready to make a move. Players must make a move within 5 seconds of resurfacing (they will receive a warning if not, and three warnings will result in a forfeit). Diving Chess was invented by American Chess Master Etan Ilfeld; the very first exhibition game took place between Ilfeld and former British Chess Champion William Hartston at the Thirdspace gym in Soho on August 2nd, 2011. Hartston won the match which lasted almost two hours such that each player was underwater for an entire hour.

3 0.79838759 1473 andrew gelman stats-2012-08-28-Turing chess run update

Introduction: In honor of the Olympics , I got my butt over to the park and played run-around-the-house chess for the first time ever. As was discussed in the comments thread awhile ago , there seem to be three possible ways to play Turing chess: 1. You make your move and run around the house. The other player has to move before you return. Once you sit down to the table, the other player runs around the house. Then you have to move, etc. You lose if you are checkmated or if you fail to move before your opponent returns to his chair. 2. You make your move and run around the house. The other player has to move before you return, but he does not have to wait until you return to start running. He can start running once he’s moved. Then when you get back you have to move before he gets back, but you can start to run once you’ve moved, etc. 3. You make your move and run around the house. The other player takes as long as he wants and makes his move, then he runs. When you return, yo

4 0.79039532 615 andrew gelman stats-2011-03-16-Chess vs. checkers

Introduction: Mark Palko writes : Chess derives most of its complexity through differentiated pieces; with checkers the complexity comes from the interaction between pieces. The result is a series of elegant graph problems where the viable paths change with each move of your opponent. To draw an analogy with chess, imagine if moving your knight could allow your opponent’s bishop to move like a rook. Add to that the potential for traps and manipulation that come with forced capture and you have one of the most remarkable games of all time. . . . It’s not unusual to hear masters of both chess and checkers (draughts) to admit that they prefer the latter. So why does chess get all the respect? Why do you never see a criminal mastermind or a Bond villain playing in a checkers tournament? Part of the problem is that we learn the game as children so we tend to think of it as a children’s game. We focus on how simple the rules are and miss how much complexity and subtlety you can get out of those ru

5 0.78651869 1818 andrew gelman stats-2013-04-22-Goal: Rules for Turing chess

Introduction: Daniel Murell has more thoughts on Turing chess (last discussed here ): When I played with my brother, we had it that if you managed to lap someone while running around the house, then you got an additional move. This means that if you had the option to take the king on your additional move, you could, and doing so won you the game. He was fitter at the time so he slipped in two additional moves over the course of the game. I still won :) I am much better at him at chess though, so I’m sure he would have beaten me had we been more even. W.r.t. dsquared’s comment and your response, I’m not overly concerned about the first move, because you can enforce that white must reach a halfway point or that some time interval elapse before black makes his first move. This version though does have one significant weakness that is evident to me. If you wait a little for your opponent to return to make his second move in a row against you, you get your breath back. He couldn’t plan for th

6 0.73616481 230 andrew gelman stats-2010-08-24-Kaggle forcasting update

7 0.69589382 1847 andrew gelman stats-2013-05-08-Of parsing and chess

8 0.6669488 218 andrew gelman stats-2010-08-20-I think you knew this already

9 0.64705873 813 andrew gelman stats-2011-07-21-Scrabble!

10 0.63674057 1467 andrew gelman stats-2012-08-23-The pinch-hitter syndrome again

11 0.63429981 634 andrew gelman stats-2011-03-29-A.I. is Whatever We Can’t Yet Automate

12 0.63141972 1290 andrew gelman stats-2012-04-30-I suppose it’s too late to add Turing’s run-around-the-house-chess to the 2012 London Olympics?

13 0.5999459 2105 andrew gelman stats-2013-11-18-What’s my Kasparov number?

14 0.56079865 1903 andrew gelman stats-2013-06-17-Weak identification provides partial information

15 0.53371674 1708 andrew gelman stats-2013-02-05-Wouldn’t it be cool if Glenn Hubbard were consulting for Herbalife and I were on the other side?

16 0.52695179 559 andrew gelman stats-2011-02-06-Bidding for the kickoff

17 0.52414167 1773 andrew gelman stats-2013-03-21-2.15

18 0.52060181 29 andrew gelman stats-2010-05-12-Probability of successive wins in baseball

19 0.51873863 665 andrew gelman stats-2011-04-17-Yes, your wish shall be granted (in 25 years)

20 0.5144015 1443 andrew gelman stats-2012-08-04-Bayesian Learning via Stochastic Gradient Langevin Dynamics


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(7, 0.036), (9, 0.039), (15, 0.015), (16, 0.097), (17, 0.077), (20, 0.019), (24, 0.096), (32, 0.013), (53, 0.01), (54, 0.029), (77, 0.133), (86, 0.031), (96, 0.01), (99, 0.288)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.95561123 1784 andrew gelman stats-2013-04-01-Wolfram on Mandelbrot

Introduction: The most perfect pairing of author and subject since Nicholson Baker and John Updike. Here’s Wolfram on the great researcher of fractals : In his way, Mandelbrot paid me some great compliments. When I was in my 20s, and he in his 60s, he would ask about my scientific work: “How can so many people take someone so young so seriously?” In 2002, my book “A New Kind of Science”—in which I argued that many phenomena across science are the complex results of relatively simple, program-like rules—appeared. Mandelbrot seemed to see it as a direct threat, once declaring that “Wolfram’s ‘science’ is not new except when it is clearly wrong; it deserves to be completely disregarded.” In private, though, several mutual friends told me, he fretted that in the long view of history it would overwhelm his work. In retrospect, I don’t think Mandelbrot had much to worry about on this account. The link from the above review came from Peter Woit, who also points to a review by Brian Hayes wit

2 0.95393109 1684 andrew gelman stats-2013-01-20-Ugly ugly ugly

Introduction: Denis Cote sends the following , under the heading, “Some bad graphs for your enjoyment”: To start with, they don’t know how to spell “color.” Seriously, though, the graph is a mess. The circular display implies a circular or periodic structure that isn’t actually in the data, the cramped display requires the use of an otherwise-unnecessary color code that makes it difficult to find or make sense of the information, the alphabetical ordering (without even supplying state names, only abbreviations) makes it further difficult to find any patterns. It would be so much better, and even easier, to just display a set of small maps shading states on whether they have different laws. But that’s part of the problem—the clearer graph would also be easier to make! To get a distinctive graph, there needs to be some degree of difficulty. The designers continue with these monstrosities: Here they decide to display only 5 states at a time so that it’s really hard to see any big pi

3 0.95382297 978 andrew gelman stats-2011-10-28-Cool job opening with brilliant researchers at Yahoo

Introduction: Duncan Watts writes: The Human Social Dynamics Group in Yahoo Research is seeking highly qualified candidates for a post-doctoral research scientist position. The Human and Social Dynamics group is devoted to understanding the interplay between individual-level behavior (e.g. how people make decisions about what music they like, which dates to go on, or which groups to join) and the social environment in which individual behavior necessarily plays itself out. In particular, we are interested in: * Structure and evolution of social groups and networks * Decision making, social influence, diffusion, and collective decisions * Networking and collaborative problem solving. The intrinsically multi-disciplinary and cross-cutting nature of the subject demands an eclectic range of researchers, both in terms of domain-expertise (e.g. decision sciences, social psychology, sociology) and technical skills (e.g. statistical analysis, mathematical modeling, computer simulations, design o

same-blog 4 0.94444185 216 andrew gelman stats-2010-08-18-More forecasting competitions

Introduction: Anthony Goldbloom from Kaggle writes : We’ve recently put up some interesting new competitions. Last week, Jeff Sonas, the creator of the Chessmetrics rating system, launched a competition to find a chess rating algorithm that performs better than the official Elo system. Already nine teams have created systems that make more accurate predictions than Elo. It’s not a surprise that Elo has been outdone – the system was invented half a century ago before we could easily crunch large amounts of historical data. However, it is a big surprise that Elo has been outperformed so quickly given that it is the product of many years’ work (at least it was a surprise to me). Rob Hyndman from Monash University has put up the first part of a tourism forecasting competition . This part requires participants to forecast the results of 518 different time series. Rob is the editor of the International Journal of Forecasting and has promised to invite the winner to contribute a discussion paper

5 0.94228137 1373 andrew gelman stats-2012-06-09-Cognitive psychology research helps us understand confusion of Jonathan Haidt and others about working-class voters

Introduction: Here’s some psychology research that’s relevant to yesterday’s discussion on working-class voting. In a paper to appear in the journal Cognitive Science , Andrei Cimpian, Amanda Brandone, and Susan Gelman write: Generic statements (e.g., “Birds lay eggs”) express generalizations about categories. In this paper, we hypothesized that there is a paradoxical asymmetry at the core of generic meaning, such that these sentences have extremely strong implications but require little evidence to be judged true. Four experiments confirmed the hypothesized asymmetry: Participants interpreted novel generics such as “Lorches have purple feathers” as referring to nearly all lorches, but they judged the same novel generics to be true given a wide range of prevalence levels (e.g., even when only 10% or 30% of lorches had purple feathers). A second hypothesis, also confirmed by the results, was that novel generic sentences about dangerous or distinctive properties would be more acceptable than

6 0.94006896 380 andrew gelman stats-2010-10-29-“Bluntly put . . .”

7 0.93527526 57 andrew gelman stats-2010-05-29-Roth and Amsterdam

8 0.93452144 1124 andrew gelman stats-2012-01-17-How to map geographically-detailed survey responses?

9 0.92616212 562 andrew gelman stats-2011-02-06-Statistician cracks Toronto lottery

10 0.92461431 93 andrew gelman stats-2010-06-17-My proposal for making college admissions fairer

11 0.92444968 1481 andrew gelman stats-2012-09-04-Cool one-day miniconference at Columbia Fri 12 Oct on computational and online social science

12 0.92321765 1604 andrew gelman stats-2012-12-04-An epithet I can live with

13 0.91891241 2122 andrew gelman stats-2013-12-03-Objects of the class “Lawrence Summers”: Arne Duncan edition

14 0.91783071 128 andrew gelman stats-2010-07-05-The greatest works of statistics never published

15 0.91782004 2054 andrew gelman stats-2013-10-07-Bing is preferred to Google by people who aren’t like me

16 0.91392225 378 andrew gelman stats-2010-10-28-World Economic Forum Data Visualization Challenge

17 0.91302413 1948 andrew gelman stats-2013-07-21-Bayes related

18 0.91174972 1561 andrew gelman stats-2012-11-04-Someone is wrong on the internet

19 0.91129291 1071 andrew gelman stats-2011-12-19-“NYU Professor Claims He Was Fired for Giving James Franco a D”

20 0.91002584 207 andrew gelman stats-2010-08-14-Pourquoi Google search est devenu plus raisonnable?