andrew_gelman_stats andrew_gelman_stats-2012 andrew_gelman_stats-2012-1264 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: I was talking with education researcher Bob Boruch about my frustrations in teaching, the idea that as statisticians we tell people to do formal experimentation but in our own teaching practice we typically just try different things without even measuring outcomes, let alone performing any formal evaluation. Boruch showed me this article with Alan Ruby about learning from failure. Unfortunately I’ve forgotten all my other thoughts from our conversation but I’m posting the article here.
sentIndex sentText sentNum sentScore
1 Boruch showed me this article with Alan Ruby about learning from failure. [sent-2, score-0.377]
2 Unfortunately I’ve forgotten all my other thoughts from our conversation but I’m posting the article here. [sent-3, score-0.703]
wordName wordTfidf (topN-words)
[('boruch', 0.555), ('formal', 0.311), ('frustrations', 0.262), ('teaching', 0.239), ('experimentation', 0.201), ('forgotten', 0.187), ('alan', 0.184), ('measuring', 0.168), ('performing', 0.165), ('conversation', 0.155), ('alone', 0.155), ('bob', 0.152), ('showed', 0.151), ('posting', 0.146), ('outcomes', 0.132), ('unfortunately', 0.123), ('researcher', 0.119), ('learning', 0.118), ('education', 0.117), ('statisticians', 0.113), ('practice', 0.11), ('article', 0.108), ('thoughts', 0.107), ('typically', 0.106), ('tell', 0.104), ('talking', 0.102), ('try', 0.082), ('let', 0.078), ('without', 0.073), ('idea', 0.062), ('things', 0.062), ('different', 0.055), ('ve', 0.047), ('even', 0.042), ('people', 0.038)]
simIndex simValue blogId blogTitle
same-blog 1 1.0000001 1264 andrew gelman stats-2012-04-14-Learning from failure
Introduction: I was talking with education researcher Bob Boruch about my frustrations in teaching, the idea that as statisticians we tell people to do formal experimentation but in our own teaching practice we typically just try different things without even measuring outcomes, let alone performing any formal evaluation. Boruch showed me this article with Alan Ruby about learning from failure. Unfortunately I’ve forgotten all my other thoughts from our conversation but I’m posting the article here.
Introduction: While visiting the education school at the University of Pennsylvania a couple months ago, I had a long conversation with Bob Boruch, a prominent researcher in the field of evidence-based education. We shared Fred Mosteller stories and talked about a lot of other things too. Boruch sent me an article about teaching randomized controlled trials to education students, which gave me the following idea which connects to my longstanding embarrassment (and subject of my next column on ethics, forthcoming in Chance magazine) about the lack of systematic measurement, sampling, or experimentation in our own teaching efforts. Anyway, here’s my idea for experimentation in statistics teaching, an idea that I think could work particularly well in classes with education students. Each class could, as part of the course, design an educational experiment to be performed on next year’s class. Easier said than done, I know, but perhaps ed school students would be particularly motivated to do t
3 0.13042523 2127 andrew gelman stats-2013-12-08-The never-ending (and often productive) race between theory and practice
Introduction: Commenter Wonks Anonymous writes : After the recent EconNobel announcement I decided to check Dimensional’s Fama-French blog to see if it had much new content recently, and while it was dissapointingly sparse it did have an interesting bit where Fama linked to the best advice he’d ever gotten , from his statistics professor Harry Roberts: With formal statistics, you say something — a hypothesis — and then you test it. Harry always said that your criterion should be not whether or not you can reject or accept the hypothesis, but what you can learn from the data. The best thing you can do is use the data to enhance your description of the world. I responded: That’s a great quote. Except that I disagree with what Fama says about “formal statistics.” Or, should I say, he has an old-fashioned view of formal statistics. (See this paper by X and me for some discussion of old-fashioned views.) Nowadays, lots of formal statistics is all about what you can learn from the data, no
4 0.12912923 74 andrew gelman stats-2010-06-08-“Extreme views weakly held”
Introduction: Alan and Felix .
5 0.095412433 785 andrew gelman stats-2011-07-02-Experimental reasoning in social science
Introduction: As a statistician, I was trained to think of randomized experimentation as representing the gold standard of knowledge in the social sciences, and, despite having seen occasional arguments to the contrary, I still hold that view, expressed pithily by Box, Hunter, and Hunter (1978) that “To find out what happens when you change something, it is necessary to change it.” At the same time, in my capacity as a social scientist, I’ve published many applied research papers, almost none of which have used experimental data. In the present article, I’ll address the following questions: 1. Why do I agree with the consensus characterization of randomized experimentation as a gold standard? 2. Given point 1 above, why does almost all my research use observational data? In confronting these issues, we must consider some general issues in the strategy of social science research. We also take from the psychology methods literature a more nuanced perspective that considers several differen
6 0.094977237 325 andrew gelman stats-2010-10-07-Fitting discrete-data regression models in social science
7 0.093730599 1394 andrew gelman stats-2012-06-27-99!
9 0.090990484 1131 andrew gelman stats-2012-01-20-Stan: A (Bayesian) Directed Graphical Model Compiler
10 0.082305163 65 andrew gelman stats-2010-06-03-How best to learn R?
11 0.080478922 1813 andrew gelman stats-2013-04-19-Grad students: Participate in an online survey on statistics education
12 0.079537481 1965 andrew gelman stats-2013-08-02-My course this fall on l’analyse bayésienne de données
13 0.079476155 1071 andrew gelman stats-2011-12-19-“NYU Professor Claims He Was Fired for Giving James Franco a D”
14 0.078703791 1760 andrew gelman stats-2013-03-12-Misunderstanding the p-value
15 0.077813037 709 andrew gelman stats-2011-05-13-D. Kahneman serves up a wacky counterfactual
16 0.073334768 1740 andrew gelman stats-2013-02-26-“Is machine learning a subset of statistics?”
17 0.071726076 583 andrew gelman stats-2011-02-21-An interesting assignment for statistical graphics
18 0.064254284 1582 andrew gelman stats-2012-11-18-How to teach methods we don’t like?
20 0.063541934 1140 andrew gelman stats-2012-01-27-Educational monoculture
topicId topicWeight
[(0, 0.085), (1, -0.022), (2, -0.037), (3, 0.002), (4, 0.001), (5, 0.047), (6, -0.003), (7, 0.065), (8, -0.006), (9, -0.003), (10, 0.012), (11, 0.032), (12, 0.005), (13, -0.015), (14, 0.016), (15, -0.013), (16, -0.025), (17, 0.009), (18, -0.047), (19, 0.013), (20, 0.006), (21, -0.051), (22, 0.005), (23, 0.022), (24, -0.038), (25, -0.0), (26, -0.005), (27, -0.01), (28, -0.001), (29, -0.003), (30, 0.052), (31, 0.017), (32, 0.005), (33, -0.009), (34, -0.001), (35, -0.024), (36, -0.001), (37, -0.022), (38, -0.006), (39, -0.012), (40, -0.051), (41, -0.035), (42, -0.007), (43, 0.013), (44, -0.005), (45, 0.024), (46, -0.032), (47, 0.004), (48, -0.019), (49, -0.005)]
simIndex simValue blogId blogTitle
same-blog 1 0.95650268 1264 andrew gelman stats-2012-04-14-Learning from failure
Introduction: I was talking with education researcher Bob Boruch about my frustrations in teaching, the idea that as statisticians we tell people to do formal experimentation but in our own teaching practice we typically just try different things without even measuring outcomes, let alone performing any formal evaluation. Boruch showed me this article with Alan Ruby about learning from failure. Unfortunately I’ve forgotten all my other thoughts from our conversation but I’m posting the article here.
Introduction: While visiting the education school at the University of Pennsylvania a couple months ago, I had a long conversation with Bob Boruch, a prominent researcher in the field of evidence-based education. We shared Fred Mosteller stories and talked about a lot of other things too. Boruch sent me an article about teaching randomized controlled trials to education students, which gave me the following idea which connects to my longstanding embarrassment (and subject of my next column on ethics, forthcoming in Chance magazine) about the lack of systematic measurement, sampling, or experimentation in our own teaching efforts. Anyway, here’s my idea for experimentation in statistics teaching, an idea that I think could work particularly well in classes with education students. Each class could, as part of the course, design an educational experiment to be performed on next year’s class. Easier said than done, I know, but perhaps ed school students would be particularly motivated to do t
3 0.73626041 1582 andrew gelman stats-2012-11-18-How to teach methods we don’t like?
Introduction: April Galyardt writes: I’m teaching my first graduate class this semester. It’s intro stats for graduate students in the college of education. Most of the students are first year PhD students. Though, there are a number of master’s students who are primarily in-service teachers. The difficulties with teaching an undergraduate intro stats course are still present, in that mathematical preparation and phobia vary widely across the class. I’ve been enjoying the class and the students, but I’d like your take on an issue I’ve been thinking about. How do I balance teaching the standard methods, like hypothesis testing, that these future researchers have to know because they are so standard, with discussing the problems with those methods (e.g. p-value as a measure of sample size , and the decline effect , not to mention multiple testing and other common mistakes). It feels a bit like saying “Ok here’s what everybody does, but really it’s broken” and then there’s not enough time to tal
Introduction: Joe Blitzstein and Xiao-Li Meng write : An effectively designed examination process goes far beyond revealing students’ knowledge or skills. It also serves as a great teaching and learning tool, incentivizing the students to think more deeply and to connect the dots at a higher level. This extends throughout the entire process: pre-exam preparation, the exam itself, and the post-exam period (the aftermath or, more appropriately, afterstat of the exam). As in the publication process, the first submission is essential but still just one piece in the dialogue. Viewing the entire exam process as an extended dialogue between students and faculty, we discuss ideas for making this dialogue induce more inspiration than perspiration, and thereby making it a memorable deep-learning triumph rather than a wish-to-forget test-taking trauma. We illustrate such a dialogue through a recently introduced course in the Harvard Statistics Department, Stat 399: Problem Solving in Statistics, and tw
Introduction: For “humanity, devotion to truth and inspiring leadership” at Columbia College. Reading Jenny’s remarks (“my hugest and most helpful pool of colleagues was to be found not among the ranks of my fellow faculty but in the classroom. . . . we shared a sense of the excitement of the enterprise on which we were all embarked”) reminds me of the comment Seth made once, that the usual goal of university teaching is to make the students into carbon copies of the instructor, and that he found it to me much better to make use of the students’ unique strengths. This can’t always be true–for example, in learning to speak a foreign language, I just want to be able to do it, and my own experiences in other domains is not so relevant. But for a worldly subject such as literature or statistics or political science, then, yes, I do think it would be good for students to get involved and use their own knowledge and experiences. One other statement of Jenny’s caught my eye. She wrote: I [Je
6 0.67703706 1752 andrew gelman stats-2013-03-06-Online Education and Jazz
7 0.67078549 516 andrew gelman stats-2011-01-14-A new idea for a science core course based entirely on computer simulation
8 0.66739577 1864 andrew gelman stats-2013-05-20-Evaluating Columbia University’s Frontiers of Science course
9 0.66139299 1071 andrew gelman stats-2011-12-19-“NYU Professor Claims He Was Fired for Giving James Franco a D”
11 0.64822245 1173 andrew gelman stats-2012-02-17-Sports examples in class
12 0.64818978 462 andrew gelman stats-2010-12-10-Who’s holding the pen?, The split screen, and other ideas for one-on-one instruction
13 0.64606839 2104 andrew gelman stats-2013-11-17-Big bad education bureaucracy does big bad things
14 0.64463699 1068 andrew gelman stats-2011-12-18-Faculty who don’t like teaching and hate working with students
15 0.63848215 1056 andrew gelman stats-2011-12-13-Drawing to Learn in Science
16 0.6347118 390 andrew gelman stats-2010-11-02-Fragment of statistical autobiography
17 0.62719834 1517 andrew gelman stats-2012-10-01-“On Inspiring Students and Being Human”
18 0.62713069 956 andrew gelman stats-2011-10-13-Hey, you! Don’t take that class!
19 0.62120134 236 andrew gelman stats-2010-08-26-Teaching yourself mathematics
20 0.61277997 1611 andrew gelman stats-2012-12-07-Feedback on my Bayesian Data Analysis class at Columbia
topicId topicWeight
[(16, 0.023), (24, 0.139), (35, 0.182), (55, 0.071), (76, 0.033), (77, 0.033), (86, 0.079), (95, 0.036), (99, 0.251)]
simIndex simValue blogId blogTitle
1 0.91757756 473 andrew gelman stats-2010-12-17-Why a bonobo won’t play poker with you
Introduction: Sciencedaily has posted an article titled Apes Unwilling to Gamble When Odds Are Uncertain : The apes readily distinguished between the different probabilities of winning: they gambled a lot when there was a 100 percent chance, less when there was a 50 percent chance, and only rarely when there was no chance In some trials, however, the experimenter didn’t remove a lid from the bowl, so the apes couldn’t assess the likelihood of winning a banana The odds from the covered bowl were identical to those from the risky option: a 50 percent chance of getting the much sought-after banana. But apes of both species were less likely to choose this ambiguous option. Like humans, they showed “ambiguity aversion” — preferring to gamble more when they knew the odds than when they didn’t. Given some of the other differences between chimps and bonobos, Hare and Rosati had expected to find the bonobos to be more averse to ambiguity, but that didn’t turn out to be the case. Thanks to Sta
2 0.91456902 837 andrew gelman stats-2011-08-04-Is it rational to vote?
Introduction: Hear me interviewed on the topic here . P.S. The interview was fine but I don’t agree with everything on the linked website. For example, this bit: Global warming is not the first case of a widespread fear based on incomplete knowledge turned out to be false or at least greatly exaggerated. Global warming has many of the characteristics of a popular delusion, an irrational fear or cause that is embraced by millions of people because, well, it is believed by millions of people! All right, then.
3 0.91050518 1443 andrew gelman stats-2012-08-04-Bayesian Learning via Stochastic Gradient Langevin Dynamics
Introduction: Burak Bayramli writes: In this paper by Sunjin Ahn, Anoop Korattikara, and Max Welling and this paper by Welling and Yee Whye The, there are some arguments on big data and the use of MCMC. Both papers have suggested improvements to speed up MCMC computations. I was wondering what your thoughts were, especially on this paragraph: When a dataset has a billion data-cases (as is not uncommon these days) MCMC algorithms will not even have generated a single (burn-in) sample when a clever learning algorithm based on stochastic gradients may already be making fairly good predictions. In fact, the intriguing results of Bottou and Bousquet (2008) seem to indicate that in terms of “number of bits learned per unit of computation”, an algorithm as simple as stochastic gradient descent is almost optimally efficient. We therefore argue that for Bayesian methods to remain useful in an age when the datasets grow at an exponential rate, they need to embrace the ideas of the stochastic optimiz
same-blog 4 0.89336693 1264 andrew gelman stats-2012-04-14-Learning from failure
Introduction: I was talking with education researcher Bob Boruch about my frustrations in teaching, the idea that as statisticians we tell people to do formal experimentation but in our own teaching practice we typically just try different things without even measuring outcomes, let alone performing any formal evaluation. Boruch showed me this article with Alan Ruby about learning from failure. Unfortunately I’ve forgotten all my other thoughts from our conversation but I’m posting the article here.
5 0.89233673 1516 andrew gelman stats-2012-09-30-Computational problems with glm etc.
Introduction: John Mount provides some useful background and follow-up on our discussion from last year on computational instability of the usual logistic regression solver. Just to refresh your memory, here’s a simple logistic regression with only a constant term and no separation, nothing pathological at all: > y <- rep (c(1,0),c(10,5)) > display (glm (y ~ 1, family=binomial(link="logit"))) glm(formula = y ~ 1, family = binomial(link = "logit")) coef.est coef.se (Intercept) 0.69 0.55 --- n = 15, k = 1 residual deviance = 19.1, null deviance = 19.1 (difference = 0.0) And here’s what happens when we give it the not-outrageous starting value of -2: > display (glm (y ~ 1, family=binomial(link="logit"), start=-2)) glm(formula = y ~ 1, family = binomial(link = "logit"), start = -2) coef.est coef.se (Intercept) 71.97 17327434.18 --- n = 15, k = 1 residual deviance = 360.4, null deviance = 19.1 (difference = -341.3) Warning message:
7 0.88264227 881 andrew gelman stats-2011-08-30-Rickey Henderson and Peter Angelos, together again
8 0.88237441 942 andrew gelman stats-2011-10-04-45% hitting, 25% fielding, 25% pitching, and 100% not telling us how they did it
9 0.87367463 2049 andrew gelman stats-2013-10-03-On house arrest for p-hacking
10 0.87178159 895 andrew gelman stats-2011-09-08-How to solve the Post Office’s problems?
11 0.87002075 1926 andrew gelman stats-2013-07-05-More plain old everyday Bayesianism
12 0.86241043 296 andrew gelman stats-2010-09-26-A simple semigraphic display
13 0.84936523 388 andrew gelman stats-2010-11-01-The placebo effect in pharma
14 0.8324762 1130 andrew gelman stats-2012-01-20-Prior beliefs about locations of decision boundaries
15 0.83209968 392 andrew gelman stats-2010-11-03-Taleb + 3.5 years
16 0.83173442 331 andrew gelman stats-2010-10-10-Bayes jumps the shark
17 0.8293854 1406 andrew gelman stats-2012-07-05-Xiao-Li Meng and Xianchao Xie rethink asymptotics
18 0.82846993 582 andrew gelman stats-2011-02-20-Statisticians vs. everybody else
19 0.82812762 269 andrew gelman stats-2010-09-10-R vs. Stata, or, Different ways to estimate multilevel models
20 0.82394779 1299 andrew gelman stats-2012-05-04-Models, assumptions, and data summaries