andrew_gelman_stats andrew_gelman_stats-2012 andrew_gelman_stats-2012-1135 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Dustin Palmer writes: I am a recent graduate looking for a bit of advice. While I took intro classes on math and statistics in my undergraduate degree as a political science major, I find myself university-less and seeking to develop my statistics toolkit. I work for an NGO in the international development field. I think that a solid statistics foundation would offer me not only more career opportunities, but more importantly, a deeper and more nuanced understanding of the processes and problems that interest me. I’m talking about field experiments and practical quantitative and qualitative data analysis. I have plenty of free time, ambition, and enthusiasm to improve this part of my toolbox, but I lack an attachment to an institution and much in the way of financial resources. How would you go about making a concentrated effort at acquiring an understanding of the field and its actual application in something like R or Stata, which I admit to never having used? Perhaps I am
sentIndex sentText sentNum sentScore
1 Dustin Palmer writes: I am a recent graduate looking for a bit of advice. [sent-1, score-0.099]
2 While I took intro classes on math and statistics in my undergraduate degree as a political science major, I find myself university-less and seeking to develop my statistics toolkit. [sent-2, score-0.992]
3 I work for an NGO in the international development field. [sent-3, score-0.193]
4 I think that a solid statistics foundation would offer me not only more career opportunities, but more importantly, a deeper and more nuanced understanding of the processes and problems that interest me. [sent-4, score-0.953]
5 I’m talking about field experiments and practical quantitative and qualitative data analysis. [sent-5, score-0.457]
6 I have plenty of free time, ambition, and enthusiasm to improve this part of my toolbox, but I lack an attachment to an institution and much in the way of financial resources. [sent-6, score-0.787]
7 How would you go about making a concentrated effort at acquiring an understanding of the field and its actual application in something like R or Stata, which I admit to never having used? [sent-7, score-0.8]
8 Perhaps I am simply asking about web resources or best texts, but any broad advice would be much appreciated too. [sent-8, score-0.541]
9 My gut recommendation is to start with a problem you care about and figure out what you need to get a reasonable solution, then go to the next problem, and so forth. [sent-9, score-0.418]
10 For books, you could start with The Statistical Sleuth and my book with Jennifer. [sent-10, score-0.125]
11 If you want to learn R, just try to make some pretty and useful graphs, that will motivate you to be able to do more. [sent-11, score-0.121]
wordName wordTfidf (topN-words)
[('dustin', 0.189), ('acquiring', 0.189), ('gut', 0.189), ('toolbox', 0.178), ('ngo', 0.171), ('enthusiasm', 0.171), ('palmer', 0.171), ('attachment', 0.171), ('nuanced', 0.156), ('texts', 0.152), ('field', 0.148), ('appreciated', 0.144), ('concentrated', 0.141), ('understanding', 0.137), ('institution', 0.134), ('importantly', 0.131), ('qualitative', 0.129), ('plenty', 0.128), ('start', 0.125), ('motivate', 0.121), ('stata', 0.12), ('seeking', 0.12), ('statistics', 0.119), ('undergraduate', 0.119), ('intro', 0.117), ('opportunities', 0.114), ('deeper', 0.114), ('solid', 0.112), ('broad', 0.109), ('foundation', 0.108), ('resources', 0.107), ('processes', 0.106), ('recommendation', 0.104), ('international', 0.103), ('classes', 0.102), ('degree', 0.102), ('develop', 0.101), ('career', 0.101), ('graduate', 0.099), ('suggestions', 0.094), ('math', 0.093), ('admit', 0.093), ('improve', 0.093), ('quantitative', 0.092), ('application', 0.092), ('web', 0.091), ('financial', 0.09), ('development', 0.09), ('asking', 0.09), ('practical', 0.088)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 1135 andrew gelman stats-2012-01-22-Advice on do-it-yourself stats education?
Introduction: Dustin Palmer writes: I am a recent graduate looking for a bit of advice. While I took intro classes on math and statistics in my undergraduate degree as a political science major, I find myself university-less and seeking to develop my statistics toolkit. I work for an NGO in the international development field. I think that a solid statistics foundation would offer me not only more career opportunities, but more importantly, a deeper and more nuanced understanding of the processes and problems that interest me. I’m talking about field experiments and practical quantitative and qualitative data analysis. I have plenty of free time, ambition, and enthusiasm to improve this part of my toolbox, but I lack an attachment to an institution and much in the way of financial resources. How would you go about making a concentrated effort at acquiring an understanding of the field and its actual application in something like R or Stata, which I admit to never having used? Perhaps I am
2 0.12840897 236 andrew gelman stats-2010-08-26-Teaching yourself mathematics
Introduction: Some thoughts from Mark Palko: Of all the subjects a student is likely to encounter after elementary school, mathematics is by far the easiest to teach yourself. . . . What is it that makes math teachers so expendable? . . . At some point all disciplines require the transition from passive to active and that transition can be challenging. In courses like high school history and science, the emphasis on passively acquiring knowledge (yes, I realize that students write essays in history classes and apply formulas in science classes but that represents a relatively small portion of their time and, more importantly, the work those students do is fundamentally different from the day-to-day work done by historians and scientists). By comparison, junior high students playing in an orchestra, writing short stories or solving math problems are almost entirely focused on processes and those processes are essentially the same as those engaged in by professional musicians, writers and mat
3 0.11438041 390 andrew gelman stats-2010-11-02-Fragment of statistical autobiography
Introduction: I studied math and physics at MIT. To be more precise, I started in math as default–ever since I was two years old, I’ve thought of myself as a mathematician, and I always did well in math class, so it seemed like a natural fit. But I was concerned. In high school I’d been in the U.S. Mathematical Olympiad training program, and there I’d met kids who were clearly much much better at math than I was. In retrospect, I don’t think I was as bad as I’d thought at the time: there were 24 kids in the program, and I was probably around #20, if that, but I think a lot of the other kids had more practice working on “math olympiad”-type problems. Maybe I was really something like the tenth-best in the group. Tenth-best or twentieth-best, whatever it was, I reached a crisis of confidence around my sophomore or junior year in college. At MIT, I started right off taking advanced math classes, and somewhere along the way I realized I wasn’t seeing the big picture. I was able to do the homework pr
4 0.10828213 2106 andrew gelman stats-2013-11-19-More on “data science” and “statistics”
Introduction: After reading Rachel and Cathy’s book , I wrote that “Statistics is the least important part of data science . . . I think it would be fair to consider statistics as a subset of data science. . . . it’s not the most important part of data science, or even close.” But then I received “Data Science for Business,” by Foster Provost and Tom Fawcett, in the mail. I might not have opened the book at all (as I’m hardly in the target audience) but for seeing a blurb by Chris Volinsky, a statistician whom I respect a lot. So I flipped through the book and it indeed looked pretty good. It moves slowly but that’s appropriate for an intro book. But what surprised me, given the book’s title and our recent discussion on the nature of data science, was that the book was 100% statistics! It had some math (for example, definitions of various distance measures), some simple algebra, some conceptual graphs such as ROC curve, some tables and graphs of low-dimensional data summaries—but almost
5 0.10714881 1611 andrew gelman stats-2012-12-07-Feedback on my Bayesian Data Analysis class at Columbia
Introduction: In one of the final Jitts, we asked the students how the course could be improved. Some of their suggestions would work, some would not. I’m putting all the suggestions below, interpolating my responses. (Overall, I think the course went well. Please remember that the remarks below are not course evaluations; they are answers to my specific question of how the course could be better. If we’d had a Jitt asking all the ways the course was good, you’d be seeing lots of positive remarks. But that wouldn’t be particularly useful or interesting.) The best thing about the course is that the kids worked hard each week on their homeworks. OK, here are the comments and my replies: Could have been better if we did less amount but more in detail. I don’t know if this would’ve been possible. I wanted to get to the harder stuff (HMC, VB, nonparametric models) which required a certain amount of preparation. And, even so, there was not time for everything. And also, needs solut
8 0.10196777 2347 andrew gelman stats-2014-05-25-Why I decided not to be a physicist
9 0.096803501 869 andrew gelman stats-2011-08-24-Mister P in Stata
10 0.094468005 1816 andrew gelman stats-2013-04-21-Exponential increase in the number of stat majors
11 0.091708109 76 andrew gelman stats-2010-06-09-Both R and Stata
12 0.091258064 537 andrew gelman stats-2011-01-25-Postdoc Position #1: Missing-Data Imputation, Diagnostics, and Applications
13 0.089677177 2245 andrew gelman stats-2014-03-12-More on publishing in journals
14 0.089005396 61 andrew gelman stats-2010-05-31-A data visualization manifesto
15 0.084912576 1864 andrew gelman stats-2013-05-20-Evaluating Columbia University’s Frontiers of Science course
16 0.083034024 1582 andrew gelman stats-2012-11-18-How to teach methods we don’t like?
17 0.082653686 2368 andrew gelman stats-2014-06-11-Bayes in the research conversation
18 0.082350016 32 andrew gelman stats-2010-05-14-Causal inference in economics
19 0.081630908 1848 andrew gelman stats-2013-05-09-A tale of two discussion papers
20 0.081441745 855 andrew gelman stats-2011-08-16-Infovis and statgraphics update update
topicId topicWeight
[(0, 0.176), (1, -0.042), (2, -0.064), (3, 0.022), (4, 0.046), (5, 0.049), (6, -0.036), (7, 0.042), (8, -0.02), (9, 0.032), (10, 0.016), (11, -0.012), (12, 0.012), (13, -0.017), (14, 0.006), (15, -0.023), (16, -0.031), (17, -0.006), (18, 0.011), (19, -0.039), (20, 0.038), (21, 0.0), (22, 0.006), (23, 0.058), (24, -0.031), (25, 0.014), (26, 0.05), (27, -0.01), (28, -0.027), (29, -0.006), (30, -0.001), (31, -0.014), (32, -0.03), (33, 0.014), (34, -0.023), (35, 0.005), (36, -0.049), (37, 0.057), (38, -0.012), (39, 0.015), (40, 0.03), (41, -0.011), (42, -0.033), (43, 0.03), (44, -0.019), (45, -0.053), (46, -0.029), (47, 0.013), (48, 0.024), (49, 0.019)]
simIndex simValue blogId blogTitle
same-blog 1 0.97798675 1135 andrew gelman stats-2012-01-22-Advice on do-it-yourself stats education?
Introduction: Dustin Palmer writes: I am a recent graduate looking for a bit of advice. While I took intro classes on math and statistics in my undergraduate degree as a political science major, I find myself university-less and seeking to develop my statistics toolkit. I work for an NGO in the international development field. I think that a solid statistics foundation would offer me not only more career opportunities, but more importantly, a deeper and more nuanced understanding of the processes and problems that interest me. I’m talking about field experiments and practical quantitative and qualitative data analysis. I have plenty of free time, ambition, and enthusiasm to improve this part of my toolbox, but I lack an attachment to an institution and much in the way of financial resources. How would you go about making a concentrated effort at acquiring an understanding of the field and its actual application in something like R or Stata, which I admit to never having used? Perhaps I am
2 0.81804937 1721 andrew gelman stats-2013-02-13-A must-read paper on statistical analysis of experimental data
Introduction: Russ Lyons points to an excellent article on statistical experimentation by Ron Kohavi, Alex Deng, Brian Frasca, Roger Longbotham, Toby Walker, Ya Xu, a group of software engineers (I presume) at Microsoft. Kohavi et al. write: Online controlled experiments are often utilized to make data-driven decisions at Amazon, Microsoft . . . deployment and mining of online controlled experiments at scale—thousands of experiments now—has taught us many lessons. The paper is well written and has excellent examples (unfortunately the substantive topics are unexciting things like clicks and revenue per user, but the general principles remain important). The ideas will be familiar to anyone with experience in practical statistics but don’t always make it into textbooks or courses, so I think many people could learn a lot from this article. I was disappointed that they didn’t cite much of the statistics literature— not even the classic Box, Hunter, and Hunter book on industrial experimentat
Introduction: For “humanity, devotion to truth and inspiring leadership” at Columbia College. Reading Jenny’s remarks (“my hugest and most helpful pool of colleagues was to be found not among the ranks of my fellow faculty but in the classroom. . . . we shared a sense of the excitement of the enterprise on which we were all embarked”) reminds me of the comment Seth made once, that the usual goal of university teaching is to make the students into carbon copies of the instructor, and that he found it to me much better to make use of the students’ unique strengths. This can’t always be true–for example, in learning to speak a foreign language, I just want to be able to do it, and my own experiences in other domains is not so relevant. But for a worldly subject such as literature or statistics or political science, then, yes, I do think it would be good for students to get involved and use their own knowledge and experiences. One other statement of Jenny’s caught my eye. She wrote: I [Je
4 0.77702481 316 andrew gelman stats-2010-10-03-Suggested reading for a prospective statistician?
Introduction: Sam Jessup writes: I am writing to ask you to recommend papers, books–anything that comes to mind that might give a prospective statistician some sense of what the future holds for statistics (and statisticians). I have a liberal arts background with an emphasis in mathematics. It seems like this is an exciting time to be a statistician, but that’s just from the outside looking in. I’m curious about your perspective on the future of the discipline. Any recommendations? My favorite is still the book, “Statistics: A Guide to the Unknown,” first edition. (I actually have a chapter in the latest (fourth) edition, but I think the first edition (from 1972, I believe) is still the best.
5 0.74389887 1722 andrew gelman stats-2013-02-14-Statistics for firefighters: update
Introduction: Following up on our earlier discussion, Daniel Rubenson from Ryerson University in Toronto writes: The course went really well (it was a couple of years ago now). The course was run through a partnership my department has with the Ontario Fire College. Basically, firefighters can do a certificate and sometimes a degree in public administration and part of that is a course on methods. It was a small group — about 8 or so — very motivated guys (all guys). Some of them were chiefs or deputy chiefs from small towns, others captains who were doing the certificate in order to improve their chances for promotion or as a step into a broader public admin career. I had asked them ahead of time to bring with them whatever data they could get their hands on and that they thought would be interesting. This included response times, data on professional v voluntary firefighters, some insurance data and the like. I should mention that is was an intensive mode course. So we had 4.5 days toge
6 0.74347711 236 andrew gelman stats-2010-08-26-Teaching yourself mathematics
7 0.73264879 1960 andrew gelman stats-2013-07-28-More on that machine learning course
8 0.72743487 793 andrew gelman stats-2011-07-09-R on the cloud
9 0.72689945 2151 andrew gelman stats-2013-12-27-Should statistics have a Nobel prize?
10 0.71842182 395 andrew gelman stats-2010-11-05-Consulting: how do you figure out what to charge?
11 0.71768367 390 andrew gelman stats-2010-11-02-Fragment of statistical autobiography
15 0.70563501 1616 andrew gelman stats-2012-12-10-John McAfee is a Heinlein hero
16 0.70235693 1519 andrew gelman stats-2012-10-02-Job!
17 0.69883817 2282 andrew gelman stats-2014-04-05-Bizarre academic spam
18 0.69629914 76 andrew gelman stats-2010-06-09-Both R and Stata
19 0.69362384 1640 andrew gelman stats-2012-12-26-What do people do wrong? WSJ columnist is looking for examples!
topicId topicWeight
[(15, 0.079), (16, 0.098), (21, 0.031), (22, 0.013), (24, 0.117), (42, 0.014), (45, 0.034), (48, 0.016), (50, 0.018), (55, 0.053), (74, 0.121), (84, 0.014), (99, 0.309)]
simIndex simValue blogId blogTitle
1 0.96015322 2261 andrew gelman stats-2014-03-23-Greg Mankiw’s utility function
Introduction: From 2010 : Greg Mankiw writes (link from Tyler Cowen ): Without any taxes, accepting that editor’s assignment would have yielded my children an extra $10,000. With taxes, it yields only $1,000. In effect, once the entire tax system is taken into account, my family’s marginal tax rate is about 90 percent. Is it any wonder that I [Mankiw] turn down most of the money-making opportunities I am offered? By contrast, without the tax increases advocated by the Obama administration, the numbers would look quite different. I would face a lower income tax rate, a lower Medicare tax rate, and no deduction phaseout or estate tax. Taking that writing assignment would yield my kids about $2,000. I would have twice the incentive to keep working. First, the good news Obama’s tax rates are much lower than Mankiw had anticipated! According to the above quote, his marginal tax rate is currently 80% but threatens to rise to 90%. But, in October 2008, Mankiw calculated that Obama’s
Introduction: Greg Mankiw writes (link from Tyler Cowen ): Without any taxes, accepting that editor’s assignment would have yielded my children an extra $10,000. With taxes, it yields only $1,000. In effect, once the entire tax system is taken into account, my family’s marginal tax rate is about 90 percent. Is it any wonder that I [Mankiw] turn down most of the money-making opportunities I am offered? By contrast, without the tax increases advocated by the Obama administration, the numbers would look quite different. I would face a lower income tax rate, a lower Medicare tax rate, and no deduction phaseout or estate tax. Taking that writing assignment would yield my kids about $2,000. I would have twice the incentive to keep working. First, the good news Obama’s tax rates are much lower than Mankiw had anticipated! According to the above quote, his marginal tax rate is currently 80% but threatens to rise to 90%. But, in October 2008, Mankiw calculated that Obama’s would tax his m
same-blog 3 0.95374614 1135 andrew gelman stats-2012-01-22-Advice on do-it-yourself stats education?
Introduction: Dustin Palmer writes: I am a recent graduate looking for a bit of advice. While I took intro classes on math and statistics in my undergraduate degree as a political science major, I find myself university-less and seeking to develop my statistics toolkit. I work for an NGO in the international development field. I think that a solid statistics foundation would offer me not only more career opportunities, but more importantly, a deeper and more nuanced understanding of the processes and problems that interest me. I’m talking about field experiments and practical quantitative and qualitative data analysis. I have plenty of free time, ambition, and enthusiasm to improve this part of my toolbox, but I lack an attachment to an institution and much in the way of financial resources. How would you go about making a concentrated effort at acquiring an understanding of the field and its actual application in something like R or Stata, which I admit to never having used? Perhaps I am
4 0.95360994 338 andrew gelman stats-2010-10-12-Update on Mankiw’s work incentives
Introduction: Tyler Cowen links to a blog by Greg Mankiw with further details on his argument that his anticipated 90% marginal tax rate will reduce his work level. Having already given my thoughts on Mankiw’s column, I merely have a few things to add/emphasize. 1. Cowen frames the arguments in terms of the “status” of George Bush, Greg Mankiw, Barack Obama, and their proposed policies. I hadn’t thought of the arguments as being about status, but I think I see what Cowen is saying. By being a well-known economist and having a column in the New York Times, Mankiw is trading some of his status for political advocacy (just as Krugman does, from the opposite direction). If Mankiw didn’t have the pre-existing status, I doubt this particular column would’ve made it into the newspaper. (Again, ditto with many of Krugman’s columns.) So it makes sense that arguments about the substance of Mankiw’s remarks will get tied into disputes about his status. 2. Neither Cowen nor Mankiw address
5 0.94789469 1324 andrew gelman stats-2012-05-16-Wikipedia author confronts Ed Wegman
Introduction: Wegman: “It’s not reprinted 100 percent like you had it.” Wikipedia guy: “No, you added another paragraph at the end and you changed the headline. . . . You even copied the typos that I’ve corrected on my website. It was taken verbatim and reprinted in your paper.” The original author got a check for $500 but, unfortunately, no free subscription to “Wiley Interdisciplinary Reviews: Computational Statistics” (a $1400-$2800 value ). P.S. To those who think I’m being mean to Wegman: I haven’t yet heard that he’s apologized to the people whose work he copied without attribution, or to the people who spent their time tracking all this down, or to the U.S. Congress for misrepresenting his expertise in his official report. Everyone makes mistakes, and just about everyone has ethical lapses at times. But when you get caught you’re supposed to make apology and restitution.
6 0.94652963 2239 andrew gelman stats-2014-03-09-Reviewing the peer review process?
7 0.94495595 285 andrew gelman stats-2010-09-18-Fiction is not for tirades? Tell that to Saul Bellow!
8 0.93865538 836 andrew gelman stats-2011-08-03-Another plagiarism mystery
9 0.93562788 140 andrew gelman stats-2010-07-10-SeeThroughNY
10 0.92896283 1612 andrew gelman stats-2012-12-08-The Case for More False Positives in Anti-doping Testing
12 0.92320812 2353 andrew gelman stats-2014-05-30-I posted this as a comment on a sociology blog
13 0.92306453 1262 andrew gelman stats-2012-04-12-“Not only defended but also applied”: The perceived absurdity of Bayesian inference
14 0.92281413 2227 andrew gelman stats-2014-02-27-“What Can we Learn from the Many Labs Replication Project?”
15 0.92276102 1878 andrew gelman stats-2013-05-31-How to fix the tabloids? Toward replicable social science research
16 0.92241287 1085 andrew gelman stats-2011-12-27-Laws as expressive
19 0.92145407 2244 andrew gelman stats-2014-03-11-What if I were to stop publishing in journals?
20 0.92052609 1779 andrew gelman stats-2013-03-27-“Two Dogmas of Strong Objective Bayesianism”