andrew_gelman_stats andrew_gelman_stats-2012 andrew_gelman_stats-2012-1642 knowledge-graph by maker-knowledge-mining

1642 andrew gelman stats-2012-12-28-New book by Stef van Buuren on missing-data imputation looks really good!


meta infos for this blog

Source: html

Introduction: Ben points us to a new book, Flexible Imputation of Missing Data . It’s excellent and I highly recommend it. Definitely worth the $89.95. Van Buuren’s book is great even if you don’t end up using the algorithm described in the book (I actually like their approach but I do think there are some limitations with their particular implementation, which is one reason we’re developing our own package ); he supplies lots of intuition, examples, and graphs. P.S. Stef’s book features an introduction by Don Rubin, which gets me thinking: if Don can find the time to write an introduction to somebody else’s book, he surely should be willing to read and comment on the third edition of his own book, no?


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Ben points us to a new book, Flexible Imputation of Missing Data . [sent-1, score-0.188]

2 Stef’s book features an introduction by Don Rubin, which gets me thinking: if Don can find the time to write an introduction to somebody else’s book, he surely should be willing to read and comment on the third edition of his own book, no? [sent-8, score-2.439]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('book', 0.425), ('introduction', 0.355), ('supplies', 0.229), ('van', 0.189), ('flexible', 0.176), ('imputation', 0.17), ('ben', 0.169), ('limitations', 0.169), ('implementation', 0.161), ('edition', 0.16), ('surely', 0.158), ('intuition', 0.156), ('definitely', 0.15), ('developing', 0.149), ('rubin', 0.143), ('algorithm', 0.143), ('package', 0.142), ('features', 0.139), ('willing', 0.134), ('somebody', 0.133), ('third', 0.129), ('described', 0.125), ('excellent', 0.124), ('recommend', 0.118), ('highly', 0.118), ('missing', 0.114), ('gets', 0.105), ('worth', 0.1), ('else', 0.099), ('examples', 0.093), ('end', 0.092), ('comment', 0.092), ('approach', 0.086), ('reason', 0.083), ('thinking', 0.082), ('lots', 0.079), ('great', 0.077), ('write', 0.074), ('particular', 0.071), ('points', 0.071), ('read', 0.07), ('us', 0.068), ('find', 0.066), ('actually', 0.059), ('using', 0.057), ('new', 0.049), ('re', 0.049), ('time', 0.044), ('even', 0.041), ('data', 0.037)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.99999994 1642 andrew gelman stats-2012-12-28-New book by Stef van Buuren on missing-data imputation looks really good!

Introduction: Ben points us to a new book, Flexible Imputation of Missing Data . It’s excellent and I highly recommend it. Definitely worth the $89.95. Van Buuren’s book is great even if you don’t end up using the algorithm described in the book (I actually like their approach but I do think there are some limitations with their particular implementation, which is one reason we’re developing our own package ); he supplies lots of intuition, examples, and graphs. P.S. Stef’s book features an introduction by Don Rubin, which gets me thinking: if Don can find the time to write an introduction to somebody else’s book, he surely should be willing to read and comment on the third edition of his own book, no?

2 0.19982049 1782 andrew gelman stats-2013-03-30-“Statistical Modeling: A Fresh Approach”

Introduction: Ben Hansen recommended to me this book and course by Daniel Kaplan. It looks pretty good. I’ve only looked at the website, not the book itself, and I’m sure I’d find lots of places to disagree with it on details, but the general flow seemed reasonable, also I liked that there’s lots of course materials to go with it. Does anyone have any experience with this book? Is it the way to go (for now)?

3 0.16482349 8 andrew gelman stats-2010-04-28-Advice to help the rich get richer

Introduction: Tyler Cowen reviews a recent book, “Lifecyle Investing,” by Ian Ayres and Barry Nalebuff, two professors of management at Yale. The book recommends that young adults take out loans to buy stocks and then hold these stocks for many years to prepare for retirement. What I’m wondering is: What’s the goal of writing this sort of book? The main audience has got to be young adults (and their parents) who are already pretty well fixed, financially. Students at Yale, for example. And the book must be intended for people who are already beyond the standard recommendations of personal-investment books (pay off your credit card debt, don’t waste so much money on restaurant meals and fancy clothes, buy 1 used car instead of 2 new ones, etc). Basically it sounds like they’re talking to people who have a lot of money but want to make sure that they retire rich rather than merely middle-class. I can’t say that I’m morally opposed to helping the rich get richer. After all, I’m not out the

4 0.13204417 608 andrew gelman stats-2011-03-12-Single or multiple imputation?

Introduction: Vishnu Ganglani writes: It appears that multiple imputation appears to be the best way to impute missing data because of the more accurate quantification of variance. However, when imputing missing data for income values in national household surveys, would you recommend it would be practical to maintain the multiple datasets associated with multiple imputations, or a single imputation method would suffice. I have worked on household survey projects (in Scotland) and in the past gone with suggesting single methods for ease of implementation, but with the availability of open source R software I am think of performing multiple imputation methodologies, but a bit apprehensive because of the complexity and also the need to maintain multiple datasets (ease of implementation). My reply: In many applications I’ve just used a single random imputation to avoid the awkwardness of working with multiple datasets. But if there’s any concern, I’d recommend doing parallel analyses on multipl

5 0.12989923 1021 andrew gelman stats-2011-11-21-Don’t judge a book by its title

Introduction: A correspondent writes: I just want to spend a few words to point you to this book I have just found on Amazon: “Understanding The New Statistics: Effect Sizes, Confidence Intervals, and Meta-Analysis” by G. Cumming. I have been attracted by the rather unusual and ‘sexy’ title but it seems to be nothing more than an attempt at alerting the psychology community on considering point estimation procedures and confidence intervals, in place of hypothesis testing, the latter being ‘a terrible idea!’ in the author’s own words. Some more quotes here . Then he says: “‘These are hardly new techniques, but I label them ‘The New Statistics’ because using them would for many researchers be quite new, as well as a highly beneficial change!’” Of course the latter is not stated on the book cover. That’s about as bad as writing a book with subtitle, “Why Americans vote the way they do,” but not actually telling the reader why Americans vote the way they do. I guess what I’m saying is:

6 0.12816325 2021 andrew gelman stats-2013-09-13-Swiss Jonah Lehrer

7 0.12530133 1948 andrew gelman stats-2013-07-21-Bayes related

8 0.12272792 1783 andrew gelman stats-2013-03-31-He’s getting ready to write a book

9 0.11746679 125 andrew gelman stats-2010-07-02-The moral of the story is, Don’t look yourself up on Google

10 0.11339948 316 andrew gelman stats-2010-10-03-Suggested reading for a prospective statistician?

11 0.11284796 1984 andrew gelman stats-2013-08-16-BDA at 40% off!

12 0.11273876 1382 andrew gelman stats-2012-06-17-How to make a good fig?

13 0.10915111 590 andrew gelman stats-2011-02-25-Good introductory book for statistical computation?

14 0.10783486 1682 andrew gelman stats-2013-01-19-R package for Bayes factors

15 0.10625432 1634 andrew gelman stats-2012-12-21-Two reviews of Nate Silver’s new book, from Kaiser Fung and Cathy O’Neil

16 0.10383356 258 andrew gelman stats-2010-09-05-A review of a review of a review of a decade

17 0.10349887 1843 andrew gelman stats-2013-05-05-The New York Times Book of Mathematics

18 0.10320158 1436 andrew gelman stats-2012-07-31-A book on presenting numbers from spreadsheets

19 0.10293096 25 andrew gelman stats-2010-05-10-Two great tastes that taste great together

20 0.10084995 1338 andrew gelman stats-2012-05-23-Advice on writing research articles


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.143), (1, -0.015), (2, -0.054), (3, 0.082), (4, 0.05), (5, 0.06), (6, 0.046), (7, -0.016), (8, 0.105), (9, 0.031), (10, 0.065), (11, -0.064), (12, 0.023), (13, -0.03), (14, 0.172), (15, -0.009), (16, -0.051), (17, 0.04), (18, 0.089), (19, -0.121), (20, 0.032), (21, 0.021), (22, 0.045), (23, 0.062), (24, 0.028), (25, 0.025), (26, 0.07), (27, 0.047), (28, 0.102), (29, 0.014), (30, -0.124), (31, -0.031), (32, 0.005), (33, 0.053), (34, 0.001), (35, 0.061), (36, 0.019), (37, -0.037), (38, 0.009), (39, -0.05), (40, -0.054), (41, -0.011), (42, 0.011), (43, 0.02), (44, 0.016), (45, -0.004), (46, -0.031), (47, 0.043), (48, 0.016), (49, 0.022)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.98967785 1642 andrew gelman stats-2012-12-28-New book by Stef van Buuren on missing-data imputation looks really good!

Introduction: Ben points us to a new book, Flexible Imputation of Missing Data . It’s excellent and I highly recommend it. Definitely worth the $89.95. Van Buuren’s book is great even if you don’t end up using the algorithm described in the book (I actually like their approach but I do think there are some limitations with their particular implementation, which is one reason we’re developing our own package ); he supplies lots of intuition, examples, and graphs. P.S. Stef’s book features an introduction by Don Rubin, which gets me thinking: if Don can find the time to write an introduction to somebody else’s book, he surely should be willing to read and comment on the third edition of his own book, no?

2 0.92093652 31 andrew gelman stats-2010-05-13-Visualization in 1939

Introduction: Willard Cope Brinton’s second book Graphic Presentation (1939) surprised me with the quality of its graphics. Prof. Michael Stoll has some scans at Flickr . For example: The whole book can be downloaded (in a worse resolution) from Archive.Org .

3 0.90499461 1179 andrew gelman stats-2012-02-21-“Readability” as freedom from the actual sensation of reading

Introduction: In her essay on Margaret Mitchell and Gone With the Wind, Claudia Roth Pierpoint writes: The much remarked “readability” of the book must have played a part in this smooth passage from the page to the screen, since “readability” has to do not only with freedom from obscurity but, paradoxically, with freedom from the actual sensation of reading [emphasis added]—of the tug and traction of words as they move thoughts into place in the mind. Requiring, in fact, the least reading, the most “readable” book allows its characters to slip easily through nets of words and into other forms. Popular art has been well defined by just this effortless movement from medium to medium, which is carried out, as Leslie Fiedler observed in relation to Uncle Tom’s Cabin, “without loss of intensity or alteration of meaning.” Isabel Archer rises from the page only in the hanging garments of Henry James’s prose, but Scarlett O’Hara is a free woman. Well put. I wish Pierpoint would come out with ano

4 0.88576329 1984 andrew gelman stats-2013-08-16-BDA at 40% off!

Introduction: Our publisher informs me of the exciting news that Amazon is now selling the 3rd edition of our book at 40% off! Enjoy.

5 0.88489312 1782 andrew gelman stats-2013-03-30-“Statistical Modeling: A Fresh Approach”

Introduction: Ben Hansen recommended to me this book and course by Daniel Kaplan. It looks pretty good. I’ve only looked at the website, not the book itself, and I’m sure I’d find lots of places to disagree with it on details, but the general flow seemed reasonable, also I liked that there’s lots of course materials to go with it. Does anyone have any experience with this book? Is it the way to go (for now)?

6 0.8707391 1188 andrew gelman stats-2012-02-28-Reference on longitudinal models?

7 0.85236549 1970 andrew gelman stats-2013-08-06-New words of 1917

8 0.85044175 1843 andrew gelman stats-2013-05-05-The New York Times Book of Mathematics

9 0.84910911 2021 andrew gelman stats-2013-09-13-Swiss Jonah Lehrer

10 0.83431387 1382 andrew gelman stats-2012-06-17-How to make a good fig?

11 0.8342728 127 andrew gelman stats-2010-07-04-Inequality and health

12 0.83152449 1783 andrew gelman stats-2013-03-31-He’s getting ready to write a book

13 0.81557316 1436 andrew gelman stats-2012-07-31-A book on presenting numbers from spreadsheets

14 0.80226529 115 andrew gelman stats-2010-06-28-Whassup with those crappy thrillers?

15 0.79323184 1895 andrew gelman stats-2013-06-12-Peter Thiel is writing another book!

16 0.78569698 8 andrew gelman stats-2010-04-28-Advice to help the rich get richer

17 0.78537118 16 andrew gelman stats-2010-05-04-Burgess on Kipling

18 0.78431833 2168 andrew gelman stats-2014-01-12-Things that I like that almost nobody else is interested in

19 0.78261411 986 andrew gelman stats-2011-11-01-MacKay update: where 12 comes from

20 0.77741021 590 andrew gelman stats-2011-02-25-Good introductory book for statistical computation?


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(0, 0.033), (2, 0.025), (16, 0.141), (20, 0.025), (21, 0.042), (24, 0.101), (34, 0.05), (82, 0.021), (96, 0.071), (99, 0.367)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.97647387 1642 andrew gelman stats-2012-12-28-New book by Stef van Buuren on missing-data imputation looks really good!

Introduction: Ben points us to a new book, Flexible Imputation of Missing Data . It’s excellent and I highly recommend it. Definitely worth the $89.95. Van Buuren’s book is great even if you don’t end up using the algorithm described in the book (I actually like their approach but I do think there are some limitations with their particular implementation, which is one reason we’re developing our own package ); he supplies lots of intuition, examples, and graphs. P.S. Stef’s book features an introduction by Don Rubin, which gets me thinking: if Don can find the time to write an introduction to somebody else’s book, he surely should be willing to read and comment on the third edition of his own book, no?

2 0.96883446 2083 andrew gelman stats-2013-10-31-Value-added modeling in education: Gaming the system by sending kids on a field trip at test time

Introduction: Just in time for Halloween, here’s a horror story for you . . . Howard Wainer writes: In my book “Uneducated Guesses” in the chapter on value-added models, I discuss how the treatment of missing data can have a profound effect on the estimates of teacher scores. I made up how a principal might send the best students on a field trip at the beginning of the year when the ‘pre-test’ was given (and their scores would be imputed from the students who showed up) and that the bottom half of the class would have a matching field trip on the day of the post test. Everyone laughed. But apparently someone decided to take it seriously. http://www.amren.com/news/2012/10/el-paso-schools-confront-scandal-of-students-who-disappeared-at-test-time/ http://www.elpasotimes.com/episd/ci_20848628/former-episd-superintendent-lorenzo-garcia-enter-plea-aggreement You can’t make this stuff up. This sort of thing is not surprising but it’s worth keeping in mind. That a measurement system c

3 0.96767002 2280 andrew gelman stats-2014-04-03-As the boldest experiment in journalism history, you admit you made a mistake

Introduction: The pre-NYT David Brooks liked to make fun of the NYT. Here’s one from 1997 : I’m not sure I’d like to be one of the people featured on the New York Times wedding page, but I know I’d like to be the father of one of them. Imagine how happy Stanley J. Kogan must have been, for example, when his daughter Jamie got into Yale. Then imagine his pride when Jamie made Phi Beta Kappa and graduated summa cum laude. . . . he must have enjoyed a gloat or two when his daughter put on that cap and gown. And things only got better. Jamie breezed through Stanford Law School. And then she met a man—Thomas Arena—who appears to be exactly the sort of son-in-law that pediatric urologists dream about. . . . These two awesome resumes collided at a wedding ceremony . . . It must have been one of the happiest days in Stanley J. Kogan’s life. The rest of us got to read about it on the New York Times wedding page. Brooks is reputed to be Jewish himself so I think it’s ok for him to mock Jewish peop

4 0.96578431 859 andrew gelman stats-2011-08-18-Misunderstanding analysis of covariance

Introduction: Jeremy Miles writes: Are you familiar with Miller and Chapman’s (2001) article : Misunderstanding Analysis of Covariance saying that ANCOVA (and therefore, I suppose regression) should not be used when groups differ on a covariate. It has caused a moderate splash in psychology circles. I wondered if you had any thoughts on it. I had not heard of the article so I followed the link . . . ugh! Already on the very first column of the very first page they confuse nonadditivity with nonlinearity. I could probably continue with, “and it gets worse,” but since nobody’s paying me to read this one, I’ll stop reading right there on the first page! I prefer when people point me to good papers to read. . . .

5 0.96478504 935 andrew gelman stats-2011-10-01-When should you worry about imputed data?

Introduction: Majid Ezzati writes: My research group is increasingly focusing on a series of problems that involve data that either have missingness or measurements that may have bias/error. We have at times developed our own approaches to imputation (as simple as interpolating a missing unit and as sophisticated as a problem-specific Bayesian hierarchical model) and at other times, other groups impute the data. The outputs are being used to investigate the basic associations between pairs of variables, Xs and Ys, in regressions; we may or may not interpret these as causal. I am contacting colleagues with relevant expertise to suggest good references on whether having imputed X and/or Y in a subsequent regression is correct or if it could somehow lead to biased/spurious associations. Thinking about this, we can have at least the following situations (these could all be Bayesian or not): 1) X and Y both measured (perhaps with error) 2) Y imputed using some data and a model and X measur

6 0.96438563 205 andrew gelman stats-2010-08-13-Arnold Zellner

7 0.96387058 886 andrew gelman stats-2011-09-02-The new Helen DeWitt novel

8 0.9638145 814 andrew gelman stats-2011-07-21-The powerful consumer?

9 0.96369934 2186 andrew gelman stats-2014-01-26-Infoviz on top of stat graphic on top of spreadsheet

10 0.96224022 1452 andrew gelman stats-2012-08-09-Visually weighting regression displays

11 0.96216017 2301 andrew gelman stats-2014-04-22-Ticket to Baaaaarf

12 0.96197522 2106 andrew gelman stats-2013-11-19-More on “data science” and “statistics”

13 0.96187234 302 andrew gelman stats-2010-09-28-This is a link to a news article about a scientific paper

14 0.96083397 2368 andrew gelman stats-2014-06-11-Bayes in the research conversation

15 0.96083188 2065 andrew gelman stats-2013-10-17-Cool dynamic demographic maps provide beautiful illustration of Chris Rock effect

16 0.96081012 54 andrew gelman stats-2010-05-27-Hype about conditional probability puzzles

17 0.96033466 690 andrew gelman stats-2011-05-01-Peter Huber’s reflections on data analysis

18 0.95938629 722 andrew gelman stats-2011-05-20-Why no Wegmania?

19 0.9592272 430 andrew gelman stats-2010-11-25-The von Neumann paradox

20 0.95919889 1887 andrew gelman stats-2013-06-07-“Happy Money: The Science of Smarter Spending”