andrew_gelman_stats andrew_gelman_stats-2011 andrew_gelman_stats-2011-595 knowledge-graph by maker-knowledge-mining

595 andrew gelman stats-2011-02-28-What Zombies see in Scatterplots


meta infos for this blog

Source: html

Introduction: This video caught my interest – news video clip (from this post2 ) http://www.stat.columbia.edu/~cook/movabletype/archives/2011/02/on_summarizing.html The news commentator did seem to be trying to point out what a couple of states had to say about the claimed relationship – almost on their own. Some methods have been worked out for zombies to do just this! So I grabbed the data as close as I quickly could, modified the code slightly and here’s the zombie veiw of it. PoliticInt.pdf North Carolina is the bolded red curve, Idaho the bolded green curve. Missisipi and New York are the bolded blue. As ugly as it is this is the Bayasian marginal picture – exactly (given MCMC errror). K? p.s. you will get a very confusing picture if you forget to centre the x (i.e. see chapter 4 of Gelman and Hill book)


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 This video caught my interest – news video clip (from this post2 ) http://www. [sent-1, score-0.975]

2 html The news commentator did seem to be trying to point out what a couple of states had to say about the claimed relationship – almost on their own. [sent-5, score-0.901]

3 Some methods have been worked out for zombies to do just this! [sent-6, score-0.264]

4 So I grabbed the data as close as I quickly could, modified the code slightly and here’s the zombie veiw of it. [sent-7, score-0.805]

5 pdf North Carolina is the bolded red curve, Idaho the bolded green curve. [sent-9, score-1.457]

6 As ugly as it is this is the Bayasian marginal picture – exactly (given MCMC errror). [sent-11, score-0.478]

7 you will get a very confusing picture if you forget to centre the x (i. [sent-15, score-0.593]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('bolded', 0.625), ('video', 0.256), ('picture', 0.188), ('commentator', 0.179), ('zombie', 0.171), ('idaho', 0.171), ('clip', 0.165), ('carolina', 0.16), ('centre', 0.149), ('grabbed', 0.147), ('modified', 0.138), ('news', 0.135), ('zombies', 0.132), ('north', 0.126), ('confusing', 0.124), ('green', 0.122), ('hill', 0.117), ('curve', 0.115), ('mcmc', 0.114), ('ugly', 0.113), ('forget', 0.105), ('marginal', 0.104), ('relationship', 0.103), ('claimed', 0.102), ('quickly', 0.102), ('caught', 0.098), ('slightly', 0.091), ('red', 0.085), ('http', 0.085), ('gelman', 0.084), ('code', 0.083), ('chapter', 0.08), ('york', 0.079), ('worked', 0.077), ('states', 0.075), ('exactly', 0.073), ('close', 0.073), ('almost', 0.066), ('interest', 0.065), ('couple', 0.065), ('trying', 0.055), ('methods', 0.055), ('book', 0.05), ('seem', 0.05), ('given', 0.046), ('say', 0.036), ('point', 0.035), ('new', 0.034), ('could', 0.028), ('get', 0.027)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 595 andrew gelman stats-2011-02-28-What Zombies see in Scatterplots

Introduction: This video caught my interest – news video clip (from this post2 ) http://www.stat.columbia.edu/~cook/movabletype/archives/2011/02/on_summarizing.html The news commentator did seem to be trying to point out what a couple of states had to say about the claimed relationship – almost on their own. Some methods have been worked out for zombies to do just this! So I grabbed the data as close as I quickly could, modified the code slightly and here’s the zombie veiw of it. PoliticInt.pdf North Carolina is the bolded red curve, Idaho the bolded green curve. Missisipi and New York are the bolded blue. As ugly as it is this is the Bayasian marginal picture – exactly (given MCMC errror). K? p.s. you will get a very confusing picture if you forget to centre the x (i.e. see chapter 4 of Gelman and Hill book)

2 0.18487681 396 andrew gelman stats-2010-11-05-Journalism in the age of data

Introduction: Journalism in the age of data is a video report including interviews with many visualization people. It’s also a great example of how citations, and further information appear alongside with the video – showing us the future of video content online.

3 0.10216583 725 andrew gelman stats-2011-05-21-People kept emailing me this one so I think I have to blog something

Introduction: Here and here , for example. I just hope they’re using our survey methods and aren’t trying to contact the zombies face-to-face!

4 0.087441206 148 andrew gelman stats-2010-07-15-“Gender Bias Still Exists in Modern Children’s Literature, Say Centre Researchers”

Introduction: You know that expression, “Not from the Onion”? How did we say that, all those years before the Onion existed? I was thinking about this after encountering (amidst a Google search for something else) this article on a website called “College News”: DANVILLE, KY., March 8, 2007–Two Centre College professors spent the past six years reading and analyzing 200 children’s books to discover a disturbing trend: gender bias still exists in much of modern children’s literature. Dr. David Anderson, professor of economics, and Dr. Mykol Hamilton, professor of psychology, have documented that gender bias is common today in many children’s books in their research published recently in Sex Roles: A Journal of Research titled “Gender Stereotyping and Under-Representation of Female Characters in 200 Popular Children’s Picture Books: A 21st Century Update.” . . . “Centre College,” huh? That’s where Area Man is studying, right? According to the materials on its website, Centre College is

5 0.072529212 857 andrew gelman stats-2011-08-17-Bayes pays

Introduction: George Leckie writes: The Centre for Multilevel Modelling at the University of Bristol is seeking to appoint an applied statistician to work on a new ESRC-funded project, Longitudinal Effects, Multilevel Modelling and Applications (LEMMA 3). LEMMA 3 is one of six Nodes of the National Centre for Research Methods (NCRM). The LEMMA 3 Node will focus on methods for the analysis of longitudinal data. The appointment, at Research Assistant or Research Associate level, will be for 2.5 years with likelihood of extension to the end of September 2014. For further details, including information on how to apply online, please go to http://www.bris.ac.uk/boris/jobs/feeds/ads?ID=100571 By “modelling,” I think he means “modeling.” And by “centre,” I think he means “center.” But I think you get the basic idea. It looks like a great place to do research.

6 0.071245059 376 andrew gelman stats-2010-10-28-My talk at American University

7 0.068161264 612 andrew gelman stats-2011-03-14-Uh-oh

8 0.064755976 1543 andrew gelman stats-2012-10-21-Model complexity as a function of sample size

9 0.061728992 1277 andrew gelman stats-2012-04-23-Infographic of the year

10 0.059686959 1698 andrew gelman stats-2013-01-30-The spam just gets weirder and weirder

11 0.057460234 1256 andrew gelman stats-2012-04-10-Our data visualization panel at the New York Public Library

12 0.055046719 126 andrew gelman stats-2010-07-03-Graphical presentation of risk ratios

13 0.053552393 1972 andrew gelman stats-2013-08-07-When you’re planning on fitting a model, build up to it by fitting simpler models first. Then, once you have a model you like, check the hell out of it

14 0.053278692 1983 andrew gelman stats-2013-08-15-More on AIC, WAIC, etc

15 0.053180542 125 andrew gelman stats-2010-07-02-The moral of the story is, Don’t look yourself up on Google

16 0.05213609 452 andrew gelman stats-2010-12-06-Followup questions

17 0.051414788 1518 andrew gelman stats-2012-10-02-Fighting a losing battle

18 0.051045492 683 andrew gelman stats-2011-04-28-Asymmetry in Political Bias

19 0.050957393 1347 andrew gelman stats-2012-05-27-Macromuddle

20 0.050734725 1735 andrew gelman stats-2013-02-24-F-f-f-fake data


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.073), (1, -0.012), (2, -0.006), (3, 0.032), (4, 0.017), (5, 0.007), (6, -0.009), (7, -0.015), (8, -0.014), (9, 0.011), (10, -0.004), (11, -0.028), (12, 0.017), (13, -0.007), (14, 0.032), (15, 0.022), (16, 0.008), (17, -0.01), (18, 0.006), (19, -0.02), (20, 0.035), (21, 0.01), (22, 0.015), (23, -0.007), (24, 0.013), (25, -0.009), (26, -0.026), (27, 0.009), (28, 0.053), (29, -0.017), (30, -0.02), (31, -0.005), (32, 0.004), (33, 0.014), (34, 0.033), (35, 0.008), (36, 0.003), (37, -0.022), (38, -0.014), (39, -0.011), (40, -0.023), (41, 0.038), (42, 0.054), (43, 0.033), (44, -0.0), (45, 0.021), (46, 0.006), (47, 0.0), (48, 0.019), (49, -0.016)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.95581985 595 andrew gelman stats-2011-02-28-What Zombies see in Scatterplots

Introduction: This video caught my interest – news video clip (from this post2 ) http://www.stat.columbia.edu/~cook/movabletype/archives/2011/02/on_summarizing.html The news commentator did seem to be trying to point out what a couple of states had to say about the claimed relationship – almost on their own. Some methods have been worked out for zombies to do just this! So I grabbed the data as close as I quickly could, modified the code slightly and here’s the zombie veiw of it. PoliticInt.pdf North Carolina is the bolded red curve, Idaho the bolded green curve. Missisipi and New York are the bolded blue. As ugly as it is this is the Bayasian marginal picture – exactly (given MCMC errror). K? p.s. you will get a very confusing picture if you forget to centre the x (i.e. see chapter 4 of Gelman and Hill book)

2 0.60385299 739 andrew gelman stats-2011-05-31-When Did Girls Start Wearing Pink?

Introduction: That cute picture is of toddler FDR in a dress, from 1884. Jeanne Maglaty writes : A Ladies’ Home Journal article [or maybe from a different source, according to a commenter] in June 1918 said, “The generally accepted rule is pink for the boys, and blue for the girls. The reason is that pink, being a more decided and stronger color, is more suitable for the boy, while blue, which is more delicate and dainty, is prettier for the girl.” Other sources said blue was flattering for blonds, pink for brunettes; or blue was for blue-eyed babies, pink for brown-eyed babies, according to Paoletti. In 1927, Time magazine printed a chart showing sex-appropriate colors for girls and boys according to leading U.S. stores. In Boston, Filene’s told parents to dress boys in pink. So did Best & Co. in New York City, Halle’s in Cleveland and Marshall Field in Chicago. Today’s color dictate wasn’t established until the 1940s . . . When the women’s liberation movement arrived in the mid-1960s, w

3 0.57948267 927 andrew gelman stats-2011-09-26-R and Google Visualization

Introduction: Eric Tassone writes: Here’s something that may be of interest and useful to your readers, and which I [Tassone] am just now checking out myself. It links R and the Google Visualization API/Google Chart Tools to make Motion Charts (as used in the well known Hans Rosling TED talk) easier to create directly in R. The website is here , and here ‘s a blog about how to use it, including some R code that actually works (if the user has all the requisite libraries, of course) in your own browser.

4 0.56898963 2277 andrew gelman stats-2014-03-31-The most-cited statistics papers ever

Introduction: Robert Grant has a list . I’ll just give the ones with more than 10,000 Google Scholar cites: Cox (1972) Regression and life tables: 35,512 citations. Dempster, Laird, Rubin (1977) Maximum likelihood from incomplete data via the EM algorithm: 34,988 Bland & Altman (1986) Statistical methods for assessing agreement between two methods of clinical measurement: 27,181 Geman & Geman (1984) Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images: 15,106 We can find some more via searching Google scholar for familiar names and topics; thus: Metropolis et al. (1953) Equation of state calculations by fast computing machines: 26,000 Benjamini and Hochberg (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing: 21,000 White (1980) A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity: 18,000 Heckman (1977) Sample selection bias as a specification error:

5 0.563411 396 andrew gelman stats-2010-11-05-Journalism in the age of data

Introduction: Journalism in the age of data is a video report including interviews with many visualization people. It’s also a great example of how citations, and further information appear alongside with the video – showing us the future of video content online.

6 0.56212956 1172 andrew gelman stats-2012-02-17-Rare name analysis and wealth convergence

7 0.55944818 760 andrew gelman stats-2011-06-12-How To Party Your Way Into a Multi-Million Dollar Facebook Job

8 0.55276167 824 andrew gelman stats-2011-07-26-Milo and Milo

9 0.53657424 1783 andrew gelman stats-2013-03-31-He’s getting ready to write a book

10 0.53208596 1788 andrew gelman stats-2013-04-04-When is there “hidden structure in data” to be discovered?

11 0.52895808 289 andrew gelman stats-2010-09-21-“How segregated is your city?”: A story of why every graph, no matter how clear it seems to be, needs a caption to anchor the reader in some numbers

12 0.52823079 1286 andrew gelman stats-2012-04-28-Agreement Groups in US Senate and Dynamic Clustering

13 0.52724379 2063 andrew gelman stats-2013-10-16-My talk 19h this evening

14 0.52666515 1477 andrew gelman stats-2012-08-30-Visualizing Distributions of Covariance Matrices

15 0.52611995 2260 andrew gelman stats-2014-03-22-Postdoc at Rennes on multilevel missing data imputation

16 0.52144217 198 andrew gelman stats-2010-08-11-Multilevel modeling in R on a Mac

17 0.51773983 1542 andrew gelman stats-2012-10-20-A statistical model for underdispersion

18 0.51715678 2228 andrew gelman stats-2014-02-28-Combining two of my interests

19 0.51706684 1177 andrew gelman stats-2012-02-20-Joshua Clover update

20 0.51220828 1297 andrew gelman stats-2012-05-03-New New York data research organizations


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(15, 0.015), (16, 0.086), (20, 0.019), (24, 0.047), (43, 0.024), (61, 0.024), (63, 0.038), (64, 0.253), (65, 0.054), (74, 0.017), (76, 0.02), (87, 0.024), (89, 0.017), (92, 0.022), (99, 0.212)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.93038791 1109 andrew gelman stats-2012-01-09-Google correlate links statistics with minorities

Introduction: John Eppley asks what I make of this : Eppley is guessing the negative spikes are searches getting swamped by holiday season shoppers.

2 0.8792882 985 andrew gelman stats-2011-11-01-Doug Schoen has 2 poll reports

Introduction: According to Chris Wilson , there are two versions of the report of the Occupy Wall Street poll from so-called hack pollster Doug Schoen. Here’s the report that Azi Paybarah says that Schoen sent to him, and here’s the final question from the poll: And here’s what’s on Schoen’s own website: Very similar, except for that last phrase, “no matter what the cost.” I have no idea which was actually asked to the survey participants, but it’s a reminder of the difficulties of public opinion research—sometimes you don’t even know what question was asked! I’m not implying anything sinister on Schoen’s part, it’s just interesting to see these two documents floating around. P.S. More here from Kaiser Fung on fundamental flaws with Schoen’s poll.

same-blog 3 0.87356615 595 andrew gelman stats-2011-02-28-What Zombies see in Scatterplots

Introduction: This video caught my interest – news video clip (from this post2 ) http://www.stat.columbia.edu/~cook/movabletype/archives/2011/02/on_summarizing.html The news commentator did seem to be trying to point out what a couple of states had to say about the claimed relationship – almost on their own. Some methods have been worked out for zombies to do just this! So I grabbed the data as close as I quickly could, modified the code slightly and here’s the zombie veiw of it. PoliticInt.pdf North Carolina is the bolded red curve, Idaho the bolded green curve. Missisipi and New York are the bolded blue. As ugly as it is this is the Bayasian marginal picture – exactly (given MCMC errror). K? p.s. you will get a very confusing picture if you forget to centre the x (i.e. see chapter 4 of Gelman and Hill book)

4 0.82723701 724 andrew gelman stats-2011-05-21-New search engine for data & statistics

Introduction: Jon Goldhill points us to a new search engine, Zanran , which is for finding data and statistics. Goldhill writes: It’s useful when you’re looking for a graph/table rather than a single number. For example, if you look for ‘teenage births rates in the united states’ in Zanran you’ll see a series of graphs. If you check in Google, there’s plenty of material – but you’d have to open everything up to see if it had any real numbers. (I hope you’ll appreciate Zanran’s preview capability as well – hovering over the icons gives a useful preview of the content.)

5 0.79900324 1521 andrew gelman stats-2012-10-04-Columbo does posterior predictive checks

Introduction: I’m already on record as saying that Ronald Reagan was a statistician so I think this is ok too . . . Here’s what Columbo does. He hears the killer’s story and he takes it very seriously (it’s murder, and Columbo never jokes about murder), examines all its implications, and finds where it doesn’t fit the data. Then Columbo carefully examines the discrepancies, tries some model expansion, and eventually concludes that he’s proved there’s a problem. OK, now you’re saying: Yeah, yeah, sure, but how does that differ from any other fictional detective? The difference, I think, is that the tradition is for the detective to find clues and use these to come up with hypotheses, or to trap the killer via internal contradictions in his or her statement. I see Columbo is different—and more in keeping with chapter 6 of Bayesian Data Analysis—in that he is taking the killer’s story seriously and exploring all its implications. That’s the essence of predictive model checking: you t

6 0.78078544 1058 andrew gelman stats-2011-12-14-Higgs bozos: Rosencrantz and Guildenstern are spinning in their graves

7 0.77427959 118 andrew gelman stats-2010-06-30-Question & Answer Communities

8 0.76823735 1653 andrew gelman stats-2013-01-04-Census dotmap

9 0.74759328 11 andrew gelman stats-2010-04-29-Auto-Gladwell, or Can fractals be used to predict human history?

10 0.71062064 977 andrew gelman stats-2011-10-27-Hack pollster Doug Schoen illustrates a general point: The #1 way to lie with statistics is . . . to just lie!

11 0.69779551 1637 andrew gelman stats-2012-12-24-Textbook for data visualization?

12 0.67775464 304 andrew gelman stats-2010-09-29-Data visualization marathon

13 0.67478025 1008 andrew gelman stats-2011-11-13-Student project competition

14 0.67239964 1554 andrew gelman stats-2012-10-31-It not necessary that Bayesian methods conform to the likelihood principle

15 0.67139441 1845 andrew gelman stats-2013-05-07-Is Felix Salmon wrong on free TV?

16 0.66873747 1761 andrew gelman stats-2013-03-13-Lame Statistics Patents

17 0.66834068 1949 andrew gelman stats-2013-07-21-Defensive political science responds defensively to an attack on social science

18 0.66775894 2249 andrew gelman stats-2014-03-15-Recently in the sister blog

19 0.6671949 2197 andrew gelman stats-2014-02-04-Peabody here.

20 0.66410345 100 andrew gelman stats-2010-06-19-Unsurprisingly, people are more worried about the economy and jobs than about deficits