andrew_gelman_stats andrew_gelman_stats-2011 andrew_gelman_stats-2011-817 knowledge-graph by maker-knowledge-mining

817 andrew gelman stats-2011-07-23-New blog home


meta infos for this blog

Source: html

Introduction: Hi all. We’ve moved the blog and are still working out some bugs. For example, we delete spam comments but sometimes they remain on the blog. A few other things. We should be cleaning it up more in the next few days.


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 We’ve moved the blog and are still working out some bugs. [sent-2, score-0.676]

2 For example, we delete spam comments but sometimes they remain on the blog. [sent-3, score-1.433]

3 We should be cleaning it up more in the next few days. [sent-5, score-0.563]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('delete', 0.467), ('cleaning', 0.384), ('hi', 0.384), ('spam', 0.348), ('remain', 0.274), ('moved', 0.273), ('days', 0.233), ('next', 0.179), ('sometimes', 0.174), ('comments', 0.17), ('working', 0.161), ('still', 0.123), ('blog', 0.119), ('ve', 0.091), ('example', 0.089)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 817 andrew gelman stats-2011-07-23-New blog home

Introduction: Hi all. We’ve moved the blog and are still working out some bugs. For example, we delete spam comments but sometimes they remain on the blog. A few other things. We should be cleaning it up more in the next few days.

2 0.25849161 425 andrew gelman stats-2010-11-21-If your comment didn’t get through . . .

Introduction: It probably got caught in the spam filter. We get tons and tons of spam (including the annoying spam that I have to remove by hand). If your comment was accompanied by an ad or a spam link, then maybe I just deleted it.

3 0.24492717 619 andrew gelman stats-2011-03-19-If a comment is flagged as spam, it will disappear forever

Introduction: A commenter wrote (by email): I’ve noticed that you’ve quit approving my comments on your blog. I hope I didn’t anger you in some way or write something you felt was inappropriate. My reply: I have not been unapproving any comments. If you have comments that have not appeared, they have probably been going into the spam filter. I get literally thousands of spam comments a day and so anything that hits the spam filter is gone forever. I think there is a way to register as a commenter; that could help.

4 0.23794456 424 andrew gelman stats-2010-11-21-Data cleaning tool!

Introduction: Hal Varian writes: You might find this a useful tool for cleaning data. I haven’t tried it out yet, but data cleaning is a hugely important topic and so this could be a big deal.

5 0.21567863 771 andrew gelman stats-2011-06-16-30 days of statistics

Introduction: I was talking with a colleague about one of our research projects and said that I would write something up, if blogging didn’t get in the way. She suggested that for the next month I just blog about my research ideas. So I think I’ll do that. This means no mocking of plagiarists, no reflections on literature, no answers to miscellaneous questions about how many groups you need in a multilevel model, no rants about economists, no links to pretty graphs, etc., for 30 days. Meanwhile, I have a roughly 30-day backlog. So after my next 30 days of stat blogging, the backlog will gradually appear. There’s some good stuff there, including reflections on Milos, a (sincere) tribute to the haters, an updated Twitteo Killed the Bloggio Star, a question about acupuncture, and some remote statistical modeling advice I gave that actually worked! I’m sure you’ll enjoy it. But you’ll have to wait for all that fun stuff. For the next thirty days, it’s statistics research every day. P.S. I

6 0.21094407 132 andrew gelman stats-2010-07-07-Note to “Cigarettes”

7 0.20753759 1488 andrew gelman stats-2012-09-08-Annals of spam

8 0.19636701 790 andrew gelman stats-2011-07-08-Blog in motion

9 0.15537655 856 andrew gelman stats-2011-08-16-Our new improved blog! Thanks to Cord Blomquist

10 0.14900102 27 andrew gelman stats-2010-05-11-Update on the spam email study

11 0.14414424 839 andrew gelman stats-2011-08-04-To commenters who are trying to sell something

12 0.1306887 523 andrew gelman stats-2011-01-18-Spam is out of control

13 0.1202144 220 andrew gelman stats-2010-08-20-Why I blog?

14 0.10362935 1330 andrew gelman stats-2012-05-19-Cross-validation to check missing-data imputation

15 0.10349446 2063 andrew gelman stats-2013-10-16-My talk 19h this evening

16 0.096988089 1694 andrew gelman stats-2013-01-26-Reflections on ethicsblogging

17 0.095887735 2160 andrew gelman stats-2014-01-06-Spam names

18 0.09386383 2236 andrew gelman stats-2014-03-07-Selection bias in the reporting of shaky research

19 0.093155265 1709 andrew gelman stats-2013-02-06-The fractal nature of scientific revolutions

20 0.086849891 545 andrew gelman stats-2011-01-30-New innovations in spam


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.065), (1, -0.034), (2, -0.043), (3, 0.031), (4, 0.025), (5, -0.002), (6, 0.045), (7, -0.055), (8, 0.031), (9, -0.051), (10, 0.027), (11, 0.032), (12, 0.151), (13, 0.033), (14, -0.028), (15, 0.087), (16, -0.047), (17, -0.081), (18, -0.057), (19, 0.085), (20, 0.09), (21, -0.075), (22, -0.057), (23, -0.107), (24, 0.0), (25, -0.003), (26, 0.033), (27, 0.078), (28, -0.025), (29, -0.05), (30, 0.012), (31, 0.044), (32, -0.018), (33, -0.012), (34, -0.038), (35, 0.091), (36, 0.014), (37, 0.092), (38, -0.022), (39, -0.02), (40, -0.142), (41, 0.081), (42, -0.046), (43, 0.027), (44, -0.004), (45, -0.022), (46, 0.057), (47, 0.003), (48, 0.004), (49, -0.022)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.96898836 817 andrew gelman stats-2011-07-23-New blog home

Introduction: Hi all. We’ve moved the blog and are still working out some bugs. For example, we delete spam comments but sometimes they remain on the blog. A few other things. We should be cleaning it up more in the next few days.

2 0.92653674 619 andrew gelman stats-2011-03-19-If a comment is flagged as spam, it will disappear forever

Introduction: A commenter wrote (by email): I’ve noticed that you’ve quit approving my comments on your blog. I hope I didn’t anger you in some way or write something you felt was inappropriate. My reply: I have not been unapproving any comments. If you have comments that have not appeared, they have probably been going into the spam filter. I get literally thousands of spam comments a day and so anything that hits the spam filter is gone forever. I think there is a way to register as a commenter; that could help.

3 0.90411669 425 andrew gelman stats-2010-11-21-If your comment didn’t get through . . .

Introduction: It probably got caught in the spam filter. We get tons and tons of spam (including the annoying spam that I have to remove by hand). If your comment was accompanied by an ad or a spam link, then maybe I just deleted it.

4 0.90051591 523 andrew gelman stats-2011-01-18-Spam is out of control

Introduction: I just took a look at the spam folder . . . 600 messages in the past hour ! Seems pretty ridiculous to me.

5 0.89382011 132 andrew gelman stats-2010-07-07-Note to “Cigarettes”

Introduction: To the person who posted an apparently non-spam comment with a URL link to a “cheap cigarettes” website: In case you’re wondering, no, your comment didn’t get caught by the spam filter–I’m not sure why not, given that URL. I put it in the spam file manually. If you’d like to participate in blog discussion in the future, please refrain from including spam links. Thank you. Also, it’s “John Tukey,” not “John Turkey.”

6 0.86990839 1488 andrew gelman stats-2012-09-08-Annals of spam

7 0.82950133 839 andrew gelman stats-2011-08-04-To commenters who are trying to sell something

8 0.80328029 1709 andrew gelman stats-2013-02-06-The fractal nature of scientific revolutions

9 0.76471299 771 andrew gelman stats-2011-06-16-30 days of statistics

10 0.71229643 790 andrew gelman stats-2011-07-08-Blog in motion

11 0.69945651 220 andrew gelman stats-2010-08-20-Why I blog?

12 0.68515015 9 andrew gelman stats-2010-04-28-But it all goes to pay for gas, car insurance, and tolls on the turnpike

13 0.67284852 876 andrew gelman stats-2011-08-28-Vaguely related to the coke-dumping story

14 0.65265363 1168 andrew gelman stats-2012-02-14-The tabloids strike again

15 0.64566666 2160 andrew gelman stats-2014-01-06-Spam names

16 0.63381559 1791 andrew gelman stats-2013-04-07-Scatterplot charades!

17 0.56878072 27 andrew gelman stats-2010-05-11-Update on the spam email study

18 0.56559855 545 andrew gelman stats-2011-01-30-New innovations in spam

19 0.55691701 856 andrew gelman stats-2011-08-16-Our new improved blog! Thanks to Cord Blomquist

20 0.54610986 1202 andrew gelman stats-2012-03-08-Between and within-Krugman correlation


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(13, 0.335), (98, 0.08), (99, 0.368)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.96350098 345 andrew gelman stats-2010-10-15-Things we do on sabbatical instead of actually working

Introduction: Frank Fischer, a political scientist at Rutgers U., says his alleged plagiarism was mere sloppiness and not all that uncommon in scholarship. I’ve heard about plagiarism but I had no idea it occurred in political science.

2 0.9367007 1559 andrew gelman stats-2012-11-02-The blog is back

Introduction: We had some security problem: not an actual virus or anything, but a potential leak which caused Google to blacklist us. Cord fixed us and now we’re fine. Good job, Google! Better to find the potential problem before there is any harm!

same-blog 3 0.92039001 817 andrew gelman stats-2011-07-23-New blog home

Introduction: Hi all. We’ve moved the blog and are still working out some bugs. For example, we delete spam comments but sometimes they remain on the blog. A few other things. We should be cleaning it up more in the next few days.

4 0.88603622 424 andrew gelman stats-2010-11-21-Data cleaning tool!

Introduction: Hal Varian writes: You might find this a useful tool for cleaning data. I haven’t tried it out yet, but data cleaning is a hugely important topic and so this could be a big deal.

5 0.87896818 1519 andrew gelman stats-2012-10-02-Job!

Introduction: Faten Sabry writes: We are looking to hire full time analysts at the undergraduate and graduate levels. The work involves extensive econometric analysis and handling of large databases. The analysts will be part of a team working to address various empirical microeconomic issues. I worked with Faten and her colleagues on a consulting project once, and they seemed like reasonable people to me.

6 0.87450445 2011 andrew gelman stats-2013-09-07-Here’s what happened when I finished my PhD thesis

7 0.87240767 1514 andrew gelman stats-2012-09-28-AdviseStat 47% Campaign Ad

8 0.86428392 234 andrew gelman stats-2010-08-25-Modeling constrained parameters

9 0.84812295 1852 andrew gelman stats-2013-05-12-Crime novels for economists

10 0.8417542 1789 andrew gelman stats-2013-04-05-Elites have alcohol problems too!

11 0.83943558 172 andrew gelman stats-2010-07-30-Why don’t we have peer reviewing for oral presentations?

12 0.83826065 597 andrew gelman stats-2011-03-02-RStudio – new cross-platform IDE for R

13 0.83383691 1916 andrew gelman stats-2013-06-27-The weirdest thing about the AJPH story

14 0.81477666 971 andrew gelman stats-2011-10-25-Apply now for Earth Institute postdoctoral fellowships at Columbia University

15 0.80555177 1137 andrew gelman stats-2012-01-24-Difficulties in publishing non-replications of implausible findings

16 0.79868436 2369 andrew gelman stats-2014-06-11-“I can’t drive home now. Not just yet. First I need to go to Utrecht.”

17 0.78729844 800 andrew gelman stats-2011-07-13-I like lineplots

18 0.78659058 1509 andrew gelman stats-2012-09-24-Analyzing photon counts

19 0.78490233 2309 andrew gelman stats-2014-04-28-Crowdstorming a dataset

20 0.78465092 1942 andrew gelman stats-2013-07-17-“Stop and frisk” statistics