andrew_gelman_stats andrew_gelman_stats-2011 andrew_gelman_stats-2011-523 knowledge-graph by maker-knowledge-mining

523 andrew gelman stats-2011-01-18-Spam is out of control


meta infos for this blog

Source: html

Introduction: I just took a look at the spam folder . . . 600 messages in the past hour ! Seems pretty ridiculous to me.


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('folder', 0.578), ('spam', 0.375), ('messages', 0.372), ('ridiculous', 0.357), ('hour', 0.328), ('took', 0.229), ('past', 0.206), ('look', 0.149), ('pretty', 0.142), ('seems', 0.119)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.99999994 523 andrew gelman stats-2011-01-18-Spam is out of control

Introduction: I just took a look at the spam folder . . . 600 messages in the past hour ! Seems pretty ridiculous to me.

2 0.27868626 425 andrew gelman stats-2010-11-21-If your comment didn’t get through . . .

Introduction: It probably got caught in the spam filter. We get tons and tons of spam (including the annoying spam that I have to remove by hand). If your comment was accompanied by an ad or a spam link, then maybe I just deleted it.

3 0.21891959 132 andrew gelman stats-2010-07-07-Note to “Cigarettes”

Introduction: To the person who posted an apparently non-spam comment with a URL link to a “cheap cigarettes” website: In case you’re wondering, no, your comment didn’t get caught by the spam filter–I’m not sure why not, given that URL. I put it in the spam file manually. If you’d like to participate in blog discussion in the future, please refrain from including spam links. Thank you. Also, it’s “John Tukey,” not “John Turkey.”

4 0.20560491 619 andrew gelman stats-2011-03-19-If a comment is flagged as spam, it will disappear forever

Introduction: A commenter wrote (by email): I’ve noticed that you’ve quit approving my comments on your blog. I hope I didn’t anger you in some way or write something you felt was inappropriate. My reply: I have not been unapproving any comments. If you have comments that have not appeared, they have probably been going into the spam filter. I get literally thousands of spam comments a day and so anything that hits the spam filter is gone forever. I think there is a way to register as a commenter; that could help.

5 0.19333923 1488 andrew gelman stats-2012-09-08-Annals of spam

Introduction: I have to go through the inbox to approve new comments. When I set to auto-approve, I get overwhelmed with spam. As is, I still get spam but it’s manageable. Usually the spam is uninteresting but this one caught my eye: At first this seemed reasonable enough: law firm is desperate for business, spams blogs to raise its Google ranking. But what’s with the writing in the actual comment? It’s incoherent but it doesn’t look computer-generated. My guess is that the law firm in Massachusetts hired a company that promised to raise their Google rankings, and that this company hired some non-English-speaking foreigners to search through the web and write some spam comments. If anyone actually reads the comments, they might get the impression that this law firm is staffed by illiterates . . . but, as we all know, nobody reads blog comments! P.S. I followed the link (sorry!) and came across this: I guess if they’re going to use a tragedy as an excuse to troll for Faceb

6 0.15540549 839 andrew gelman stats-2011-08-04-To commenters who are trying to sell something

7 0.15239604 771 andrew gelman stats-2011-06-16-30 days of statistics

8 0.14080882 27 andrew gelman stats-2010-05-11-Update on the spam email study

9 0.1306887 817 andrew gelman stats-2011-07-23-New blog home

10 0.1254798 220 andrew gelman stats-2010-08-20-Why I blog?

11 0.11527087 2160 andrew gelman stats-2014-01-06-Spam names

12 0.09363503 545 andrew gelman stats-2011-01-30-New innovations in spam

13 0.089449942 635 andrew gelman stats-2011-03-29-Bayesian spam!

14 0.085055582 1933 andrew gelman stats-2013-07-10-Please send all comments to -dev-ripley

15 0.080539592 1561 andrew gelman stats-2012-11-04-Someone is wrong on the internet

16 0.07469257 1050 andrew gelman stats-2011-12-10-Presenting at the econ seminar

17 0.069369525 2068 andrew gelman stats-2013-10-18-G+ hangout for Bayesian Data Analysis course now! (actually, in 5 minutes)

18 0.068227328 1709 andrew gelman stats-2013-02-06-The fractal nature of scientific revolutions

19 0.066907741 2276 andrew gelman stats-2014-03-31-On deck this week

20 0.06496302 2282 andrew gelman stats-2014-04-05-Bizarre academic spam


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.038), (1, -0.033), (2, -0.02), (3, 0.018), (4, 0.023), (5, 0.011), (6, 0.026), (7, -0.02), (8, 0.009), (9, -0.039), (10, 0.004), (11, 0.006), (12, 0.106), (13, 0.02), (14, -0.016), (15, 0.053), (16, 0.007), (17, -0.051), (18, -0.039), (19, 0.031), (20, 0.054), (21, -0.056), (22, -0.016), (23, -0.098), (24, 0.01), (25, -0.009), (26, 0.033), (27, 0.065), (28, -0.043), (29, -0.017), (30, -0.001), (31, 0.063), (32, -0.016), (33, -0.01), (34, -0.055), (35, 0.111), (36, -0.012), (37, 0.069), (38, -0.004), (39, -0.01), (40, -0.11), (41, 0.079), (42, -0.061), (43, 0.014), (44, -0.002), (45, -0.04), (46, 0.073), (47, 0.036), (48, 0.003), (49, -0.023)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.98533314 523 andrew gelman stats-2011-01-18-Spam is out of control

Introduction: I just took a look at the spam folder . . . 600 messages in the past hour ! Seems pretty ridiculous to me.

2 0.95962977 425 andrew gelman stats-2010-11-21-If your comment didn’t get through . . .

Introduction: It probably got caught in the spam filter. We get tons and tons of spam (including the annoying spam that I have to remove by hand). If your comment was accompanied by an ad or a spam link, then maybe I just deleted it.

3 0.90580601 619 andrew gelman stats-2011-03-19-If a comment is flagged as spam, it will disappear forever

Introduction: A commenter wrote (by email): I’ve noticed that you’ve quit approving my comments on your blog. I hope I didn’t anger you in some way or write something you felt was inappropriate. My reply: I have not been unapproving any comments. If you have comments that have not appeared, they have probably been going into the spam filter. I get literally thousands of spam comments a day and so anything that hits the spam filter is gone forever. I think there is a way to register as a commenter; that could help.

4 0.88512337 1488 andrew gelman stats-2012-09-08-Annals of spam

Introduction: I have to go through the inbox to approve new comments. When I set to auto-approve, I get overwhelmed with spam. As is, I still get spam but it’s manageable. Usually the spam is uninteresting but this one caught my eye: At first this seemed reasonable enough: law firm is desperate for business, spams blogs to raise its Google ranking. But what’s with the writing in the actual comment? It’s incoherent but it doesn’t look computer-generated. My guess is that the law firm in Massachusetts hired a company that promised to raise their Google rankings, and that this company hired some non-English-speaking foreigners to search through the web and write some spam comments. If anyone actually reads the comments, they might get the impression that this law firm is staffed by illiterates . . . but, as we all know, nobody reads blog comments! P.S. I followed the link (sorry!) and came across this: I guess if they’re going to use a tragedy as an excuse to troll for Faceb

5 0.87849706 132 andrew gelman stats-2010-07-07-Note to “Cigarettes”

Introduction: To the person who posted an apparently non-spam comment with a URL link to a “cheap cigarettes” website: In case you’re wondering, no, your comment didn’t get caught by the spam filter–I’m not sure why not, given that URL. I put it in the spam file manually. If you’d like to participate in blog discussion in the future, please refrain from including spam links. Thank you. Also, it’s “John Tukey,” not “John Turkey.”

6 0.85348612 839 andrew gelman stats-2011-08-04-To commenters who are trying to sell something

7 0.83370268 817 andrew gelman stats-2011-07-23-New blog home

8 0.68710929 876 andrew gelman stats-2011-08-28-Vaguely related to the coke-dumping story

9 0.66428268 1709 andrew gelman stats-2013-02-06-The fractal nature of scientific revolutions

10 0.64277208 9 andrew gelman stats-2010-04-28-But it all goes to pay for gas, car insurance, and tolls on the turnpike

11 0.63164705 1168 andrew gelman stats-2012-02-14-The tabloids strike again

12 0.6239388 2160 andrew gelman stats-2014-01-06-Spam names

13 0.6014114 27 andrew gelman stats-2010-05-11-Update on the spam email study

14 0.59689325 771 andrew gelman stats-2011-06-16-30 days of statistics

15 0.5828439 1791 andrew gelman stats-2013-04-07-Scatterplot charades!

16 0.55456865 545 andrew gelman stats-2011-01-30-New innovations in spam

17 0.52067482 220 andrew gelman stats-2010-08-20-Why I blog?

18 0.51401573 635 andrew gelman stats-2011-03-29-Bayesian spam!

19 0.49304944 790 andrew gelman stats-2011-07-08-Blog in motion

20 0.47611558 199 andrew gelman stats-2010-08-11-Note to semi-spammers


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(65, 0.153), (98, 0.196), (99, 0.399)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.96663308 523 andrew gelman stats-2011-01-18-Spam is out of control

Introduction: I just took a look at the spam folder . . . 600 messages in the past hour ! Seems pretty ridiculous to me.

2 0.95066178 396 andrew gelman stats-2010-11-05-Journalism in the age of data

Introduction: Journalism in the age of data is a video report including interviews with many visualization people. It’s also a great example of how citations, and further information appear alongside with the video – showing us the future of video content online.

3 0.93794048 208 andrew gelman stats-2010-08-15-When Does a Name Become Androgynous?

Introduction: Good stuff , as always, from Laura Wattenberg.

4 0.91109502 710 andrew gelman stats-2011-05-14-Missed Friday the 13th Zombie Plot Update

Introduction: The revised paper plot13.pdf Slightly improved figures figure13.pdf And just the history part from my thesis – that some find interesting. (And to provide a selfish wiki meta-analysis entry pointer) JustHistory.pdf I have had about a dozen friends read this or earlier versions – they split into finding it interesting (and pragmatic) versus incomprehensible. The reason for that may or may not point to ways to make it clearer. K?

5 0.90382439 742 andrew gelman stats-2011-06-02-Grouponomics, counterfactuals, and opportunity cost

Introduction: I keep encountering the word “Groupon”–I think it’s some sort of pets.com-style commercial endeavor where people can buy coupons? I don’t really care, and I’ve avoided googling the word out of a general animosity toward our society’s current glorification of get-rich-quick schemes. (As you can tell, I’m still bitter about that whole stock market thing.) Anyway, even without knowing what Groupon actually is, I enjoyed this blog by Kaiser Fung in which he tries to work out some of its economic consequences. He connects the statistical notion of counterfactuals to the concept of opportunity cost from economics. The comments are interesting too.

6 0.89991915 1 andrew gelman stats-2010-04-22-Political Belief Networks: Socio-cognitive Heterogeneity in American Public Opinion

7 0.89623511 1333 andrew gelman stats-2012-05-20-Question 10 of my final exam for Design and Analysis of Sample Surveys

8 0.88775963 325 andrew gelman stats-2010-10-07-Fitting discrete-data regression models in social science

9 0.88611543 196 andrew gelman stats-2010-08-10-The U.S. as welfare state

10 0.87737036 96 andrew gelman stats-2010-06-18-Course proposal: Bayesian and advanced likelihood statistical methods for zombies.

11 0.87605572 132 andrew gelman stats-2010-07-07-Note to “Cigarettes”

12 0.87409747 26 andrew gelman stats-2010-05-11-Update on religious affiliations of Supreme Court justices

13 0.87275374 1806 andrew gelman stats-2013-04-16-My talk in Chicago this Thurs 6:30pm

14 0.86776704 955 andrew gelman stats-2011-10-12-Why it doesn’t make sense to chew people out for not reading the help page

15 0.86433917 619 andrew gelman stats-2011-03-19-If a comment is flagged as spam, it will disappear forever

16 0.86217856 1399 andrew gelman stats-2012-06-28-Life imitates blog

17 0.8617596 1361 andrew gelman stats-2012-06-02-Question 23 of my final exam for Design and Analysis of Sample Surveys

18 0.86174649 1334 andrew gelman stats-2012-05-21-Question 11 of my final exam for Design and Analysis of Sample Surveys

19 0.86029065 2333 andrew gelman stats-2014-05-13-Personally, I’d rather go with Teragram

20 0.85972154 1701 andrew gelman stats-2013-01-31-The name that fell off a cliff