andrew_gelman_stats andrew_gelman_stats-2011 andrew_gelman_stats-2011-523 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: I just took a look at the spam folder . . . 600 messages in the past hour ! Seems pretty ridiculous to me.
sentIndex sentText sentNum sentScore
wordName wordTfidf (topN-words)
[('folder', 0.578), ('spam', 0.375), ('messages', 0.372), ('ridiculous', 0.357), ('hour', 0.328), ('took', 0.229), ('past', 0.206), ('look', 0.149), ('pretty', 0.142), ('seems', 0.119)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999994 523 andrew gelman stats-2011-01-18-Spam is out of control
Introduction: I just took a look at the spam folder . . . 600 messages in the past hour ! Seems pretty ridiculous to me.
2 0.27868626 425 andrew gelman stats-2010-11-21-If your comment didn’t get through . . .
Introduction: It probably got caught in the spam filter. We get tons and tons of spam (including the annoying spam that I have to remove by hand). If your comment was accompanied by an ad or a spam link, then maybe I just deleted it.
3 0.21891959 132 andrew gelman stats-2010-07-07-Note to “Cigarettes”
Introduction: To the person who posted an apparently non-spam comment with a URL link to a “cheap cigarettes” website: In case you’re wondering, no, your comment didn’t get caught by the spam filter–I’m not sure why not, given that URL. I put it in the spam file manually. If you’d like to participate in blog discussion in the future, please refrain from including spam links. Thank you. Also, it’s “John Tukey,” not “John Turkey.”
4 0.20560491 619 andrew gelman stats-2011-03-19-If a comment is flagged as spam, it will disappear forever
Introduction: A commenter wrote (by email): I’ve noticed that you’ve quit approving my comments on your blog. I hope I didn’t anger you in some way or write something you felt was inappropriate. My reply: I have not been unapproving any comments. If you have comments that have not appeared, they have probably been going into the spam filter. I get literally thousands of spam comments a day and so anything that hits the spam filter is gone forever. I think there is a way to register as a commenter; that could help.
5 0.19333923 1488 andrew gelman stats-2012-09-08-Annals of spam
Introduction: I have to go through the inbox to approve new comments. When I set to auto-approve, I get overwhelmed with spam. As is, I still get spam but it’s manageable. Usually the spam is uninteresting but this one caught my eye: At first this seemed reasonable enough: law firm is desperate for business, spams blogs to raise its Google ranking. But what’s with the writing in the actual comment? It’s incoherent but it doesn’t look computer-generated. My guess is that the law firm in Massachusetts hired a company that promised to raise their Google rankings, and that this company hired some non-English-speaking foreigners to search through the web and write some spam comments. If anyone actually reads the comments, they might get the impression that this law firm is staffed by illiterates . . . but, as we all know, nobody reads blog comments! P.S. I followed the link (sorry!) and came across this: I guess if they’re going to use a tragedy as an excuse to troll for Faceb
6 0.15540549 839 andrew gelman stats-2011-08-04-To commenters who are trying to sell something
7 0.15239604 771 andrew gelman stats-2011-06-16-30 days of statistics
8 0.14080882 27 andrew gelman stats-2010-05-11-Update on the spam email study
9 0.1306887 817 andrew gelman stats-2011-07-23-New blog home
10 0.1254798 220 andrew gelman stats-2010-08-20-Why I blog?
11 0.11527087 2160 andrew gelman stats-2014-01-06-Spam names
12 0.09363503 545 andrew gelman stats-2011-01-30-New innovations in spam
13 0.089449942 635 andrew gelman stats-2011-03-29-Bayesian spam!
14 0.085055582 1933 andrew gelman stats-2013-07-10-Please send all comments to -dev-ripley
15 0.080539592 1561 andrew gelman stats-2012-11-04-Someone is wrong on the internet
16 0.07469257 1050 andrew gelman stats-2011-12-10-Presenting at the econ seminar
17 0.069369525 2068 andrew gelman stats-2013-10-18-G+ hangout for Bayesian Data Analysis course now! (actually, in 5 minutes)
18 0.068227328 1709 andrew gelman stats-2013-02-06-The fractal nature of scientific revolutions
19 0.066907741 2276 andrew gelman stats-2014-03-31-On deck this week
20 0.06496302 2282 andrew gelman stats-2014-04-05-Bizarre academic spam
topicId topicWeight
[(0, 0.038), (1, -0.033), (2, -0.02), (3, 0.018), (4, 0.023), (5, 0.011), (6, 0.026), (7, -0.02), (8, 0.009), (9, -0.039), (10, 0.004), (11, 0.006), (12, 0.106), (13, 0.02), (14, -0.016), (15, 0.053), (16, 0.007), (17, -0.051), (18, -0.039), (19, 0.031), (20, 0.054), (21, -0.056), (22, -0.016), (23, -0.098), (24, 0.01), (25, -0.009), (26, 0.033), (27, 0.065), (28, -0.043), (29, -0.017), (30, -0.001), (31, 0.063), (32, -0.016), (33, -0.01), (34, -0.055), (35, 0.111), (36, -0.012), (37, 0.069), (38, -0.004), (39, -0.01), (40, -0.11), (41, 0.079), (42, -0.061), (43, 0.014), (44, -0.002), (45, -0.04), (46, 0.073), (47, 0.036), (48, 0.003), (49, -0.023)]
simIndex simValue blogId blogTitle
same-blog 1 0.98533314 523 andrew gelman stats-2011-01-18-Spam is out of control
Introduction: I just took a look at the spam folder . . . 600 messages in the past hour ! Seems pretty ridiculous to me.
2 0.95962977 425 andrew gelman stats-2010-11-21-If your comment didn’t get through . . .
Introduction: It probably got caught in the spam filter. We get tons and tons of spam (including the annoying spam that I have to remove by hand). If your comment was accompanied by an ad or a spam link, then maybe I just deleted it.
3 0.90580601 619 andrew gelman stats-2011-03-19-If a comment is flagged as spam, it will disappear forever
Introduction: A commenter wrote (by email): I’ve noticed that you’ve quit approving my comments on your blog. I hope I didn’t anger you in some way or write something you felt was inappropriate. My reply: I have not been unapproving any comments. If you have comments that have not appeared, they have probably been going into the spam filter. I get literally thousands of spam comments a day and so anything that hits the spam filter is gone forever. I think there is a way to register as a commenter; that could help.
4 0.88512337 1488 andrew gelman stats-2012-09-08-Annals of spam
Introduction: I have to go through the inbox to approve new comments. When I set to auto-approve, I get overwhelmed with spam. As is, I still get spam but it’s manageable. Usually the spam is uninteresting but this one caught my eye: At first this seemed reasonable enough: law firm is desperate for business, spams blogs to raise its Google ranking. But what’s with the writing in the actual comment? It’s incoherent but it doesn’t look computer-generated. My guess is that the law firm in Massachusetts hired a company that promised to raise their Google rankings, and that this company hired some non-English-speaking foreigners to search through the web and write some spam comments. If anyone actually reads the comments, they might get the impression that this law firm is staffed by illiterates . . . but, as we all know, nobody reads blog comments! P.S. I followed the link (sorry!) and came across this: I guess if they’re going to use a tragedy as an excuse to troll for Faceb
5 0.87849706 132 andrew gelman stats-2010-07-07-Note to “Cigarettes”
Introduction: To the person who posted an apparently non-spam comment with a URL link to a “cheap cigarettes” website: In case you’re wondering, no, your comment didn’t get caught by the spam filter–I’m not sure why not, given that URL. I put it in the spam file manually. If you’d like to participate in blog discussion in the future, please refrain from including spam links. Thank you. Also, it’s “John Tukey,” not “John Turkey.”
6 0.85348612 839 andrew gelman stats-2011-08-04-To commenters who are trying to sell something
7 0.83370268 817 andrew gelman stats-2011-07-23-New blog home
8 0.68710929 876 andrew gelman stats-2011-08-28-Vaguely related to the coke-dumping story
9 0.66428268 1709 andrew gelman stats-2013-02-06-The fractal nature of scientific revolutions
10 0.64277208 9 andrew gelman stats-2010-04-28-But it all goes to pay for gas, car insurance, and tolls on the turnpike
11 0.63164705 1168 andrew gelman stats-2012-02-14-The tabloids strike again
12 0.6239388 2160 andrew gelman stats-2014-01-06-Spam names
13 0.6014114 27 andrew gelman stats-2010-05-11-Update on the spam email study
14 0.59689325 771 andrew gelman stats-2011-06-16-30 days of statistics
15 0.5828439 1791 andrew gelman stats-2013-04-07-Scatterplot charades!
16 0.55456865 545 andrew gelman stats-2011-01-30-New innovations in spam
17 0.52067482 220 andrew gelman stats-2010-08-20-Why I blog?
18 0.51401573 635 andrew gelman stats-2011-03-29-Bayesian spam!
19 0.49304944 790 andrew gelman stats-2011-07-08-Blog in motion
20 0.47611558 199 andrew gelman stats-2010-08-11-Note to semi-spammers
topicId topicWeight
[(65, 0.153), (98, 0.196), (99, 0.399)]
simIndex simValue blogId blogTitle
same-blog 1 0.96663308 523 andrew gelman stats-2011-01-18-Spam is out of control
Introduction: I just took a look at the spam folder . . . 600 messages in the past hour ! Seems pretty ridiculous to me.
2 0.95066178 396 andrew gelman stats-2010-11-05-Journalism in the age of data
Introduction: Journalism in the age of data is a video report including interviews with many visualization people. It’s also a great example of how citations, and further information appear alongside with the video – showing us the future of video content online.
3 0.93794048 208 andrew gelman stats-2010-08-15-When Does a Name Become Androgynous?
Introduction: Good stuff , as always, from Laura Wattenberg.
4 0.91109502 710 andrew gelman stats-2011-05-14-Missed Friday the 13th Zombie Plot Update
Introduction: The revised paper plot13.pdf Slightly improved figures figure13.pdf And just the history part from my thesis – that some find interesting. (And to provide a selfish wiki meta-analysis entry pointer) JustHistory.pdf I have had about a dozen friends read this or earlier versions – they split into finding it interesting (and pragmatic) versus incomprehensible. The reason for that may or may not point to ways to make it clearer. K?
5 0.90382439 742 andrew gelman stats-2011-06-02-Grouponomics, counterfactuals, and opportunity cost
Introduction: I keep encountering the word “Groupon”–I think it’s some sort of pets.com-style commercial endeavor where people can buy coupons? I don’t really care, and I’ve avoided googling the word out of a general animosity toward our society’s current glorification of get-rich-quick schemes. (As you can tell, I’m still bitter about that whole stock market thing.) Anyway, even without knowing what Groupon actually is, I enjoyed this blog by Kaiser Fung in which he tries to work out some of its economic consequences. He connects the statistical notion of counterfactuals to the concept of opportunity cost from economics. The comments are interesting too.
6 0.89991915 1 andrew gelman stats-2010-04-22-Political Belief Networks: Socio-cognitive Heterogeneity in American Public Opinion
7 0.89623511 1333 andrew gelman stats-2012-05-20-Question 10 of my final exam for Design and Analysis of Sample Surveys
8 0.88775963 325 andrew gelman stats-2010-10-07-Fitting discrete-data regression models in social science
9 0.88611543 196 andrew gelman stats-2010-08-10-The U.S. as welfare state
10 0.87737036 96 andrew gelman stats-2010-06-18-Course proposal: Bayesian and advanced likelihood statistical methods for zombies.
11 0.87605572 132 andrew gelman stats-2010-07-07-Note to “Cigarettes”
12 0.87409747 26 andrew gelman stats-2010-05-11-Update on religious affiliations of Supreme Court justices
13 0.87275374 1806 andrew gelman stats-2013-04-16-My talk in Chicago this Thurs 6:30pm
14 0.86776704 955 andrew gelman stats-2011-10-12-Why it doesn’t make sense to chew people out for not reading the help page
15 0.86433917 619 andrew gelman stats-2011-03-19-If a comment is flagged as spam, it will disappear forever
16 0.86217856 1399 andrew gelman stats-2012-06-28-Life imitates blog
17 0.8617596 1361 andrew gelman stats-2012-06-02-Question 23 of my final exam for Design and Analysis of Sample Surveys
18 0.86174649 1334 andrew gelman stats-2012-05-21-Question 11 of my final exam for Design and Analysis of Sample Surveys
19 0.86029065 2333 andrew gelman stats-2014-05-13-Personally, I’d rather go with Teragram
20 0.85972154 1701 andrew gelman stats-2013-01-31-The name that fell off a cliff