andrew_gelman_stats andrew_gelman_stats-2012 andrew_gelman_stats-2012-1573 knowledge-graph by maker-knowledge-mining

1573 andrew gelman stats-2012-11-11-Incredibly strange spam


meta infos for this blog

Source: html

Introduction: Unsolicited (of course) in the email the other day: Just wanted to touch base with you to see if you needed any quotes on Parking lot lighting or Garage Lighting? (Induction, LED, Canopy etc…) We help retrofit 1000′s of garages around the country. Let me know your specs and ill send you a quote in 24 hours. ** Owner Emergency Lights Co. Ill indeed. . . .


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Unsolicited (of course) in the email the other day: Just wanted to touch base with you to see if you needed any quotes on Parking lot lighting or Garage Lighting? [sent-1, score-1.374]

2 (Induction, LED, Canopy etc…) We help retrofit 1000′s of garages around the country. [sent-2, score-0.151]

3 Let me know your specs and ill send you a quote in 24 hours. [sent-3, score-0.978]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('lighting', 0.496), ('ill', 0.467), ('specs', 0.248), ('garage', 0.234), ('emergency', 0.209), ('lights', 0.204), ('owner', 0.199), ('touch', 0.195), ('parking', 0.185), ('unsolicited', 0.182), ('induction', 0.177), ('base', 0.142), ('quotes', 0.137), ('led', 0.123), ('needed', 0.118), ('send', 0.113), ('quote', 0.11), ('etc', 0.105), ('email', 0.102), ('wanted', 0.096), ('day', 0.082), ('help', 0.081), ('course', 0.07), ('around', 0.07), ('let', 0.07), ('lot', 0.055), ('know', 0.04), ('see', 0.033)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 1573 andrew gelman stats-2012-11-11-Incredibly strange spam

Introduction: Unsolicited (of course) in the email the other day: Just wanted to touch base with you to see if you needed any quotes on Parking lot lighting or Garage Lighting? (Induction, LED, Canopy etc…) We help retrofit 1000′s of garages around the country. Let me know your specs and ill send you a quote in 24 hours. ** Owner Emergency Lights Co. Ill indeed. . . .

2 0.15749201 1598 andrew gelman stats-2012-11-30-A graphics talk with no visuals!

Introduction: So, I’m at MIT, twenty minutes into my talk on tradeoffs in information graphics to the computer scientists, when the power goes out. They had some dim backup lighting so we weren’t all sitting there in the dark, but the projector wasn’t working. So I took questions for the remaining 40 minutes. It went well, perhaps better than the actual talk would’ve gone, even though they didn’t get to see most of my slides .

3 0.11457221 503 andrew gelman stats-2011-01-04-Clarity on my email policy

Introduction: I never read email before 4. That doesn’t mean I never send email before 4.

4 0.11231938 545 andrew gelman stats-2011-01-30-New innovations in spam

Introduction: I received the following (unsolicited) email today: Hello Andrew, I’m interested in whether you are accepting guest article submissions for your site Statistical Modeling, Causal Inference, and Social Science? I’m the owner of the recently created nonprofit site OnlineEngineeringDegree.org and am interested in writing / submitting an article for your consideration to be published on your site. Is that something you’d be willing to consider, and if so, what specs in terms of topics or length requirements would you be looking for? Thanks you for your time, and if you have any questions or are interested, I’d appreciate you letting me know. Sincerely, Samantha Rhodes Huh? P.S. My vote for most obnoxious spam remains this one , which does its best to dilute whatever remains of the reputation of Wolfram Research. Or maybe that particular bit of spam was written by a particularly awesome cellular automaton that Wolfram discovered? I guess in the world of big-time software

5 0.067723759 1164 andrew gelman stats-2012-02-13-Help with this problem, win valuable prizes

Introduction: Corrected equation                 This post is by Phil. In the comments to an earlier post , I mentioned a problem I am struggling with right now. Several people mentioned having (and solving!) similar problems in the past, so this seems like a great way for me and a bunch of other blog readers to learn something. I will describe the problem, one or more of you will tell me how to solve it, and you will win…wait for it….my thanks, and the approval and admiration of your fellow blog readers, and a big thank-you in any publication that includes results from fitting the model.  You can’t ask fairer than that! Here’s the problem.  The goal is to estimate six parameters that characterize the leakiness (or air-tightness) of a house with an attached garage.  We are specifically interested in the parameters that describe the connection between the house and the garage; this is of interest because of the effect on the air quality in the house  if there are toxic chemic

6 0.066380076 1741 andrew gelman stats-2013-02-27-Thin scientists say it’s unhealthy to be fat

7 0.058003724 343 andrew gelman stats-2010-10-15-?

8 0.057467293 27 andrew gelman stats-2010-05-11-Update on the spam email study

9 0.054295994 614 andrew gelman stats-2011-03-15-Induction within a model, deductive inference for model evaluation

10 0.049250342 1652 andrew gelman stats-2013-01-03-“The Case for Inductive Theory Building”

11 0.04824096 18 andrew gelman stats-2010-05-06-$63,000 worth of abusive research . . . or just a really stupid waste of time?

12 0.04770688 743 andrew gelman stats-2011-06-03-An argument that can’t possibly make sense

13 0.045262668 926 andrew gelman stats-2011-09-26-NYC

14 0.045247436 332 andrew gelman stats-2010-10-10-Proposed new section of the American Statistical Association on Imaging Sciences

15 0.044413984 240 andrew gelman stats-2010-08-29-ARM solutions

16 0.043421827 2118 andrew gelman stats-2013-11-30-???

17 0.043023095 1434 andrew gelman stats-2012-07-29-FindTheData.org

18 0.040177707 986 andrew gelman stats-2011-11-01-MacKay update: where 12 comes from

19 0.039990034 1670 andrew gelman stats-2013-01-13-More Bell Labs happy talk

20 0.039737578 880 andrew gelman stats-2011-08-30-Annals of spam


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.039), (1, -0.022), (2, -0.024), (3, 0.015), (4, 0.011), (5, 0.017), (6, 0.014), (7, -0.01), (8, 0.003), (9, -0.012), (10, 0.006), (11, -0.016), (12, 0.032), (13, -0.005), (14, -0.007), (15, 0.014), (16, 0.019), (17, -0.024), (18, 0.006), (19, 0.015), (20, 0.01), (21, -0.004), (22, 0.016), (23, -0.027), (24, -0.007), (25, 0.014), (26, 0.01), (27, 0.016), (28, 0.001), (29, 0.017), (30, -0.011), (31, 0.007), (32, -0.028), (33, -0.02), (34, 0.001), (35, -0.001), (36, -0.002), (37, -0.006), (38, 0.015), (39, 0.004), (40, 0.016), (41, -0.004), (42, 0.002), (43, -0.041), (44, -0.006), (45, -0.011), (46, 0.025), (47, -0.018), (48, 0.017), (49, -0.031)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.94939542 1573 andrew gelman stats-2012-11-11-Incredibly strange spam

Introduction: Unsolicited (of course) in the email the other day: Just wanted to touch base with you to see if you needed any quotes on Parking lot lighting or Garage Lighting? (Induction, LED, Canopy etc…) We help retrofit 1000′s of garages around the country. Let me know your specs and ill send you a quote in 24 hours. ** Owner Emergency Lights Co. Ill indeed. . . .

2 0.73466283 503 andrew gelman stats-2011-01-04-Clarity on my email policy

Introduction: I never read email before 4. That doesn’t mean I never send email before 4.

3 0.73346281 1589 andrew gelman stats-2012-11-25-Life as a blogger: the emails just get weirder and weirder

Introduction: In the email the other day, subject line “Casting blogger, writer, journalist to host cable series”: Hi there Andrew, I’m casting a male journalist, writer, blogger, documentary filmmaker or comedian with a certain type personality for a television pilot along with production company, Pipeline39. See below: A certain type of character – no cockiness, no ego, a person who is smart, savvy, dry humor, but someone who isn’t imposing, who can infiltrate these organizations. This person will be hosting his own show and covering alternative lifestyles and secret societies around the world. If you’re interested in hearing more or would like to be considered for this project, please email me a photo and a bio of yourself, along with contact information. I’ll respond to you ASAP. I’m looking forward to hearing from you. *** Casting Producer (646) ***.**** ***@gmail.com I was with them until I got to the “no ego” part. . . . Also, I don’t think I could infiltrate any org

4 0.7212593 259 andrew gelman stats-2010-09-06-Inbox zero. Really.

Introduction: Just in time for the new semester: This time I’m sticking with the plan : 1. Don’t open a message until I’m ready to deal with it. 2. Don’t store anything–anything–in the inbox. 3. Put to-do items in the (physical) bookje rather than the (computer) “desktop.” 4. Never read email before 4pm. (This is the one rule I have been following. 5. Only one email session per day. (I’ll have to see how this one works.)

5 0.70892763 343 andrew gelman stats-2010-10-15-?

Introduction: How am I supposed to handle this sort of thing? (See below.) I just stuck it one of my email folders without responding, but then I wondered . . . what’s it all about? Is there some sort of Glengarry Glen Ross-like parallel world where down-on-their-luck Jack Lemmons of public relations world send out electronic cold calls? More than anything else, this sort of thing makes me glad I have a steady job. Here’s the (unsolicited) email, which came with the subject line “Please help a reporter do his job”: Dear Andrew, As an Editor for the Bulldog Reporter (www.bulldogreporter.com/dailydog), a media relations trade publication, my job is to help ensure that my readers have accurate info about you and send you the best quality pitches. By taking five minutes or less to answer my questions (pasted below), you’ll receive targeted PR pitches from our client base that will match your beat and interests. Any help or direction is appreciated. Here are my questions. We have you listed

6 0.70629787 332 andrew gelman stats-2010-10-10-Proposed new section of the American Statistical Association on Imaging Sciences

7 0.70573777 27 andrew gelman stats-2010-05-11-Update on the spam email study

8 0.70163107 980 andrew gelman stats-2011-10-29-When people meet this guy, can they resist the temptation to ask him what he’s doing for breakfast??

9 0.64907628 2148 andrew gelman stats-2013-12-25-Spam!

10 0.62712032 1434 andrew gelman stats-2012-07-29-FindTheData.org

11 0.62507075 1380 andrew gelman stats-2012-06-15-Coaching, teaching, and writing

12 0.62203991 1077 andrew gelman stats-2011-12-21-In which I compare “POLITICO’s chief political columnist” unfavorably to a cranky old dead guy and one of the funniest writers who’s ever lived

13 0.61654711 2338 andrew gelman stats-2014-05-19-My short career as a Freud expert

14 0.6129967 2213 andrew gelman stats-2014-02-16-There’s no need for you to read this one

15 0.6000554 18 andrew gelman stats-2010-05-06-$63,000 worth of abusive research . . . or just a really stupid waste of time?

16 0.59518903 605 andrew gelman stats-2011-03-09-Does it feel like cheating when I do this? Variation in ethical standards and expectations

17 0.59044766 530 andrew gelman stats-2011-01-22-MS-Bayes?

18 0.58122742 1871 andrew gelman stats-2013-05-27-Annals of spam

19 0.57896084 28 andrew gelman stats-2010-05-12-Alert: Incompetent colleague wastes time of hardworking Wolfram Research publicist

20 0.57844359 880 andrew gelman stats-2011-08-30-Annals of spam


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(16, 0.088), (24, 0.151), (27, 0.041), (84, 0.035), (94, 0.028), (97, 0.381), (99, 0.101)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.90821683 1573 andrew gelman stats-2012-11-11-Incredibly strange spam

Introduction: Unsolicited (of course) in the email the other day: Just wanted to touch base with you to see if you needed any quotes on Parking lot lighting or Garage Lighting? (Induction, LED, Canopy etc…) We help retrofit 1000′s of garages around the country. Let me know your specs and ill send you a quote in 24 hours. ** Owner Emergency Lights Co. Ill indeed. . . .

2 0.7449981 882 andrew gelman stats-2011-08-31-Meanwhile, on the sister blog . . .

Introduction: NYT columnist Douthat asks: Should we be disturbed that a leading presidential candidate endorses a pro-slavery position? Who’s on the web? And where are they? Sowell, Carlson, Barone: fools, knaves, or simply victims of a cognitive illusion? Don’t blame the American public for the D.C. deadlock Calvin College update Help reform the Institutional Review Board (IRB) system! Powerful credit-rating agencies are a creation of the government . . . what does it mean when they bite the hand that feeds them? “Waiting for a landslide” A simple theory of why Obama didn’t come out fighting in 2009 A modest proposal Noooooooooooooooo!!!!!!!!!!!!!!! The Family Research Council and the Barnard Center for Research on Women Sleazy data miners Genetic essentialism is in our genes Wow, that was a lot! No wonder I don’t get any research done…

3 0.66433185 160 andrew gelman stats-2010-07-23-Unhappy with improvement by a factor of 10^29

Introduction: I have an optimization problem: I have a complicated physical model that predicts energy and thermal behavior of a building, given the values of a slew of parameters, such as insulation effectiveness, window transmissivity, etc. I’m trying to find the parameter set that best fits several weeks of thermal and energy use data from the real building that we modeled. (Of course I would rather explore parameter space and come up with probability distributions for the parameters, and maybe that will come later, but for now I’m just optimizing). To do the optimization, colleagues and I implemented a “particle swarm optimization” algorithm on a massively parallel machine. This involves giving each of about 120 “particles” an initial position in parameter space, then letting them move around, trying to move to better positions according to a specific algorithm. We gave each particle an initial position sampled from our prior distribution for each parameter. So far we’ve run about 140 itera

4 0.61925179 1001 andrew gelman stats-2011-11-10-Three hours in the life of a statistician

Introduction: Kaiser Fung tells what it’s really like . Here’s a sample: As soon as I [Kaiser] put the substring-concatenate expression together with two lines of code that generate data tables, it choked. Sorta like Dashiell Hammett without the broads and the heaters. And here’s another take, from a slightly different perspective.

5 0.61889464 996 andrew gelman stats-2011-11-07-Chi-square FAIL when many cells have small expected values

Introduction: William Perkins, Mark Tygert, and Rachel Ward write : If a discrete probability distribution in a model being tested for goodness-of-fit is not close to uniform, then forming the Pearson χ2 statistic can involve division by nearly zero. This often leads to serious trouble in practice — even in the absence of round-off errors . . . The problem is not merely that the chi-squared statistic doesn’t have the advertised chi-squared distribution —a reference distribution can always be computed via simulation, either using the posterior predictive distribution or by conditioning on a point estimate of the cell expectations and then making a degrees-of-freedom sort of adjustment. Rather, the problem is that, when there are lots of cells with near-zero expectation, the chi-squared test is mostly noise. And this is not merely a theoretical problem. It comes up in real examples. Here’s one, taken from the classic 1992 genetics paper of Guo and Thomspson: And here are the e

6 0.60865414 1651 andrew gelman stats-2013-01-03-Faculty Position in Visualization, Visual Analytics, Imaging, and Human Centered Computing

7 0.60030258 142 andrew gelman stats-2010-07-12-God, Guns, and Gaydar: The Laws of Probability Push You to Overestimate Small Groups

8 0.56487727 1694 andrew gelman stats-2013-01-26-Reflections on ethicsblogging

9 0.56262732 553 andrew gelman stats-2011-02-03-is it possible to “overstratify” when assigning a treatment in a randomized control trial?

10 0.54261976 13 andrew gelman stats-2010-04-30-Things I learned from the Mickey Kaus for Senate campaign

11 0.49655959 2118 andrew gelman stats-2013-11-30-???

12 0.48432699 526 andrew gelman stats-2011-01-19-“If it saves the life of a single child…” and other nonsense

13 0.48093814 820 andrew gelman stats-2011-07-25-Design of nonrandomized cluster sample study

14 0.47367078 1812 andrew gelman stats-2013-04-19-Chomsky chomsky chomsky chomsky furiously

15 0.46050569 1293 andrew gelman stats-2012-05-01-Huff the Magic Dragon

16 0.45714238 2121 andrew gelman stats-2013-12-02-Should personal genetic testing be regulated? Battle of the blogroll

17 0.44958752 2246 andrew gelman stats-2014-03-13-An Economist’s Guide to Visualizing Data

18 0.44493893 1258 andrew gelman stats-2012-04-10-Why display 6 years instead of 30?

19 0.44481036 1335 andrew gelman stats-2012-05-21-Responding to a bizarre anti-social-science screed

20 0.44320223 18 andrew gelman stats-2010-05-06-$63,000 worth of abusive research . . . or just a really stupid waste of time?