andrew_gelman_stats andrew_gelman_stats-2012 andrew_gelman_stats-2012-1434 knowledge-graph by maker-knowledge-mining

1434 andrew gelman stats-2012-07-29-FindTheData.org


meta infos for this blog

Source: html

Introduction: I received the following (unsolicited) email: Hi Andrew, I work on the business development team of FindTheData.org, an unbiased comparison engine founded by Kevin O’Connor (founder and former CEO of DoubleClick) and backed by Kleiner Perkins with ~10M unique visitors per month. We are working with large online publishers including Golf Digest, Huffington Post, Under30CEO, and offer a variety of options to integrate our highly engaging content with your site.  I believe our un-biased and reliable data resources would be of interest to you and your readers. I’d like to set up a quick call to discuss similar partnership ideas with you and would greatly appreciate 10 minutes of your time. Please suggest a couple times that work best for you or let me know if you would like me to send some more information before you make time for a call. Looking forward to hearing from you, Jonny – JONNY KINTZELE Business Development, FindThe Data mobile: 619-307-097


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 I received the following (unsolicited) email: Hi Andrew, I work on the business development team of FindTheData. [sent-1, score-0.283]

2 org, an unbiased comparison engine founded by Kevin O’Connor (founder and former CEO of DoubleClick) and backed by Kleiner Perkins with ~10M unique visitors per month. [sent-2, score-0.685]

3 We are working with large online publishers including Golf Digest, Huffington Post, Under30CEO, and offer a variety of options to integrate our highly engaging content with your site. [sent-3, score-0.675]

4 I believe our un-biased and reliable data resources would be of interest to you and your readers. [sent-4, score-0.195]

5 I’d like to set up a quick call to discuss similar partnership ideas with you and would greatly appreciate 10 minutes of your time. [sent-5, score-0.436]

6 Looking forward to hearing from you, Jonny – JONNY KINTZELE Business Development, FindThe Data mobile: 619-307-0976 skype: jonny. [sent-7, score-0.167]

7 kintzele14 FindTheBest, a powerful tool for making quick and informed decisions. [sent-8, score-0.391]

8 It looked kinda spammy but I clicked on the link and it looked sort of interesting. [sent-11, score-0.483]

9 I can’t imagine how it could be integrated with my site but I thought I’d pass this on to all of you. [sent-12, score-0.414]

10 Of course, if they were to offer real money for sponsorship I’d consider it but I can’t imagine that would make sense to them. [sent-13, score-0.269]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('findthebest', 0.346), ('jonny', 0.346), ('visitors', 0.157), ('development', 0.15), ('perkins', 0.148), ('partnership', 0.148), ('offer', 0.145), ('founded', 0.142), ('skype', 0.142), ('huffington', 0.137), ('business', 0.133), ('founder', 0.13), ('golf', 0.13), ('ceo', 0.13), ('looked', 0.129), ('quick', 0.129), ('mobile', 0.127), ('digest', 0.127), ('integrate', 0.127), ('imagine', 0.124), ('integrated', 0.118), ('unsolicited', 0.116), ('kinda', 0.114), ('backed', 0.114), ('engaging', 0.114), ('hi', 0.113), ('publishers', 0.111), ('clicked', 0.111), ('kevin', 0.11), ('facebook', 0.11), ('reliable', 0.106), ('engine', 0.105), ('hiring', 0.105), ('unbiased', 0.102), ('greatly', 0.1), ('hearing', 0.094), ('options', 0.094), ('powerful', 0.091), ('informed', 0.09), ('pass', 0.09), ('resources', 0.089), ('variety', 0.084), ('unique', 0.084), ('former', 0.083), ('minutes', 0.083), ('site', 0.082), ('tool', 0.081), ('content', 0.08), ('appreciate', 0.076), ('forward', 0.073)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 1434 andrew gelman stats-2012-07-29-FindTheData.org

Introduction: I received the following (unsolicited) email: Hi Andrew, I work on the business development team of FindTheData.org, an unbiased comparison engine founded by Kevin O’Connor (founder and former CEO of DoubleClick) and backed by Kleiner Perkins with ~10M unique visitors per month. We are working with large online publishers including Golf Digest, Huffington Post, Under30CEO, and offer a variety of options to integrate our highly engaging content with your site.  I believe our un-biased and reliable data resources would be of interest to you and your readers. I’d like to set up a quick call to discuss similar partnership ideas with you and would greatly appreciate 10 minutes of your time. Please suggest a couple times that work best for you or let me know if you would like me to send some more information before you make time for a call. Looking forward to hearing from you, Jonny – JONNY KINTZELE Business Development, FindThe Data mobile: 619-307-097

2 0.10524111 2221 andrew gelman stats-2014-02-23-Postdoc with Huffpost Pollster to do Bayesian poll tracking

Introduction: Mark Blumenthal writes: HuffPost Pollster has an immediate opening for a social and data scientist to join us full time, preferably in our Washington D.C. bureau, to work on development and improvement of our poll tracking models and political forecasts. You are someone who has: * A passion for electoral politics, * Advanced training in statistics and dynamic Bayesian data analysis, * A Ph.D. in statistics, political science, economics or the social sciences or comparable high level training or experience, * A desire to make a lasting contribution in the way the news media cover polls and elections. We are: * The award-winning website formerly known as  Pollster.com , which joined the Huffington Post in 2010 and remains the internet’s premier source for uniquely interactive polling charts and electorate forecasts and a running daily commentary that explains, demystifies and critiques political polling. * Home to the open source Pollster API, which provides academic

3 0.093620405 1012 andrew gelman stats-2011-11-16-Blog bribes!

Introduction: Nick Rizzo points to this amusing Gawker item (sorry!) from Hamilton Nolan about advertisers trying to sneak links into blog content. The Gawker blogger got the following email: Greetings, My name is Bryan Clark, and I’m a big fan of your writing. I contacted you because I think I have a mutually beneficial agreement that will allow you to make additional money for articles you are already writing online. Let me give you an idea of how we can help each other. We’re looking for writers that can help increase the profile of our clients by linking to them within the context of their articles. The clients are huge, and we generally have one that can fit naturally in the context of most article niches. In return, we pay generously for a single link for our clients. If you are interested, I’d love to talk about it more. Regards, Bryan Clark Hey, that looks kinda familiar! Here’s an email I received a couple weeks ago [actually, last month, as we're currently on a 1-month l

4 0.082529359 1240 andrew gelman stats-2012-04-02-Blogads update

Introduction: A few months ago I reported on someone who wanted to insert text links into the blog. I asked her how much they would pay and got no answer. Yesterday, though, I received this reply: Hello Andrew, I am sorry for the delay in getting back to you. I’d like to make a proposal for your site. Please refer below. We would like to place a simple text link ad on page http://andrewgelman.com/2011/07/super_sam_fuld/ to link to *** with the key phrase ***. We will incorporate the key phrase into a sentence so it would read well. Rest assured it won’t sound obnoxious or advertorial. We will then process the final text link code as soon as you agree to our proposal. We can offer you $200 for this with the assumption that you will keep the link “live” on that page for 12 months or longer if you prefer. Please get back to us with a quick reply on your thoughts on this and include your Paypal ID for payment process. Hoping for a positive response from you. I wrote back: Hi,

5 0.082497969 880 andrew gelman stats-2011-08-30-Annals of spam

Introduction: I received the following (unsolicited) email: Howdy Andrew, Hope you’re keeping well! I was wondering if you’re open to guest posts at Statistical Modeling, Causal Inference, and Social Science – if you are interested, I can offer an original 500-1000 word, very high quality article in fitting with the site. All research and writing will be carried out by a professional writer (namely, me) and once approved it will be entirely yours to place on the site as you see fit. I can choose a title for the article or you can suggest one and I’ll work around that. Normally I write for property and travel sites (particularly cruise and ski related), but I’m game for anything if you’re happy to entertain me. I can also include some copyright-free and high quality pictures related to the blog post. You’re probably wondering what’s in it for me, which is a fair question – in return, all I’d ask is a subtle link back in return. Other than that, the material itself would be non-commericial

6 0.074988842 1080 andrew gelman stats-2011-12-24-Latest in blog advertising

7 0.073859528 48 andrew gelman stats-2010-05-23-The bane of many causes

8 0.073339216 545 andrew gelman stats-2011-01-30-New innovations in spam

9 0.07277374 503 andrew gelman stats-2011-01-04-Clarity on my email policy

10 0.071839057 18 andrew gelman stats-2010-05-06-$63,000 worth of abusive research . . . or just a really stupid waste of time?

11 0.071732 1405 andrew gelman stats-2012-07-04-“Titanic Thompson: The Man Who Would Bet on Everything”

12 0.070494577 1871 andrew gelman stats-2013-05-27-Annals of spam

13 0.070487112 760 andrew gelman stats-2011-06-12-How To Party Your Way Into a Multi-Million Dollar Facebook Job

14 0.069971338 1421 andrew gelman stats-2012-07-19-Alexa, Maricel, and Marty: Three cellular automata who got on my nerves

15 0.069762059 1195 andrew gelman stats-2012-03-04-Multiple comparisons dispute in the tabloids

16 0.06830804 1763 andrew gelman stats-2013-03-14-Everyone’s trading bias for variance at some point, it’s just done at different places in the analyses

17 0.064176604 951 andrew gelman stats-2011-10-11-Data mining efforts for Obama’s campaign

18 0.064155653 1447 andrew gelman stats-2012-08-07-Reproducible science FAIL (so far): What’s stoppin people from sharin data and code?

19 0.064141266 1976 andrew gelman stats-2013-08-10-The birthday problem

20 0.064080104 199 andrew gelman stats-2010-08-11-Note to semi-spammers


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.121), (1, -0.036), (2, -0.041), (3, 0.017), (4, 0.042), (5, 0.037), (6, 0.004), (7, -0.034), (8, -0.032), (9, 0.002), (10, -0.021), (11, -0.041), (12, 0.059), (13, -0.005), (14, -0.033), (15, 0.068), (16, 0.027), (17, -0.046), (18, 0.008), (19, 0.021), (20, 0.015), (21, 0.021), (22, 0.053), (23, -0.022), (24, -0.013), (25, 0.016), (26, 0.038), (27, 0.0), (28, 0.028), (29, 0.024), (30, -0.02), (31, -0.039), (32, -0.009), (33, -0.004), (34, -0.017), (35, 0.011), (36, -0.006), (37, -0.003), (38, -0.02), (39, 0.019), (40, 0.051), (41, -0.027), (42, 0.001), (43, -0.01), (44, -0.055), (45, 0.05), (46, 0.003), (47, 0.005), (48, 0.01), (49, -0.054)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.9584136 1434 andrew gelman stats-2012-07-29-FindTheData.org

Introduction: I received the following (unsolicited) email: Hi Andrew, I work on the business development team of FindTheData.org, an unbiased comparison engine founded by Kevin O’Connor (founder and former CEO of DoubleClick) and backed by Kleiner Perkins with ~10M unique visitors per month. We are working with large online publishers including Golf Digest, Huffington Post, Under30CEO, and offer a variety of options to integrate our highly engaging content with your site.  I believe our un-biased and reliable data resources would be of interest to you and your readers. I’d like to set up a quick call to discuss similar partnership ideas with you and would greatly appreciate 10 minutes of your time. Please suggest a couple times that work best for you or let me know if you would like me to send some more information before you make time for a call. Looking forward to hearing from you, Jonny – JONNY KINTZELE Business Development, FindThe Data mobile: 619-307-097

2 0.79706866 343 andrew gelman stats-2010-10-15-?

Introduction: How am I supposed to handle this sort of thing? (See below.) I just stuck it one of my email folders without responding, but then I wondered . . . what’s it all about? Is there some sort of Glengarry Glen Ross-like parallel world where down-on-their-luck Jack Lemmons of public relations world send out electronic cold calls? More than anything else, this sort of thing makes me glad I have a steady job. Here’s the (unsolicited) email, which came with the subject line “Please help a reporter do his job”: Dear Andrew, As an Editor for the Bulldog Reporter (www.bulldogreporter.com/dailydog), a media relations trade publication, my job is to help ensure that my readers have accurate info about you and send you the best quality pitches. By taking five minutes or less to answer my questions (pasted below), you’ll receive targeted PR pitches from our client base that will match your beat and interests. Any help or direction is appreciated. Here are my questions. We have you listed

3 0.77875733 1589 andrew gelman stats-2012-11-25-Life as a blogger: the emails just get weirder and weirder

Introduction: In the email the other day, subject line “Casting blogger, writer, journalist to host cable series”: Hi there Andrew, I’m casting a male journalist, writer, blogger, documentary filmmaker or comedian with a certain type personality for a television pilot along with production company, Pipeline39. See below: A certain type of character – no cockiness, no ego, a person who is smart, savvy, dry humor, but someone who isn’t imposing, who can infiltrate these organizations. This person will be hosting his own show and covering alternative lifestyles and secret societies around the world. If you’re interested in hearing more or would like to be considered for this project, please email me a photo and a bio of yourself, along with contact information. I’ll respond to you ASAP. I’m looking forward to hearing from you. *** Casting Producer (646) ***.**** ***@gmail.com I was with them until I got to the “no ego” part. . . . Also, I don’t think I could infiltrate any org

4 0.76967341 1192 andrew gelman stats-2012-03-02-These people totally don’t know what Chance magazine is all about

Introduction: I received the following unsolicited email, subject line “Chance Magazine – Comedy Showcase”: Hi Andrew, Hope you’re doing well. I’m writing to let you know that we will be putting on an industry showcase at the brand new Laughing Devil Comedy Club (4738 Vernon Blvd. Long Island City) on Thursday, February 9th at 8:00 PM. If you’re unfamiliar, it’s one stop on the 7 train from Grand Central. Following the showcase, the club will stay open for an industry mingle/happy hour with drink specials and all the business card exchanging you can hope for. This showcase will feature 9 of our best: Steve Hofstetter’s latest album hit #1 in the world. He’ll be hosting Collin Moulton (Showtime Half Hour Special), Tony Deyo (Aspen Comedy Festival), Tom Simmons (Winner of the SF International Comedy Festival), Marc Ryan (Host of Mudslingers), Mike Trainor (TruTV), Jessi Campbell (CMT), Danny Browning (Bob & Tom), and Joe Zimmerman (Sirius/XM). I would love for you (and anyone you’d like to

5 0.75702423 1871 andrew gelman stats-2013-05-27-Annals of spam

Introduction: I received the following email, subject line “Want to Buy Text Link from andrewgelman.com”: Dear, I am Mary Taylor. I have started a link building campaign for my growing websites. For this, I need your cooperation. The campaign is quite diverse and large scale and if you take some time to understand it – it will benefit us. First I want to clarify that I do not want “blogroll” ”footer” or any other type of “site wide links”. Secondly I want links from inner pages of site – with good page rank of course. Third links should be within text so that Google may not mark them as spam – not for you and not for me. Hence this link building will cause almost no harm to your site or me. Because content links are fine with Google. Now I should come to the requirements. I will accept links from Page Rank 3 to as high as you have got. Also kindly note that I can buy 1 to 50 links from one site – so you should understand the scale of the project. If you have multiple sites with co

6 0.75340748 1012 andrew gelman stats-2011-11-16-Blog bribes!

7 0.73699665 223 andrew gelman stats-2010-08-21-Statoverflow

8 0.72190392 1698 andrew gelman stats-2013-01-30-The spam just gets weirder and weirder

9 0.72117662 1618 andrew gelman stats-2012-12-11-The consulting biz

10 0.71230143 880 andrew gelman stats-2011-08-30-Annals of spam

11 0.70539594 2148 andrew gelman stats-2013-12-25-Spam!

12 0.69794577 951 andrew gelman stats-2011-10-11-Data mining efforts for Obama’s campaign

13 0.68198746 1175 andrew gelman stats-2012-02-19-Factual – a new place to find data

14 0.67857629 866 andrew gelman stats-2011-08-23-Participate in a research project on combining information for prediction

15 0.67592746 118 andrew gelman stats-2010-06-30-Question & Answer Communities

16 0.67255586 919 andrew gelman stats-2011-09-21-Least surprising headline of the year

17 0.66830361 1240 andrew gelman stats-2012-04-02-Blogads update

18 0.66722882 2118 andrew gelman stats-2013-11-30-???

19 0.66589278 2304 andrew gelman stats-2014-04-24-An open site for researchers to post and share papers

20 0.66080254 1211 andrew gelman stats-2012-03-13-A personal bit of spam, just for me!


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(16, 0.019), (24, 0.049), (30, 0.022), (53, 0.013), (97, 0.029), (99, 0.769)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.99894583 1434 andrew gelman stats-2012-07-29-FindTheData.org

Introduction: I received the following (unsolicited) email: Hi Andrew, I work on the business development team of FindTheData.org, an unbiased comparison engine founded by Kevin O’Connor (founder and former CEO of DoubleClick) and backed by Kleiner Perkins with ~10M unique visitors per month. We are working with large online publishers including Golf Digest, Huffington Post, Under30CEO, and offer a variety of options to integrate our highly engaging content with your site.  I believe our un-biased and reliable data resources would be of interest to you and your readers. I’d like to set up a quick call to discuss similar partnership ideas with you and would greatly appreciate 10 minutes of your time. Please suggest a couple times that work best for you or let me know if you would like me to send some more information before you make time for a call. Looking forward to hearing from you, Jonny – JONNY KINTZELE Business Development, FindThe Data mobile: 619-307-097

2 0.99836886 772 andrew gelman stats-2011-06-17-Graphical tools for understanding multilevel models

Introduction: There are a few things I want to do: 1. Understand a fitted model using tools such as average predictive comparisons , R-squared, and partial pooling factors . In defining these concepts, Iain and I came up with some clever tricks, including (but not limited to): - Separating the inputs and averaging over all possible values of the input not being altered (for average predictive comparisons); - Defining partial pooling without referring to a raw-data or maximum-likelihood or no-pooling estimate (these don’t necessarily exist when you’re fitting logistic regression with sparse data); - Defining an R-squared for each level of a multilevel model. The methods get pretty complicated, though, and they have some loose ends–in particular, for average predictive comparisons with continuous input variables. So now we want to implement these in R and put them into arm along with bglmer etc. 2. Setting up coefplot so it works more generally (that is, so the graphics look nice

3 0.99833059 1315 andrew gelman stats-2012-05-12-Question 2 of my final exam for Design and Analysis of Sample Surveys

Introduction: 2. Which of the following are useful goals in a pilot study? (Indicate all that apply.) (a) You can search for statistical significance, then from that decide what to look for in a confirmatory analysis of your full dataset. (b) You can see if you find statistical significance in a pre-chosen comparison of interest. (c) You can examine the direction (positive or negative, even if not statistically significant) of comparisons of interest. (d) With a small sample size, you cannot hope to learn anything conclusive, but you can get a crude estimate of effect size and standard deviation which will be useful in a power analysis to help you decide how large your full study needs to be. (e) You can talk with survey respondents and get a sense of how they perceived your questions. (f) You get a chance to learn about practical difficulties with sampling, nonresponse, and question wording. (g) You can check if your sample is approximately representative of your population. Soluti

4 0.99829757 1813 andrew gelman stats-2013-04-19-Grad students: Participate in an online survey on statistics education

Introduction: Joan Garfield, a leading researcher in statistics education, is conducting a survey of graduate students who teach or assist with the teaching of statistics. She writes: We want to invite them to take a short survey that will enable us to collect some baseline data that we may use in a grant proposal we are developing. The project would provide summer workshops and ongoing support for graduate students who will be teaching or assisting with teaching introductory statistics classes. If the grant is funded, we would invite up to 40 students from around the country who are entering graduate programs in statistics to participate in a three-year training and support program. The goal of this program is to help these students become expert and flexible teachers of statistics, and to support them as they move through their teaching experiences as graduate students. Here’s the the online survey . Garfield writes, “Your responses are completely voluntary and anonymous. Results w

5 0.99816567 1483 andrew gelman stats-2012-09-04-“Bestselling Author Caught Posting Positive Reviews of His Own Work on Amazon”

Introduction: I don’t have much sympathy for well-paid academic plagiarists who are too lazy to do their jobs, but I actually can feel for the author in this story who posted fake positive Amazon reviews of his own books and negative reviews of his competitors’. I mean, sure, this is despicable behavior, I won’t deny that, but it’s gotta be harder and harder to make money writing books. Even a so-called bestselling author must feel under a lot of pressure. I was recently reading a book by Jonathan Coe—he’s just great, and famous, and celebrated, but I doubt he’s getting rich from his books. Not that there’s any reason that he has to get rich, but if even Jonathan Coe isn’t living the high life, that’s not good for authors in general. It’s a far cry from the days in which Updike, Styron, etc., could swagger around like bigshots.

6 0.99800986 174 andrew gelman stats-2010-08-01-Literature and life

7 0.99768591 23 andrew gelman stats-2010-05-09-Popper’s great, but don’t bother with his theory of probability

8 0.99722028 589 andrew gelman stats-2011-02-24-On summarizing a noisy scatterplot with a single comparison of two points

9 0.99698877 1288 andrew gelman stats-2012-04-29-Clueless Americans think they’ll never get sick

10 0.99654251 756 andrew gelman stats-2011-06-10-Christakis-Fowler update

11 0.99637389 6 andrew gelman stats-2010-04-27-Jelte Wicherts lays down the stats on IQ

12 0.99637389 90 andrew gelman stats-2010-06-16-Oil spill and corn production

13 0.99637389 122 andrew gelman stats-2010-07-01-MCMC machine

14 0.99637389 299 andrew gelman stats-2010-09-27-what is = what “should be” ??

15 0.99637389 632 andrew gelman stats-2011-03-28-Wobegon on the Potomac

16 0.99637389 826 andrew gelman stats-2011-07-27-The Statistics Forum!

17 0.99637389 1298 andrew gelman stats-2012-05-03-News from the sister blog!

18 0.99637389 1464 andrew gelman stats-2012-08-20-Donald E. Westlake on George W. Bush

19 0.99637085 521 andrew gelman stats-2011-01-17-“the Tea Party’s ire, directed at Democrats and Republicans alike”

20 0.9960708 860 andrew gelman stats-2011-08-18-Trolls!