andrew_gelman_stats andrew_gelman_stats-2011 andrew_gelman_stats-2011-951 knowledge-graph by maker-knowledge-mining

951 andrew gelman stats-2011-10-11-Data mining efforts for Obama’s campaign


meta infos for this blog

Source: html

Introduction: From CNN : In July, KDNuggets.com, an online newsite focused on data mining and analytics software, ran an unusual listing in its jobs section: “We are looking for Predictive Modeling/Data Mining Scientists and Analysts, at both the senior and junior level, to join our department through November 2012 at our Chicago Headquarters,” read the ad. “We are a multi-disciplinary team of statisticians, predictive modelers, data mining experts, mathematicians, software developers, general analysts and organizers – all striving for a single goal: re-electing President Obama.” Users of the Obama 2012 – Are You In? app are not only giving the campaign personal data like their name, gender, birthday, current city, religion and political views, they are sharing their list of friends and information those friends share, like their birthday, current city, religion and political views. As Facebook is now offering the geo-targeting of ads down to ZIP code, this kind of fine-grained informa


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 “We are a multi-disciplinary team of statisticians, predictive modelers, data mining experts, mathematicians, software developers, general analysts and organizers – all striving for a single goal: re-electing President Obama. [sent-3, score-0.921]

2 app are not only giving the campaign personal data like their name, gender, birthday, current city, religion and political views, they are sharing their list of friends and information those friends share, like their birthday, current city, religion and political views. [sent-5, score-1.084]

3 As Facebook is now offering the geo-targeting of ads down to ZIP code, this kind of fine-grained information is invaluable. [sent-6, score-0.286]

4 Inside the Obama operation, his staff members are using a powerful social networking tool called NationalField, which enables everyone to share what they are working on. [sent-7, score-0.667]

5 Modeled on Facebook, the tool connects all levels of staff to the information they are gathering as they work on tasks like signing up volunteers, knocking on doors, identifying likely voters and dealing with problems. [sent-8, score-1.102]

6 Managers can set goals for field organizers — number of calls made, number of doors knocked — and see, in real time, how people are doing against all kinds of metrics. [sent-9, score-0.572]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('mining', 0.271), ('doors', 0.239), ('birthday', 0.224), ('organizers', 0.218), ('analysts', 0.185), ('facebook', 0.185), ('staff', 0.174), ('religion', 0.162), ('city', 0.142), ('tool', 0.136), ('friends', 0.131), ('obama', 0.128), ('software', 0.127), ('zip', 0.125), ('enables', 0.125), ('headquarters', 0.125), ('predictive', 0.12), ('share', 0.12), ('signing', 0.115), ('app', 0.115), ('knocked', 0.115), ('modelers', 0.112), ('networking', 0.112), ('knocking', 0.109), ('cnn', 0.109), ('metrics', 0.107), ('information', 0.105), ('gathering', 0.104), ('managers', 0.102), ('operation', 0.101), ('mathematicians', 0.099), ('analytics', 0.099), ('current', 0.098), ('listing', 0.097), ('connects', 0.096), ('volunteers', 0.096), ('developers', 0.096), ('ads', 0.096), ('tasks', 0.096), ('junior', 0.091), ('july', 0.085), ('identifying', 0.085), ('offering', 0.085), ('modeled', 0.085), ('join', 0.084), ('november', 0.083), ('sharing', 0.082), ('dealing', 0.082), ('senior', 0.081), ('inside', 0.079)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 951 andrew gelman stats-2011-10-11-Data mining efforts for Obama’s campaign

Introduction: From CNN : In July, KDNuggets.com, an online newsite focused on data mining and analytics software, ran an unusual listing in its jobs section: “We are looking for Predictive Modeling/Data Mining Scientists and Analysts, at both the senior and junior level, to join our department through November 2012 at our Chicago Headquarters,” read the ad. “We are a multi-disciplinary team of statisticians, predictive modelers, data mining experts, mathematicians, software developers, general analysts and organizers – all striving for a single goal: re-electing President Obama.” Users of the Obama 2012 – Are You In? app are not only giving the campaign personal data like their name, gender, birthday, current city, religion and political views, they are sharing their list of friends and information those friends share, like their birthday, current city, religion and political views. As Facebook is now offering the geo-targeting of ads down to ZIP code, this kind of fine-grained informa

2 0.132403 1902 andrew gelman stats-2013-06-17-Job opening at new “big data” consulting firm!

Introduction: David Shor sends along a job announcement for Civis Analytics, which he describes as “basically Obama’s Analytics team reconstituted as a company”: Data Scientist Position Overview Data Scientists are responsible for providing the fundamental data science that powers our work – including predictive analytics, data mining, experimental design and ad-hoc statistical analysis. As a Data Scientist, you will join our Chicago-based data science team, working closely and collaboratively with analysts and engineers to identify, quantify and solve big, meaningful problems. Data Scientists will have the opportunity to dive deeply into big problems and work in a variety of areas. Civis Analytics has opportunities for applicants who are seasoned professionals, brilliant new comers, and anywhere in between. Qualifications · Master’s degree in statistics, machine learning, computer science with heavy quant focus, a related subject, or a Bachelor’s degree and significant work ex

3 0.12064236 635 andrew gelman stats-2011-03-29-Bayesian spam!

Introduction: Cool! I know Bayes has reached the big time when I receive spam like this: Bayesian networks are rapidly emerging as a new research paradigm . . . With this monthly newsletter, we’ll keep you up to date . . . Financial Analytics Webinar . . . will exhibit at this year’s INFORMS Analytics Conference in downtown Chicago. Please join us for our Bayesian networks technology workshop on April 10 . . . a powerful desktop application (Windows/Mac/Unix) for knowledge discovery, data mining, analytics, predictive modeling and simulation . . . the world’s only comprehensive software package for learning, editing and analyzing Bayesian networks . . . If you no longer wish to receive these emails, please reply to this message with “Unsubscribe” in the subject line . . . You know the saying, “It’s not real unless it’s on TV”? My saying is: It’s not real until it’s on spam.

4 0.10208385 2309 andrew gelman stats-2014-04-28-Crowdstorming a dataset

Introduction: Raphael Silberzahn writes: Brian Nosek, Eric Luis Uhlmann, Dan Martin, and I just launched a project through the Open Science Center we think you’ll find interesting. The basic idea is to “Crowdstorm a Dataset”. Multiple independent analysts are recruited to test the same hypothesis on the same data set in whatever manner they see as best. If everyone comes up with the same results, then scientists can speak with one voice. If not, the subjectivity and conditionality of results on analysis strategy is made transparent. For this first project, we are crowdstorming the question of whether soccer referees are more likely to give red cards to dark skin toned players than light skin toned players. The full project description is here . If you’re interested in being one of the crowdstormer analysts, you can register here . All analysts will receive an author credit on the final paper. We would love to have Bayesian analysts represented in the group. Also, please feel free to let

5 0.094526656 1447 andrew gelman stats-2012-08-07-Reproducible science FAIL (so far): What’s stoppin people from sharin data and code?

Introduction: David Karger writes: Your recent post on sharing data was of great interest to me, as my own research in computer science asks how to incentivize and lower barriers to data sharing. I was particularly curious about your highlighting of effort as the major dis-incentive to sharing. I would love to hear more, as this question of effort is on we specifically target in our development of tools for data authoring and publishing. As a straw man, let me point out that sharing data technically requires no more than posting an excel spreadsheet online. And that you likely already produced that spreadsheet during your own analytic work. So, in what way does such low-tech publishing fail to meet your data sharing objectives? Our own hypothesis has been that the effort is really quite low, with the problem being a lack of *immediate/tangible* benefits (as opposed to the long-term values you accurately describe). To attack this problem, we’re developing tools (and, since it appear

6 0.093090788 1279 andrew gelman stats-2012-04-24-ESPN is looking to hire a research analyst

7 0.090585694 465 andrew gelman stats-2010-12-13-$3M health care prediction challenge

8 0.088966653 1519 andrew gelman stats-2012-10-02-Job!

9 0.087058671 1777 andrew gelman stats-2013-03-26-Data Science for Social Good summer fellowship program

10 0.084854253 41 andrew gelman stats-2010-05-19-Updated R code and data for ARM

11 0.083488211 1297 andrew gelman stats-2012-05-03-New New York data research organizations

12 0.075672314 2255 andrew gelman stats-2014-03-19-How Americans vote

13 0.073534206 760 andrew gelman stats-2011-06-12-How To Party Your Way Into a Multi-Million Dollar Facebook Job

14 0.073455997 1569 andrew gelman stats-2012-11-08-30-30-40 Nation

15 0.072048686 231 andrew gelman stats-2010-08-24-Yet another Bayesian job opportunity

16 0.07175611 362 andrew gelman stats-2010-10-22-A redrawing of the Red-Blue map in November 2010?

17 0.071620554 1497 andrew gelman stats-2012-09-15-Our blog makes connections!

18 0.071563751 1650 andrew gelman stats-2013-01-03-Did Steven Levitt really believe in 2008 that Obama “would be the greatest president in history”?

19 0.071245499 978 andrew gelman stats-2011-10-28-Cool job opening with brilliant researchers at Yahoo

20 0.071066 1574 andrew gelman stats-2012-11-12-How to Lie With Statistics example number 12,498,122


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.11), (1, -0.033), (2, 0.018), (3, 0.05), (4, 0.006), (5, 0.044), (6, -0.059), (7, -0.037), (8, -0.065), (9, 0.035), (10, -0.032), (11, -0.023), (12, 0.012), (13, -0.027), (14, -0.055), (15, 0.035), (16, -0.003), (17, -0.034), (18, 0.013), (19, 0.017), (20, 0.016), (21, 0.052), (22, -0.001), (23, 0.018), (24, -0.027), (25, -0.021), (26, -0.001), (27, 0.018), (28, 0.052), (29, -0.004), (30, 0.012), (31, -0.017), (32, 0.015), (33, 0.005), (34, 0.002), (35, 0.022), (36, -0.043), (37, 0.031), (38, -0.013), (39, -0.0), (40, 0.025), (41, -0.028), (42, 0.016), (43, 0.027), (44, -0.035), (45, 0.031), (46, 0.013), (47, 0.03), (48, 0.0), (49, -0.037)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.95602912 951 andrew gelman stats-2011-10-11-Data mining efforts for Obama’s campaign

Introduction: From CNN : In July, KDNuggets.com, an online newsite focused on data mining and analytics software, ran an unusual listing in its jobs section: “We are looking for Predictive Modeling/Data Mining Scientists and Analysts, at both the senior and junior level, to join our department through November 2012 at our Chicago Headquarters,” read the ad. “We are a multi-disciplinary team of statisticians, predictive modelers, data mining experts, mathematicians, software developers, general analysts and organizers – all striving for a single goal: re-electing President Obama.” Users of the Obama 2012 – Are You In? app are not only giving the campaign personal data like their name, gender, birthday, current city, religion and political views, they are sharing their list of friends and information those friends share, like their birthday, current city, religion and political views. As Facebook is now offering the geo-targeting of ads down to ZIP code, this kind of fine-grained informa

2 0.73374492 1777 andrew gelman stats-2013-03-26-Data Science for Social Good summer fellowship program

Introduction: Juan-Pablo Velez writes: I’m helping with a  Data Science for Social Good  summer fellowship program at the University of Chicago. The goal is to train data scientists that can tackle problems in education, healthcare, energy, transportation, and more. Working with full-time mentors from academia, industry, and the  Obama campaign , fellows will build high-impact analytics projects using statistics, machine learning, data mining, and big data technologies. For fellows, we’re looking for grad students, advanced undergrads, and professionals in computer science, machine learning, statistics, and the computational and quantitative sciences. For mentors, we’re looking for folks with practical data science experience. Fellows and mentors will be paid competitively and housed in Chicago for duration of the program, from early June to late August. Rayid Ghani , former Chief Scientist of the Obama 2012 campaign, is leading the program.  Eric Sch

3 0.70490664 1909 andrew gelman stats-2013-06-21-Job openings at conservative political analytics firm!

Introduction: After posting that announcement about Civis Analytics, I wrote, “If a reconstituted Romney Analytics team is hiring, let me know and I’ll post that ad too.” Adam Schaeffer obliged : Not sure about Romney’s team, but Evolving Strategies is looking for sharp folks who lean right: Evolving Strategies is a political communications research firm specializing in randomized controlled experiments in the “lab” and in the “field.” ES is bringing a scientific revolution to free-market/conservative politics. We are looking for people who are obsessive about getting things right and creative in their work. A ideal candidate will have a deep understanding of the academic literature in their field, highly developed skills, a commitment to academic rigor, but an intuitive understanding of practical political concerns and objectives as well. We’re looking for new talent to help with our fast-growing portfolio in these areas: High-level data processing, statistical analysis and modelin

4 0.69685864 1990 andrew gelman stats-2013-08-20-Job opening at an organization that promotes reproducible research!

Introduction: I was told about an organization called Reproducibility Initiative. They tell me they are trying to make what was described in our “50 shades of gray” post standard across all of science, particularly areas like cancer research. I don’t know anything else about them, but that sounds like a good start! Here’s the ad: Data Scientist: Science Exchange, Palo Alto, CA Science Exchange is an innovative start-up with a mission to improve the efficiency and quality of scientific research. This Data Science position is critical to our mission. Our ideal candidate has the ability to collect and normalize data from multiple sources. This information will be used to drive marketing and product decisions, as well as fuel many of the features of Science Exchange. Desired Skills & Experience Experience with text mining, entity extraction and natural language processing is essential Experience scripting with either Python or R Experience running complex statistical analyses on l

5 0.68965799 1923 andrew gelman stats-2013-07-03-Bayes pays!

Introduction: Jason Rosenfeld, who has the amazing title of “Manager of Basketball Analytics” at the Charlotte Bobcats, announces the following jobs : Basketball Operations: Statistics Basketball Operations Systems Developer – Charlotte Bobcats (Charlotte, NC) POSITION OVERVIEW The Basketball Operations System Developer will collect and import data to our database, check data, and field requests from the Basketball Operations staff.  This position will be instrumental in molding and improving our database to assist the staff in player personnel and coaching efforts. ESSENTIAL DUTIES AND RESPONSIBILITIES • Respond to data and database requests from the front office. • Build user-friendly software tools for use by the basketball operations staff. • Accumulate data from various sources to input and organize into our system to assist the basketball operations staff with decisions. • Check and clean data for accuracy and import to our database. • Provide ideas and play a key ro

6 0.68568903 1434 andrew gelman stats-2012-07-29-FindTheData.org

7 0.67776924 1902 andrew gelman stats-2013-06-17-Job opening at new “big data” consulting firm!

8 0.66830754 2221 andrew gelman stats-2014-02-23-Postdoc with Huffpost Pollster to do Bayesian poll tracking

9 0.6582222 1279 andrew gelman stats-2012-04-24-ESPN is looking to hire a research analyst

10 0.65558773 465 andrew gelman stats-2010-12-13-$3M health care prediction challenge

11 0.64547342 1297 andrew gelman stats-2012-05-03-New New York data research organizations

12 0.6337285 1525 andrew gelman stats-2012-10-08-Ethical standards in different data communities

13 0.6309787 714 andrew gelman stats-2011-05-16-NYT Labs releases Openpaths, a utility for saving your iphone data

14 0.62870395 275 andrew gelman stats-2010-09-14-Data visualization at the American Evaluation Association

15 0.62712038 2307 andrew gelman stats-2014-04-27-Big Data…Big Deal? Maybe, if Used with Caution.

16 0.61906344 1530 andrew gelman stats-2012-10-11-Migrating your blog from Movable Type to WordPress

17 0.61716115 192 andrew gelman stats-2010-08-08-Turning pages into data

18 0.61308187 2345 andrew gelman stats-2014-05-24-An interesting mosaic of a data programming course

19 0.61221284 1630 andrew gelman stats-2012-12-18-Postdoc positions at Microsoft Research – NYC

20 0.60835749 211 andrew gelman stats-2010-08-17-Deducer update


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(5, 0.212), (9, 0.014), (16, 0.076), (17, 0.022), (23, 0.036), (24, 0.049), (30, 0.03), (45, 0.019), (53, 0.023), (55, 0.019), (72, 0.052), (77, 0.023), (82, 0.029), (86, 0.024), (93, 0.017), (98, 0.011), (99, 0.254)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.95284092 422 andrew gelman stats-2010-11-20-A Gapminder-like data visualization package

Introduction: Ossama Hamed writes in with a new dynamic graphing software: I have the pleasure to brief you on our Data Visualization software “Trend Compass”. TC is a new concept in viewing statistics and trends in an animated way by displaying in one chart 5 axis (X, Y, Time, Bubble size & Bubble color) instead of just the traditional X and Y axis. . . .

2 0.93388945 513 andrew gelman stats-2011-01-12-“Tied for Warmest Year On Record”

Introduction: The National Climatic Data Center has tentatively announced that 2010 is, get this, “tied” for warmest on record. Presumably they mean it’s tied to the precision that they quote (1.12 F above the 20th-century average). The uncertainty in the measurements, as well as some fuzziness about exactly what is being measured (how much of the atmosphere, and the oceans) makes these global-average things really suspect. For instance, if there’s more oceanic turnover one year, that can warm the deep ocean but cool the shallow ocean and atmosphere, so even though the heat content of the atmosphere-ocean system goes up, some of these “global-average” estimates can go down. The reverse can happen too. And of course there are various sources of natural variability that are not, these days, what most people are most interested in. So everybody who knows about the climate professes to hate the emphasis on climate records. And yet, they’re irresistible. I’m sure we’ll see the usual clamor of som

3 0.93269956 665 andrew gelman stats-2011-04-17-Yes, your wish shall be granted (in 25 years)

Introduction: This one was so beautiful I just had to repost it: From the New York Times, 9 Sept 1981: IF I COULD CHANGE PARK SLOPE If I could change Park Slope I would turn it into a palace with queens and kings and princesses to dance the night away at the ball. The trees would look like garden stalks. The lights would look like silver pearls and the dresses would look like soft silver silk. You should see the ball. It looks so luxurious to me. The Park Slope ball is great. Can you guess what street it’s on? “Yes. My street. That’s Carroll Street.” – Jennifer Chatmon, second grade, P.S. 321 This was a few years before my sister told me that she felt safer having a crack house down the block because the cops were surveilling it all the time.

4 0.92882603 1250 andrew gelman stats-2012-04-07-Hangman tips

Introduction: Jeff pointed me to this article by Nick Berry. It’s kind of fun but of course if you know your opponent will be following this strategy you can figure out how to outwit it. Also, Berry writes that ETAOIN SHRDLU CMFWYP VBGKQJ XZ is the “ordering of letter frequency in English language.” Indeed this is the conventional ordering but nobody thinks it’s right anymore. See here (with further discussion here ). I wonder what corpus he’s using. P.S. Klutz was my personal standby.

5 0.92846632 2005 andrew gelman stats-2013-09-02-“Il y a beaucoup de candidats démocrates, et leurs idéologies ne sont pas très différentes. Et la participation est imprévisible.”

Introduction: As I wrote a couple years ago: Even though statistical analysis has demonstrated that presidential elections are predictable given economic conditions and previous votes in the states . . . it certainly doesn’t mean that every election can be accurately predicted ahead of time. Presidential general election campaigns have several distinct features that distinguish them from most other elections: 1. Two major candidates; 2. The candidates clearly differ in their political ideologies and in their positions on economic issues; 3. The two sides have roughly equal financial and organizational resources; 4. The current election is the latest in a long series of similar contests (every four years); 5. A long campaign, giving candidates a long time to present their case and giving voters a long time to make up their minds. Other elections look different. . . . Or, as I said in reference to the current NYC mayoral election: Et selon Andrew Gelman, expert de l’universi

6 0.92288113 87 andrew gelman stats-2010-06-15-Statistical analysis and visualization of the drug war in Mexico

same-blog 7 0.92067492 951 andrew gelman stats-2011-10-11-Data mining efforts for Obama’s campaign

8 0.9134959 1286 andrew gelman stats-2012-04-28-Agreement Groups in US Senate and Dynamic Clustering

9 0.90855676 1103 andrew gelman stats-2012-01-06-Unconvincing defense of the recent Russian elections, and a problem when an official organ of an academic society has low standards for publication

10 0.90816021 1512 andrew gelman stats-2012-09-27-A Non-random Walk Down Campaign Street

11 0.90441352 364 andrew gelman stats-2010-10-22-Politics is not a random walk: Momentum and mean reversion in polling

12 0.89770621 1052 andrew gelman stats-2011-12-11-Rational Turbulence

13 0.88325298 224 andrew gelman stats-2010-08-22-Mister P gets married

14 0.87891066 228 andrew gelman stats-2010-08-24-A new efficient lossless compression algorithm

15 0.87425566 1606 andrew gelman stats-2012-12-05-The Grinch Comes Back

16 0.86227781 1894 andrew gelman stats-2013-06-12-How to best graph the Beveridge curve, relating the vacancy rate in jobs to the unemployment rate?

17 0.86098617 1986 andrew gelman stats-2013-08-17-Somebody’s looking for a book on time series analysis in the style of Angrist and Pischke, or Gelman and Hill

18 0.860264 1207 andrew gelman stats-2012-03-10-A quick suggestion

19 0.83913314 1914 andrew gelman stats-2013-06-25-Is there too much coauthorship in economics (and science more generally)? Or too little?

20 0.83817387 123 andrew gelman stats-2010-07-01-Truth in headlines