andrew_gelman_stats andrew_gelman_stats-2010 andrew_gelman_stats-2010-223 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Skirant Vadali writes: I am writing to seek your help in building a community driven Q&A; website tentatively called called ‘Statistics Analysis’. I am neither a founder of this website nor do I have any financial stake in its success. By way of background to this website, please see Stackoverflow (http://stackoverflow.com/) and Mathoverflow (http://mathoverflow.net/). Stackoverflow is a Q&A; website targeted at software developers and is designed to help them ask questions and get answers from other developers. Mathoverflow is a Q&A; website targeted at research mathematicians and is designed to help them ask and answer questions from other mathematicians across the world. The success of both these sites in helping their respective communities is a strong indicator that sites designed along these lines are very useful. The company that runs Stackoverflow (who also host Mathoverflow.net) has recently decided to develop other community driven websites for various other topic are
sentIndex sentText sentNum sentScore
1 Skirant Vadali writes: I am writing to seek your help in building a community driven Q&A; website tentatively called called ‘Statistics Analysis’. [sent-1, score-1.493]
2 I am neither a founder of this website nor do I have any financial stake in its success. [sent-2, score-0.736]
3 Stackoverflow is a Q&A; website targeted at software developers and is designed to help them ask questions and get answers from other developers. [sent-6, score-1.467]
4 Mathoverflow is a Q&A; website targeted at research mathematicians and is designed to help them ask and answer questions from other mathematicians across the world. [sent-7, score-1.623]
5 The success of both these sites in helping their respective communities is a strong indicator that sites designed along these lines are very useful. [sent-8, score-1.064]
6 The company that runs Stackoverflow (who also host Mathoverflow. [sent-9, score-0.224]
7 net) has recently decided to develop other community driven websites for various other topic areas including statistics. [sent-10, score-0.749]
8 Given the number of emails you get seeking help with statistics related questions and given the volume of help messages at various stats forums, I think a community driven statistics Q&A; website would be very helpful. [sent-11, score-2.117]
9 I have provided a link if you wish to explore the ‘Statistics Analysis’ Q&A; website which is currently in the process of being developed. [sent-12, score-0.677]
10 I haven’t had a chance to look at this; just passing it on to anyone who might be interested. [sent-14, score-0.085]
wordName wordTfidf (topN-words)
[('website', 0.419), ('stackoverflow', 0.374), ('mathoverflow', 0.273), ('driven', 0.247), ('designed', 0.22), ('help', 0.203), ('targeted', 0.189), ('community', 0.189), ('mathematicians', 0.186), ('sites', 0.178), ('questions', 0.125), ('respective', 0.117), ('tentatively', 0.117), ('http', 0.112), ('websites', 0.108), ('forums', 0.105), ('statistics', 0.104), ('founder', 0.102), ('ask', 0.095), ('stake', 0.093), ('communities', 0.092), ('developers', 0.09), ('host', 0.09), ('called', 0.089), ('indicator', 0.086), ('passing', 0.085), ('emails', 0.081), ('messages', 0.08), ('seek', 0.079), ('seeking', 0.079), ('various', 0.077), ('helping', 0.077), ('volume', 0.074), ('runs', 0.072), ('stats', 0.072), ('wish', 0.069), ('explore', 0.068), ('develop', 0.067), ('answers', 0.066), ('neither', 0.063), ('provided', 0.063), ('company', 0.062), ('decided', 0.061), ('building', 0.061), ('given', 0.06), ('software', 0.06), ('financial', 0.059), ('success', 0.059), ('currently', 0.058), ('lines', 0.057)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 223 andrew gelman stats-2010-08-21-Statoverflow
Introduction: Skirant Vadali writes: I am writing to seek your help in building a community driven Q&A; website tentatively called called ‘Statistics Analysis’. I am neither a founder of this website nor do I have any financial stake in its success. By way of background to this website, please see Stackoverflow (http://stackoverflow.com/) and Mathoverflow (http://mathoverflow.net/). Stackoverflow is a Q&A; website targeted at software developers and is designed to help them ask questions and get answers from other developers. Mathoverflow is a Q&A; website targeted at research mathematicians and is designed to help them ask and answer questions from other mathematicians across the world. The success of both these sites in helping their respective communities is a strong indicator that sites designed along these lines are very useful. The company that runs Stackoverflow (who also host Mathoverflow.net) has recently decided to develop other community driven websites for various other topic are
2 0.2367544 118 andrew gelman stats-2010-06-30-Question & Answer Communities
Introduction: StackOverflow has been a popular community where software developers would help one another. Recently they raised some VC funding , and to make profits they are selling job postings and expanding the model to other areas. Metaoptimize LLC has started a similar website, using the open-source OSQA framework for such as statistics and machine learning. Here’s a description: You and other data geeks can ask and answer questions on machine learning, natural language processing, artificial intelligence, text analysis, information retrieval, search, data mining, statistical modeling, and data visualization. Here you can ask and answer questions, comment and vote for the questions of others and their answers. Both questions and answers can be revised and improved. Questions can be tagged with the relevant keywords to simplify future access and organize the accumulated material. If you work very hard on your questions and answers, you will receive badges like “Guru”, “Studen
3 0.13295466 2239 andrew gelman stats-2014-03-09-Reviewing the peer review process?
Introduction: I received the following email: Dear Colleague, Recently we informed you about SciRev, our new website where researchers can share their experiences with the peer review process and select an efficient journal for submitting their work. Since our start, we already received over 500 reviews and many positive reactions, which reveal a great need for comparable information on duration and quality of the review process. All reviews are publicly available on our website, both at the pages of the journals and in an overview at www.scirev.sc/reviews To make this venture a success, many reviews are needed. We therefore would appreciate it very much if you could take a few minutes to visit our website www.SciRev.sc and share your recent review experiences with your colleagues. SciRev also offers you the possibility to create a free account where you can administer your manuscripts under review and create a personal journal list. Thanks on behalf of the research community, Jan
Introduction: Amanda Martinez, a writer for The Atlantic and others, advised attendees that her favorite writing “accorded me the basic human dignity of allowing me to draw my own conclusions.” I really like that way of putting it, and this is something we tried hard to do with Red State Blue State, to put the information and our reasoning right there in front of the reader, rather than hiding behind a bunch of statistically-significant regression coefficients. This is related to the idea of presenting research findings quantitatively (which, I think, lends itself to clearer statements of uncertainty and variation) rather than qualitatively (which seems to come out more deterministically, as “X causes Y” or “when A happens, B happens”). The above quote comes from a conference of students organized by Nathan Sanders, who writes: Thanks so much for posting an announcement about the Communicating Science workshop (ComSciCon) back in January! With the help of your blog, we received more than
5 0.12219165 1061 andrew gelman stats-2011-12-16-CrossValidated: A place to post your statistics questions
Introduction: Seth Rogers writes: I [Rogers] am a member of an online community of statisticians where I burn a great deal of time (and a recovering cog sci researcher). Our community website is a peer-reviewed Q and A spanning stats topics ranging from applications to mathematical theory. Our online community consists of mostly university faculty, grad students and technical consultants. The answer quality is very strong and the web design is intuitive. I think you and your readers are like-minded and would be really interested in some of the topics on the site, CrossValidated (you may know the sister site: stackoverflow.com ). The philosophy is purely to further knowledge for the sake of knowledge and take pride in learning. I took a quick look and the site seemed like it could be useful to people. The only thing I didn’t understand is, why doesn’t it have a search function? (Or maybe it was there somewhere and I couldn’t find it.) P.S. to all the commenters who wrote replies such
6 0.12111448 1080 andrew gelman stats-2011-12-24-Latest in blog advertising
7 0.11148881 2345 andrew gelman stats-2014-05-24-An interesting mosaic of a data programming course
8 0.094378725 199 andrew gelman stats-2010-08-11-Note to semi-spammers
9 0.089851797 1782 andrew gelman stats-2013-03-30-“Statistical Modeling: A Fresh Approach”
10 0.084785268 1489 andrew gelman stats-2012-09-09-Commercial Bayesian inference software is popping up all over
11 0.083802462 1754 andrew gelman stats-2013-03-08-Cool GSS training video! And cumulative file 1972-2012!
12 0.081302091 1948 andrew gelman stats-2013-07-21-Bayes related
13 0.080596909 132 andrew gelman stats-2010-07-07-Note to “Cigarettes”
14 0.079889588 231 andrew gelman stats-2010-08-24-Yet another Bayesian job opportunity
15 0.078295559 2016 andrew gelman stats-2013-09-11-Zipfian Academy, A School for Data Science
16 0.073950283 880 andrew gelman stats-2011-08-30-Annals of spam
17 0.073886484 1630 andrew gelman stats-2012-12-18-Postdoc positions at Microsoft Research – NYC
18 0.072406515 1976 andrew gelman stats-2013-08-10-The birthday problem
19 0.069926284 505 andrew gelman stats-2011-01-05-Wacky interview questions: An exploration into the nature of evidence on the internet
20 0.068795554 513 andrew gelman stats-2011-01-12-“Tied for Warmest Year On Record”
topicId topicWeight
[(0, 0.1), (1, -0.027), (2, -0.053), (3, 0.015), (4, 0.036), (5, 0.076), (6, -0.041), (7, -0.016), (8, -0.031), (9, 0.012), (10, -0.002), (11, -0.062), (12, 0.069), (13, -0.005), (14, -0.04), (15, 0.046), (16, 0.009), (17, -0.027), (18, -0.0), (19, 0.005), (20, 0.044), (21, 0.014), (22, 0.024), (23, -0.042), (24, 0.002), (25, 0.037), (26, 0.062), (27, -0.006), (28, 0.002), (29, -0.011), (30, 0.012), (31, -0.043), (32, 0.037), (33, 0.015), (34, -0.043), (35, 0.027), (36, -0.006), (37, 0.019), (38, 0.013), (39, -0.006), (40, 0.037), (41, -0.029), (42, -0.002), (43, 0.043), (44, -0.035), (45, 0.037), (46, -0.013), (47, 0.008), (48, 0.001), (49, 0.02)]
simIndex simValue blogId blogTitle
same-blog 1 0.95675242 223 andrew gelman stats-2010-08-21-Statoverflow
Introduction: Skirant Vadali writes: I am writing to seek your help in building a community driven Q&A; website tentatively called called ‘Statistics Analysis’. I am neither a founder of this website nor do I have any financial stake in its success. By way of background to this website, please see Stackoverflow (http://stackoverflow.com/) and Mathoverflow (http://mathoverflow.net/). Stackoverflow is a Q&A; website targeted at software developers and is designed to help them ask questions and get answers from other developers. Mathoverflow is a Q&A; website targeted at research mathematicians and is designed to help them ask and answer questions from other mathematicians across the world. The success of both these sites in helping their respective communities is a strong indicator that sites designed along these lines are very useful. The company that runs Stackoverflow (who also host Mathoverflow.net) has recently decided to develop other community driven websites for various other topic are
2 0.73498535 118 andrew gelman stats-2010-06-30-Question & Answer Communities
Introduction: StackOverflow has been a popular community where software developers would help one another. Recently they raised some VC funding , and to make profits they are selling job postings and expanding the model to other areas. Metaoptimize LLC has started a similar website, using the open-source OSQA framework for such as statistics and machine learning. Here’s a description: You and other data geeks can ask and answer questions on machine learning, natural language processing, artificial intelligence, text analysis, information retrieval, search, data mining, statistical modeling, and data visualization. Here you can ask and answer questions, comment and vote for the questions of others and their answers. Both questions and answers can be revised and improved. Questions can be tagged with the relevant keywords to simplify future access and organize the accumulated material. If you work very hard on your questions and answers, you will receive badges like “Guru”, “Studen
3 0.73288977 1279 andrew gelman stats-2012-04-24-ESPN is looking to hire a research analyst
Introduction: This is somebody’s dream job, I’m sure . . . ESPN is looking for a statistician to join the HR department as a Research Analyst . The job will consist of analytical research and producing statistics about the people that work at ESPN. Topics of interest will include productivity, efficiency, and retention of employees, among other items. In addition to data mining and producing reports, we also field surveys and analyze results. The position is located at the headquarters in Bristol, Connecticut, the same campus where nearly all ESPN shows are produced. ESPN is a Disney company, so discounts and free admission to Disney parks are available for employees. Flexible work arrangements are available, along with working in the New York City office part-time if desired. The role is a relatively new function and will have a high impact very quickly on helping the business function. Statistical software, text books, and any other resource needed to get the job done will be provided. T
4 0.72991365 1434 andrew gelman stats-2012-07-29-FindTheData.org
Introduction: I received the following (unsolicited) email: Hi Andrew, I work on the business development team of FindTheData.org, an unbiased comparison engine founded by Kevin O’Connor (founder and former CEO of DoubleClick) and backed by Kleiner Perkins with ~10M unique visitors per month. We are working with large online publishers including Golf Digest, Huffington Post, Under30CEO, and offer a variety of options to integrate our highly engaging content with your site. I believe our un-biased and reliable data resources would be of interest to you and your readers. I’d like to set up a quick call to discuss similar partnership ideas with you and would greatly appreciate 10 minutes of your time. Please suggest a couple times that work best for you or let me know if you would like me to send some more information before you make time for a call. Looking forward to hearing from you, Jonny – JONNY KINTZELE Business Development, FindThe Data mobile: 619-307-097
5 0.70236778 1909 andrew gelman stats-2013-06-21-Job openings at conservative political analytics firm!
Introduction: After posting that announcement about Civis Analytics, I wrote, “If a reconstituted Romney Analytics team is hiring, let me know and I’ll post that ad too.” Adam Schaeffer obliged : Not sure about Romney’s team, but Evolving Strategies is looking for sharp folks who lean right: Evolving Strategies is a political communications research firm specializing in randomized controlled experiments in the “lab” and in the “field.” ES is bringing a scientific revolution to free-market/conservative politics. We are looking for people who are obsessive about getting things right and creative in their work. A ideal candidate will have a deep understanding of the academic literature in their field, highly developed skills, a commitment to academic rigor, but an intuitive understanding of practical political concerns and objectives as well. We’re looking for new talent to help with our fast-growing portfolio in these areas: High-level data processing, statistical analysis and modelin
7 0.6477142 1061 andrew gelman stats-2011-12-16-CrossValidated: A place to post your statistics questions
8 0.64513135 1871 andrew gelman stats-2013-05-27-Annals of spam
9 0.64437407 880 andrew gelman stats-2011-08-30-Annals of spam
10 0.63967854 1923 andrew gelman stats-2013-07-03-Bayes pays!
12 0.63942993 199 andrew gelman stats-2010-08-11-Note to semi-spammers
13 0.63319319 412 andrew gelman stats-2010-11-13-Time to apply for the hackNY summer fellows program
14 0.63137448 1990 andrew gelman stats-2013-08-20-Job opening at an organization that promotes reproducible research!
15 0.62810773 1618 andrew gelman stats-2012-12-11-The consulting biz
16 0.62496442 2304 andrew gelman stats-2014-04-24-An open site for researchers to post and share papers
17 0.62376159 211 andrew gelman stats-2010-08-17-Deducer update
18 0.62374109 1872 andrew gelman stats-2013-05-27-More spam!
19 0.61735737 866 andrew gelman stats-2011-08-23-Participate in a research project on combining information for prediction
20 0.61576879 2309 andrew gelman stats-2014-04-28-Crowdstorming a dataset
topicId topicWeight
[(4, 0.015), (6, 0.027), (9, 0.015), (16, 0.082), (17, 0.025), (24, 0.142), (27, 0.028), (30, 0.014), (52, 0.249), (73, 0.013), (86, 0.024), (89, 0.05), (99, 0.197)]
simIndex simValue blogId blogTitle
1 0.89304078 485 andrew gelman stats-2010-12-25-Unlogging
Introduction: Catherine Bueker writes: I [Bueker] am analyzing the effect of various contextual factors on the voter turnout of naturalized Latino citizens. I have included the natural log of the number of Spanish Language ads run in each state during the election cycle to predict voter turnout. I now want to calculate the predicted probabilities of turnout for those in states with 0 ads, 500 ads, 1000 ads, etc. The problem is that I do not know how to handle the beta coefficient of the LN(Spanish language ads). Is there someway to “unlog” the coefficient? My reply: Calculate these probabilities for specific values of predictors, then graph the predictions of interest. Also, you can average over the other inputs in your model to get summaries. See this article with Pardoe for further discussion.
same-blog 2 0.88651466 223 andrew gelman stats-2010-08-21-Statoverflow
Introduction: Skirant Vadali writes: I am writing to seek your help in building a community driven Q&A; website tentatively called called ‘Statistics Analysis’. I am neither a founder of this website nor do I have any financial stake in its success. By way of background to this website, please see Stackoverflow (http://stackoverflow.com/) and Mathoverflow (http://mathoverflow.net/). Stackoverflow is a Q&A; website targeted at software developers and is designed to help them ask questions and get answers from other developers. Mathoverflow is a Q&A; website targeted at research mathematicians and is designed to help them ask and answer questions from other mathematicians across the world. The success of both these sites in helping their respective communities is a strong indicator that sites designed along these lines are very useful. The company that runs Stackoverflow (who also host Mathoverflow.net) has recently decided to develop other community driven websites for various other topic are
3 0.88179964 1246 andrew gelman stats-2012-04-04-Data visualization panel at the New York Public Library this evening!
Introduction: I’ll be participating in a panel (along with Kaiser Fung, Mark Hansen, Tahir Hemphill, and Manuel Lima), “What Makes Good Data Visualization?”, at the 42nd St. library this evening. The event is organized by Isabel Walcott Draves and is part of the Leaders in Software and Art series. This article with Antony Unwin should be relevant (although I won’t be “presenting”; I’ll be part of a panel and we’ll be having a wide-ranging conversation).
4 0.86721236 546 andrew gelman stats-2011-01-31-Infovis vs. statistical graphics: My talk tomorrow (Tues) 1pm at Columbia
Introduction: Infovis vs. statistical graphics . Tues 1 Feb 2011 1pm, Avery Hall room 114. It’s for the Lectures in Planning Series at the School of Architecture, Planning, and Preservation. Background on the talk (joint with Antony Unwin) is here . And here are more of my thoughts on statistical graphics.
5 0.86003101 1686 andrew gelman stats-2013-01-21-Finite-population Anova calculations for models with interactions
Introduction: Jim Thomson writes: I wonder if you could provide some clarification on the correct way to calculate the finite-population standard deviations for interaction terms in your Bayesian approach to ANOVA (as explained in your 2005 paper, and Gelman and Hill 2007). I understand that it is the SD of the constrained batch coefficients that is of interest, but in most WinBUGS examples I have seen, the SDs are all calculated directly as sd.fin<-sd(beta.main[]) for main effects and sd(beta.int[,]) for interaction effects, where beta.main and beta.int are the unconstrained coefficients, e.g. beta.int[i,j]~dnorm(0,tau). For main effects, I can see that it makes no difference, since the constrained value is calculated by subtracting the mean, and sd(B[]) = sd(B[]-mean(B[])). But the conventional sum-to-zero constraint for interaction terms in linear models is more complicated than subtracting the mean (there are only (n1-1)*(n2-1) free coefficients for an interaction b/w factors with n1 a
6 0.85580873 1256 andrew gelman stats-2012-04-10-Our data visualization panel at the New York Public Library
7 0.85416317 914 andrew gelman stats-2011-09-16-meta-infographic
9 0.82824045 1301 andrew gelman stats-2012-05-05-Related to z-statistics
10 0.80984277 1531 andrew gelman stats-2012-10-12-Elderpedia
11 0.79603481 889 andrew gelman stats-2011-09-04-The acupuncture paradox
12 0.79571462 1020 andrew gelman stats-2011-11-20-No no no no no
13 0.79377425 104 andrew gelman stats-2010-06-22-Seeking balance
14 0.7670213 948 andrew gelman stats-2011-10-10-Combining data from many sources
15 0.76478535 1369 andrew gelman stats-2012-06-06-Your conclusion is only as good as your data
16 0.7574439 786 andrew gelman stats-2011-07-04-Questions about quantum computing
18 0.72652948 107 andrew gelman stats-2010-06-24-PPS in Georgia
19 0.72266102 2096 andrew gelman stats-2013-11-10-Schiminovich is on The Simpsons
20 0.72088754 918 andrew gelman stats-2011-09-21-Avoiding boundary estimates in linear mixed models