andrew_gelman_stats andrew_gelman_stats-2010 andrew_gelman_stats-2010-211 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: A year ago we blogged about Ian Fellows’s R Gui called Deducer (oops, my bad, I meant to link to this ). Fellows sends in this update: Since version 0.1, I [Fellows] have added: 1. A nice plug-in interface, so that people can extend Deducer’s capability without leaving the comfort of R. (see: http://www.deducer.org/pmwiki/pmwiki.php?n=Main.Development ) 2. Several new dialogs. 3. A one-step installer for windows. 4. A plug-in package (DeducerExtras) which extends the scope of analyses covered. 5. A plotting GUI that can create anything from simple histograms to complex custom graphics. Deducer is designed to be a free easy to use alternative to proprietary data analysis software such as SPSS, JMP, and Minitab. It has a menu system to do common data manipulation and analysis tasks, and an excel-like spreadsheet in which to view and edit data frames. The goal of the project is two fold. Provide an intuitive interface so that non-technical users can learn and p
sentIndex sentText sentNum sentScore
1 A year ago we blogged about Ian Fellows’s R Gui called Deducer (oops, my bad, I meant to link to this ). [sent-1, score-0.144]
2 Fellows sends in this update: Since version 0. [sent-2, score-0.063]
3 A nice plug-in interface, so that people can extend Deducer’s capability without leaving the comfort of R. [sent-4, score-0.496]
4 A plug-in package (DeducerExtras) which extends the scope of analyses covered. [sent-15, score-0.355]
5 A plotting GUI that can create anything from simple histograms to complex custom graphics. [sent-17, score-0.358]
6 Deducer is designed to be a free easy to use alternative to proprietary data analysis software such as SPSS, JMP, and Minitab. [sent-18, score-0.305]
7 It has a menu system to do common data manipulation and analysis tasks, and an excel-like spreadsheet in which to view and edit data frames. [sent-19, score-0.484]
8 Provide an intuitive interface so that non-technical users can learn and perform analyses without programming getting in their way. [sent-21, score-0.702]
9 Increase the efficiency of expert R users when performing common tasks by replacing hundreds of keystrokes with a few mouse clicks. [sent-22, score-0.886]
10 Deducer is designed to be used with the Java based R console JGR, though it supports a number of other R environments (e. [sent-24, score-0.424]
wordName wordTfidf (topN-words)
[('deducer', 0.535), ('gui', 0.318), ('fellows', 0.301), ('interface', 0.185), ('tasks', 0.177), ('designed', 0.143), ('users', 0.131), ('oops', 0.122), ('installer', 0.122), ('java', 0.122), ('menu', 0.122), ('mouse', 0.122), ('console', 0.11), ('capability', 0.11), ('custom', 0.106), ('histograms', 0.106), ('spss', 0.106), ('analyses', 0.105), ('proprietary', 0.103), ('extends', 0.1), ('comfort', 0.096), ('edit', 0.096), ('common', 0.095), ('ian', 0.092), ('environments', 0.088), ('plotting', 0.087), ('replacing', 0.087), ('intuitive', 0.087), ('scope', 0.086), ('extend', 0.086), ('spreadsheet', 0.086), ('windows', 0.085), ('manipulation', 0.085), ('leaving', 0.083), ('supports', 0.083), ('blogged', 0.081), ('efficiency', 0.074), ('performing', 0.073), ('hundreds', 0.069), ('programming', 0.067), ('update', 0.066), ('package', 0.064), ('without', 0.064), ('perform', 0.063), ('sends', 0.063), ('meant', 0.063), ('create', 0.059), ('software', 0.059), ('expert', 0.058), ('nice', 0.057)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 211 andrew gelman stats-2010-08-17-Deducer update
Introduction: A year ago we blogged about Ian Fellows’s R Gui called Deducer (oops, my bad, I meant to link to this ). Fellows sends in this update: Since version 0.1, I [Fellows] have added: 1. A nice plug-in interface, so that people can extend Deducer’s capability without leaving the comfort of R. (see: http://www.deducer.org/pmwiki/pmwiki.php?n=Main.Development ) 2. Several new dialogs. 3. A one-step installer for windows. 4. A plug-in package (DeducerExtras) which extends the scope of analyses covered. 5. A plotting GUI that can create anything from simple histograms to complex custom graphics. Deducer is designed to be a free easy to use alternative to proprietary data analysis software such as SPSS, JMP, and Minitab. It has a menu system to do common data manipulation and analysis tasks, and an excel-like spreadsheet in which to view and edit data frames. The goal of the project is two fold. Provide an intuitive interface so that non-technical users can learn and p
2 0.13920106 1777 andrew gelman stats-2013-03-26-Data Science for Social Good summer fellowship program
Introduction: Juan-Pablo Velez writes: I’m helping with a Data Science for Social Good summer fellowship program at the University of Chicago. The goal is to train data scientists that can tackle problems in education, healthcare, energy, transportation, and more. Working with full-time mentors from academia, industry, and the Obama campaign , fellows will build high-impact analytics projects using statistics, machine learning, data mining, and big data technologies. For fellows, we’re looking for grad students, advanced undergrads, and professionals in computer science, machine learning, statistics, and the computational and quantitative sciences. For mentors, we’re looking for folks with practical data science experience. Fellows and mentors will be paid competitively and housed in Chicago for duration of the program, from early June to late August. Rayid Ghani , former Chief Scientist of the Obama 2012 campaign, is leading the program. Eric Sch
3 0.13436493 894 andrew gelman stats-2011-09-07-Hipmunk FAIL: Graphics without content is not enough
Introduction: I love a good GUI but not if it doesn’t give me the information I need. I again tried Hipmunk and it again failed (this time for a trip to Baltimore where it gave only a useless subset of the available Amtrak trains). I don’t know anything about the internet biz. What I’m guessing is that they set up this cool website that is pretty much functional, with the goal of selling it for a few million dollars to Travelocity or Expedia or Kayak. What I’m wondering is, why haven’t they made the deal already? Hipmunk’s GUI is great. The site is useless because it’s missing so many flights, but if you put it in an actual travel site such as Expedia, it would be great. It’s enough to make me want to hit someone with an i-phone . . .
4 0.10041074 412 andrew gelman stats-2010-11-13-Time to apply for the hackNY summer fellows program
Introduction: Chris Wiggins writes of an interesting-looking summer program that undergraduate or graduate students can apply to: The hackNY Fellows program is an initiative to mentor the next generation of technology innovators in New York, focusing on tech startups. Last summer’s class of fellows was paired with NYC startups which demonstrated they could provide a mentoring environment (a clear project, a person who could work with the Fellow, and sufficient stability to commit to 10 weeks of compensation for the Fellow). hackNY, with the support of the Kauffman foundation and the Internet Society of New York, provided shared housing in NYU dorms in Union Square, and organized a series of pedagogical lectures. hackNY was founded by Hilary Mason, chief scientist at bit.ly, Evan Korth, professor of CS at NYU, and Chris Wiggins, professor of applied mathematics at Columbia. Each of us has spent thousands of student-hours teaching and mentoring, and is committed to help build a strong communi
5 0.082231522 2069 andrew gelman stats-2013-10-19-R package for effect size calculations for psychology researchers
Introduction: Dan Gerlanc writes: I read your post the other day [now the other month, as our blog is on a bit of a delay] on helping psychologists do research and thought you might be interested in our R package, “bootES”, for robust effect size calculation and confidence interval estimation using resampling techniques. The package provides one function, ‘bootES’, that makes a variety of effect size calculations fairly straightforward for researchers with limited programming experience. The majority of the implemented are not available in R or SPSS without custom coding. Kris Kirby (Williams College) and I have published a paper in Behavioral Research Methods describing the methods and providing a tutorial on use of the package: http://bit.ly/YIM6VD. We hope that it’s useful to psychologists and other social science researchers! I haven’t tried this out but it might be of interest for some of you.
6 0.073601931 2124 andrew gelman stats-2013-12-05-Stan (quietly) passes 512 people on the users list
7 0.07314907 1948 andrew gelman stats-2013-07-21-Bayes related
8 0.072473943 537 andrew gelman stats-2011-01-25-Postdoc Position #1: Missing-Data Imputation, Diagnostics, and Applications
9 0.068511046 290 andrew gelman stats-2010-09-22-Data Thief
10 0.067199722 1134 andrew gelman stats-2012-01-21-Lessons learned from a recent R package submission
11 0.066684246 1296 andrew gelman stats-2012-05-03-Google Translate for code, and an R help-list bot
12 0.065428138 597 andrew gelman stats-2011-03-02-RStudio – new cross-platform IDE for R
13 0.0626541 2345 andrew gelman stats-2014-05-24-An interesting mosaic of a data programming course
14 0.059482973 220 andrew gelman stats-2010-08-20-Why I blog?
15 0.057014618 1175 andrew gelman stats-2012-02-19-Factual – a new place to find data
16 0.056397136 1605 andrew gelman stats-2012-12-04-Write This Book
17 0.054496758 223 andrew gelman stats-2010-08-21-Statoverflow
18 0.054346375 1920 andrew gelman stats-2013-06-30-“Non-statistical” statistics tools
19 0.053907055 2190 andrew gelman stats-2014-01-29-Stupid R Tricks: Random Scope
20 0.052578114 2203 andrew gelman stats-2014-02-08-“Guys who do more housework get less sex”
topicId topicWeight
[(0, 0.088), (1, -0.001), (2, -0.023), (3, 0.014), (4, 0.054), (5, 0.028), (6, -0.02), (7, -0.033), (8, -0.015), (9, -0.007), (10, -0.018), (11, -0.023), (12, 0.005), (13, -0.021), (14, -0.014), (15, 0.045), (16, 0.024), (17, -0.017), (18, -0.0), (19, -0.007), (20, 0.004), (21, 0.037), (22, -0.001), (23, -0.003), (24, -0.025), (25, -0.001), (26, 0.027), (27, -0.003), (28, 0.042), (29, -0.015), (30, -0.006), (31, -0.028), (32, 0.072), (33, -0.016), (34, -0.025), (35, 0.039), (36, -0.029), (37, -0.008), (38, 0.015), (39, 0.006), (40, -0.011), (41, -0.028), (42, -0.003), (43, 0.003), (44, 0.007), (45, 0.007), (46, -0.032), (47, -0.018), (48, 0.03), (49, 0.014)]
simIndex simValue blogId blogTitle
same-blog 1 0.92414582 211 andrew gelman stats-2010-08-17-Deducer update
Introduction: A year ago we blogged about Ian Fellows’s R Gui called Deducer (oops, my bad, I meant to link to this ). Fellows sends in this update: Since version 0.1, I [Fellows] have added: 1. A nice plug-in interface, so that people can extend Deducer’s capability without leaving the comfort of R. (see: http://www.deducer.org/pmwiki/pmwiki.php?n=Main.Development ) 2. Several new dialogs. 3. A one-step installer for windows. 4. A plug-in package (DeducerExtras) which extends the scope of analyses covered. 5. A plotting GUI that can create anything from simple histograms to complex custom graphics. Deducer is designed to be a free easy to use alternative to proprietary data analysis software such as SPSS, JMP, and Minitab. It has a menu system to do common data manipulation and analysis tasks, and an excel-like spreadsheet in which to view and edit data frames. The goal of the project is two fold. Provide an intuitive interface so that non-technical users can learn and p
2 0.69073802 1907 andrew gelman stats-2013-06-20-Amazing retro gnu graphics!
Introduction: Bill Harris writes: Speaking of strange graphics, http://makingsense.facilitatedsystems.com/2007/03/making-musical-sense-by-email-part-2.html shows an example of text (gnuplot’s dumb terminal) graphics of data from MCSim (code and other material available from http://makingsense.facilitatedsystems.com/2007/03/making-musical-sense-by-email-table-of.html). At another extreme, slide 20 of https://docs.google.com/viewer?a=v&pid;=sites&srcid;=ZGVmYXVsdGRvbWFpbnx3c2hhcnJpczEzfGd4OjZkNGFjZWZhOTAyYTFkMDg shows a stereogram of more MCSim output (I was a bit more naive back then). I included the stereogram as a bit of humor just to show what could be done with J graphics. Surprisingly, one person in the audience focused intently on that slide and, after a moment, said “Got it!” We spoke afterwards, and it turned out that he was on the board or at least a volunteer at the Portland (OR) 3D Center of Art & Photography (http://www.3dcenterusa.com/index.html). Regarding mcsim, the
3 0.68591827 1175 andrew gelman stats-2012-02-19-Factual – a new place to find data
Introduction: Factual collects data on a variety of topics, organizes them, and allows easy access. If you ever wanted to do a histogram of calorie content in Starbucks coffees or plot warnings with a live feed of earthquake data – your life should be a bit simpler now. Also see DataMarket , InfoChimps , and a few older links in The Future of Data Analysis . If you access the data through the API, you can build live visualizations like this: Of course, you could just go to the source. Roy Mendelssohn writes (with minor edits): Since you are both interested in data access, please look at our service ERDDAP: http://coastwatch.pfel.noaa.gov/erddap/index.html http://upwell.pfeg.noaa.gov/erddap/index.html Please do not be fooled by the web pages. Everything is a service (including search and graphics) and the URL completely defines the request, and response formats are easily changed just by changing the “file extension”. The web pages are just html and javascript that u
4 0.66448736 714 andrew gelman stats-2011-05-16-NYT Labs releases Openpaths, a utility for saving your iphone data
Introduction: Jake Porway writes: We launched Openpaths the other week. It’s a site where people can privately upload and view their iPhone location data (at least until an Apple update wipes it out) and also download their data for their own use. More than just giving people a neat tool to view their data with, however, we’re also creating an option for them to donate their data to research projects at varying levels of anonymity. We’re still working out the terms for that, but we’d love any input and to get in touch with anyone who might want to use the data. I don’t have any use for this personally but maybe it will interest some of you. From the webpage: Openpaths is an anonymous, user-contributed database for the personal location data files recorded by iOS devices. Users securely store, explore, and manage their personal location data, and grant researchers access to portions of that data as they choose. All location data stored in openpaths is kept separate from user profi
5 0.66350341 802 andrew gelman stats-2011-07-13-Super Sam Fuld Needs Your Help (with Foul Ball stats)
Introduction: I was pleasantly surprised to have my recreational reading about baseball in the New Yorker interrupted by a digression on statistics. Sam Fuld of the Tampa Bay Rays, was the subjet of a Ben McGrath profile in the 4 July 2011 issue of the New Yorker , in an article titled Super Sam . After quoting a minor-league trainer who described Fuld as “a bit of a geek” (who isn’t these days?), McGrath gets into that lovely New Yorker detail: One could have pointed out the more persuasive and telling examples, such as the fact that in 2005, after his first pro season, with the Class-A Peoria Chiefs, Fuld applied for a fall internship with Stats, Inc., the research firm that supplies broadcasters with much of the data anad analysis that you hear in sports telecasts. After a description of what they had him doing, reviewing footage of games and cataloguing, he said “I thought, They have a stat for everything, but they don’t have any stats regarding foul balls.” Fuld’s
6 0.65740311 275 andrew gelman stats-2010-09-14-Data visualization at the American Evaluation Association
7 0.64151824 1923 andrew gelman stats-2013-07-03-Bayes pays!
8 0.63086933 1853 andrew gelman stats-2013-05-12-OpenData Latinoamerica
9 0.62514901 1990 andrew gelman stats-2013-08-20-Job opening at an organization that promotes reproducible research!
10 0.62412006 951 andrew gelman stats-2011-10-11-Data mining efforts for Obama’s campaign
11 0.61657351 223 andrew gelman stats-2010-08-21-Statoverflow
12 0.61580604 597 andrew gelman stats-2011-03-02-RStudio – new cross-platform IDE for R
13 0.61569715 1014 andrew gelman stats-2011-11-16-Visualizations of NYPD stop-and-frisk data
14 0.61270344 1434 andrew gelman stats-2012-07-29-FindTheData.org
15 0.60816067 59 andrew gelman stats-2010-05-30-Extended Binary Format Support for Mac OS X
16 0.60654503 2016 andrew gelman stats-2013-09-11-Zipfian Academy, A School for Data Science
17 0.60247213 422 andrew gelman stats-2010-11-20-A Gapminder-like data visualization package
18 0.59505486 1689 andrew gelman stats-2013-01-23-MLB Hall of Fame Voting Trajectories
19 0.59386933 290 andrew gelman stats-2010-09-22-Data Thief
20 0.59182495 907 andrew gelman stats-2011-09-14-Reproducibility in Practice
topicId topicWeight
[(0, 0.012), (11, 0.018), (12, 0.329), (15, 0.029), (16, 0.036), (24, 0.137), (36, 0.013), (45, 0.049), (73, 0.017), (82, 0.024), (86, 0.041), (90, 0.014), (99, 0.167)]
simIndex simValue blogId blogTitle
1 0.86182094 677 andrew gelman stats-2011-04-24-My NOAA story
Introduction: I recently learned we have some readers at the National Oceanic and Atmospheric Administration so I thought I’d share an old story. About 35 years ago my brother worked briefly as a clerk at NOAA in their D.C. (or maybe it was D.C.-area) office. His job was to enter the weather numbers that came in. He had a boss who was very orderly. At one point there was a hurricane that wiped out some weather station in the Caribbean, and his boss told him to put in the numbers anyway. My brother protested that they didn’t have the data, to which his boss replied: “I know what the numbers are.” Nowadays we call this sort of thing “imputation” and we like it. But not in the raw data! I bet nowadays they have an NA code.
same-blog 2 0.81360328 211 andrew gelman stats-2010-08-17-Deducer update
Introduction: A year ago we blogged about Ian Fellows’s R Gui called Deducer (oops, my bad, I meant to link to this ). Fellows sends in this update: Since version 0.1, I [Fellows] have added: 1. A nice plug-in interface, so that people can extend Deducer’s capability without leaving the comfort of R. (see: http://www.deducer.org/pmwiki/pmwiki.php?n=Main.Development ) 2. Several new dialogs. 3. A one-step installer for windows. 4. A plug-in package (DeducerExtras) which extends the scope of analyses covered. 5. A plotting GUI that can create anything from simple histograms to complex custom graphics. Deducer is designed to be a free easy to use alternative to proprietary data analysis software such as SPSS, JMP, and Minitab. It has a menu system to do common data manipulation and analysis tasks, and an excel-like spreadsheet in which to view and edit data frames. The goal of the project is two fold. Provide an intuitive interface so that non-technical users can learn and p
3 0.73928547 1119 andrew gelman stats-2012-01-15-Excellence in Statistical Reporting Award
Introduction: The American Statistical Association is seeking nominations for its annual Excellence in Statistical Reporting Award . The award was created in 2004 to encourage and recognize members of the communications media who have best displayed an informed interest in the science of statistics and its role in public life. The award can be given for a single statistical article or for a body of work. Former winners of the award include: Felix Salmon , financial blogger, 2010; Sharon Begley , Newsweek, 2009; Mark Buchanan, New York Times, 2008; John Berry, Bloomberg News, 2005; and Gina Kolata, New York Times, 2004. If anyone has any suggestions for the 2012 award, feel free to post in the comments or email me.
4 0.69196594 189 andrew gelman stats-2010-08-06-Proposal for a moratorium on the use of the words “fashionable” and “trendy”
Introduction: Tyler Cowen links to an interesting article by Terry Teachout on David Mamet’s political conservatism. I don’t think of playwrights as gurus, but I do find it interesting to consider the political orientations of authors and celebrities . I have only one problem with Teachout’s thought-provoking article. He writes: As early as 2002 . . . Arguing that “the Western press [had] embraced antisemitism as the new black,” Mamet drew a sharp contrast between that trendy distaste for Jews and the harsh realities of daily life in Israel . . . In 2006, Mamet published a collection of essays called The Wicked Son: Anti-Semitism, Jewish Self-Hatred and the Jews that made the point even more bluntly. “The Jewish State,” he wrote, “has offered the Arab world peace since 1948; it has received war, and slaughter, and the rhetoric of annihilation.” He went on to argue that secularized Jews who “reject their birthright of ‘connection to the Divine’” succumb in time to a self-hatred tha
5 0.66350621 434 andrew gelman stats-2010-11-28-When Small Numbers Lead to Big Errors
Introduction: My column in Scientific American . Check out the comments. I have to remember never ever to write about guns.
6 0.65582883 1871 andrew gelman stats-2013-05-27-Annals of spam
7 0.65278292 1858 andrew gelman stats-2013-05-15-Reputations changeable, situations tolerable
8 0.65252006 372 andrew gelman stats-2010-10-27-A use for tables (really)
9 0.63987732 1282 andrew gelman stats-2012-04-26-Bad news about (some) statisticians
10 0.6234321 1660 andrew gelman stats-2013-01-08-Bayesian, Permutable Symmetries
11 0.60466248 1597 andrew gelman stats-2012-11-29-What is expected of a consultant
12 0.60290694 840 andrew gelman stats-2011-08-05-An example of Bayesian model averaging
13 0.60247725 1777 andrew gelman stats-2013-03-26-Data Science for Social Good summer fellowship program
14 0.59839684 239 andrew gelman stats-2010-08-28-The mathematics of democracy
15 0.59053051 752 andrew gelman stats-2011-06-08-Traffic Prediction
16 0.57161474 244 andrew gelman stats-2010-08-30-Useful models, model checking, and external validation: a mini-discussion
17 0.56270468 1742 andrew gelman stats-2013-02-27-What is “explanation”?
18 0.56073046 1348 andrew gelman stats-2012-05-27-Question 17 of my final exam for Design and Analysis of Sample Surveys
19 0.56050676 1266 andrew gelman stats-2012-04-16-Another day, another plagiarist
20 0.55930829 2010 andrew gelman stats-2013-09-06-Would today’s captains of industry be happier in a 1950s-style world?