andrew_gelman_stats andrew_gelman_stats-2013 andrew_gelman_stats-2013-1736 knowledge-graph by maker-knowledge-mining

1736 andrew gelman stats-2013-02-24-Rcpp class in Sat 9 Mar in NYC


meta infos for this blog

Source: html

Introduction: Join Dirk Eddelbuettel for six hours of detailed and hands-on instructions and discussions around Rcpp, RInside, RcppArmadillo, RcppGSL and other packages . . . Rcpp has become the most widely-used language extension for R. Currently deployed by 103 CRAN packages and a further 10 BioConductor packages, it permits users and developers to pass “whole R objects” with ease between R and C++ . . . Morning session: “A Hands-on Introduction to R and C++” . . . Afternoon session: “Advanced R and C++ Topics” . . .


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Join Dirk Eddelbuettel for six hours of detailed and hands-on instructions and discussions around Rcpp, RInside, RcppArmadillo, RcppGSL and other packages . [sent-1, score-1.131]

2 Rcpp has become the most widely-used language extension for R. [sent-4, score-0.366]

3 Currently deployed by 103 CRAN packages and a further 10 BioConductor packages, it permits users and developers to pass “whole R objects” with ease between R and C++ . [sent-5, score-1.24]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('rcpp', 0.432), ('packages', 0.425), ('session', 0.316), ('dirk', 0.229), ('cran', 0.207), ('ease', 0.2), ('permits', 0.194), ('afternoon', 0.166), ('developers', 0.166), ('extension', 0.164), ('instructions', 0.164), ('objects', 0.155), ('introduction', 0.15), ('join', 0.145), ('morning', 0.145), ('advanced', 0.137), ('detailed', 0.133), ('pass', 0.131), ('users', 0.124), ('six', 0.12), ('hours', 0.116), ('language', 0.109), ('discussions', 0.108), ('currently', 0.107), ('topics', 0.106), ('become', 0.093), ('whole', 0.084), ('around', 0.065)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0000001 1736 andrew gelman stats-2013-02-24-Rcpp class in Sat 9 Mar in NYC

Introduction: Join Dirk Eddelbuettel for six hours of detailed and hands-on instructions and discussions around Rcpp, RInside, RcppArmadillo, RcppGSL and other packages . . . Rcpp has become the most widely-used language extension for R. Currently deployed by 103 CRAN packages and a further 10 BioConductor packages, it permits users and developers to pass “whole R objects” with ease between R and C++ . . . Morning session: “A Hands-on Introduction to R and C++” . . . Afternoon session: “Advanced R and C++ Topics” . . .

2 0.16461527 1134 andrew gelman stats-2012-01-21-Lessons learned from a recent R package submission

Introduction: R has zillions of packages, and people are submitting new ones each day . The volunteers who keep R going are doing an incredibly useful service to the profession, and they’re busy . A colleague sends in some suugestions based on a recent experience with a package update: 1. Always use the R dev version to write a package. Not the current stable release. The R people use the R dev version to check your package anyway. If you don’t use the R dev version, there is chance that your package won’t pass the check. In my own experience, every time R has a major change, it tends to have new standards and find new errors in your package with these new standards. So better use the dev version to find out the potential errors in advance. 2. After submission, write an email to claim it. I used to submit the package to the CRAN without writing an email. This was standard operating procedure, but it has changed. Writing an email to claim about the submission is now a requir

3 0.14386663 705 andrew gelman stats-2011-05-10-Some interesting unpublished ideas on survey weighting

Introduction: A couple years ago we had an amazing all-star session at the Joint Statistical Meetings. The topic was new approaches to survey weighting (which is a mess , as I’m sure you’ve heard). Xiao-Li Meng recommended shrinking weights by taking them to a fractional power (such as square root) instead of trimming the extremes. Rod Little combined design-based and model-based survey inference. Michael Elliott used mixture models for complex survey design. And here’s my introduction to the session.

4 0.13288286 324 andrew gelman stats-2010-10-07-Contest for developing an R package recommendation system

Introduction: After I spoke tonight at the NYC R meetup, John Myles White and Drew Conway told me about this competition they’re administering for developing a recommendation system for R packages. They seem to have already done some work laying out the network of R packages–which packages refer to which others, and so forth. I just hope they set up their system so that my own packages (“R2WinBUGS”, “r2jags”, “arm”, and “mi”) get recommended automatically. I really hate to think that there are people out there running regressions in R and not using display() and coefplot() to look at the output. P.S. Ajay Shah asks what I mean by that last sentence. My quick answer is that it’s good to be able to visualize the coefficients and the uncertainty about them. The default options of print(), summary(), and plot() in R don’t do that: - print() doesn’t give enough information - summary() gives everything to a zillion decimal places and gives useless things like p-values - plot() gives a bunch

5 0.12308019 1655 andrew gelman stats-2013-01-05-The statistics software signal

Introduction: Tyler Cowen links to a post by Sean Taylor, who writes the following about users of R: You are willing to invest in learning something difficult. You do not care about aesthetics, only availability of packages and getting results quickly. To me, R is easy and Sas is difficult. I once worked with some students who were running Sas and the output was unreadable! Pages and pages of numbers that made no sense. When it comes to ease or difficulty of use, I think it depends on what you’re used to! And I really don’t understand the bit about aesthetics. What about this ? One reason I use R is to make pretty graphs. That said, if I’d never learned R, I’d just be making pretty graphs in Fortran or whatever. My guess is, the way I program, R is actually hindering rather than helping my ability to make attractive graphs. Half the time I’m scrambling around, writing custom code to get around R’s defaults.

6 0.10313597 1903 andrew gelman stats-2013-06-17-Weak identification provides partial information

7 0.088937208 259 andrew gelman stats-2010-09-06-Inbox zero. Really.

8 0.081450023 1009 andrew gelman stats-2011-11-14-Wickham R short course

9 0.07970117 123 andrew gelman stats-2010-07-01-Truth in headlines

10 0.077115849 2103 andrew gelman stats-2013-11-16-Objects of the class “Objects of the class”

11 0.071228139 535 andrew gelman stats-2011-01-24-Bleg: Automatic Differentiation for Log Prob Gradients?

12 0.06832809 830 andrew gelman stats-2011-07-29-Introductory overview lectures at the Joint Statistical Meetings in Miami this coming week

13 0.068309978 2178 andrew gelman stats-2014-01-20-Mailing List Degree-of-Difficulty Difficulty

14 0.067667529 919 andrew gelman stats-2011-09-21-Least surprising headline of the year

15 0.063497342 347 andrew gelman stats-2010-10-17-Getting arm and lme4 running on the Mac

16 0.057287797 272 andrew gelman stats-2010-09-13-Ross Ihaka to R: Drop Dead

17 0.0569258 2209 andrew gelman stats-2014-02-13-CmdStan, RStan, PyStan v2.2.0

18 0.056661796 1216 andrew gelman stats-2012-03-17-Modeling group-level predictors in a multilevel regression

19 0.054843344 2325 andrew gelman stats-2014-05-07-Stan users meetup next week

20 0.053968996 266 andrew gelman stats-2010-09-09-The future of R


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.03), (1, -0.009), (2, -0.016), (3, 0.019), (4, 0.035), (5, 0.034), (6, -0.003), (7, -0.03), (8, -0.015), (9, -0.018), (10, -0.023), (11, -0.016), (12, -0.003), (13, -0.018), (14, -0.001), (15, -0.008), (16, 0.008), (17, -0.014), (18, -0.008), (19, 0.003), (20, 0.003), (21, 0.014), (22, -0.021), (23, 0.016), (24, -0.029), (25, 0.024), (26, 0.006), (27, 0.022), (28, 0.021), (29, 0.007), (30, -0.001), (31, -0.025), (32, -0.002), (33, -0.008), (34, 0.01), (35, -0.034), (36, -0.017), (37, 0.001), (38, 0.022), (39, -0.01), (40, 0.011), (41, 0.002), (42, 0.0), (43, -0.015), (44, -0.006), (45, 0.005), (46, -0.043), (47, -0.012), (48, 0.014), (49, -0.02)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.9940666 1736 andrew gelman stats-2013-02-24-Rcpp class in Sat 9 Mar in NYC

Introduction: Join Dirk Eddelbuettel for six hours of detailed and hands-on instructions and discussions around Rcpp, RInside, RcppArmadillo, RcppGSL and other packages . . . Rcpp has become the most widely-used language extension for R. Currently deployed by 103 CRAN packages and a further 10 BioConductor packages, it permits users and developers to pass “whole R objects” with ease between R and C++ . . . Morning session: “A Hands-on Introduction to R and C++” . . . Afternoon session: “Advanced R and C++ Topics” . . .

2 0.64336419 1134 andrew gelman stats-2012-01-21-Lessons learned from a recent R package submission

Introduction: R has zillions of packages, and people are submitting new ones each day . The volunteers who keep R going are doing an incredibly useful service to the profession, and they’re busy . A colleague sends in some suugestions based on a recent experience with a package update: 1. Always use the R dev version to write a package. Not the current stable release. The R people use the R dev version to check your package anyway. If you don’t use the R dev version, there is chance that your package won’t pass the check. In my own experience, every time R has a major change, it tends to have new standards and find new errors in your package with these new standards. So better use the dev version to find out the potential errors in advance. 2. After submission, write an email to claim it. I used to submit the package to the CRAN without writing an email. This was standard operating procedure, but it has changed. Writing an email to claim about the submission is now a requir

3 0.62114066 1009 andrew gelman stats-2011-11-14-Wickham R short course

Introduction: Hadley writes: I [Hadley] am going to be teaching an R development master class in New York City on Dec 12-13. The basic idea of the class is to help you write better code, focused on the mantra of “do not repeat yourself”. In day one you will learn powerful new tools of abstraction, allowing you to solve a wider range of problems with fewer lines of code. Day two will teach you how to make packages, the fundamental unit of code distribution in R, allowing others to save time by allowing them to use your code. To get the most out of this course, you should have some experience programming in R already: you should be familiar with writing functions, and the basic data structures of R: vectors, matrices, arrays, lists and data frames. You will find the course particularly useful if you’re an experienced R user looking to take the next step, or if you’re moving to R from other programming languages and you want to quickly get up to speed with R’s unique features. A coupl

4 0.59958833 535 andrew gelman stats-2011-01-24-Bleg: Automatic Differentiation for Log Prob Gradients?

Introduction: We need help picking out an automatic differentiation package for Hamiltonian Monte Carlo sampling from the posterior of a generalized linear model with deep interactions. Specifically, we need to compute gradients for log probability functions with thousands of parameters that involve matrix (determinants, eigenvalues, inverses), stats (distributions), and math (log gamma) functions. Any suggestions? The Application: Hybrid Monte Carlo for Posteriors We’re getting serious about implementing posterior sampling using Hamiltonian Monte Carlo. HMC speeds up mixing by including gradient information to help guide the Metropolis proposals toward areas high probability. In practice, the algorithm requires a handful or of gradient calculations per sample, but there are many dimensions and the functions are hairy enough we don’t want to compute derivaties by hand. Auto Diff: Perhaps not What you Think It may not have been clear to readers of this blog that automatic diffe

5 0.59709781 2089 andrew gelman stats-2013-11-04-Shlemiel the Software Developer and Unknown Unknowns

Introduction: The Stan meeting today reminded me of Joel Spolsky’s recasting of the Yiddish joke about Shlemiel the Painter. Joel retold it on his blog, Joel on Software , in the post Back to Basics : Shlemiel gets a job as a street painter, painting the dotted lines down the middle of the road. On the first day he takes a can of paint out to the road and finishes 300 yards of the road. “That’s pretty good!” says his boss, “you’re a fast worker!” and pays him a kopeck. The next day Shlemiel only gets 150 yards done. “Well, that’s not nearly as good as yesterday, but you’re still a fast worker. 150 yards is respectable,” and pays him a kopeck. The next day Shlemiel paints 30 yards of the road. “Only 30!” shouts his boss. “That’s unacceptable! On the first day you did ten times that much work! What’s going on?” “I can’t help it,” says Shlemiel. “Every day I get farther and farther away from the paint can!” Joel used it as an example of the kind of string processing naive programmers ar

6 0.56881642 266 andrew gelman stats-2010-09-09-The future of R

7 0.53957283 1753 andrew gelman stats-2013-03-06-Stan 1.2.0 and RStan 1.2.0

8 0.53869963 1716 andrew gelman stats-2013-02-09-iPython Notebook

9 0.5332247 2011 andrew gelman stats-2013-09-07-Here’s what happened when I finished my PhD thesis

10 0.53204286 347 andrew gelman stats-2010-10-17-Getting arm and lme4 running on the Mac

11 0.53160918 1036 andrew gelman stats-2011-11-30-Stan uses Nuts!

12 0.51742429 1339 andrew gelman stats-2012-05-23-Learning Differential Geometry for Hamiltonian Monte Carlo

13 0.51245153 1799 andrew gelman stats-2013-04-12-Stan 1.3.0 and RStan 1.3.0 Ready for Action

14 0.50505823 1296 andrew gelman stats-2012-05-03-Google Translate for code, and an R help-list bot

15 0.48553553 1655 andrew gelman stats-2013-01-05-The statistics software signal

16 0.48472309 597 andrew gelman stats-2011-03-02-RStudio – new cross-platform IDE for R

17 0.48114532 354 andrew gelman stats-2010-10-19-There’s only one Amtrak

18 0.47340873 1710 andrew gelman stats-2013-02-06-The new Stan 1.1.1, featuring Gaussian processes!

19 0.46948382 2052 andrew gelman stats-2013-10-05-Give me a ticket for an aeroplane

20 0.46704587 667 andrew gelman stats-2011-04-19-Free $5 gift certificate!


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(0, 0.341), (24, 0.086), (43, 0.031), (73, 0.053), (86, 0.096), (90, 0.066), (95, 0.023), (96, 0.054), (99, 0.091)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.92313176 1736 andrew gelman stats-2013-02-24-Rcpp class in Sat 9 Mar in NYC

Introduction: Join Dirk Eddelbuettel for six hours of detailed and hands-on instructions and discussions around Rcpp, RInside, RcppArmadillo, RcppGSL and other packages . . . Rcpp has become the most widely-used language extension for R. Currently deployed by 103 CRAN packages and a further 10 BioConductor packages, it permits users and developers to pass “whole R objects” with ease between R and C++ . . . Morning session: “A Hands-on Introduction to R and C++” . . . Afternoon session: “Advanced R and C++ Topics” . . .

2 0.78090453 565 andrew gelman stats-2011-02-09-Dennis the dentist, debunked?

Introduction: Devah Pager points me to this article by Uri Simonsohn, which begins: Three articles published [by Brett Pelham et al.] have shown that a disproportionate share of people choose spouses, places to live, and occupations with names similar to their own. These findings, interpreted as evidence of implicit egotism, are included in most modern social psychology textbooks and many university courses. The current article successfully replicates the original findings but shows that they are most likely caused by a combination of cohort, geographic, and ethnic confounds as well as reverse causality. From Simonsohn’s article, here’s a handy summary of the claims and the evidence (click on it to enlarge): The Pelham et al. articles have come up several times on the blog, starting with this discussion and this estimate and then more recently here . I’m curious what Pelham and his collaborators think of Simonsohn’s claims.

3 0.75815213 1765 andrew gelman stats-2013-03-16-Recently in the sister blog

Introduction: 1. New Italian production of Life on Mars . 2. Psychological essentialism in everyday thought .

4 0.65604186 2166 andrew gelman stats-2014-01-10-3 years out of date on the whole Dennis the dentist thing!

Introduction: Paging Uri Simonsohn . . . January 2014: Alice Robb writes , completely uncritically: “If Your Name is Dennis, You’re More Likely to Become a Dentist The strange science of how names shape careers.” But look what you can learn from a quick google: Hmmmm, maybe worth following up on that second link . . . More details here , from 2011: Devah Pager points me to this article by Uri Simonsohn, which begins: Three articles published [by Brett Pelham et al.] have shown that a disproportionate share of people choose spouses, places to live, and occupations with names similar to their own. These findings, interpreted as evidence of implicit egotism, are included in most modern social psychology textbooks and many university courses. The current article successfully replicates the original findings but shows that they are most likely caused by a combination of cohort, geographic, and ethnic confounds as well as reverse causality. From Simonsohn’s article, here’s a han

5 0.61151397 1540 andrew gelman stats-2012-10-18-“Intrade to the 57th power”

Introduction: David Pennock writes: http://PredictWiseQ.com is our (beta) prediction contest which aims to estimate not just the marginal probabilities of election outcomes this November, but millions of correlations among outcomes as well, like the chance Obama will win both Ohio and Florida, or the chance Romney will win if the September jobs numbers are negative. It’s a working example of a combinatorial prediction market design we published this summer in the conference ACM EC’12. And here’s Pennock’s blog, which supplies more background.

6 0.60724831 1068 andrew gelman stats-2011-12-18-Faculty who don’t like teaching and hate working with students

7 0.56981599 1828 andrew gelman stats-2013-04-27-Time-Sharing Experiments for the Social Sciences

8 0.52723539 365 andrew gelman stats-2010-10-24-Erving Goffman archives

9 0.5229671 381 andrew gelman stats-2010-10-30-Sorry, Senator DeMint: Most Americans Don’t Want to Ban Gays from the Classroom

10 0.5209859 4 andrew gelman stats-2010-04-26-Prolefeed

11 0.51609701 40 andrew gelman stats-2010-05-18-What visualization is best?

12 0.47093982 1075 andrew gelman stats-2011-12-20-This guy has a regular column at Reuters

13 0.46915972 1781 andrew gelman stats-2013-03-29-Another Feller theory

14 0.46366745 557 andrew gelman stats-2011-02-05-Call for book proposals

15 0.4426204 1177 andrew gelman stats-2012-02-20-Joshua Clover update

16 0.42480171 862 andrew gelman stats-2011-08-20-An illustrated calculus textbook

17 0.41222841 2190 andrew gelman stats-2014-01-29-Stupid R Tricks: Random Scope

18 0.40352559 436 andrew gelman stats-2010-11-29-Quality control problems at the New York Times

19 0.39974838 1416 andrew gelman stats-2012-07-14-Ripping off a ripoff

20 0.3982091 1755 andrew gelman stats-2013-03-09-Plaig