andrew_gelman_stats andrew_gelman_stats-2013 andrew_gelman_stats-2013-1716 knowledge-graph by maker-knowledge-mining

1716 andrew gelman stats-2013-02-09-iPython Notebook


meta infos for this blog

Source: html

Introduction: Burak Bayramli writes: I wanted to inform you on iPython Notebook technology – allowing markup, Python code to reside in one document. Someone ported one of your examples from ARM . iPynb file is actually a live document, can be downloaded and reran locally, hence change of code on document means change of images, results. Graphs (as well as text output) which are generated by the code, are placed inside the document automatically. No more referencing image files seperately. For now running notebooks locally require a notebook server, but that part can live “on the cloud” as part of an educational software. Viewers, such as nbviewer.ipython.org, do not even need that much, since all recent results of a notebook are embedded in the notebook itself. A lot of people are excited about this; Also out of nowhere, Alfred P. Sloan Foundation dropped a $1.15 million grant on the developers of ipython which provided some extra energy on the project. Cool. We’ll have to do that ex


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Burak Bayramli writes: I wanted to inform you on iPython Notebook technology – allowing markup, Python code to reside in one document. [sent-1, score-0.445]

2 iPynb file is actually a live document, can be downloaded and reran locally, hence change of code on document means change of images, results. [sent-3, score-0.944]

3 Graphs (as well as text output) which are generated by the code, are placed inside the document automatically. [sent-4, score-0.588]

4 For now running notebooks locally require a notebook server, but that part can live “on the cloud” as part of an educational software. [sent-6, score-1.296]

5 org, do not even need that much, since all recent results of a notebook are embedded in the notebook itself. [sent-9, score-1.092]

6 A lot of people are excited about this; Also out of nowhere, Alfred P. [sent-10, score-0.093]

7 15 million grant on the developers of ipython which provided some extra energy on the project. [sent-12, score-0.701]

8 It shouldn’t be too hard if we just discretize the arsenic and distance variables into something like 50 categories each. [sent-15, score-0.423]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('notebook', 0.49), ('ipython', 0.298), ('document', 0.256), ('locally', 0.245), ('code', 0.178), ('markup', 0.136), ('notebooks', 0.136), ('ported', 0.136), ('bayramli', 0.136), ('burak', 0.136), ('arsenic', 0.136), ('referencing', 0.136), ('discretize', 0.128), ('live', 0.127), ('cloud', 0.123), ('server', 0.123), ('downloaded', 0.112), ('embedded', 0.112), ('alfred', 0.109), ('python', 0.107), ('nowhere', 0.105), ('viewers', 0.103), ('inform', 0.101), ('developers', 0.098), ('dropped', 0.096), ('change', 0.095), ('files', 0.094), ('excited', 0.093), ('generated', 0.092), ('placed', 0.089), ('images', 0.086), ('grant', 0.085), ('allowing', 0.085), ('output', 0.085), ('gaussian', 0.083), ('inside', 0.081), ('file', 0.081), ('technology', 0.081), ('educational', 0.081), ('distance', 0.081), ('arm', 0.079), ('categories', 0.078), ('foundation', 0.077), ('image', 0.077), ('energy', 0.076), ('extra', 0.075), ('part', 0.074), ('text', 0.07), ('require', 0.069), ('provided', 0.069)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 1716 andrew gelman stats-2013-02-09-iPython Notebook

Introduction: Burak Bayramli writes: I wanted to inform you on iPython Notebook technology – allowing markup, Python code to reside in one document. Someone ported one of your examples from ARM . iPynb file is actually a live document, can be downloaded and reran locally, hence change of code on document means change of images, results. Graphs (as well as text output) which are generated by the code, are placed inside the document automatically. No more referencing image files seperately. For now running notebooks locally require a notebook server, but that part can live “on the cloud” as part of an educational software. Viewers, such as nbviewer.ipython.org, do not even need that much, since all recent results of a notebook are embedded in the notebook itself. A lot of people are excited about this; Also out of nowhere, Alfred P. Sloan Foundation dropped a $1.15 million grant on the developers of ipython which provided some extra energy on the project. Cool. We’ll have to do that ex

2 0.32948834 1235 andrew gelman stats-2012-03-29-I’m looking for a quadrille notebook with faint lines

Introduction: I want a graph-paper-style notebook, ideally something lightweight—I’m looking to make notes, not art drawings—and not too large. I’m currently using a 17 x 22 cm notebook, which is a fine size. It also has pretty small squares, which I like. My problem with the notebook I have now is that the ink is too heavy—that is, the lines are too dark. I want very faint lines, just visible enough to be used as guides but not so heavy that to be overwhelming. The notebooks I see in the stores all have pretty dark lines. Any suggestions?

3 0.13304022 2291 andrew gelman stats-2014-04-14-Transitioning to Stan

Introduction: Kevin Cartier writes: I’ve been happily using R for a number of years now and recently came across Stan. Looks big and powerful, so I’d like to pick an appropriate project and try it out. I wondered if you could point me to a link or document that goes into the motivation for this tool (aside from the Stan user doc)? What I’d like to understand is, at what point might you look at an emergent R project and advise, “You know, that thing you’re trying to do would be a whole lot easier/simpler/more straightforward to implement with Stan.” (or words to that effect). My reply: For my collaborators in political science, Stan has been most useful for models where the data set is not huge (e.g., we might have 10,000 data points or 50,000 data points but not 10 million) but where the model is somewhat complex (for example, a model with latent time series structure). The point is that the model has enough parameters and uncertainty that you’ll want to do full Bayes (rather than some sort

4 0.12407445 2229 andrew gelman stats-2014-02-28-God-leaf-tree

Introduction: Govind Manian writes: I wanted to pass along a fragment from Lichtenberg’s Waste Books — which I am finding to be great stone soup — that reminded me of God is in Every Leaf : To the wise man nothing is great and nothing small…I believe he could write treatises on keyholes that sounded as weighty as a jus naturae and would be just as instructive. As the few adepts in such things well know, universal morality is to be found in little everyday penny-events just as much as in great ones. There is so much goodness and ingenuity in a raindrop that an apothecary wouldn’t let it go for less than half-a-crown… (Notebook B, 33)

5 0.1079682 41 andrew gelman stats-2010-05-19-Updated R code and data for ARM

Introduction: Patricia and I have cleaned up some of the R and Bugs code and collected the data for almost all the examples in ARM. See here for links to zip files with the code and data.

6 0.10632994 1748 andrew gelman stats-2013-03-04-PyStan!

7 0.07949286 2342 andrew gelman stats-2014-05-21-Models with constraints

8 0.077367358 1447 andrew gelman stats-2012-08-07-Reproducible science FAIL (so far): What’s stoppin people from sharin data and code?

9 0.073187493 907 andrew gelman stats-2011-09-14-Reproducibility in Practice

10 0.070874736 1919 andrew gelman stats-2013-06-29-R sucks

11 0.068424322 154 andrew gelman stats-2010-07-18-Predictive checks for hierarchical models

12 0.067595176 1423 andrew gelman stats-2012-07-21-Optimizing software in C++

13 0.066316612 2161 andrew gelman stats-2014-01-07-My recent debugging experience

14 0.063301086 1009 andrew gelman stats-2011-11-14-Wickham R short course

15 0.062586926 1443 andrew gelman stats-2012-08-04-Bayesian Learning via Stochastic Gradient Langevin Dynamics

16 0.062457103 2364 andrew gelman stats-2014-06-08-Regression and causality and variable ordering

17 0.059413459 2093 andrew gelman stats-2013-11-07-I’m negative on the expression “false positives”

18 0.059223246 852 andrew gelman stats-2011-08-13-Checking your model using fake data

19 0.058926366 42 andrew gelman stats-2010-05-19-Updated solutions to Bayesian Data Analysis homeworks

20 0.057761658 1807 andrew gelman stats-2013-04-17-Data problems, coding errors…what can be done?


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.075), (1, -0.005), (2, -0.008), (3, 0.035), (4, 0.07), (5, 0.006), (6, 0.002), (7, -0.069), (8, -0.008), (9, -0.027), (10, -0.047), (11, -0.02), (12, -0.03), (13, -0.023), (14, 0.021), (15, 0.013), (16, 0.014), (17, -0.001), (18, 0.01), (19, -0.008), (20, 0.002), (21, 0.019), (22, -0.029), (23, -0.008), (24, -0.026), (25, 0.003), (26, 0.03), (27, -0.03), (28, 0.013), (29, 0.01), (30, 0.011), (31, -0.017), (32, -0.026), (33, 0.006), (34, 0.02), (35, -0.008), (36, -0.011), (37, 0.03), (38, 0.013), (39, -0.003), (40, -0.004), (41, -0.012), (42, 0.019), (43, 0.007), (44, 0.013), (45, 0.032), (46, -0.018), (47, 0.052), (48, 0.015), (49, 0.017)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.95863563 1716 andrew gelman stats-2013-02-09-iPython Notebook

Introduction: Burak Bayramli writes: I wanted to inform you on iPython Notebook technology – allowing markup, Python code to reside in one document. Someone ported one of your examples from ARM . iPynb file is actually a live document, can be downloaded and reran locally, hence change of code on document means change of images, results. Graphs (as well as text output) which are generated by the code, are placed inside the document automatically. No more referencing image files seperately. For now running notebooks locally require a notebook server, but that part can live “on the cloud” as part of an educational software. Viewers, such as nbviewer.ipython.org, do not even need that much, since all recent results of a notebook are embedded in the notebook itself. A lot of people are excited about this; Also out of nowhere, Alfred P. Sloan Foundation dropped a $1.15 million grant on the developers of ipython which provided some extra energy on the project. Cool. We’ll have to do that ex

2 0.71842337 1472 andrew gelman stats-2012-08-28-Migrating from dot to underscore

Introduction: My C-oriented Stan collaborators have convinced me to use underscore (_) rather than dot (.) as much as possible in expressions in R. For example, I can name a variable n_years rather than n.years. This is fine. But I’m getting annoyed because I need to press the shift key every time I type the underscore. What do people do about this? I know that it’s easy enough to reassign keys (I could, for example, assign underscore to backslash, which I never use). I’m just wondering what C programmers actually do. Do they reassign the key or do they just get used to pressing Shift? P.S. In comments, Ben Hyde points to Google’s R style guide, which recommends that variable names use dots, not underscore or camel case, for variable names (for example, “avg.clicks” rather than “avg_Clicks” or “avgClicks”). I think they’re recommending this to be consistent with R coding conventions . I am switching to underscores in R variable names to be consistent with C. Otherwise we were run

3 0.71371365 1627 andrew gelman stats-2012-12-17-Stan and RStan 1.1.0

Introduction: We’re happy to announce the availability of Stan and RStan versions 1.1.0, which are general tools for performing model-based Bayesian inference using the no-U-turn sampler, an adaptive form of Hamiltonian Monte Carlo. Information on downloading and installing and using them is available as always from Stan Home Page: http://mc-stan.org/ Let us know if you have any problems on the mailing lists or at the e-mails linked on the home page (please don’t use this web page). The full release notes follow. (R)Stan Version 1.1.0 Release Notes =================================== -- Backward Compatibility Issue * Categorical distribution recoded to match documentation; it now has support {1,...,K} rather than {0,...,K-1}. * (RStan) change default value of permuted flag from FALSE to TRUE for Stan fit S4 extract() method -- New Features * Conditional (if-then-else) statements * While statements -- New Functions * generalized multiply_lower_tri

4 0.69075042 2161 andrew gelman stats-2014-01-07-My recent debugging experience

Introduction: OK, so this sort of thing happens sometimes. I was working on a new idea (still working on it; if it ultimately works out—or if it doesn’t—I’ll let you know) and as part of it I was fitting little models in Stan, in a loop. I thought it would make sense to start with linear regression with normal priors and known data variance, because then the exact solution is Gaussian and I can also work with the problem analytically. So I programmed up the algorithm and, no surprise, it didn’t work. I went through my R code, put in print statements here and there, and cleared out bug after bug until at least it stopped crashing. But the algorithm still wasn’t doing what it was supposed to do. So I decided to do something simpler, and just check that the Stan linear regression gave the same answer as the analytic posterior distribution: I ran Stan for tons of iterations, then computed the sample mean and variance of the simulations. It was an example with two coefficients—I’d originally cho

5 0.68693978 1580 andrew gelman stats-2012-11-16-Stantastic!

Introduction: Richard McElreath writes: I’ve been translating a few ongoing data analysis projects into Stan code, mostly with success. The most important for me right now has been a hierarchical zero-inflated gamma problem. This a “hurdle” model, in which a bernoulli GLM produces zeros/nonzeros, and then a gamma GLM produces the nonzero values, using varying effects correlated with those in the bernoulli process. The data are 20 years of human foraging returns from a subsistence hunting population in Paraguay (the Ache), comprising about 15k hunts in total (Hill & Kintigh. 2009. Current Anthropology 50:369-377). Observed values are kilograms of meat returned to camp. The more complex models contain a 147-by-9 matrix of varying effects (147 unique hunters), as well as imputation of missing values. Originally, I had written the sampler myself in raw R code. It was very slow, but I knew what it was doing at least. Just before Stan version 1.0 was released, I had managed to get JAGS to do it a

6 0.67449772 2150 andrew gelman stats-2013-12-27-(R-Py-Cmd)Stan 2.1.0

7 0.66984493 2291 andrew gelman stats-2014-04-14-Transitioning to Stan

8 0.66409677 1919 andrew gelman stats-2013-06-29-R sucks

9 0.66314542 2089 andrew gelman stats-2013-11-04-Shlemiel the Software Developer and Unknown Unknowns

10 0.65889913 1711 andrew gelman stats-2013-02-07-How Open Should Academic Papers Be?

11 0.65152365 266 andrew gelman stats-2010-09-09-The future of R

12 0.65146571 2190 andrew gelman stats-2014-01-29-Stupid R Tricks: Random Scope

13 0.64840388 1807 andrew gelman stats-2013-04-17-Data problems, coding errors…what can be done?

14 0.64328492 1799 andrew gelman stats-2013-04-12-Stan 1.3.0 and RStan 1.3.0 Ready for Action

15 0.64098465 1753 andrew gelman stats-2013-03-06-Stan 1.2.0 and RStan 1.2.0

16 0.63960117 198 andrew gelman stats-2010-08-11-Multilevel modeling in R on a Mac

17 0.63752198 2020 andrew gelman stats-2013-09-12-Samplers for Big Science: emcee and BAT

18 0.63536835 1235 andrew gelman stats-2012-03-29-I’m looking for a quadrille notebook with faint lines

19 0.63032949 1296 andrew gelman stats-2012-05-03-Google Translate for code, and an R help-list bot

20 0.6292277 1748 andrew gelman stats-2013-03-04-PyStan!


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(2, 0.018), (15, 0.012), (16, 0.073), (24, 0.119), (35, 0.028), (39, 0.015), (44, 0.011), (48, 0.011), (55, 0.031), (59, 0.315), (61, 0.011), (73, 0.035), (74, 0.015), (85, 0.033), (86, 0.022), (89, 0.014), (99, 0.12)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.92129153 580 andrew gelman stats-2011-02-19-Weather visualization with WeatherSpark

Introduction: WeatherSpark : prediction and observation quantiles, historic data, multiple predictors, zoomable, draggable, colorful, wonderful: Via Jure Cuhalev .

2 0.91277528 403 andrew gelman stats-2010-11-09-Society for Industrial and Applied Mathematics startup-math meetup

Introduction: Chris Wiggins sends along this . It’s a meetup at Davis Auditorium, CEPSR Bldg, Columbia University, on Wed 10 Nov (that’s tomorrow! or maybe today! depending on when you’re reading this), 6-8pm.

same-blog 3 0.86439222 1716 andrew gelman stats-2013-02-09-iPython Notebook

Introduction: Burak Bayramli writes: I wanted to inform you on iPython Notebook technology – allowing markup, Python code to reside in one document. Someone ported one of your examples from ARM . iPynb file is actually a live document, can be downloaded and reran locally, hence change of code on document means change of images, results. Graphs (as well as text output) which are generated by the code, are placed inside the document automatically. No more referencing image files seperately. For now running notebooks locally require a notebook server, but that part can live “on the cloud” as part of an educational software. Viewers, such as nbviewer.ipython.org, do not even need that much, since all recent results of a notebook are embedded in the notebook itself. A lot of people are excited about this; Also out of nowhere, Alfred P. Sloan Foundation dropped a $1.15 million grant on the developers of ipython which provided some extra energy on the project. Cool. We’ll have to do that ex

4 0.78233135 214 andrew gelman stats-2010-08-17-Probability-processing hardware

Introduction: Lyric Semiconductor posted: For over 60 years, computers have been based on digital computing principles. Data is represented as bits (0s and 1s). Boolean logic gates perform operations on these bits. A processor steps through many of these operations serially in order to perform a function. However, today’s most interesting problems are not at all suited to this approach. Here at Lyric Semiconductor, we are redesigning information processing circuits from the ground up to natively process probabilities: from the gate circuits to the processor architecture to the programming language. As a result, many applications that today require a thousand conventional processors will soon run in just one Lyric processor, providing 1,000x efficiencies in cost, power, and size. Om Malik has some more information, also relating to the team and the business. The fundamental idea is that computing architectures work deterministically, even though the world is fundamentally stochastic.

5 0.69115901 763 andrew gelman stats-2011-06-13-Inventor of Connect Four dies at 91

Introduction: Obit here . I think I have a cousin with the same last name as this guy, so maybe we’re related by marriage in some way. (By that standard we’re also related to Marge Simpson and, I seem to recall, the guy who wrote the scripts for Dark Shadows.)

6 0.68174523 853 andrew gelman stats-2011-08-14-Preferential admissions for children of elite colleges

7 0.67434704 965 andrew gelman stats-2011-10-19-Web-friendly visualizations in R

8 0.65220797 1599 andrew gelman stats-2012-11-30-“The scientific literature must be cleansed of everything that is fraudulent, especially if it involves the work of a leading academic”

9 0.63953692 34 andrew gelman stats-2010-05-14-Non-academic writings on literature

10 0.60422397 1000 andrew gelman stats-2011-11-10-Forecasting 2012: How much does ideology matter?

11 0.60093164 1764 andrew gelman stats-2013-03-15-How do I make my graphs?

12 0.60002589 1380 andrew gelman stats-2012-06-15-Coaching, teaching, and writing

13 0.5931741 229 andrew gelman stats-2010-08-24-Bizarre twisty argument about medical diagnostic tests

14 0.58809817 517 andrew gelman stats-2011-01-14-Bayes in China update

15 0.57467699 1408 andrew gelman stats-2012-07-07-Not much difference between communicating to self and communicating to others

16 0.56807429 1185 andrew gelman stats-2012-02-26-A statistician’s rants and raves

17 0.55637497 1377 andrew gelman stats-2012-06-13-A question about AIC

18 0.5527308 199 andrew gelman stats-2010-08-11-Note to semi-spammers

19 0.55216455 766 andrew gelman stats-2011-06-14-Last Wegman post (for now)

20 0.55078614 2095 andrew gelman stats-2013-11-09-Typo in Ghitza and Gelman MRP paper