andrew_gelman_stats andrew_gelman_stats-2013 andrew_gelman_stats-2013-1802 knowledge-graph by maker-knowledge-mining

1802 andrew gelman stats-2013-04-14-Detecting predictability in complex ecosystems


meta infos for this blog

Source: html

Introduction: A couple people pointed me to a recent article , “Detecting Causality in Complex Ecosystems,” by fisheries researchers George Sugihara, Robert May, Hao Ye, Chih-hao Hsieh, Ethan Deyle, Michael Fogarty, and Stephan Munch. I don’t know anything about ecology research but I could imagine this method being useful in that field. I can’t see the approach doing much in political science, where I think their stated goal of “identifying causal networks” is typically irrelevant. That said, if you replace the word “causality” by “predictability” everywhere in the paper, it starts to make a lot more sense. As they write, they are working within “a framework that uses predictability as opposed to correlation to identify causation between time-series variables.” Setting causation aside, predictability is an important topic in itself. The search for patterns of predictability in complex structures may motivate causal hypotheses that can be studied more directly, using more traditional statis


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 A couple people pointed me to a recent article , “Detecting Causality in Complex Ecosystems,” by fisheries researchers George Sugihara, Robert May, Hao Ye, Chih-hao Hsieh, Ethan Deyle, Michael Fogarty, and Stephan Munch. [sent-1, score-0.307]

2 I don’t know anything about ecology research but I could imagine this method being useful in that field. [sent-2, score-0.264]

3 I can’t see the approach doing much in political science, where I think their stated goal of “identifying causal networks” is typically irrelevant. [sent-3, score-0.441]

4 That said, if you replace the word “causality” by “predictability” everywhere in the paper, it starts to make a lot more sense. [sent-4, score-0.415]

5 As they write, they are working within “a framework that uses predictability as opposed to correlation to identify causation between time-series variables. [sent-5, score-1.357]

6 ” Setting causation aside, predictability is an important topic in itself. [sent-6, score-0.908]

7 The search for patterns of predictability in complex structures may motivate causal hypotheses that can be studied more directly, using more traditional statistical designs such as experiments and observational studies. [sent-7, score-1.95]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('predictability', 0.592), ('causation', 0.258), ('causality', 0.223), ('fisheries', 0.175), ('fogarty', 0.175), ('stephan', 0.165), ('ethan', 0.165), ('complex', 0.163), ('causal', 0.14), ('ecology', 0.131), ('detecting', 0.129), ('structures', 0.121), ('everywhere', 0.121), ('identifying', 0.113), ('motivate', 0.112), ('designs', 0.111), ('starts', 0.11), ('stated', 0.109), ('replace', 0.104), ('opposed', 0.103), ('networks', 0.102), ('hypotheses', 0.1), ('observational', 0.099), ('studied', 0.095), ('may', 0.093), ('framework', 0.089), ('traditional', 0.089), ('george', 0.088), ('identify', 0.088), ('robert', 0.085), ('uses', 0.083), ('correlation', 0.083), ('aside', 0.082), ('word', 0.08), ('patterns', 0.079), ('search', 0.078), ('michael', 0.078), ('experiments', 0.078), ('setting', 0.076), ('pointed', 0.072), ('imagine', 0.069), ('goal', 0.069), ('typically', 0.067), ('directly', 0.066), ('method', 0.064), ('within', 0.061), ('couple', 0.06), ('studies', 0.06), ('topic', 0.058), ('approach', 0.056)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.99999982 1802 andrew gelman stats-2013-04-14-Detecting predictability in complex ecosystems

Introduction: A couple people pointed me to a recent article , “Detecting Causality in Complex Ecosystems,” by fisheries researchers George Sugihara, Robert May, Hao Ye, Chih-hao Hsieh, Ethan Deyle, Michael Fogarty, and Stephan Munch. I don’t know anything about ecology research but I could imagine this method being useful in that field. I can’t see the approach doing much in political science, where I think their stated goal of “identifying causal networks” is typically irrelevant. That said, if you replace the word “causality” by “predictability” everywhere in the paper, it starts to make a lot more sense. As they write, they are working within “a framework that uses predictability as opposed to correlation to identify causation between time-series variables.” Setting causation aside, predictability is an important topic in itself. The search for patterns of predictability in complex structures may motivate causal hypotheses that can be studied more directly, using more traditional statis

2 0.14114991 1709 andrew gelman stats-2013-02-06-The fractal nature of scientific revolutions

Introduction: Phil Earnhardt writes: I stumbled across your blog entry after googling on those terms. If I could comment on the closed entry [We had to shut off comments on old blog entries for reasons of spam --- ed.], I’d note: scientific revolutions are fractal; they’re also chaotic in their dynamics. Predictability when a particular scientific revolution will take hold—or be rejected—is problematic. I find myself wishing that Chaos Theory had been established when Kuhn wrote his essay.

3 0.1266963 1778 andrew gelman stats-2013-03-27-My talk at the University of Michigan today 4pm

Introduction: Causality and Statistical Learning Andrew Gelman, Statistics and Political Science, Columbia University Wed 27 Mar, 4pm, Betty Ford Auditorium, Ford School of Public Policy Causal inference is central to the social and biomedical sciences. There are unresolved debates about the meaning of causality and the methods that should be used to measure it. As a statistician, I am trained to say that randomized experiments are a gold standard, yet I have spent almost all my applied career analyzing observational data. In this talk we shall consider various approaches to causal reasoning from the perspective of an applied statistician who recognizes the importance of causal identification yet must learn from available information. Two relevant papers are here and here .

4 0.12308913 950 andrew gelman stats-2011-10-10-“Causality is almost always in doubt”

Introduction: Dave Backus writes: We macroeconomists are thrilled with the Nobel prize for Sargent and Sims. But on causality: they spent more time showing how hard it was to identify causality than showing how to do it. And that’s a fair assessment of our field [economics]: causality is almost always in doubt. More here . If I were in a snarky mood, I’d say something like, Causality is always in doubt in economics . . . unless you’re talking about abortion and crime, in which case you can be absolutely certain. But I’m in a good mood right now so I won’t say that. Instead I’ll just remark that, as a statistician, I’m positively thrilled that somebody named “Sims” received a major award.

5 0.12152365 1418 andrew gelman stats-2012-07-16-Long discussion about causal inference and the use of hierarchical models to bridge between different inferential settings

Introduction: Elias Bareinboim asked what I thought about his comment on selection bias in which he referred to a paper by himself and Judea Pearl, “Controlling Selection Bias in Causal Inference.” I replied that I have no problem with what he wrote, but that from my perspective I find it easier to conceptualize such problems in terms of multilevel models. I elaborated on that point in a recent post , “Hierarchical modeling as a framework for extrapolation,” which I think was read by only a few people (I say this because it received only two comments). I don’t think Bareinboim objected to anything I wrote, but like me he is comfortable working within his own framework. He wrote the following to me: In some sense, “not ad hoc” could mean logically consistent. In other words, if one agrees with the assumptions encoded in the model, one must also agree with the conclusions entailed by these assumptions. I am not aware of any other way of doing mathematics. As it turns out, to get causa

6 0.11524034 1939 andrew gelman stats-2013-07-15-Forward causal reasoning statements are about estimation; reverse causal questions are about model checking and hypothesis generation

7 0.10974113 212 andrew gelman stats-2010-08-17-Futures contracts, Granger causality, and my preference for estimation to testing

8 0.10420804 2097 andrew gelman stats-2013-11-11-Why ask why? Forward causal inference and reverse causal questions

9 0.10113104 1624 andrew gelman stats-2012-12-15-New prize on causality in statstistics education

10 0.10073925 2268 andrew gelman stats-2014-03-26-New research journal on observational studies

11 0.097444423 2207 andrew gelman stats-2014-02-11-My talks in Bristol this Wed and London this Thurs

12 0.096680015 1996 andrew gelman stats-2013-08-24-All inference is about generalizing from sample to population

13 0.095955655 1675 andrew gelman stats-2013-01-15-“10 Things You Need to Know About Causal Effects”

14 0.095090158 1512 andrew gelman stats-2012-09-27-A Non-random Walk Down Campaign Street

15 0.083636425 1336 andrew gelman stats-2012-05-22-Battle of the Repo Man quotes: Reid Hastie’s turn

16 0.08326111 2037 andrew gelman stats-2013-09-25-Classical probability does not apply to quantum systems (causal inference edition)

17 0.080452032 1666 andrew gelman stats-2013-01-10-They’d rather be rigorous than right

18 0.078755811 2254 andrew gelman stats-2014-03-18-Those wacky anti-Bayesians used to be intimidating, but now they’re just pathetic

19 0.078407876 879 andrew gelman stats-2011-08-29-New journal on causal inference

20 0.078174137 2263 andrew gelman stats-2014-03-24-Empirical implications of Empirical Implications of Theoretical Models


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.111), (1, -0.005), (2, -0.015), (3, -0.048), (4, -0.033), (5, 0.002), (6, -0.056), (7, -0.017), (8, 0.023), (9, 0.043), (10, -0.032), (11, 0.005), (12, 0.013), (13, 0.0), (14, -0.001), (15, 0.017), (16, -0.015), (17, -0.002), (18, -0.049), (19, 0.023), (20, -0.013), (21, -0.072), (22, 0.031), (23, 0.041), (24, 0.065), (25, 0.076), (26, 0.036), (27, -0.009), (28, 0.004), (29, 0.032), (30, 0.026), (31, -0.022), (32, 0.022), (33, -0.033), (34, -0.053), (35, 0.009), (36, 0.029), (37, -0.021), (38, 0.016), (39, 0.003), (40, -0.01), (41, 0.021), (42, 0.035), (43, 0.043), (44, -0.039), (45, -0.029), (46, 0.019), (47, 0.022), (48, 0.014), (49, -0.059)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.95808446 1802 andrew gelman stats-2013-04-14-Detecting predictability in complex ecosystems

Introduction: A couple people pointed me to a recent article , “Detecting Causality in Complex Ecosystems,” by fisheries researchers George Sugihara, Robert May, Hao Ye, Chih-hao Hsieh, Ethan Deyle, Michael Fogarty, and Stephan Munch. I don’t know anything about ecology research but I could imagine this method being useful in that field. I can’t see the approach doing much in political science, where I think their stated goal of “identifying causal networks” is typically irrelevant. That said, if you replace the word “causality” by “predictability” everywhere in the paper, it starts to make a lot more sense. As they write, they are working within “a framework that uses predictability as opposed to correlation to identify causation between time-series variables.” Setting causation aside, predictability is an important topic in itself. The search for patterns of predictability in complex structures may motivate causal hypotheses that can be studied more directly, using more traditional statis

2 0.79530412 340 andrew gelman stats-2010-10-13-Randomized experiments, non-randomized experiments, and observational studies

Introduction: In the spirit of Dehejia and Wahba: Three Conditions under Which Experiments and Observational Studies Produce Comparable Causal Estimates: New Findings from Within-Study Comparisons , by Cook, Shadish, and Wong. Can Nonrandomized Experiments Yield Accurate Answers? A Randomized Experiment Comparing Random and Nonrandom Assignments, by Shadish, Clark, and Steiner. I just talk about causal inference. These people do it. The second link above is particularly interesting because it includes discussions by some causal inference heavyweights. WWJD and all that.

3 0.78510165 785 andrew gelman stats-2011-07-02-Experimental reasoning in social science

Introduction: As a statistician, I was trained to think of randomized experimentation as representing the gold standard of knowledge in the social sciences, and, despite having seen occasional arguments to the contrary, I still hold that view, expressed pithily by Box, Hunter, and Hunter (1978) that “To find out what happens when you change something, it is necessary to change it.” At the same time, in my capacity as a social scientist, I’ve published many applied research papers, almost none of which have used experimental data. In the present article, I’ll address the following questions: 1. Why do I agree with the consensus characterization of randomized experimentation as a gold standard? 2. Given point 1 above, why does almost all my research use observational data? In confronting these issues, we must consider some general issues in the strategy of social science research. We also take from the psychology methods literature a more nuanced perspective that considers several differen

4 0.77102745 1996 andrew gelman stats-2013-08-24-All inference is about generalizing from sample to population

Introduction: Jeff Walker writes: Your blog has skirted around the value of observational studies and chided folks for using causal language when they only have associations but I sense that you ultimately find value in these associations. I would love for you to expand this thought in a blog. Specifically: Does a measured association “suggest” a causal relationship? Are measured associations a good and efficient way to narrow the field of things that should be studied? Of all the things we should pursue, should we start with the stuff that has some largish measured association? Certainly many associations are not directly causal but due to joint association. Similarly, there must be many variables that are directly causally associated ( A -> B) but the effect, measured as an association, is masked by confounders. So if we took the “measured associations are worthwhile” approach, we’d never or rarely find the masked effects. But I’d also like to know if one is more likely to find a large causal

5 0.75353783 2286 andrew gelman stats-2014-04-08-Understanding Simpson’s paradox using a graph

Introduction: Joshua Vogelstein pointed me to this post by Michael Nielsen on how to teach Simpson’s paradox. I don’t know if Nielsen (and others) are aware that people have developed some snappy graphical methods for displaying Simpson’s paradox (and, more generally, aggregation issues). We do some this in our Red State Blue State book, but before that was the BK plot, named by Howard Wainer after a 2001 paper by Stuart Baker and Barnett Kramer, although in apparently appeared earlier in a 1987 paper by Jeon, Chung, and Bae, and doubtless was made by various other people before then. Here’s Wainer’s graphical explication from 2002 (adapted from Baker and Kramer’s 2001 paper): Here’s the version from our 2007 article (with Boris Shor, Joe Bafumi, and David Park): But I recommend Wainer’s article (linked to above) as the first thing to read on the topic of presenting aggregation paradoxes in a clear and grabby way. P.S. Robert Long writes in: I noticed your post ab

6 0.75345665 1778 andrew gelman stats-2013-03-27-My talk at the University of Michigan today 4pm

7 0.73000968 879 andrew gelman stats-2011-08-29-New journal on causal inference

8 0.72865212 1675 andrew gelman stats-2013-01-15-“10 Things You Need to Know About Causal Effects”

9 0.70975137 550 andrew gelman stats-2011-02-02-An IV won’t save your life if the line is tangled

10 0.70857245 1888 andrew gelman stats-2013-06-08-New Judea Pearl journal of causal inference

11 0.69885463 1939 andrew gelman stats-2013-07-15-Forward causal reasoning statements are about estimation; reverse causal questions are about model checking and hypothesis generation

12 0.69551122 1418 andrew gelman stats-2012-07-16-Long discussion about causal inference and the use of hierarchical models to bridge between different inferential settings

13 0.68402153 1492 andrew gelman stats-2012-09-11-Using the “instrumental variables” or “potential outcomes” approach to clarify causal thinking

14 0.68075776 2097 andrew gelman stats-2013-11-11-Why ask why? Forward causal inference and reverse causal questions

15 0.67459142 1136 andrew gelman stats-2012-01-23-Fight! (also a bit of reminiscence at the end)

16 0.66983271 1624 andrew gelman stats-2012-12-15-New prize on causality in statstistics education

17 0.65306681 1336 andrew gelman stats-2012-05-22-Battle of the Repo Man quotes: Reid Hastie’s turn

18 0.65227461 2207 andrew gelman stats-2014-02-11-My talks in Bristol this Wed and London this Thurs

19 0.64433867 1645 andrew gelman stats-2012-12-31-Statistical modeling, causal inference, and social science

20 0.63030338 32 andrew gelman stats-2010-05-14-Causal inference in economics


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(15, 0.015), (16, 0.035), (21, 0.037), (24, 0.096), (30, 0.017), (53, 0.357), (99, 0.319)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.96577412 1589 andrew gelman stats-2012-11-25-Life as a blogger: the emails just get weirder and weirder

Introduction: In the email the other day, subject line “Casting blogger, writer, journalist to host cable series”: Hi there Andrew, I’m casting a male journalist, writer, blogger, documentary filmmaker or comedian with a certain type personality for a television pilot along with production company, Pipeline39. See below: A certain type of character – no cockiness, no ego, a person who is smart, savvy, dry humor, but someone who isn’t imposing, who can infiltrate these organizations. This person will be hosting his own show and covering alternative lifestyles and secret societies around the world. If you’re interested in hearing more or would like to be considered for this project, please email me a photo and a bio of yourself, along with contact information. I’ll respond to you ASAP. I’m looking forward to hearing from you. *** Casting Producer (646) ***.**** ***@gmail.com I was with them until I got to the “no ego” part. . . . Also, I don’t think I could infiltrate any org

2 0.96224684 298 andrew gelman stats-2010-09-27-Who is that masked person: The use of face masks on Mexico City public transportation during the Influenza A (H1N1) outbreak

Introduction: Tapen Sinha writes: Living in Mexico, I have been witness to many strange (and beautiful) things. Perhaps the strangest happened during the first outbreak of A(H1N1) in Mexico City. We had our university closed, football (soccer) was played in empty stadiums (or should it be stadia) because the government feared a spread of the virus. The Metro was operating and so were the private/public buses and taxis. Since the university was closed, we took the opportunity to collect data on facemask use in the public transport systems. It was a simple (but potentially deadly!) exercise in first hand statistical data collection that we teach our students (Although I must admit that I did not dare sending my research assistant to collect data – what if she contracted the virus?). I believe it was a unique experiment never to be repeated. The paper appeared in the journal Health Policy. From the abstract: At the height of the influenza epidemic in Mexico City in the spring of 2009, the f

3 0.94219518 413 andrew gelman stats-2010-11-14-Statistics of food consumption

Introduction: Visual Economics shows statistics on average food consumption in America: My brief feedback is that water is confounded with these results. They should have subtracted water content from the weight of all dietary items, as it inflates the proportion of milk, vegetable and fruit items that contain more water. They did that for soda (which is represented as sugar/corn syrup), amplifying the inconsistency. Time Magazine had a beautiful gallery that visualizes diets around the world in a more appealing way.

4 0.93688983 1856 andrew gelman stats-2013-05-14-GPstuff: Bayesian Modeling with Gaussian Processes

Introduction: I think it’s part of my duty as a blogger to intersperse, along with the steady flow of jokes, rants, and literary criticism, some material that will actually be useful to you. So here goes. Jarno Vanhatalo, Jaakko Riihimäki, Jouni Hartikainen, Pasi Jylänki, Ville Tolvanen, and Aki Vehtari write : The GPstuff toolbox is a versatile collection of Gaussian process models and computational tools required for Bayesian inference. The tools include, among others, various inference methods, sparse approximations and model assessment methods. We can actually now fit Gaussian processes in Stan . But for big problems (or even moderately-sized problems), full Bayes can be slow. GPstuff uses EP, which is faster. At some point we’d like to implement EP in Stan. (Right now we’re working with Dave Blei to implement VB.) GPstuff really works. I saw Aki use it to fit a nonparametric version of the Bangladesh well-switching example in ARM. He was sitting in his office and just whip

5 0.93599498 1677 andrew gelman stats-2013-01-16-Greenland is one tough town

Introduction: Americans (including me) don’t know much about other countries. Jeff Lax sent me to this blog post by Myrddin pointing out that Belgium has a higher murder rate than the rest of Western Europe. I have no particular take on this, but it’s a good reminder that other countries differ from each other. Here in the U.S., we tend to think all western European countries are the same, all eastern European countries are the same, etc. In reality, Sweden is not Finland . P.S. According to the Wiki , Greenland is one tough town. I guess there’s nothing much to do out there but watch satellite TV, chew the blubber, and kill people.

same-blog 6 0.9324221 1802 andrew gelman stats-2013-04-14-Detecting predictability in complex ecosystems

7 0.91611195 46 andrew gelman stats-2010-05-21-Careers, one-hit wonders, and an offer of a free book

8 0.91201323 1468 andrew gelman stats-2012-08-24-Multilevel modeling and instrumental variables

9 0.89413774 1905 andrew gelman stats-2013-06-18-There are no fat sprinters

10 0.89120173 991 andrew gelman stats-2011-11-04-Insecure researchers aren’t sharing their data

11 0.88690192 1902 andrew gelman stats-2013-06-17-Job opening at new “big data” consulting firm!

12 0.88507575 733 andrew gelman stats-2011-05-27-Another silly graph

13 0.87365466 2022 andrew gelman stats-2013-09-13-You heard it here first: Intense exercise can suppress appetite

14 0.86690086 1555 andrew gelman stats-2012-10-31-Social scientists who use medical analogies to explain causal inference are, I think, implicitly trying to borrow some of the scientific and cultural authority of that field for our own purposes

15 0.86089933 2067 andrew gelman stats-2013-10-18-EP and ABC

16 0.85899997 880 andrew gelman stats-2011-08-30-Annals of spam

17 0.85275257 495 andrew gelman stats-2010-12-31-“Threshold earners” and economic inequality

18 0.84767348 1047 andrew gelman stats-2011-12-08-I Am Too Absolutely Heteroskedastic for This Probit Model

19 0.82460171 354 andrew gelman stats-2010-10-19-There’s only one Amtrak

20 0.82422113 1861 andrew gelman stats-2013-05-17-Where do theories come from?