andrew_gelman_stats andrew_gelman_stats-2010 andrew_gelman_stats-2010-378 knowledge-graph by maker-knowledge-mining

378 andrew gelman stats-2010-10-28-World Economic Forum Data Visualization Challenge


meta infos for this blog

Source: html

Introduction: Jaidev Deshpande writes: The World Economic Forum recently posed a data visualization problem . The dataset is a survey of experts from the so called “Agenda Councils” of the WEF. Here are the details . The dataset primarily contains the experts’ opinions on which global / regional / industrial agenda council of the WEF they would benefit from by interacting with the most. It occurs to me that this dataset can be thought of as an instance of a social networking dynamics, in that it represents the preferences of individuals towards belonging or not belonging to a particular group within the network. It is these ‘groups’ that must be identified to solve the problem. Under what conditions would this hypothesis be valid? I have a hunch that dimensionality reduction will not necessarily help me visualize this data satisfactorily. They also need to be complemented by the way social networks detect cliques amongst their members. The prize is $3000 plus bragging rights, and submiss


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Jaidev Deshpande writes: The World Economic Forum recently posed a data visualization problem . [sent-1, score-0.239]

2 The dataset is a survey of experts from the so called “Agenda Councils” of the WEF. [sent-2, score-0.533]

3 The dataset primarily contains the experts’ opinions on which global / regional / industrial agenda council of the WEF they would benefit from by interacting with the most. [sent-4, score-1.666]

4 It occurs to me that this dataset can be thought of as an instance of a social networking dynamics, in that it represents the preferences of individuals towards belonging or not belonging to a particular group within the network. [sent-5, score-1.994]

5 It is these ‘groups’ that must be identified to solve the problem. [sent-6, score-0.19]

6 I have a hunch that dimensionality reduction will not necessarily help me visualize this data satisfactorily. [sent-8, score-0.669]

7 They also need to be complemented by the way social networks detect cliques amongst their members. [sent-9, score-0.483]

8 The prize is $3000 plus bragging rights, and submissions are due 15 Nov. [sent-10, score-0.618]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('belonging', 0.369), ('agenda', 0.291), ('dataset', 0.288), ('bragging', 0.185), ('experts', 0.179), ('dimensionality', 0.161), ('interacting', 0.161), ('networking', 0.156), ('hunch', 0.152), ('amongst', 0.148), ('visualize', 0.145), ('posed', 0.143), ('nov', 0.143), ('industrial', 0.14), ('council', 0.136), ('dynamics', 0.134), ('primarily', 0.13), ('submissions', 0.129), ('regional', 0.127), ('rights', 0.125), ('reduction', 0.125), ('detect', 0.122), ('prize', 0.121), ('occurs', 0.12), ('towards', 0.119), ('forum', 0.115), ('preferences', 0.112), ('networks', 0.107), ('contains', 0.106), ('social', 0.106), ('instance', 0.103), ('identified', 0.1), ('valid', 0.1), ('global', 0.1), ('opinions', 0.099), ('represents', 0.097), ('plus', 0.097), ('visualization', 0.096), ('conditions', 0.091), ('solve', 0.09), ('benefit', 0.088), ('individuals', 0.088), ('necessarily', 0.086), ('due', 0.086), ('groups', 0.08), ('hypothesis', 0.075), ('details', 0.072), ('economic', 0.068), ('group', 0.067), ('called', 0.066)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 378 andrew gelman stats-2010-10-28-World Economic Forum Data Visualization Challenge

Introduction: Jaidev Deshpande writes: The World Economic Forum recently posed a data visualization problem . The dataset is a survey of experts from the so called “Agenda Councils” of the WEF. Here are the details . The dataset primarily contains the experts’ opinions on which global / regional / industrial agenda council of the WEF they would benefit from by interacting with the most. It occurs to me that this dataset can be thought of as an instance of a social networking dynamics, in that it represents the preferences of individuals towards belonging or not belonging to a particular group within the network. It is these ‘groups’ that must be identified to solve the problem. Under what conditions would this hypothesis be valid? I have a hunch that dimensionality reduction will not necessarily help me visualize this data satisfactorily. They also need to be complemented by the way social networks detect cliques amongst their members. The prize is $3000 plus bragging rights, and submiss

2 0.14947188 1032 andrew gelman stats-2011-11-28-Does Avastin work on breast cancer? Should Medicare be paying for it?

Introduction: Discussion by a panel of experts at the Statistics Forum .

3 0.13151693 978 andrew gelman stats-2011-10-28-Cool job opening with brilliant researchers at Yahoo

Introduction: Duncan Watts writes: The Human Social Dynamics Group in Yahoo Research is seeking highly qualified candidates for a post-doctoral research scientist position. The Human and Social Dynamics group is devoted to understanding the interplay between individual-level behavior (e.g. how people make decisions about what music they like, which dates to go on, or which groups to join) and the social environment in which individual behavior necessarily plays itself out. In particular, we are interested in: * Structure and evolution of social groups and networks * Decision making, social influence, diffusion, and collective decisions * Networking and collaborative problem solving. The intrinsically multi-disciplinary and cross-cutting nature of the subject demands an eclectic range of researchers, both in terms of domain-expertise (e.g. decision sciences, social psychology, sociology) and technical skills (e.g. statistical analysis, mathematical modeling, computer simulations, design o

4 0.098253541 717 andrew gelman stats-2011-05-17-Statistics plagiarism scandal

Introduction: See more at the Statistics Forum (of course).

5 0.077657253 1695 andrew gelman stats-2013-01-28-Economists argue about Bayes

Introduction: Robert Bell pointed me to this post by Brad De Long on Bayesian statistics, and then I also noticed this from Noah Smith, who wrote: My impression is that although the Bayesian/Frequentist debate is interesting and intellectually fun, there’s really not much “there” there… despite being so-hip-right-now, Bayesian is not the Statistical Jesus. I’m happy to see the discussion going in this direction. Twenty-five years ago or so, when I got into this biz, there were some serious anti-Bayesian attitudes floating around in mainstream statistics. Discussions in the journals sometimes devolved into debates of the form, “Bayesians: knaves or fools?”. You’d get all sorts of free-floating skepticism about any prior distribution at all, even while people were accepting without question (and doing theory on) logistic regressions, proportional hazards models, and all sorts of strong strong models. (In the subfield of survey sampling, various prominent researchers would refuse to mode

6 0.077218562 756 andrew gelman stats-2011-06-10-Christakis-Fowler update

7 0.076975964 703 andrew gelman stats-2011-05-10-Bringing Causal Models Into the Mainstream

8 0.072326593 676 andrew gelman stats-2011-04-23-The payoff: $650. The odds: 1 in 500,000.

9 0.07061398 1015 andrew gelman stats-2011-11-17-Good examples of lurking variables?

10 0.067920968 648 andrew gelman stats-2011-04-04-The Case for More False Positives in Anti-doping Testing

11 0.065554298 1888 andrew gelman stats-2013-06-08-New Judea Pearl journal of causal inference

12 0.064469099 544 andrew gelman stats-2011-01-29-Splitting the data

13 0.063883662 1051 andrew gelman stats-2011-12-10-Towards a Theory of Trust in Networks of Humans and Computers

14 0.060942419 1414 andrew gelman stats-2012-07-12-Steven Pinker’s unconvincing debunking of group selection

15 0.059003837 154 andrew gelman stats-2010-07-18-Predictive checks for hierarchical models

16 0.058827471 747 andrew gelman stats-2011-06-06-Research Directions for Machine Learning and Algorithms

17 0.057874482 1811 andrew gelman stats-2013-04-18-Psychology experiments to understand what’s going on with data graphics?

18 0.055740282 799 andrew gelman stats-2011-07-13-Hypothesis testing with multiple imputations

19 0.054123387 1678 andrew gelman stats-2013-01-17-Wanted: 365 stories of statistics

20 0.053649522 982 andrew gelman stats-2011-10-30-“There’s at least as much as an 80 percent chance . . .”


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.085), (1, -0.011), (2, 0.014), (3, -0.018), (4, 0.005), (5, 0.015), (6, -0.059), (7, 0.008), (8, -0.036), (9, 0.039), (10, -0.026), (11, -0.023), (12, 0.021), (13, 0.019), (14, -0.036), (15, 0.0), (16, -0.014), (17, 0.025), (18, 0.028), (19, -0.029), (20, 0.01), (21, 0.023), (22, -0.018), (23, -0.027), (24, -0.025), (25, 0.03), (26, 0.026), (27, -0.01), (28, 0.015), (29, -0.002), (30, 0.014), (31, -0.01), (32, 0.02), (33, -0.001), (34, -0.027), (35, 0.024), (36, 0.016), (37, 0.023), (38, 0.045), (39, 0.072), (40, -0.02), (41, -0.006), (42, 0.004), (43, -0.015), (44, -0.031), (45, 0.006), (46, 0.053), (47, -0.052), (48, -0.001), (49, 0.006)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.94773155 378 andrew gelman stats-2010-10-28-World Economic Forum Data Visualization Challenge

Introduction: Jaidev Deshpande writes: The World Economic Forum recently posed a data visualization problem . The dataset is a survey of experts from the so called “Agenda Councils” of the WEF. Here are the details . The dataset primarily contains the experts’ opinions on which global / regional / industrial agenda council of the WEF they would benefit from by interacting with the most. It occurs to me that this dataset can be thought of as an instance of a social networking dynamics, in that it represents the preferences of individuals towards belonging or not belonging to a particular group within the network. It is these ‘groups’ that must be identified to solve the problem. Under what conditions would this hypothesis be valid? I have a hunch that dimensionality reduction will not necessarily help me visualize this data satisfactorily. They also need to be complemented by the way social networks detect cliques amongst their members. The prize is $3000 plus bragging rights, and submiss

2 0.66223431 275 andrew gelman stats-2010-09-14-Data visualization at the American Evaluation Association

Introduction: Stephanie Evergreen writes: Media, web design, and marketing have all created an environment where stakeholders – clients, program participants, funders – all expect high quality graphics and reporting that effectively conveys the valuable insights from evaluation work. Some in statistics and mathematics have used data visualization strategies to support more useful reporting of complex ideas. Global growing interest in improving communications has begun to take root in the evaluation field as well. But as anyone who has sat through a day’s worth of a conference or had to endure a dissertation-worthy evaluation report knows, evaluators still have a long way to go. To support the development of researchers and evaluators, some members of the American Evaluation Association are proposing a new TIG (Topical Interest Group) on Data Visualization and Reporting. If you are a member of AEA (or want to be) and you are interested in joining this TIG, contact Stephanie Evergreen.

3 0.65652454 1837 andrew gelman stats-2013-05-03-NYC Data Skeptics Meetup

Introduction: Rachel Schutt writes: The hype surrounding Big Data and Data Science is at a fever pitch with promises to solve the world’s business and social problems, large and small. How accurate or misleading is this message? How is it helping or damaging people, and which people? What opportunities exist for data nerds and entrepreneurs that examine the larger issues with a skeptical view? This Meetup focuses on mathematical, ethical, and business aspects of data from a skeptical perspective. Guest speakers will discuss the misuse of and best practices with data, common mistakes people make with data and ways to avoid them, how to deal with intentional gaming and politics surrounding mathematical modeling, and taking into account the feedback loops and wider consequences of modeling. We will take deep dives into models in the fields of Data Science, statistics, financial engineering, and economics. This is an independent forum and open to anyone sharing an interest in the larger use of

4 0.63385969 1541 andrew gelman stats-2012-10-19-Statistical discrimination again

Introduction: Mark Johnstone writes: I’ve recently been investigating a new European Court of Justice ruling on insurance calculations (on behalf of MoneySuperMarket) and I found something related to statistics that caught my attention. . . . The ruling (which comes into effect in December 2012) states that insurers in Europe can no longer provide different premiums based on gender. Despite the fact that women are statistically safer drivers, unless it’s biologically proven there is a causal relationship between being female and being a safer driver, this is now seen as an act of discrimination (more on this from the Wall Street Journal). However, where do you stop with this? What about age? What about other factors? And what does this mean for the application of statistics in general? Is it inherently unjust in this context? One proposal has been to fit ‘black boxes’ into cars so more individual data can be collected, as opposed to relying heavily on aggregates. For fans of data and s

5 0.61361712 2221 andrew gelman stats-2014-02-23-Postdoc with Huffpost Pollster to do Bayesian poll tracking

Introduction: Mark Blumenthal writes: HuffPost Pollster has an immediate opening for a social and data scientist to join us full time, preferably in our Washington D.C. bureau, to work on development and improvement of our poll tracking models and political forecasts. You are someone who has: * A passion for electoral politics, * Advanced training in statistics and dynamic Bayesian data analysis, * A Ph.D. in statistics, political science, economics or the social sciences or comparable high level training or experience, * A desire to make a lasting contribution in the way the news media cover polls and elections. We are: * The award-winning website formerly known as  Pollster.com , which joined the Huffington Post in 2010 and remains the internet’s premier source for uniquely interactive polling charts and electorate forecasts and a running daily commentary that explains, demystifies and critiques political polling. * Home to the open source Pollster API, which provides academic

6 0.59643918 1828 andrew gelman stats-2013-04-27-Time-Sharing Experiments for the Social Sciences

7 0.58963102 527 andrew gelman stats-2011-01-20-Cars vs. trucks

8 0.57546014 1212 andrew gelman stats-2012-03-14-Controversy about a ranking of philosophy departments, or How should we think about statistical results when we can’t see the raw data?

9 0.56396723 192 andrew gelman stats-2010-08-08-Turning pages into data

10 0.55997473 1853 andrew gelman stats-2013-05-12-OpenData Latinoamerica

11 0.55974776 2167 andrew gelman stats-2014-01-10-Do you believe that “humans and other living things have evolved over time”?

12 0.55551803 978 andrew gelman stats-2011-10-28-Cool job opening with brilliant researchers at Yahoo

13 0.55418229 714 andrew gelman stats-2011-05-16-NYT Labs releases Openpaths, a utility for saving your iphone data

14 0.55143493 1276 andrew gelman stats-2012-04-22-“Gross misuse of statistics” can be a good thing, if it indicates the acceptance of the importance of statistical reasoning

15 0.54902345 2307 andrew gelman stats-2014-04-27-Big Data…Big Deal? Maybe, if Used with Caution.

16 0.54552782 298 andrew gelman stats-2010-09-27-Who is that masked person: The use of face masks on Mexico City public transportation during the Influenza A (H1N1) outbreak

17 0.54360557 2043 andrew gelman stats-2013-09-29-The difficulties of measuring just about anything

18 0.54049939 830 andrew gelman stats-2011-07-29-Introductory overview lectures at the Joint Statistical Meetings in Miami this coming week

19 0.53657842 1920 andrew gelman stats-2013-06-30-“Non-statistical” statistics tools

20 0.53291464 1525 andrew gelman stats-2012-10-08-Ethical standards in different data communities


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(9, 0.026), (11, 0.103), (13, 0.019), (15, 0.014), (16, 0.082), (21, 0.019), (24, 0.107), (28, 0.02), (54, 0.02), (55, 0.029), (57, 0.019), (68, 0.017), (76, 0.019), (77, 0.081), (84, 0.014), (95, 0.042), (99, 0.265)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.96544874 378 andrew gelman stats-2010-10-28-World Economic Forum Data Visualization Challenge

Introduction: Jaidev Deshpande writes: The World Economic Forum recently posed a data visualization problem . The dataset is a survey of experts from the so called “Agenda Councils” of the WEF. Here are the details . The dataset primarily contains the experts’ opinions on which global / regional / industrial agenda council of the WEF they would benefit from by interacting with the most. It occurs to me that this dataset can be thought of as an instance of a social networking dynamics, in that it represents the preferences of individuals towards belonging or not belonging to a particular group within the network. It is these ‘groups’ that must be identified to solve the problem. Under what conditions would this hypothesis be valid? I have a hunch that dimensionality reduction will not necessarily help me visualize this data satisfactorily. They also need to be complemented by the way social networks detect cliques amongst their members. The prize is $3000 plus bragging rights, and submiss

2 0.93567783 1386 andrew gelman stats-2012-06-21-Belief in hell is associated with lower crime rates

Introduction: I remember attending a talk a few years ago by my political science colleague John Huber in which he discussed cross-national comparisons of religious attitudes. One thing I remember is that the U.S. is highly religious, another thing I remembered is that lots more Americans believe in heaven than believe in hell. Some of this went into Red State Blue State—not the heaven/hell thing, but the graph of religiosity vs. GDP: and the corresponding graph of religious attendance vs. GDP for U.S. states: Also we learned that, at the individual level, the correlation of religious attendance with income is zero (according to survey reports, rich Americans are neither more nor less likely than poor Americans to go to church regularly): while the correlation of prayer with income is strongly negative (poor Americans are much more likely than rich Americans to regularly pray): Anyway, with all this, I was primed to be interested in a recent study by psychologist

3 0.93145829 458 andrew gelman stats-2010-12-08-Blogging: Is it “fair use”?

Introduction: Dave Kane writes: I [Kane] am involved in a dispute relating to whether or not a blog can be considered part of one’s academic writing. Williams College restricts the use of undergraduate theses as follows: Non-commercial, academic use within the scope of “Fair Use” standards is acceptable. Otherwise, you may not copy or distribute any content without the permission of the copyright holder. Seems obvious enough. Yet some folks think that my use of thesis material in a blog post fails this test because it is not “academic.” See this post for the gory details. Parenthetically, your readers might be interested in the substantive discovery here, the details of the Williams admissions process (which is probably very similar to Columbia’s). Williams places students into academic rating (AR) categories as follows: verbal math composite SAT II ACT AP AR 1: 770-800 750-800 1520-1600 750-800 35-36 mostly 5s AR 2: 730-770 720-750 1450-1520 720-770 33-34 4s an

4 0.92250133 598 andrew gelman stats-2011-03-03-Is Harvard hurting poor kids by cutting tuition for the upper middle class?

Introduction: Timothy Noah reports : At the end of 2007, Harvard announced that it would limit tuition to no more than 10 percent of family income for families earning up to $180,000. (It also eliminated all loans, following a trail blazed by Princeton, and stopped including home equity in its calculations of family wealth.) Yale saw and raised to $200,000, and other wealthy colleges weighed in with variations. Noah argues that this is a bad thing because it encourages other colleges to give tuition breaks to families with six-figure incomes, thus sucking up money that could otherwise go to reduce tuition for lower-income students. For example: Roger Lehecka, a former dean of students at Columbia, and Andrew Delbanco, director of American studies there, wrote in the New York Times that Harvard’s initiative was “good news for students at Harvard or Yale” but “bad news” for everyone else. “The problem,” they explained, “is that most colleges will feel compelled to follow Harvard and Yale’s

5 0.92020792 978 andrew gelman stats-2011-10-28-Cool job opening with brilliant researchers at Yahoo

Introduction: Duncan Watts writes: The Human Social Dynamics Group in Yahoo Research is seeking highly qualified candidates for a post-doctoral research scientist position. The Human and Social Dynamics group is devoted to understanding the interplay between individual-level behavior (e.g. how people make decisions about what music they like, which dates to go on, or which groups to join) and the social environment in which individual behavior necessarily plays itself out. In particular, we are interested in: * Structure and evolution of social groups and networks * Decision making, social influence, diffusion, and collective decisions * Networking and collaborative problem solving. The intrinsically multi-disciplinary and cross-cutting nature of the subject demands an eclectic range of researchers, both in terms of domain-expertise (e.g. decision sciences, social psychology, sociology) and technical skills (e.g. statistical analysis, mathematical modeling, computer simulations, design o

6 0.91624796 382 andrew gelman stats-2010-10-30-“Presidential Election Outcomes Directly Influence Suicide Rates”

7 0.9153887 1620 andrew gelman stats-2012-12-12-“Teaching effectiveness” as another dimension in cognitive ability

8 0.91395152 1684 andrew gelman stats-2013-01-20-Ugly ugly ugly

9 0.91364282 93 andrew gelman stats-2010-06-17-My proposal for making college admissions fairer

10 0.91337979 562 andrew gelman stats-2011-02-06-Statistician cracks Toronto lottery

11 0.91165787 1387 andrew gelman stats-2012-06-21-Will Tiger Woods catch Jack Nicklaus? And a discussion of the virtues of using continuous data even if your goal is discrete prediction

12 0.91117734 1784 andrew gelman stats-2013-04-01-Wolfram on Mandelbrot

13 0.90829253 1466 andrew gelman stats-2012-08-22-The scaled inverse Wishart prior distribution for a covariance matrix in a hierarchical model

14 0.90787464 2007 andrew gelman stats-2013-09-03-Popper and Jaynes

15 0.90748763 1604 andrew gelman stats-2012-12-04-An epithet I can live with

16 0.90737575 1462 andrew gelman stats-2012-08-18-Standardizing regression inputs

17 0.90711403 2054 andrew gelman stats-2013-10-07-Bing is preferred to Google by people who aren’t like me

18 0.90707463 428 andrew gelman stats-2010-11-24-Flawed visualization of U.S. voting maybe has some good features

19 0.90631688 1373 andrew gelman stats-2012-06-09-Cognitive psychology research helps us understand confusion of Jonathan Haidt and others about working-class voters

20 0.90629405 216 andrew gelman stats-2010-08-18-More forecasting competitions