hunch_net hunch_net-2005 hunch_net-2005-121 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: I attended the IBM research 60th anniversary . IBM research is, by any reasonable account, the industrial research lab which has managed to bring the most value to it’s parent company over the long term. This can be seen by simply counting the survivors: IBM research is the only older research lab which has not gone through a period of massive firing. (Note that there are also new research labs .) Despite this impressive record, IBM research has failed, by far, to achieve it’s potential. Examples which came up in this meeting include: It took about a decade to produce DRAM after it was invented in the lab. (In fact, Intel produced it first.) Relational databases and SQL were invented and then languished. It was only under external competition that IBM released it’s own relational database. Why didn’t IBM grow an Oracle division ? An early lead in IP networking hardware did not result in IBM growing a Cisco division . Why not? And remember … IBM research is a s
sentIndex sentText sentNum sentScore
1 IBM research is, by any reasonable account, the industrial research lab which has managed to bring the most value to it’s parent company over the long term. [sent-2, score-1.432]
2 This can be seen by simply counting the survivors: IBM research is the only older research lab which has not gone through a period of massive firing. [sent-3, score-0.95]
3 ) Despite this impressive record, IBM research has failed, by far, to achieve it’s potential. [sent-5, score-0.264]
4 Examples which came up in this meeting include: It took about a decade to produce DRAM after it was invented in the lab. [sent-6, score-0.141]
5 ) Relational databases and SQL were invented and then languished. [sent-8, score-0.206]
6 An early lead in IP networking hardware did not result in IBM growing a Cisco division . [sent-11, score-0.27]
7 And remember … IBM research is a stark success story compared to it’s competitors. [sent-13, score-0.402]
8 Why is there such a pervasive failure to recognize the really big things when they come along in a research lab? [sent-14, score-0.337]
9 This is a huge by capitolization standards: several ideas created in IBM labs have resulted in companies with a stock market valuation greater than IBM. [sent-15, score-0.354]
10 This is also of fundamental concern to researchers everywhere, because that failure is much of the reason why research is chronically underfunded. [sent-16, score-0.592]
11 A reasonable argument is “it’s much harder to predict the big ones in advance than in hindsight”. [sent-17, score-0.188]
12 Too many ideas have succeeded because someone else outside of the orignal company recognized the value of the idea and exploited it. [sent-19, score-0.547]
13 There is no fundamental reason why VCs should have an inherent advantage over the company running a research lab at recognizing good ideas within it’s own research lab. [sent-20, score-1.548]
14 In particular, people who invent something within a research lab have little personal incentive in seeing it’s potential realized so they fail to pursue it as vigorously as they might in a startup setting. [sent-22, score-1.022]
15 This is a very reasonable argument: incentives for success at a research lab are typically a low percentage of base salary while startup founders have been known to become billionaires. [sent-23, score-1.151]
16 A third argument is that researchers at a research lab are likely to find new and better ways of doing things that the company already does. [sent-24, score-1.179]
17 This creates a problem because a large fraction of the company is specifically invested in the current-to-become-old way of doing things. [sent-25, score-0.238]
18 When a debate happens, it is always easy to find the faults and drawbacks of the new method while ignoring the accepted faults of the old (this is common to new directions in research as well). [sent-26, score-0.641]
19 None of these three reasons are fundamentally unremovable, and it seems plausible that the first industrial research lab to remove these obstacles will reap huge benefits. [sent-27, score-1.027]
20 My current candidates for ‘most likely to remove barriers’ are google and NICTA . [sent-28, score-0.228]
wordName wordTfidf (topN-words)
[('lab', 0.422), ('ibm', 0.419), ('research', 0.264), ('company', 0.238), ('faults', 0.159), ('invented', 0.141), ('division', 0.141), ('startup', 0.141), ('relational', 0.131), ('argument', 0.119), ('incentives', 0.117), ('labs', 0.109), ('ideas', 0.109), ('industrial', 0.105), ('remove', 0.1), ('failure', 0.073), ('within', 0.071), ('huge', 0.071), ('chronically', 0.07), ('stark', 0.07), ('exploited', 0.07), ('anniversary', 0.07), ('networking', 0.07), ('parent', 0.07), ('salary', 0.07), ('sql', 0.07), ('reasonable', 0.069), ('likely', 0.069), ('success', 0.068), ('researchers', 0.067), ('recognized', 0.065), ('resulted', 0.065), ('nicta', 0.065), ('succeeded', 0.065), ('databases', 0.065), ('obstacles', 0.065), ('unremovable', 0.065), ('intel', 0.062), ('recognizing', 0.062), ('barriers', 0.062), ('invent', 0.062), ('ip', 0.062), ('muthu', 0.062), ('pursue', 0.062), ('fundamental', 0.06), ('ignoring', 0.059), ('everywhere', 0.059), ('candidates', 0.059), ('hardware', 0.059), ('reason', 0.058)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999964 121 hunch net-2005-10-12-The unrealized potential of the research lab
Introduction: I attended the IBM research 60th anniversary . IBM research is, by any reasonable account, the industrial research lab which has managed to bring the most value to it’s parent company over the long term. This can be seen by simply counting the survivors: IBM research is the only older research lab which has not gone through a period of massive firing. (Note that there are also new research labs .) Despite this impressive record, IBM research has failed, by far, to achieve it’s potential. Examples which came up in this meeting include: It took about a decade to produce DRAM after it was invented in the lab. (In fact, Intel produced it first.) Relational databases and SQL were invented and then languished. It was only under external competition that IBM released it’s own relational database. Why didn’t IBM grow an Oracle division ? An early lead in IP networking hardware did not result in IBM growing a Cisco division . Why not? And remember … IBM research is a s
2 0.25735661 132 hunch net-2005-11-26-The Design of an Optimal Research Environment
Introduction: How do you create an optimal environment for research? Here are some essential ingredients that I see. Stability . University-based research is relatively good at this. On any particular day, researchers face choices in what they will work on. A very common tradeoff is between: easy small difficult big For researchers without stability, the ‘easy small’ option wins. This is often “ok”—a series of incremental improvements on the state of the art can add up to something very beneficial. However, it misses one of the big potentials of research: finding entirely new and better ways of doing things. Stability comes in many forms. The prototypical example is tenure at a university—a tenured professor is almost imposssible to fire which means that the professor has the freedom to consider far horizon activities. An iron-clad guarantee of a paycheck is not necessary—industrial research labs have succeeded well with research positions of indefinite duration. Atnt rese
3 0.18906242 344 hunch net-2009-02-22-Effective Research Funding
Introduction: With a worldwide recession on, my impression is that the carnage in research has not been as severe as might be feared, at least in the United States. I know of two notable negative impacts: It’s quite difficult to get a job this year, as many companies and universities simply aren’t hiring. This is particularly tough on graduating students. Perhaps 10% of IBM research was fired. In contrast, around the time of the dot com bust, ATnT Research and Lucent had one or several 50% size firings wiping out much of the remainder of Bell Labs , triggering a notable diaspora for the respected machine learning group there. As the recession progresses, we may easily see more firings as companies in particular reach a point where they can no longer support research. There are a couple positives to the recession as well. Both the implosion of Wall Street (which siphoned off smart people) and the general difficulty of getting a job coming out of an undergraduate education s
4 0.17582314 464 hunch net-2012-05-03-Microsoft Research, New York City
Introduction: Yahoo! laid off people . Unlike every previous time there have been layoffs, this is serious for Yahoo! Research . We had advanced warning from Prabhakar through the simple act of leaving . Yahoo! Research was a world class organization that Prabhakar recruited much of personally, so it is deeply implausible that he would spontaneously decide to leave. My first thought when I saw the news was “Uhoh, Rob said that he knew it was serious when the head of ATnT Research left.” In this case it was even more significant, because Prabhakar recruited me on the premise that Y!R was an experiment in how research should be done: via a combination of high quality people and high engagement with the company. Prabhakar’s departure is a clear end to that experiment. The result is ambiguous from a business perspective. Y!R clearly was not capable of saving the company from its illnesses. I’m not privy to the internal accounting of impact and this is the kind of subject where there c
5 0.17295082 36 hunch net-2005-03-05-Funding Research
Introduction: The funding of research (and machine learning research) is an issue which seems to have become more significant in the United States over the last decade. The word “research” is applied broadly here to science, mathematics, and engineering. There are two essential difficulties with funding research: Longshot Paying a researcher is often a big gamble. Most research projects don’t pan out, but a few big payoffs can make it all worthwhile. Information Only Much of research is about finding the right way to think about or do something. The Longshot difficulty means that there is high variance in payoffs. This can be compensated for by funding many different research projects, reducing variance. The Information-Only difficulty means that it’s hard to extract a profit directly from many types of research, so companies have difficulty justifying basic research. (Patents are a mechanism for doing this. They are often extraordinarily clumsy or simply not applicable.) T
6 0.11319859 110 hunch net-2005-09-10-“Failure” is an option
7 0.112422 51 hunch net-2005-04-01-The Producer-Consumer Model of Research
8 0.10342235 175 hunch net-2006-04-30-John Langford –> Yahoo Research, NY
9 0.1002062 449 hunch net-2011-11-26-Giving Thanks
10 0.093490355 421 hunch net-2011-01-03-Herman Goldstine 2011
11 0.09262637 22 hunch net-2005-02-18-What it means to do research.
12 0.088091105 208 hunch net-2006-09-18-What is missing for online collaborative research?
13 0.084234893 134 hunch net-2005-12-01-The Webscience Future
14 0.08320336 194 hunch net-2006-07-11-New Models
15 0.082033023 229 hunch net-2007-01-26-Parallel Machine Learning Problems
16 0.081337787 76 hunch net-2005-05-29-Bad ideas
17 0.08025831 151 hunch net-2006-01-25-1 year
18 0.077717893 370 hunch net-2009-09-18-Necessary and Sufficient Research
19 0.07671243 96 hunch net-2005-07-21-Six Months
20 0.0741143 255 hunch net-2007-07-13-The View From China
topicId topicWeight
[(0, 0.161), (1, -0.043), (2, -0.105), (3, 0.151), (4, -0.147), (5, -0.033), (6, 0.047), (7, 0.082), (8, -0.045), (9, 0.063), (10, 0.038), (11, 0.027), (12, -0.036), (13, -0.051), (14, -0.005), (15, 0.069), (16, -0.032), (17, 0.046), (18, -0.045), (19, -0.085), (20, -0.014), (21, -0.014), (22, 0.008), (23, 0.047), (24, -0.025), (25, 0.027), (26, -0.004), (27, 0.004), (28, -0.087), (29, -0.125), (30, 0.091), (31, -0.055), (32, 0.012), (33, 0.064), (34, -0.031), (35, -0.032), (36, 0.036), (37, 0.061), (38, -0.076), (39, -0.028), (40, 0.036), (41, -0.022), (42, -0.034), (43, -0.003), (44, -0.065), (45, -0.019), (46, 0.011), (47, 0.058), (48, -0.012), (49, 0.064)]
simIndex simValue blogId blogTitle
same-blog 1 0.97799349 121 hunch net-2005-10-12-The unrealized potential of the research lab
Introduction: I attended the IBM research 60th anniversary . IBM research is, by any reasonable account, the industrial research lab which has managed to bring the most value to it’s parent company over the long term. This can be seen by simply counting the survivors: IBM research is the only older research lab which has not gone through a period of massive firing. (Note that there are also new research labs .) Despite this impressive record, IBM research has failed, by far, to achieve it’s potential. Examples which came up in this meeting include: It took about a decade to produce DRAM after it was invented in the lab. (In fact, Intel produced it first.) Relational databases and SQL were invented and then languished. It was only under external competition that IBM released it’s own relational database. Why didn’t IBM grow an Oracle division ? An early lead in IP networking hardware did not result in IBM growing a Cisco division . Why not? And remember … IBM research is a s
2 0.82540506 344 hunch net-2009-02-22-Effective Research Funding
Introduction: With a worldwide recession on, my impression is that the carnage in research has not been as severe as might be feared, at least in the United States. I know of two notable negative impacts: It’s quite difficult to get a job this year, as many companies and universities simply aren’t hiring. This is particularly tough on graduating students. Perhaps 10% of IBM research was fired. In contrast, around the time of the dot com bust, ATnT Research and Lucent had one or several 50% size firings wiping out much of the remainder of Bell Labs , triggering a notable diaspora for the respected machine learning group there. As the recession progresses, we may easily see more firings as companies in particular reach a point where they can no longer support research. There are a couple positives to the recession as well. Both the implosion of Wall Street (which siphoned off smart people) and the general difficulty of getting a job coming out of an undergraduate education s
3 0.81461698 132 hunch net-2005-11-26-The Design of an Optimal Research Environment
Introduction: How do you create an optimal environment for research? Here are some essential ingredients that I see. Stability . University-based research is relatively good at this. On any particular day, researchers face choices in what they will work on. A very common tradeoff is between: easy small difficult big For researchers without stability, the ‘easy small’ option wins. This is often “ok”—a series of incremental improvements on the state of the art can add up to something very beneficial. However, it misses one of the big potentials of research: finding entirely new and better ways of doing things. Stability comes in many forms. The prototypical example is tenure at a university—a tenured professor is almost imposssible to fire which means that the professor has the freedom to consider far horizon activities. An iron-clad guarantee of a paycheck is not necessary—industrial research labs have succeeded well with research positions of indefinite duration. Atnt rese
4 0.81272531 36 hunch net-2005-03-05-Funding Research
Introduction: The funding of research (and machine learning research) is an issue which seems to have become more significant in the United States over the last decade. The word “research” is applied broadly here to science, mathematics, and engineering. There are two essential difficulties with funding research: Longshot Paying a researcher is often a big gamble. Most research projects don’t pan out, but a few big payoffs can make it all worthwhile. Information Only Much of research is about finding the right way to think about or do something. The Longshot difficulty means that there is high variance in payoffs. This can be compensated for by funding many different research projects, reducing variance. The Information-Only difficulty means that it’s hard to extract a profit directly from many types of research, so companies have difficulty justifying basic research. (Patents are a mechanism for doing this. They are often extraordinarily clumsy or simply not applicable.) T
5 0.78329623 51 hunch net-2005-04-01-The Producer-Consumer Model of Research
Introduction: In the quest to understand what good reviewing is, perhaps it’s worthwhile to think about what good research is. One way to think about good research is in terms of a producer/consumer model. In the producer/consumer model of research, for any element of research there are producers (authors and coauthors of papers, for example) and consumers (people who use the papers to make new papers or code solving problems). An produced bit of research is judged as “good” if it is used by many consumers. There are two basic questions which immediately arise: Is this a good model of research? Are there alternatives? The producer/consumer model has some difficulties which can be (partially) addressed. Disconnect. A group of people doing research on some subject may become disconnected from the rest of the world. Each person uses the research of other people in the group so it appears good research is being done, but the group has no impact on the rest of the world. One way
6 0.7584154 464 hunch net-2012-05-03-Microsoft Research, New York City
7 0.75793719 76 hunch net-2005-05-29-Bad ideas
8 0.72756398 255 hunch net-2007-07-13-The View From China
9 0.724204 175 hunch net-2006-04-30-John Langford –> Yahoo Research, NY
10 0.66766763 241 hunch net-2007-04-28-The Coming Patent Apocalypse
11 0.66641229 154 hunch net-2006-02-04-Research Budget Changes
12 0.6568464 110 hunch net-2005-09-10-“Failure” is an option
13 0.64372909 449 hunch net-2011-11-26-Giving Thanks
14 0.62629837 142 hunch net-2005-12-22-Yes , I am applying
15 0.60823864 48 hunch net-2005-03-29-Academic Mechanism Design
16 0.59871912 73 hunch net-2005-05-17-A Short Guide to PhD Graduate Study
17 0.59046257 208 hunch net-2006-09-18-What is missing for online collaborative research?
18 0.58797312 22 hunch net-2005-02-18-What it means to do research.
19 0.57324874 193 hunch net-2006-07-09-The Stock Prediction Machine Learning Problem
20 0.56386805 358 hunch net-2009-06-01-Multitask Poisoning
topicId topicWeight
[(3, 0.021), (10, 0.048), (27, 0.139), (37, 0.017), (38, 0.036), (53, 0.058), (55, 0.081), (68, 0.013), (77, 0.011), (84, 0.361), (94, 0.061), (95, 0.073)]
simIndex simValue blogId blogTitle
1 0.96052974 467 hunch net-2012-06-15-Normal Deviate and the UCSC Machine Learning Summer School
Introduction: Larry Wasserman has started the Normal Deviate blog which I added to the blogroll on the right. Manfred Warmuth points out the UCSC machine learning summer school running July 9-20 which may be of particular interest to those in silicon valley.
same-blog 2 0.89218462 121 hunch net-2005-10-12-The unrealized potential of the research lab
Introduction: I attended the IBM research 60th anniversary . IBM research is, by any reasonable account, the industrial research lab which has managed to bring the most value to it’s parent company over the long term. This can be seen by simply counting the survivors: IBM research is the only older research lab which has not gone through a period of massive firing. (Note that there are also new research labs .) Despite this impressive record, IBM research has failed, by far, to achieve it’s potential. Examples which came up in this meeting include: It took about a decade to produce DRAM after it was invented in the lab. (In fact, Intel produced it first.) Relational databases and SQL were invented and then languished. It was only under external competition that IBM released it’s own relational database. Why didn’t IBM grow an Oracle division ? An early lead in IP networking hardware did not result in IBM growing a Cisco division . Why not? And remember … IBM research is a s
3 0.87678474 383 hunch net-2009-12-09-Inherent Uncertainty
Introduction: I’d like to point out Inherent Uncertainty , which I’ve added to the ML blog post scanner on the right. My understanding from Jake is that the intention is to have a multiauthor blog which is more specialized towards learning theory/game theory than this one. Nevertheless, several of the posts seem to be of wider interest.
4 0.82364446 200 hunch net-2006-08-03-AOL’s data drop
Introduction: AOL has released several large search engine related datasets. This looks like a pretty impressive data release, and it is a big opportunity for people everywhere to worry about search engine related learning problems, if they want.
5 0.81066573 411 hunch net-2010-09-21-Regretting the dead
Introduction: Nikos pointed out this new york times article about poor clinical design killing people . For those of us who study learning from exploration information this is a reminder that low regret algorithms are particularly important, as regret in clinical trials is measured by patient deaths. Two obvious improvements on the experimental design are: With reasonable record keeping of existing outcomes for the standard treatments, there is no need to explicitly assign people to a control group with the standard treatment, as that approach is effectively explored with great certainty. Asserting otherwise would imply that the nature of effective treatments for cancer has changed between now and a year ago, which denies the value of any clinical trial. An optimal experimental design will smoothly phase between exploration and exploitation as evidence for a new treatment shows that it can be effective. This is old tech, for example in the EXP3.P algorithm (page 12 aka 59) although
6 0.76737344 142 hunch net-2005-12-22-Yes , I am applying
7 0.7159341 219 hunch net-2006-11-22-Explicit Randomization in Learning algorithms
8 0.58783776 337 hunch net-2009-01-21-Nearly all natural problems require nonlinearity
9 0.54546571 464 hunch net-2012-05-03-Microsoft Research, New York City
10 0.51403069 132 hunch net-2005-11-26-The Design of an Optimal Research Environment
11 0.50225276 36 hunch net-2005-03-05-Funding Research
12 0.49076137 329 hunch net-2008-11-28-A Bumper Crop of Machine Learning Graduates
13 0.49053761 314 hunch net-2008-08-24-Mass Customized Medicine in the Future?
14 0.46720701 454 hunch net-2012-01-30-ICML Posters and Scope
15 0.46640134 343 hunch net-2009-02-18-Decision by Vetocracy
16 0.46611017 281 hunch net-2007-12-21-Vowpal Wabbit Code Release
17 0.46526092 437 hunch net-2011-07-10-ICML 2011 and the future
18 0.46446458 260 hunch net-2007-08-25-The Privacy Problem
19 0.46324539 466 hunch net-2012-06-05-ICML acceptance statistics
20 0.46151245 207 hunch net-2006-09-12-Incentive Compatible Reviewing