hunch_net hunch_net-2005 hunch_net-2005-132 knowledge-graph by maker-knowledge-mining

132 hunch net-2005-11-26-The Design of an Optimal Research Environment


meta infos for this blog

Source: html

Introduction: How do you create an optimal environment for research? Here are some essential ingredients that I see. Stability . University-based research is relatively good at this. On any particular day, researchers face choices in what they will work on. A very common tradeoff is between: easy small difficult big For researchers without stability, the ‘easy small’ option wins. This is often “ok”—a series of incremental improvements on the state of the art can add up to something very beneficial. However, it misses one of the big potentials of research: finding entirely new and better ways of doing things. Stability comes in many forms. The prototypical example is tenure at a university—a tenured professor is almost imposssible to fire which means that the professor has the freedom to consider far horizon activities. An iron-clad guarantee of a paycheck is not necessary—industrial research labs have succeeded well with research positions of indefinite duration. Atnt rese


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 An iron-clad guarantee of a paycheck is not necessary—industrial research labs have succeeded well with research positions of indefinite duration. [sent-11, score-0.886]

2 One significant part of stability is financial stability of the parent organization. [sent-16, score-0.501]

3 The nature of research implies that what makes ‘funding sense’ depends on the size of the set of people who can benefit from it. [sent-17, score-0.509]

4 University-based research is relatively terrible about this while industrial labs vary widely. [sent-20, score-0.82]

5 Running the university requires very significant time and energy. [sent-22, score-0.424]

6 Writing grant proposals for funding requires very significant time and energy (success in grant writing is typically a necessity to achieve tenure at US research universities). [sent-23, score-0.731]

7 Advising students well requires significant time and energy. [sent-24, score-0.415]

8 In industrial labs (think of “IBM research” as an example), there is an tendency to shift research towards short term rewards. [sent-26, score-0.754]

9 This is partly becaue the research labs have not proved very capable of using the big successes and partly because even large businesses have ups and downs. [sent-27, score-0.641]

10 During a ‘down’ time, it is very tempting to use the reserve of very capable people in a research lab for short duration projects. [sent-28, score-0.481]

11 One thing to understand about research is that it is not the case that a person with half as much free time can produce half as much work. [sent-29, score-0.627]

12 Problem exposure The set of research problems which people could work on is much larger than the set of research problems which are plausibly useful to the rest of the world (*). [sent-31, score-0.98]

13 This is true, but there are several important caveats: The “hit rate” (in terms of big impact on everyones lives) for unmotivated research is much lower. [sent-37, score-0.468]

14 University-based research is may do this best via giving the researchers the ability to form their own companies with partial ownership. [sent-47, score-0.453]

15 In the context of machine learning, spam filtering is of obvious and significant common use, yet a researcher with a better spam filter can not easily create a company around the method. [sent-50, score-0.685]

16 The benefit structure in industrial research labs is (perhaps) often even worse. [sent-59, score-0.929]

17 University style research partially achieves this (a notable complaint of students is that they do not have enough freedom). [sent-78, score-0.443]

18 Research students are of course much more common on university campuses, although there are sometimes government funded research labs such as MPI in Germany and NICTA in Australia. [sent-88, score-0.909]

19 The effort of explaining a new topic of research often aids research via simplification and consideration in new contexts. [sent-92, score-0.695]

20 The internet has greatly aided concentration because some significant benefit can be derived even from people who are not physically near to each other. [sent-99, score-0.621]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('research', 0.316), ('labs', 0.254), ('stability', 0.19), ('spam', 0.184), ('industrial', 0.184), ('concentration', 0.137), ('researchers', 0.137), ('university', 0.136), ('students', 0.127), ('significant', 0.121), ('exposure', 0.114), ('filtering', 0.113), ('development', 0.106), ('benefit', 0.104), ('barrier', 0.103), ('greatly', 0.099), ('appointment', 0.092), ('devolution', 0.092), ('people', 0.089), ('universities', 0.087), ('requires', 0.084), ('company', 0.083), ('time', 0.083), ('salary', 0.082), ('demands', 0.082), ('success', 0.079), ('duration', 0.076), ('manager', 0.076), ('unmotivated', 0.076), ('free', 0.076), ('much', 0.076), ('kind', 0.074), ('even', 0.071), ('student', 0.071), ('google', 0.07), ('power', 0.069), ('plausibly', 0.069), ('fire', 0.068), ('delayed', 0.068), ('anyone', 0.068), ('system', 0.066), ('relatively', 0.066), ('tenure', 0.066), ('freedom', 0.066), ('linux', 0.066), ('professor', 0.066), ('wikipedia', 0.066), ('topic', 0.063), ('necessity', 0.061), ('overhead', 0.061)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0000013 132 hunch net-2005-11-26-The Design of an Optimal Research Environment

Introduction: How do you create an optimal environment for research? Here are some essential ingredients that I see. Stability . University-based research is relatively good at this. On any particular day, researchers face choices in what they will work on. A very common tradeoff is between: easy small difficult big For researchers without stability, the ‘easy small’ option wins. This is often “ok”—a series of incremental improvements on the state of the art can add up to something very beneficial. However, it misses one of the big potentials of research: finding entirely new and better ways of doing things. Stability comes in many forms. The prototypical example is tenure at a university—a tenured professor is almost imposssible to fire which means that the professor has the freedom to consider far horizon activities. An iron-clad guarantee of a paycheck is not necessary—industrial research labs have succeeded well with research positions of indefinite duration. Atnt rese

2 0.35648766 344 hunch net-2009-02-22-Effective Research Funding

Introduction: With a worldwide recession on, my impression is that the carnage in research has not been as severe as might be feared, at least in the United States. I know of two notable negative impacts: It’s quite difficult to get a job this year, as many companies and universities simply aren’t hiring. This is particularly tough on graduating students. Perhaps 10% of IBM research was fired. In contrast, around the time of the dot com bust, ATnT Research and Lucent had one or several 50% size firings wiping out much of the remainder of Bell Labs , triggering a notable diaspora for the respected machine learning group there. As the recession progresses, we may easily see more firings as companies in particular reach a point where they can no longer support research. There are a couple positives to the recession as well. Both the implosion of Wall Street (which siphoned off smart people) and the general difficulty of getting a job coming out of an undergraduate education s

3 0.25794673 36 hunch net-2005-03-05-Funding Research

Introduction: The funding of research (and machine learning research) is an issue which seems to have become more significant in the United States over the last decade. The word “research” is applied broadly here to science, mathematics, and engineering. There are two essential difficulties with funding research: Longshot Paying a researcher is often a big gamble. Most research projects don’t pan out, but a few big payoffs can make it all worthwhile. Information Only Much of research is about finding the right way to think about or do something. The Longshot difficulty means that there is high variance in payoffs. This can be compensated for by funding many different research projects, reducing variance. The Information-Only difficulty means that it’s hard to extract a profit directly from many types of research, so companies have difficulty justifying basic research. (Patents are a mechanism for doing this. They are often extraordinarily clumsy or simply not applicable.) T

4 0.25735661 121 hunch net-2005-10-12-The unrealized potential of the research lab

Introduction: I attended the IBM research 60th anniversary . IBM research is, by any reasonable account, the industrial research lab which has managed to bring the most value to it’s parent company over the long term. This can be seen by simply counting the survivors: IBM research is the only older research lab which has not gone through a period of massive firing. (Note that there are also new research labs .) Despite this impressive record, IBM research has failed, by far, to achieve it’s potential. Examples which came up in this meeting include: It took about a decade to produce DRAM after it was invented in the lab. (In fact, Intel produced it first.) Relational databases and SQL were invented and then languished. It was only under external competition that IBM released it’s own relational database. Why didn’t IBM grow an Oracle division ? An early lead in IP networking hardware did not result in IBM growing a Cisco division . Why not? And remember … IBM research is a s

5 0.20718105 22 hunch net-2005-02-18-What it means to do research.

Introduction: I want to try to describe what doing research means, especially from the point of view of an undergraduate. The shift from a class-taking mentality to a research mentality is very significant and not easy. Problem Posing Posing the right problem is often as important as solving them. Many people can get by in research by solving problems others have posed, but that’s not sufficient for really inspiring research. For learning in particular, there is a strong feeling that we just haven’t figured out which questions are the right ones to ask. You can see this, because the answers we have do not seem convincing. Gambling your life When you do research, you think very hard about new ways of solving problems, new problems, and new solutions. Many conversations are of the form “I wonder what would happen if…” These processes can be short (days or weeks) or years-long endeavours. The worst part is that you’ll only know if you were succesful at the end of the process (and some

6 0.18899602 223 hunch net-2006-12-06-The Spam Problem

7 0.18836804 110 hunch net-2005-09-10-“Failure” is an option

8 0.1733193 464 hunch net-2012-05-03-Microsoft Research, New York City

9 0.17263129 51 hunch net-2005-04-01-The Producer-Consumer Model of Research

10 0.16904825 358 hunch net-2009-06-01-Multitask Poisoning

11 0.16601463 48 hunch net-2005-03-29-Academic Mechanism Design

12 0.16575162 435 hunch net-2011-05-16-Research Directions for Machine Learning and Algorithms

13 0.16494179 378 hunch net-2009-11-15-The Other Online Learning

14 0.16364135 445 hunch net-2011-09-28-Somebody’s Eating Your Lunch

15 0.16308658 208 hunch net-2006-09-18-What is missing for online collaborative research?

16 0.16115201 370 hunch net-2009-09-18-Necessary and Sufficient Research

17 0.15925848 134 hunch net-2005-12-01-The Webscience Future

18 0.15401709 154 hunch net-2006-02-04-Research Budget Changes

19 0.15168729 297 hunch net-2008-04-22-Taking the next step

20 0.14414674 73 hunch net-2005-05-17-A Short Guide to PhD Graduate Study


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.349), (1, -0.065), (2, -0.177), (3, 0.244), (4, -0.217), (5, -0.037), (6, 0.073), (7, 0.116), (8, -0.052), (9, 0.112), (10, 0.036), (11, 0.009), (12, -0.066), (13, -0.046), (14, 0.042), (15, 0.053), (16, -0.065), (17, 0.028), (18, -0.077), (19, 0.012), (20, -0.075), (21, -0.019), (22, 0.004), (23, 0.029), (24, 0.028), (25, 0.011), (26, -0.035), (27, 0.033), (28, -0.048), (29, -0.074), (30, 0.025), (31, -0.087), (32, -0.017), (33, 0.04), (34, -0.006), (35, 0.018), (36, 0.029), (37, -0.024), (38, 0.024), (39, -0.008), (40, 0.032), (41, -0.056), (42, -0.044), (43, -0.092), (44, -0.015), (45, -0.071), (46, -0.034), (47, 0.025), (48, -0.084), (49, 0.012)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.9815836 132 hunch net-2005-11-26-The Design of an Optimal Research Environment

Introduction: How do you create an optimal environment for research? Here are some essential ingredients that I see. Stability . University-based research is relatively good at this. On any particular day, researchers face choices in what they will work on. A very common tradeoff is between: easy small difficult big For researchers without stability, the ‘easy small’ option wins. This is often “ok”—a series of incremental improvements on the state of the art can add up to something very beneficial. However, it misses one of the big potentials of research: finding entirely new and better ways of doing things. Stability comes in many forms. The prototypical example is tenure at a university—a tenured professor is almost imposssible to fire which means that the professor has the freedom to consider far horizon activities. An iron-clad guarantee of a paycheck is not necessary—industrial research labs have succeeded well with research positions of indefinite duration. Atnt rese

2 0.93296081 344 hunch net-2009-02-22-Effective Research Funding

Introduction: With a worldwide recession on, my impression is that the carnage in research has not been as severe as might be feared, at least in the United States. I know of two notable negative impacts: It’s quite difficult to get a job this year, as many companies and universities simply aren’t hiring. This is particularly tough on graduating students. Perhaps 10% of IBM research was fired. In contrast, around the time of the dot com bust, ATnT Research and Lucent had one or several 50% size firings wiping out much of the remainder of Bell Labs , triggering a notable diaspora for the respected machine learning group there. As the recession progresses, we may easily see more firings as companies in particular reach a point where they can no longer support research. There are a couple positives to the recession as well. Both the implosion of Wall Street (which siphoned off smart people) and the general difficulty of getting a job coming out of an undergraduate education s

3 0.87082595 36 hunch net-2005-03-05-Funding Research

Introduction: The funding of research (and machine learning research) is an issue which seems to have become more significant in the United States over the last decade. The word “research” is applied broadly here to science, mathematics, and engineering. There are two essential difficulties with funding research: Longshot Paying a researcher is often a big gamble. Most research projects don’t pan out, but a few big payoffs can make it all worthwhile. Information Only Much of research is about finding the right way to think about or do something. The Longshot difficulty means that there is high variance in payoffs. This can be compensated for by funding many different research projects, reducing variance. The Information-Only difficulty means that it’s hard to extract a profit directly from many types of research, so companies have difficulty justifying basic research. (Patents are a mechanism for doing this. They are often extraordinarily clumsy or simply not applicable.) T

4 0.85997033 121 hunch net-2005-10-12-The unrealized potential of the research lab

Introduction: I attended the IBM research 60th anniversary . IBM research is, by any reasonable account, the industrial research lab which has managed to bring the most value to it’s parent company over the long term. This can be seen by simply counting the survivors: IBM research is the only older research lab which has not gone through a period of massive firing. (Note that there are also new research labs .) Despite this impressive record, IBM research has failed, by far, to achieve it’s potential. Examples which came up in this meeting include: It took about a decade to produce DRAM after it was invented in the lab. (In fact, Intel produced it first.) Relational databases and SQL were invented and then languished. It was only under external competition that IBM released it’s own relational database. Why didn’t IBM grow an Oracle division ? An early lead in IP networking hardware did not result in IBM growing a Cisco division . Why not? And remember … IBM research is a s

5 0.8169257 110 hunch net-2005-09-10-“Failure” is an option

Introduction: This is about the hard choices that graduate students must make. The cultural definition of success in academic research is to: Produce good research which many other people appreciate. Produce many students who go on to do the same. There are fundamental reasons why this is success in the local culture. Good research appreciated by others means access to jobs. Many students succesful in the same way implies that there are a number of people who think in a similar way and appreciate your work. In order to graduate, a phd student must live in an academic culture for a period of several years. It is common to adopt the culture’s definition of success during this time. It’s also common for many phd students discover they are not suited to an academic research lifestyle. This collision of values and abilities naturally results in depression. The most fundamental advice when this happens is: change something. Pick a new advisor. Pick a new research topic. Or leave th

6 0.81443971 76 hunch net-2005-05-29-Bad ideas

7 0.79779822 51 hunch net-2005-04-01-The Producer-Consumer Model of Research

8 0.7902993 449 hunch net-2011-11-26-Giving Thanks

9 0.7896955 255 hunch net-2007-07-13-The View From China

10 0.77934378 73 hunch net-2005-05-17-A Short Guide to PhD Graduate Study

11 0.77725774 48 hunch net-2005-03-29-Academic Mechanism Design

12 0.77500355 358 hunch net-2009-06-01-Multitask Poisoning

13 0.76281071 464 hunch net-2012-05-03-Microsoft Research, New York City

14 0.75375247 142 hunch net-2005-12-22-Yes , I am applying

15 0.75355583 241 hunch net-2007-04-28-The Coming Patent Apocalypse

16 0.73820114 22 hunch net-2005-02-18-What it means to do research.

17 0.72334146 154 hunch net-2006-02-04-Research Budget Changes

18 0.70554423 295 hunch net-2008-04-12-It Doesn’t Stop

19 0.69884533 106 hunch net-2005-09-04-Science in the Government

20 0.69716215 208 hunch net-2006-09-18-What is missing for online collaborative research?


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(10, 0.021), (27, 0.197), (30, 0.147), (38, 0.033), (49, 0.029), (51, 0.022), (53, 0.068), (55, 0.109), (64, 0.011), (68, 0.018), (84, 0.034), (94, 0.097), (95, 0.097)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.95982009 455 hunch net-2012-02-20-Berkeley Streaming Data Workshop

Introduction: The From Data to Knowledge workshop May 7-11 at Berkeley should be of interest to the many people encountering streaming data in different disciplines. It’s run by a group of astronomers who encounter streaming data all the time. I met Josh Bloom recently and he is broadly interested in a workshop covering all aspects of Machine Learning on streaming data. The hope here is that techniques developed in one area turn out useful in another which seems quite plausible. Particularly if you are in the bay area, consider checking it out.

2 0.95324665 444 hunch net-2011-09-07-KDD and MUCMD 2011

Introduction: At KDD I enjoyed Stephen Boyd ‘s invited talk about optimization quite a bit. However, the most interesting talk for me was David Haussler ‘s. His talk started out with a formidable load of biological complexity. About half-way through you start wondering, “can this be used to help with cancer?” And at the end he connects it directly to use with a call to arms for the audience: cure cancer. The core thesis here is that cancer is a complex set of diseases which can be distentangled via genetic assays, allowing attacking the specific signature of individual cancers. However, the data quantity and complex dependencies within the data require systematic and relatively automatic prediction and analysis algorithms of the kind that we are best familiar with. Some of the papers which interested me are: Kai-Wei Chang and Dan Roth , Selective Block Minimization for Faster Convergence of Limited Memory Large-Scale Linear Models , which is about effectively using a hard-example

3 0.94670707 364 hunch net-2009-07-11-Interesting papers at KDD

Introduction: I attended KDD this year. The conference has always had a strong grounding in what works based on the KDDcup , but it has developed a halo of workshops on various subjects. It seems that KDD has become a place where the economy meets machine learning in a stronger sense than many other conferences. There were several papers that other people might like to take a look at. Yehuda Koren Collaborative Filtering with Temporal Dynamics . This paper describes how to incorporate temporal dynamics into a couple of collaborative filtering approaches. This was also a best paper award. D. Sculley , Robert Malkin, Sugato Basu , Roberto J. Bayardo , Predicting Bounce Rates in Sponsored Search Advertisements . The basic claim of this paper is that the probability people immediately leave (“bounce”) after clicking on an advertisement is predictable. Frank McSherry and Ilya Mironov Differentially Private Recommender Systems: Building Privacy into the Netflix Prize Contende

4 0.93438512 154 hunch net-2006-02-04-Research Budget Changes

Introduction: The announcement of an increase in funding for basic research in the US is encouraging. There is some discussion of this at the Computing Research Policy blog. One part of this discussion has a graph of NSF funding over time, presumably in dollar budgets. I don’t believe that dollar budgets are the right way to judge the impact of funding changes on researchers. A better way to judge seems to be in terms of dollar budget divided by GDP which provides a measure of the relative emphasis on research. This graph was assembled by dividing the NSF budget by the US GDP . For 2005 GDP, I used the current estimate and for 2006 and 2007 assumed an increase by a factor of 1.04 per year. The 2007 number also uses the requested 2007 budget which is certain to change. This graph makes it clear why researchers were upset: research funding emphasis has fallen for 3 years in a row. The reality has been significantly more severe due to DARPA decreasing funding and industrial

same-blog 5 0.92518854 132 hunch net-2005-11-26-The Design of an Optimal Research Environment

Introduction: How do you create an optimal environment for research? Here are some essential ingredients that I see. Stability . University-based research is relatively good at this. On any particular day, researchers face choices in what they will work on. A very common tradeoff is between: easy small difficult big For researchers without stability, the ‘easy small’ option wins. This is often “ok”—a series of incremental improvements on the state of the art can add up to something very beneficial. However, it misses one of the big potentials of research: finding entirely new and better ways of doing things. Stability comes in many forms. The prototypical example is tenure at a university—a tenured professor is almost imposssible to fire which means that the professor has the freedom to consider far horizon activities. An iron-clad guarantee of a paycheck is not necessary—industrial research labs have succeeded well with research positions of indefinite duration. Atnt rese

6 0.90320915 189 hunch net-2006-07-05-more icml papers

7 0.8807556 85 hunch net-2005-06-28-A COLT paper

8 0.84898788 464 hunch net-2012-05-03-Microsoft Research, New York City

9 0.83996511 466 hunch net-2012-06-05-ICML acceptance statistics

10 0.83928537 343 hunch net-2009-02-18-Decision by Vetocracy

11 0.83913219 301 hunch net-2008-05-23-Three levels of addressing the Netflix Prize

12 0.83618838 36 hunch net-2005-03-05-Funding Research

13 0.83378309 360 hunch net-2009-06-15-In Active Learning, the question changes

14 0.83085525 225 hunch net-2007-01-02-Retrospective

15 0.83048594 105 hunch net-2005-08-23-(Dis)similarities between academia and open source programmers

16 0.83030212 286 hunch net-2008-01-25-Turing’s Club for Machine Learning

17 0.8301875 297 hunch net-2008-04-22-Taking the next step

18 0.82875746 344 hunch net-2009-02-22-Effective Research Funding

19 0.82854503 437 hunch net-2011-07-10-ICML 2011 and the future

20 0.82839155 359 hunch net-2009-06-03-Functionally defined Nonlinear Dynamic Models