hunch_net hunch_net-2012 hunch_net-2012-464 knowledge-graph by maker-knowledge-mining

464 hunch net-2012-05-03-Microsoft Research, New York City


meta infos for this blog

Source: html

Introduction: Yahoo! laid off people . Unlike every previous time there have been layoffs, this is serious for Yahoo! Research . We had advanced warning from Prabhakar through the simple act of leaving . Yahoo! Research was a world class organization that Prabhakar recruited much of personally, so it is deeply implausible that he would spontaneously decide to leave. My first thought when I saw the news was “Uhoh, Rob said that he knew it was serious when the head of ATnT Research left.” In this case it was even more significant, because Prabhakar recruited me on the premise that Y!R was an experiment in how research should be done: via a combination of high quality people and high engagement with the company. Prabhakar’s departure is a clear end to that experiment. The result is ambiguous from a business perspective. Y!R clearly was not capable of saving the company from its illnesses. I’m not privy to the internal accounting of impact and this is the kind of subject where there c


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Research was a world class organization that Prabhakar recruited much of personally, so it is deeply implausible that he would spontaneously decide to leave. [sent-7, score-0.41]

2 R was an experiment in how research should be done: via a combination of high quality people and high engagement with the company. [sent-10, score-0.333]

3 R clearly was not capable of saving the company from its illnesses. [sent-14, score-0.212]

4 It turns out that talking to the rest of the organization doing consulting, architecting, and prototyping on a minority basis helps research by sharpening the questions you ask more than it hinders by taking up time. [sent-22, score-0.429]

5 Consequently, the abrupt departure of Prabhakar and an apparent lack of appreciation by the new CEO created a crisis of confidence. [sent-29, score-0.26]

6 Many people who were sitting on strong offers quickly left, and everyone else started looking around. [sent-30, score-0.326]

7 In this situation, my first concern was for colleagues, both in Machine Learning across the company and the Yahoo! [sent-31, score-0.212]

8 Every company and government in the world is drowning in data, and Machine Learning is the prime tool for actually using it to do interesting things. [sent-34, score-0.234]

9 More generally, the demand for high quality seasoned machine learning researchers across startups, mature companies, government labs, and academia has been astonishing, and I expect the outcome to reflect that. [sent-35, score-0.443]

10 This is remarkably different from the cuts that hit ATnT research in late 2001 and early 2002 where the famous machine learning group there took many months to disperse to new positions. [sent-36, score-0.443]

11 In the New York office, we investigated many possibilities hard enough that it became a news story . [sent-37, score-0.254]

12 While that article is wrong in specifics (we ended up not fired for example, although it is difficult to discern cause and effect), we certainly shook the job tree very hard to see what would fall out. [sent-38, score-0.248]

13 My belief is that the new Microsoft Research New York City lab will become an even greater techhouse than Y! [sent-41, score-0.265]

14 In light of this, I would encourage people in academia to consider Yahoo! [sent-45, score-0.294]

15 There are and will be some serious hard feelings about the outcome as various top researchers elsewhere in the organization feel compelled to look for jobs and leave. [sent-47, score-0.345]

16 took a real gamble supporting a research organization about 7 years ago, and many positive things have come of this gamble from all perspectives. [sent-49, score-0.716]

17 I considered all possibilities in accepting the job and was prepared to simply put aside a job search for some time if necessary, but the timing was surreally perfect. [sent-55, score-0.415]

18 Deepak was sitting on an offer at Linkedin and simply took it, so the disruption there was even more minimal. [sent-59, score-0.3]

19 Amongst other things, VW is the ultrascale learning algorithm , not the kind of thing that you would want to put aside lightly. [sent-62, score-0.279]

20 This surprised me greatly—Microsoft has made serious commitments to supporting open source in various ways and that commitment is what sealed the deal for me. [sent-64, score-0.204]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('prabhakar', 0.38), ('yahoo', 0.262), ('departure', 0.19), ('microsoft', 0.163), ('organization', 0.159), ('company', 0.152), ('research', 0.136), ('linkedin', 0.127), ('lab', 0.126), ('light', 0.123), ('took', 0.119), ('investigated', 0.112), ('sitting', 0.112), ('recruited', 0.112), ('serious', 0.11), ('gamble', 0.104), ('academia', 0.101), ('atnt', 0.098), ('job', 0.098), ('supporting', 0.094), ('york', 0.093), ('offers', 0.09), ('surprise', 0.087), ('leaving', 0.084), ('government', 0.082), ('ended', 0.08), ('possibilities', 0.078), ('aside', 0.078), ('basis', 0.078), ('outcome', 0.076), ('companies', 0.071), ('would', 0.07), ('icml', 0.07), ('new', 0.07), ('even', 0.069), ('deeply', 0.069), ('kind', 0.068), ('high', 0.067), ('else', 0.066), ('added', 0.065), ('news', 0.064), ('experiment', 0.063), ('put', 0.063), ('early', 0.061), ('across', 0.06), ('clearly', 0.06), ('looking', 0.058), ('kdd', 0.057), ('machine', 0.057), ('turns', 0.056)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0000001 464 hunch net-2012-05-03-Microsoft Research, New York City

Introduction: Yahoo! laid off people . Unlike every previous time there have been layoffs, this is serious for Yahoo! Research . We had advanced warning from Prabhakar through the simple act of leaving . Yahoo! Research was a world class organization that Prabhakar recruited much of personally, so it is deeply implausible that he would spontaneously decide to leave. My first thought when I saw the news was “Uhoh, Rob said that he knew it was serious when the head of ATnT Research left.” In this case it was even more significant, because Prabhakar recruited me on the premise that Y!R was an experiment in how research should be done: via a combination of high quality people and high engagement with the company. Prabhakar’s departure is a clear end to that experiment. The result is ambiguous from a business perspective. Y!R clearly was not capable of saving the company from its illnesses. I’m not privy to the internal accounting of impact and this is the kind of subject where there c

2 0.26928869 175 hunch net-2006-04-30-John Langford –> Yahoo Research, NY

Introduction: I will join Yahoo Research (in New York) after my contract ends at TTI-Chicago . The deciding reasons are: Yahoo is running into many hard learning problems. This is precisely the situation where basic research might hope to have the greatest impact. Yahoo Research understands research including publishing, conferences, etc… Yahoo Research is growing, so there is a chance I can help it grow well. Yahoo understands the internet, including (but not at all limited to) experimenting with research blogs. In the end, Yahoo Research seems like the place where I might have a chance to make the greatest difference. Yahoo (as a company) has made a strong bet on Yahoo Research. We-the-researchers all hope that bet will pay off, and this seems plausible. I’ll certainly have fun trying.

3 0.17582314 121 hunch net-2005-10-12-The unrealized potential of the research lab

Introduction: I attended the IBM research 60th anniversary . IBM research is, by any reasonable account, the industrial research lab which has managed to bring the most value to it’s parent company over the long term. This can be seen by simply counting the survivors: IBM research is the only older research lab which has not gone through a period of massive firing. (Note that there are also new research labs .) Despite this impressive record, IBM research has failed, by far, to achieve it’s potential. Examples which came up in this meeting include: It took about a decade to produce DRAM after it was invented in the lab. (In fact, Intel produced it first.) Relational databases and SQL were invented and then languished. It was only under external competition that IBM released it’s own relational database. Why didn’t IBM grow an Oracle division ? An early lead in IP networking hardware did not result in IBM growing a Cisco division . Why not? And remember … IBM research is a s

4 0.1733193 132 hunch net-2005-11-26-The Design of an Optimal Research Environment

Introduction: How do you create an optimal environment for research? Here are some essential ingredients that I see. Stability . University-based research is relatively good at this. On any particular day, researchers face choices in what they will work on. A very common tradeoff is between: easy small difficult big For researchers without stability, the ‘easy small’ option wins. This is often “ok”—a series of incremental improvements on the state of the art can add up to something very beneficial. However, it misses one of the big potentials of research: finding entirely new and better ways of doing things. Stability comes in many forms. The prototypical example is tenure at a university—a tenured professor is almost imposssible to fire which means that the professor has the freedom to consider far horizon activities. An iron-clad guarantee of a paycheck is not necessary—industrial research labs have succeeded well with research positions of indefinite duration. Atnt rese

5 0.16628957 156 hunch net-2006-02-11-Yahoo’s Learning Problems.

Introduction: I just visited Yahoo Research which has several fundamental learning problems near to (or beyond) the set of problems we know how to solve well. Here are 3 of them. Ranking This is the canonical problem of all search engines. It is made extra difficult for several reasons. There is relatively little “good” supervised learning data and a great deal of data with some signal (such as click through rates). The learning must occur in a partially adversarial environment. Many people very actively attempt to place themselves at the top of rankings. It is not even quite clear whether the problem should be posed as ‘ranking’ or as ‘regression’ which is then used to produce a ranking. Collaborative filtering Yahoo has a large number of recommendation systems for music, movies, etc… In these sorts of systems, users specify how they liked a set of things, and then the system can (hopefully) find some more examples of things they might like by reasoning across multiple

6 0.15540901 344 hunch net-2009-02-22-Effective Research Funding

7 0.13278544 178 hunch net-2006-05-08-Big machine learning

8 0.12613472 36 hunch net-2005-03-05-Funding Research

9 0.12082399 437 hunch net-2011-07-10-ICML 2011 and the future

10 0.1200337 343 hunch net-2009-02-18-Decision by Vetocracy

11 0.11585154 22 hunch net-2005-02-18-What it means to do research.

12 0.11403244 105 hunch net-2005-08-23-(Dis)similarities between academia and open source programmers

13 0.11270136 454 hunch net-2012-01-30-ICML Posters and Scope

14 0.11110725 423 hunch net-2011-02-02-User preferences for search engines

15 0.11094696 452 hunch net-2012-01-04-Why ICML? and the summer conferences

16 0.11031416 461 hunch net-2012-04-09-ICML author feedback is open

17 0.10966561 134 hunch net-2005-12-01-The Webscience Future

18 0.098612711 318 hunch net-2008-09-26-The SODA Program Committee

19 0.097874947 478 hunch net-2013-01-07-NYU Large Scale Machine Learning Class

20 0.09721081 51 hunch net-2005-04-01-The Producer-Consumer Model of Research


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.242), (1, -0.115), (2, -0.122), (3, 0.121), (4, -0.12), (5, -0.014), (6, -0.002), (7, 0.057), (8, -0.123), (9, -0.018), (10, 0.045), (11, 0.015), (12, -0.014), (13, 0.004), (14, -0.009), (15, 0.126), (16, -0.104), (17, -0.019), (18, -0.034), (19, -0.128), (20, 0.124), (21, 0.032), (22, 0.058), (23, 0.009), (24, -0.043), (25, 0.11), (26, -0.01), (27, -0.024), (28, -0.043), (29, -0.012), (30, 0.052), (31, -0.076), (32, -0.063), (33, 0.034), (34, -0.018), (35, 0.043), (36, 0.017), (37, 0.011), (38, -0.056), (39, -0.006), (40, 0.05), (41, -0.022), (42, -0.024), (43, -0.027), (44, -0.031), (45, -0.015), (46, 0.061), (47, 0.084), (48, -0.037), (49, 0.095)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.95394307 464 hunch net-2012-05-03-Microsoft Research, New York City

Introduction: Yahoo! laid off people . Unlike every previous time there have been layoffs, this is serious for Yahoo! Research . We had advanced warning from Prabhakar through the simple act of leaving . Yahoo! Research was a world class organization that Prabhakar recruited much of personally, so it is deeply implausible that he would spontaneously decide to leave. My first thought when I saw the news was “Uhoh, Rob said that he knew it was serious when the head of ATnT Research left.” In this case it was even more significant, because Prabhakar recruited me on the premise that Y!R was an experiment in how research should be done: via a combination of high quality people and high engagement with the company. Prabhakar’s departure is a clear end to that experiment. The result is ambiguous from a business perspective. Y!R clearly was not capable of saving the company from its illnesses. I’m not privy to the internal accounting of impact and this is the kind of subject where there c

2 0.87216437 175 hunch net-2006-04-30-John Langford –> Yahoo Research, NY

Introduction: I will join Yahoo Research (in New York) after my contract ends at TTI-Chicago . The deciding reasons are: Yahoo is running into many hard learning problems. This is precisely the situation where basic research might hope to have the greatest impact. Yahoo Research understands research including publishing, conferences, etc… Yahoo Research is growing, so there is a chance I can help it grow well. Yahoo understands the internet, including (but not at all limited to) experimenting with research blogs. In the end, Yahoo Research seems like the place where I might have a chance to make the greatest difference. Yahoo (as a company) has made a strong bet on Yahoo Research. We-the-researchers all hope that bet will pay off, and this seems plausible. I’ll certainly have fun trying.

3 0.74067861 121 hunch net-2005-10-12-The unrealized potential of the research lab

Introduction: I attended the IBM research 60th anniversary . IBM research is, by any reasonable account, the industrial research lab which has managed to bring the most value to it’s parent company over the long term. This can be seen by simply counting the survivors: IBM research is the only older research lab which has not gone through a period of massive firing. (Note that there are also new research labs .) Despite this impressive record, IBM research has failed, by far, to achieve it’s potential. Examples which came up in this meeting include: It took about a decade to produce DRAM after it was invented in the lab. (In fact, Intel produced it first.) Relational databases and SQL were invented and then languished. It was only under external competition that IBM released it’s own relational database. Why didn’t IBM grow an Oracle division ? An early lead in IP networking hardware did not result in IBM growing a Cisco division . Why not? And remember … IBM research is a s

4 0.72617245 156 hunch net-2006-02-11-Yahoo’s Learning Problems.

Introduction: I just visited Yahoo Research which has several fundamental learning problems near to (or beyond) the set of problems we know how to solve well. Here are 3 of them. Ranking This is the canonical problem of all search engines. It is made extra difficult for several reasons. There is relatively little “good” supervised learning data and a great deal of data with some signal (such as click through rates). The learning must occur in a partially adversarial environment. Many people very actively attempt to place themselves at the top of rankings. It is not even quite clear whether the problem should be posed as ‘ranking’ or as ‘regression’ which is then used to produce a ranking. Collaborative filtering Yahoo has a large number of recommendation systems for music, movies, etc… In these sorts of systems, users specify how they liked a set of things, and then the system can (hopefully) find some more examples of things they might like by reasoning across multiple

5 0.68768072 178 hunch net-2006-05-08-Big machine learning

Introduction: According to the New York Times , Yahoo is releasing Project Panama shortly . Project Panama is about better predicting which advertisements are relevant to a search, implying a higher click through rate, implying larger income for Yahoo . There are two things that seem interesting here: A significant portion of that improved accuracy is almost certainly machine learning at work. The quantitative effect is huge—the estimate in the article is $600*10 6 . Google already has such improvements and Microsoft Search is surely working on them, which suggest this is (perhaps) a $10 9 per year machine learning problem. The exact methodology under use is unlikely to be publicly discussed in the near future because of the competitive enivironment. Hopefully we’ll have some public “war stories” at some point in the future when this information becomes less sensitive. For now, it’s reassuring to simply note that machine learning is having a big impact.

6 0.68511635 132 hunch net-2005-11-26-The Design of an Optimal Research Environment

7 0.63328356 344 hunch net-2009-02-22-Effective Research Funding

8 0.62351328 142 hunch net-2005-12-22-Yes , I am applying

9 0.61189032 449 hunch net-2011-11-26-Giving Thanks

10 0.59681159 36 hunch net-2005-03-05-Funding Research

11 0.59629756 110 hunch net-2005-09-10-“Failure” is an option

12 0.59310508 255 hunch net-2007-07-13-The View From China

13 0.58234584 333 hunch net-2008-12-27-Adversarial Academia

14 0.57778621 76 hunch net-2005-05-29-Bad ideas

15 0.5730136 51 hunch net-2005-04-01-The Producer-Consumer Model of Research

16 0.56779879 339 hunch net-2009-01-27-Key Scientific Challenges

17 0.54283792 193 hunch net-2006-07-09-The Stock Prediction Machine Learning Problem

18 0.53888363 423 hunch net-2011-02-02-User preferences for search engines

19 0.53863901 241 hunch net-2007-04-28-The Coming Patent Apocalypse

20 0.52961588 73 hunch net-2005-05-17-A Short Guide to PhD Graduate Study


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(10, 0.031), (16, 0.017), (20, 0.143), (22, 0.013), (27, 0.158), (38, 0.042), (48, 0.03), (53, 0.05), (55, 0.124), (56, 0.016), (57, 0.013), (68, 0.018), (70, 0.014), (84, 0.047), (89, 0.02), (92, 0.012), (94, 0.055), (95, 0.107)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.92034632 208 hunch net-2006-09-18-What is missing for online collaborative research?

Introduction: The internet has recently made the research process much smoother: papers are easy to obtain, citations are easy to follow, and unpublished “tutorials” are often available. Yet, new research fields can look very complicated to outsiders or newcomers. Every paper is like a small piece of an unfinished jigsaw puzzle: to understand just one publication, a researcher without experience in the field will typically have to follow several layers of citations, and many of the papers he encounters have a great deal of repeated information. Furthermore, from one publication to the next, notation and terminology may not be consistent which can further confuse the reader. But the internet is now proving to be an extremely useful medium for collaboration and knowledge aggregation. Online forums allow users to ask and answer questions and to share ideas. The recent phenomenon of Wikipedia provides a proof-of-concept for the “anyone can edit” system. Can such models be used to facilitate research a

same-blog 2 0.91859102 464 hunch net-2012-05-03-Microsoft Research, New York City

Introduction: Yahoo! laid off people . Unlike every previous time there have been layoffs, this is serious for Yahoo! Research . We had advanced warning from Prabhakar through the simple act of leaving . Yahoo! Research was a world class organization that Prabhakar recruited much of personally, so it is deeply implausible that he would spontaneously decide to leave. My first thought when I saw the news was “Uhoh, Rob said that he knew it was serious when the head of ATnT Research left.” In this case it was even more significant, because Prabhakar recruited me on the premise that Y!R was an experiment in how research should be done: via a combination of high quality people and high engagement with the company. Prabhakar’s departure is a clear end to that experiment. The result is ambiguous from a business perspective. Y!R clearly was not capable of saving the company from its illnesses. I’m not privy to the internal accounting of impact and this is the kind of subject where there c

3 0.88427997 7 hunch net-2005-01-31-Watchword: Assumption

Introduction: “Assumption” is another word to be careful with in machine learning because it is used in several ways. Assumption = Bias There are several ways to see that some form of ‘bias’ (= preferring of one solution over another) is necessary. This is obvious in an adversarial setting. A good bit of work has been expended explaining this in other settings with “ no free lunch ” theorems. This is a usage specialized to learning which is particularly common when talking about priors for Bayesian Learning. Assumption = “if” of a theorem The assumptions are the ‘if’ part of the ‘if-then’ in a theorem. This is a fairly common usage. Assumption = Axiom The assumptions are the things that we assume are true, but which we cannot verify. Examples are “the IID assumption” or “my problem is a DNF on a small number of bits”. This is the usage which I prefer. One difficulty with any use of the word “assumption” is that you often encounter “if assumption then conclusion so if no

4 0.81896871 116 hunch net-2005-09-30-Research in conferences

Introduction: Conferences exist as part of the process of doing research. They provide many roles including “announcing research”, “meeting people”, and “point of reference”. Not all conferences are alike so a basic question is: “to what extent do individual conferences attempt to aid research?” This question is very difficult to answer in any satisfying way. What we can do is compare details of the process across multiple conferences. Comments The average quality of comments across conferences can vary dramatically. At one extreme, the tradition in CS theory conferences is to provide essentially zero feedback. At the other extreme, some conferences have a strong tradition of providing detailed constructive feedback. Detailed feedback can give authors significant guidance about how to improve research. This is the most subjective entry. Blind Virtually all conferences offer single blind review where authors do not know reviewers. Some also provide double blind review where rev

5 0.81814742 105 hunch net-2005-08-23-(Dis)similarities between academia and open source programmers

Introduction: Martin Pool and I recently discussed the similarities and differences between academia and open source programming. Similarities: Cost profile Research and programming share approximately the same cost profile: A large upfront effort is required to produce something useful, and then “anyone” can use it. (The “anyone” is not quite right for either group because only sufficiently technical people could use it.) Wealth profile A “wealthy” academic or open source programmer is someone who has contributed a lot to other people in research or programs. Much of academia is a “gift culture”: whoever gives the most is most respected. Problems Both academia and open source programming suffer from similar problems. Whether or not (and which) open source program is used are perhaps too-often personality driven rather than driven by capability or usefulness. Similar phenomena can happen in academia with respect to directions of research. Funding is often a problem for

6 0.81098688 466 hunch net-2012-06-05-ICML acceptance statistics

7 0.79230165 36 hunch net-2005-03-05-Funding Research

8 0.79050469 344 hunch net-2009-02-22-Effective Research Funding

9 0.78815496 132 hunch net-2005-11-26-The Design of an Optimal Research Environment

10 0.78677112 437 hunch net-2011-07-10-ICML 2011 and the future

11 0.78369886 343 hunch net-2009-02-18-Decision by Vetocracy

12 0.78301591 454 hunch net-2012-01-30-ICML Posters and Scope

13 0.78050756 225 hunch net-2007-01-02-Retrospective

14 0.77615303 351 hunch net-2009-05-02-Wielding a New Abstraction

15 0.77596253 445 hunch net-2011-09-28-Somebody’s Eating Your Lunch

16 0.77555722 190 hunch net-2006-07-06-Branch Prediction Competition

17 0.77116722 452 hunch net-2012-01-04-Why ICML? and the summer conferences

18 0.77075857 89 hunch net-2005-07-04-The Health of COLT

19 0.76799661 403 hunch net-2010-07-18-ICML & COLT 2010

20 0.7678535 204 hunch net-2006-08-28-Learning Theory standards for NIPS 2006