hunch_net hunch_net-2007 hunch_net-2007-255 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: I’m visiting Beijing for the Pao-Lu Hsu Statistics Conference on Machine Learning. I had several discussions about the state of Chinese research. Given the large population and economy, you might expect substantial research—more than has been observed at international conferences. The fundamental problem seems to be the Cultural Revolution which lobotimized higher education, and the research associated with it. There has been a process of slow recovery since then, which has begun to be felt in the research world via increased participation in international conferences and (now) conferences in China. The amount of effort going into construction in Beijing is very impressive—people are literally building a skyscraper at night outside the window of the hotel I’m staying at (and this is not unusual). If a small fraction of this effort is later focused onto supporting research, the effect could be very substantial. General growth in China’s research portfolio should be expecte
sentIndex sentText sentNum sentScore
1 I’m visiting Beijing for the Pao-Lu Hsu Statistics Conference on Machine Learning. [sent-1, score-0.149]
2 I had several discussions about the state of Chinese research. [sent-2, score-0.198]
3 Given the large population and economy, you might expect substantial research—more than has been observed at international conferences. [sent-3, score-0.628]
4 The fundamental problem seems to be the Cultural Revolution which lobotimized higher education, and the research associated with it. [sent-4, score-0.472]
5 There has been a process of slow recovery since then, which has begun to be felt in the research world via increased participation in international conferences and (now) conferences in China. [sent-5, score-1.219]
6 The amount of effort going into construction in Beijing is very impressive—people are literally building a skyscraper at night outside the window of the hotel I’m staying at (and this is not unusual). [sent-6, score-1.513]
7 If a small fraction of this effort is later focused onto supporting research, the effect could be very substantial. [sent-7, score-0.928]
8 General growth in China’s research portfolio should be expected. [sent-8, score-0.495]
wordName wordTfidf (topN-words)
[('beijing', 0.33), ('international', 0.24), ('research', 0.196), ('window', 0.178), ('unusual', 0.178), ('literally', 0.178), ('revolution', 0.178), ('economy', 0.178), ('staying', 0.165), ('population', 0.165), ('chinese', 0.165), ('hotel', 0.165), ('effort', 0.163), ('onto', 0.156), ('portfolio', 0.156), ('supporting', 0.149), ('china', 0.149), ('visiting', 0.149), ('night', 0.149), ('cultural', 0.143), ('growth', 0.143), ('participation', 0.138), ('conferences', 0.136), ('construction', 0.133), ('education', 0.126), ('increased', 0.123), ('hsu', 0.123), ('impressive', 0.12), ('discussions', 0.115), ('slow', 0.115), ('outside', 0.113), ('building', 0.113), ('focused', 0.105), ('later', 0.105), ('higher', 0.102), ('fraction', 0.102), ('associated', 0.098), ('statistics', 0.095), ('observed', 0.092), ('effect', 0.087), ('state', 0.083), ('expected', 0.083), ('amount', 0.079), ('going', 0.077), ('fundamental', 0.076), ('expect', 0.068), ('via', 0.068), ('world', 0.067), ('substantial', 0.063), ('small', 0.061)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999994 255 hunch net-2007-07-13-The View From China
Introduction: I’m visiting Beijing for the Pao-Lu Hsu Statistics Conference on Machine Learning. I had several discussions about the state of Chinese research. Given the large population and economy, you might expect substantial research—more than has been observed at international conferences. The fundamental problem seems to be the Cultural Revolution which lobotimized higher education, and the research associated with it. There has been a process of slow recovery since then, which has begun to be felt in the research world via increased participation in international conferences and (now) conferences in China. The amount of effort going into construction in Beijing is very impressive—people are literally building a skyscraper at night outside the window of the hotel I’m staying at (and this is not unusual). If a small fraction of this effort is later focused onto supporting research, the effect could be very substantial. General growth in China’s research portfolio should be expecte
2 0.14193331 357 hunch net-2009-05-30-Many ways to Learn this summer
Introduction: There are at least 3 summer schools related to machine learning this summer. The first is at University of Chicago June 1-11 organized by Misha Belkin , Partha Niyogi , and Steve Smale . Registration is closed for this one, meaning they met their capacity limit. The format is essentially an extended Tutorial/Workshop. I was particularly interested to see Valiant amongst the speakers. I’m also presenting Saturday June 6, on logarithmic time prediction. Praveen Srinivasan points out the second at Peking University in Beijing, China, July 20-27. This one differs substantially, as it is about vision, machine learning, and their intersection. The deadline for applications is June 10 or 15. This is also another example of the growth of research in China, with active support from NSF . The third one is at Cambridge , England, August 29-September 10. It’s in the MLSS series . Compared to the Chicago one, this one is more about the Bayesian side of ML, alth
3 0.098264277 132 hunch net-2005-11-26-The Design of an Optimal Research Environment
Introduction: How do you create an optimal environment for research? Here are some essential ingredients that I see. Stability . University-based research is relatively good at this. On any particular day, researchers face choices in what they will work on. A very common tradeoff is between: easy small difficult big For researchers without stability, the ‘easy small’ option wins. This is often “ok”—a series of incremental improvements on the state of the art can add up to something very beneficial. However, it misses one of the big potentials of research: finding entirely new and better ways of doing things. Stability comes in many forms. The prototypical example is tenure at a university—a tenured professor is almost imposssible to fire which means that the professor has the freedom to consider far horizon activities. An iron-clad guarantee of a paycheck is not necessary—industrial research labs have succeeded well with research positions of indefinite duration. Atnt rese
4 0.095167153 51 hunch net-2005-04-01-The Producer-Consumer Model of Research
Introduction: In the quest to understand what good reviewing is, perhaps it’s worthwhile to think about what good research is. One way to think about good research is in terms of a producer/consumer model. In the producer/consumer model of research, for any element of research there are producers (authors and coauthors of papers, for example) and consumers (people who use the papers to make new papers or code solving problems). An produced bit of research is judged as “good” if it is used by many consumers. There are two basic questions which immediately arise: Is this a good model of research? Are there alternatives? The producer/consumer model has some difficulties which can be (partially) addressed. Disconnect. A group of people doing research on some subject may become disconnected from the rest of the world. Each person uses the research of other people in the group so it appears good research is being done, but the group has no impact on the rest of the world. One way
5 0.083790608 344 hunch net-2009-02-22-Effective Research Funding
Introduction: With a worldwide recession on, my impression is that the carnage in research has not been as severe as might be feared, at least in the United States. I know of two notable negative impacts: It’s quite difficult to get a job this year, as many companies and universities simply aren’t hiring. This is particularly tough on graduating students. Perhaps 10% of IBM research was fired. In contrast, around the time of the dot com bust, ATnT Research and Lucent had one or several 50% size firings wiping out much of the remainder of Bell Labs , triggering a notable diaspora for the respected machine learning group there. As the recession progresses, we may easily see more firings as companies in particular reach a point where they can no longer support research. There are a couple positives to the recession as well. Both the implosion of Wall Street (which siphoned off smart people) and the general difficulty of getting a job coming out of an undergraduate education s
6 0.08147341 36 hunch net-2005-03-05-Funding Research
7 0.074906118 142 hunch net-2005-12-22-Yes , I am applying
8 0.0741143 121 hunch net-2005-10-12-The unrealized potential of the research lab
9 0.070728771 46 hunch net-2005-03-24-The Role of Workshops
10 0.069912434 378 hunch net-2009-11-15-The Other Online Learning
11 0.069156267 358 hunch net-2009-06-01-Multitask Poisoning
12 0.068644419 452 hunch net-2012-01-04-Why ICML? and the summer conferences
13 0.067591265 21 hunch net-2005-02-17-Learning Research Programs
14 0.067538723 228 hunch net-2007-01-15-The Machine Learning Department
15 0.067190528 169 hunch net-2006-04-05-What is state?
16 0.066963777 174 hunch net-2006-04-27-Conferences, Workshops, and Tutorials
17 0.066245027 146 hunch net-2006-01-06-MLTV
18 0.066005543 284 hunch net-2008-01-18-Datasets
19 0.065828405 295 hunch net-2008-04-12-It Doesn’t Stop
20 0.063611172 415 hunch net-2010-10-28-NY ML Symposium 2010
topicId topicWeight
[(0, 0.126), (1, -0.062), (2, -0.078), (3, 0.065), (4, -0.086), (5, -0.02), (6, 0.04), (7, 0.032), (8, -0.004), (9, 0.069), (10, 0.021), (11, -0.002), (12, -0.007), (13, -0.006), (14, 0.017), (15, -0.001), (16, 0.011), (17, 0.067), (18, 0.0), (19, -0.022), (20, -0.004), (21, -0.059), (22, 0.039), (23, 0.036), (24, -0.016), (25, -0.008), (26, -0.052), (27, 0.011), (28, -0.085), (29, -0.008), (30, 0.045), (31, 0.017), (32, -0.033), (33, 0.013), (34, 0.075), (35, 0.025), (36, -0.002), (37, -0.008), (38, 0.0), (39, 0.045), (40, 0.038), (41, 0.019), (42, 0.063), (43, 0.046), (44, -0.069), (45, -0.068), (46, -0.003), (47, 0.014), (48, -0.086), (49, 0.001)]
simIndex simValue blogId blogTitle
same-blog 1 0.96240252 255 hunch net-2007-07-13-The View From China
Introduction: I’m visiting Beijing for the Pao-Lu Hsu Statistics Conference on Machine Learning. I had several discussions about the state of Chinese research. Given the large population and economy, you might expect substantial research—more than has been observed at international conferences. The fundamental problem seems to be the Cultural Revolution which lobotimized higher education, and the research associated with it. There has been a process of slow recovery since then, which has begun to be felt in the research world via increased participation in international conferences and (now) conferences in China. The amount of effort going into construction in Beijing is very impressive—people are literally building a skyscraper at night outside the window of the hotel I’m staying at (and this is not unusual). If a small fraction of this effort is later focused onto supporting research, the effect could be very substantial. General growth in China’s research portfolio should be expecte
2 0.6795221 132 hunch net-2005-11-26-The Design of an Optimal Research Environment
Introduction: How do you create an optimal environment for research? Here are some essential ingredients that I see. Stability . University-based research is relatively good at this. On any particular day, researchers face choices in what they will work on. A very common tradeoff is between: easy small difficult big For researchers without stability, the ‘easy small’ option wins. This is often “ok”—a series of incremental improvements on the state of the art can add up to something very beneficial. However, it misses one of the big potentials of research: finding entirely new and better ways of doing things. Stability comes in many forms. The prototypical example is tenure at a university—a tenured professor is almost imposssible to fire which means that the professor has the freedom to consider far horizon activities. An iron-clad guarantee of a paycheck is not necessary—industrial research labs have succeeded well with research positions of indefinite duration. Atnt rese
3 0.6647675 121 hunch net-2005-10-12-The unrealized potential of the research lab
Introduction: I attended the IBM research 60th anniversary . IBM research is, by any reasonable account, the industrial research lab which has managed to bring the most value to it’s parent company over the long term. This can be seen by simply counting the survivors: IBM research is the only older research lab which has not gone through a period of massive firing. (Note that there are also new research labs .) Despite this impressive record, IBM research has failed, by far, to achieve it’s potential. Examples which came up in this meeting include: It took about a decade to produce DRAM after it was invented in the lab. (In fact, Intel produced it first.) Relational databases and SQL were invented and then languished. It was only under external competition that IBM released it’s own relational database. Why didn’t IBM grow an Oracle division ? An early lead in IP networking hardware did not result in IBM growing a Cisco division . Why not? And remember … IBM research is a s
4 0.64853597 36 hunch net-2005-03-05-Funding Research
Introduction: The funding of research (and machine learning research) is an issue which seems to have become more significant in the United States over the last decade. The word “research” is applied broadly here to science, mathematics, and engineering. There are two essential difficulties with funding research: Longshot Paying a researcher is often a big gamble. Most research projects don’t pan out, but a few big payoffs can make it all worthwhile. Information Only Much of research is about finding the right way to think about or do something. The Longshot difficulty means that there is high variance in payoffs. This can be compensated for by funding many different research projects, reducing variance. The Information-Only difficulty means that it’s hard to extract a profit directly from many types of research, so companies have difficulty justifying basic research. (Patents are a mechanism for doing this. They are often extraordinarily clumsy or simply not applicable.) T
5 0.64299434 344 hunch net-2009-02-22-Effective Research Funding
Introduction: With a worldwide recession on, my impression is that the carnage in research has not been as severe as might be feared, at least in the United States. I know of two notable negative impacts: It’s quite difficult to get a job this year, as many companies and universities simply aren’t hiring. This is particularly tough on graduating students. Perhaps 10% of IBM research was fired. In contrast, around the time of the dot com bust, ATnT Research and Lucent had one or several 50% size firings wiping out much of the remainder of Bell Labs , triggering a notable diaspora for the respected machine learning group there. As the recession progresses, we may easily see more firings as companies in particular reach a point where they can no longer support research. There are a couple positives to the recession as well. Both the implosion of Wall Street (which siphoned off smart people) and the general difficulty of getting a job coming out of an undergraduate education s
6 0.63296276 76 hunch net-2005-05-29-Bad ideas
7 0.60943973 366 hunch net-2009-08-03-Carbon in Computer Science Research
8 0.60230023 93 hunch net-2005-07-13-“Sister Conference” presentations
9 0.59719372 51 hunch net-2005-04-01-The Producer-Consumer Model of Research
10 0.59694999 110 hunch net-2005-09-10-“Failure” is an option
11 0.59046972 146 hunch net-2006-01-06-MLTV
12 0.58040315 449 hunch net-2011-11-26-Giving Thanks
13 0.57174128 73 hunch net-2005-05-17-A Short Guide to PhD Graduate Study
14 0.5705176 306 hunch net-2008-07-02-Proprietary Data in Academic Research?
15 0.56574678 464 hunch net-2012-05-03-Microsoft Research, New York City
16 0.56574285 1 hunch net-2005-01-19-Why I decided to run a weblog.
17 0.54886299 142 hunch net-2005-12-22-Yes , I am applying
18 0.54830021 358 hunch net-2009-06-01-Multitask Poisoning
19 0.53778094 22 hunch net-2005-02-18-What it means to do research.
20 0.5358519 106 hunch net-2005-09-04-Science in the Government
topicId topicWeight
[(10, 0.019), (27, 0.18), (42, 0.545), (55, 0.055), (94, 0.041), (95, 0.044)]
simIndex simValue blogId blogTitle
same-blog 1 0.91218632 255 hunch net-2007-07-13-The View From China
Introduction: I’m visiting Beijing for the Pao-Lu Hsu Statistics Conference on Machine Learning. I had several discussions about the state of Chinese research. Given the large population and economy, you might expect substantial research—more than has been observed at international conferences. The fundamental problem seems to be the Cultural Revolution which lobotimized higher education, and the research associated with it. There has been a process of slow recovery since then, which has begun to be felt in the research world via increased participation in international conferences and (now) conferences in China. The amount of effort going into construction in Beijing is very impressive—people are literally building a skyscraper at night outside the window of the hotel I’m staying at (and this is not unusual). If a small fraction of this effort is later focused onto supporting research, the effect could be very substantial. General growth in China’s research portfolio should be expecte
2 0.80922765 242 hunch net-2007-04-30-COLT 2007
Introduction: Registration for COLT 2007 is now open. The conference will take place on 13-15 June, 2007, in San Diego, California, as part of the 2007 Federated Computing Research Conference (FCRC), which includes STOC, Complexity, and EC. The website for COLT: http://www.learningtheory.org/colt2007/index.html The early registration deadline is May 11, and the cutoff date for discounted hotel rates is May 9. Before registering, take note that the fees are substantially lower for members of ACM and/or SIGACT than for nonmembers. If you’ve been contemplating joining either of these two societies (annual dues: $99 for ACM, $18 for SIGACT), now would be a good time!
3 0.68179137 485 hunch net-2013-06-29-The Benefits of Double-Blind Review
Introduction: This post is a (near) transcript of a talk that I gave at the ICML 2013 Workshop on Peer Review and Publishing Models . Although there’s a PDF available on my website , I’ve chosen to post a slightly modified version here as well in order to better facilitate discussion. Disclaimers and Context I want to start with a couple of disclaimers and some context. First, I want to point out that although I’ve read a lot about double-blind review, this isn’t my research area and the research discussed in this post is not my own. As a result, I probably can’t answer super detailed questions about these studies. I also want to note that I’m not opposed to open peer review — I was a free and open source software developer for over ten years and I care a great deal about openness and transparency. Rather, my motivation in writing this post is simply to create awareness of and to initiate discussion about the benefits of double-blind review. Lastly, and most importantly, I think it’s e
4 0.62875813 282 hunch net-2008-01-06-Research Political Issues
Introduction: I’ve avoided discussing politics here, although not for lack of interest. The problem with discussing politics is that it’s customary for people to say much based upon little information. Nevertheless, politics can have a substantial impact on science (and we might hope for the vice-versa). It’s primary election time in the United States, so the topic is timely, although the issues are not. There are several policy decisions which substantially effect development of science and technology in the US. Education The US has great contrasts in education. The top universities are very good places, yet the grade school education system produces mediocre results. For me, the contrast between a public education and Caltech was bracing. For many others attending Caltech, it clearly was not. Upgrading the k-12 education system in the US is a long-standing chronic problem which I know relatively little about. My own experience is that a basic attitude of “no child unrealized” i
5 0.62215662 216 hunch net-2006-11-02-2006 NIPS workshops
Introduction: I expect the NIPS 2006 workshops to be quite interesting, and recommend going for anyone interested in machine learning research. (Most or all of the workshops webpages can be found two links deep.)
6 0.53568608 89 hunch net-2005-07-04-The Health of COLT
7 0.52965927 22 hunch net-2005-02-18-What it means to do research.
8 0.36762178 406 hunch net-2010-08-22-KDD 2010
9 0.35089228 275 hunch net-2007-11-29-The Netflix Crack
10 0.34995967 36 hunch net-2005-03-05-Funding Research
11 0.34465402 146 hunch net-2006-01-06-MLTV
12 0.34391463 40 hunch net-2005-03-13-Avoiding Bad Reviewing
13 0.3409217 228 hunch net-2007-01-15-The Machine Learning Department
14 0.33516699 106 hunch net-2005-09-04-Science in the Government
15 0.33461976 333 hunch net-2008-12-27-Adversarial Academia
16 0.33330095 304 hunch net-2008-06-27-Reviewing Horror Stories
17 0.33317068 360 hunch net-2009-06-15-In Active Learning, the question changes
18 0.33192495 220 hunch net-2006-11-27-Continuizing Solutions
19 0.33044606 452 hunch net-2012-01-04-Why ICML? and the summer conferences
20 0.33023944 378 hunch net-2009-11-15-The Other Online Learning