hunch_net hunch_net-2007 hunch_net-2007-228 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Carnegie Mellon School of Computer Science has the first academic Machine Learning department . This department already existed as the Center for Automated Learning and Discovery , but recently changed it’s name. The reason for changing the name is obvious: very few people think of themselves as “Automated Learner and Discoverers”, but there are number of people who think of themselves as “Machine Learners”. Machine learning is both more succinct and recognizable—good properties for a name. A more interesting question is “Should there be a Machine Learning Department?”. Tom Mitchell has a relevant whitepaper claiming that machine learning is answering a different question than other fields or departments. The fundamental debate here is “Is machine learning different from statistics?” At a cultural level, there is no real debate: they are different. Machine learning is characterized by several very active large peer reviewed conferences, operating in a computer
sentIndex sentText sentNum sentScore
1 Carnegie Mellon School of Computer Science has the first academic Machine Learning department . [sent-1, score-0.325]
2 This department already existed as the Center for Automated Learning and Discovery , but recently changed it’s name. [sent-2, score-0.409]
3 The reason for changing the name is obvious: very few people think of themselves as “Automated Learner and Discoverers”, but there are number of people who think of themselves as “Machine Learners”. [sent-3, score-0.103]
4 Machine learning is both more succinct and recognizable—good properties for a name. [sent-4, score-0.153]
5 A more interesting question is “Should there be a Machine Learning Department? [sent-5, score-0.079]
6 Tom Mitchell has a relevant whitepaper claiming that machine learning is answering a different question than other fields or departments. [sent-7, score-0.491]
7 The fundamental debate here is “Is machine learning different from statistics? [sent-8, score-0.38]
8 ” At a cultural level, there is no real debate: they are different. [sent-9, score-0.067]
9 Machine learning is characterized by several very active large peer reviewed conferences, operating in a computer science mode. [sent-10, score-0.627]
10 Statistics tends to function with a greater emphasis on journals and a lesser emphasis on conferences which often implies a much longer publishing cycle. [sent-11, score-0.316]
11 It is true that the core problems of statistics in the past have typically differed from the core problems of machine learning today. [sent-13, score-0.812]
12 Yet, there has been some substantial overlap, and there are a number of statisticians nowadays that are actively doing machine learning. [sent-14, score-0.183]
13 It’s reasonably plausible that in the long term statistics departments will adopt the core problems of machine learning, removing the reasons for a separate machine learning department. [sent-15, score-1.015]
14 The parallel question for computer science comes up less often perhaps because computer science is a notoriously broad field. [sent-16, score-1.134]
15 The practical implication of a new department is the ability to create a more specific curricula, admit more specific students, and hire faculty based upon more specific interests. [sent-17, score-1.055]
16 An alternative solution like “learn everything from computer science and statistics” is personally appealing to me, and I have benefitted from and recommend a broad education. [sent-19, score-0.738]
17 In my experience, a machine learning skill set is an effective specialization with which people can do important things in the world. [sent-21, score-0.499]
18 Given this, having a department with a machine learning centered curricula seems like a good idea. [sent-22, score-0.842]
19 In the future and elsewhere it may have a different name, but the value of the machine learning skill set should grow with research, improving computers, and improving data sources. [sent-24, score-0.618]
wordName wordTfidf (topN-words)
[('department', 0.325), ('statistics', 0.314), ('computer', 0.261), ('science', 0.213), ('curricula', 0.189), ('machine', 0.183), ('skill', 0.168), ('carnegie', 0.168), ('mellon', 0.168), ('classes', 0.16), ('dropped', 0.147), ('specific', 0.138), ('debate', 0.122), ('core', 0.12), ('emphasis', 0.119), ('favor', 0.116), ('broad', 0.107), ('name', 0.103), ('automated', 0.099), ('improving', 0.096), ('programming', 0.092), ('compared', 0.092), ('benefitted', 0.084), ('existed', 0.084), ('claiming', 0.084), ('discovery', 0.084), ('hire', 0.084), ('practical', 0.081), ('question', 0.079), ('admit', 0.078), ('characterized', 0.078), ('succinct', 0.078), ('graphics', 0.078), ('lesser', 0.078), ('mitchell', 0.078), ('learning', 0.075), ('appealing', 0.073), ('specialization', 0.073), ('center', 0.073), ('faculty', 0.073), ('removing', 0.07), ('centered', 0.07), ('adopt', 0.07), ('answering', 0.07), ('tom', 0.07), ('learner', 0.067), ('architecture', 0.067), ('cultural', 0.067), ('learners', 0.067), ('languages', 0.065)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999994 228 hunch net-2007-01-15-The Machine Learning Department
Introduction: Carnegie Mellon School of Computer Science has the first academic Machine Learning department . This department already existed as the Center for Automated Learning and Discovery , but recently changed it’s name. The reason for changing the name is obvious: very few people think of themselves as “Automated Learner and Discoverers”, but there are number of people who think of themselves as “Machine Learners”. Machine learning is both more succinct and recognizable—good properties for a name. A more interesting question is “Should there be a Machine Learning Department?”. Tom Mitchell has a relevant whitepaper claiming that machine learning is answering a different question than other fields or departments. The fundamental debate here is “Is machine learning different from statistics?” At a cultural level, there is no real debate: they are different. Machine learning is characterized by several very active large peer reviewed conferences, operating in a computer
2 0.14190316 290 hunch net-2008-02-27-The Stats Handicap
Introduction: Graduating students in Statistics appear to be at a substantial handicap compared to graduating students in Machine Learning, despite being in substantially overlapping subjects. The problem seems to be cultural. Statistics comes from a mathematics background which emphasizes large publications slowly published under review at journals. Machine Learning comes from a Computer Science background which emphasizes quick publishing at reviewed conferences. This has a number of implications: Graduating statistics PhDs often have 0-2 publications while graduating machine learning PhDs might have 5-15. Graduating ML students have had a chance for others to build on their work. Stats students have had no such chance. Graduating ML students have attended a number of conferences and presented their work, giving them a chance to meet people. Stats students have had fewer chances of this sort. In short, Stats students have had relatively few chances to distinguish themselves and
3 0.13603322 313 hunch net-2008-08-18-Radford Neal starts a blog
Introduction: here on statistics, ML, CS, and other things he knows well.
4 0.13283254 414 hunch net-2010-10-17-Partha Niyogi has died
Introduction: from brain cancer. I asked Misha who worked with him to write about it. Partha Niyogi, Louis Block Professor in Computer Science and Statistics at the University of Chicago passed away on October 1, 2010, aged 43. I first met Partha Niyogi almost exactly ten years ago when I was a graduate student in math and he had just started as a faculty in Computer Science and Statistics at the University of Chicago. Strangely, we first talked at length due to a somewhat convoluted mathematical argument in a paper on pattern recognition. I asked him some questions about the paper, and, even though the topic was new to him, he had put serious thought into it and we started regular meetings. We made significant progress and developed a line of research stemming initially just from trying to understand that one paper and to simplify one derivation. I think this was typical of Partha, showing both his intellectual curiosity and his intuition for the serendipitous; having a sense and focus fo
5 0.13158281 445 hunch net-2011-09-28-Somebody’s Eating Your Lunch
Introduction: Since we last discussed the other online learning , Stanford has very visibly started pushing mass teaching in AI , Machine Learning , and Databases . In retrospect, it’s not too surprising that the next step up in serious online teaching experiments are occurring at the computer science department of a university embedded in the land of startups. Numbers on the order of 100000 are quite significant—similar in scale to the number of computer science undergraduate students/year in the US. Although these populations surely differ, the fact that they could overlap is worth considering for the future. It’s too soon to say how successful these classes will be and there are many easy criticisms to make: Registration != Learning … but if only 1/10th complete these classes, the scale of teaching still surpasses the scale of any traditional process. 1st year excitement != nth year routine … but if only 1/10th take future classes, the scale of teaching still surpass
6 0.12644678 64 hunch net-2005-04-28-Science Fiction and Research
7 0.11846869 50 hunch net-2005-04-01-Basic computer science research takes a hit
8 0.11124834 372 hunch net-2009-09-29-Machine Learning Protests at the G20
9 0.10669318 366 hunch net-2009-08-03-Carbon in Computer Science Research
10 0.10638119 68 hunch net-2005-05-10-Learning Reductions are Reductionist
11 0.10480262 112 hunch net-2005-09-14-The Predictionist Viewpoint
12 0.10320868 345 hunch net-2009-03-08-Prediction Science
13 0.09910877 215 hunch net-2006-10-22-Exemplar programming
14 0.098673306 452 hunch net-2012-01-04-Why ICML? and the summer conferences
15 0.096790791 454 hunch net-2012-01-30-ICML Posters and Scope
16 0.096447691 194 hunch net-2006-07-11-New Models
17 0.095001839 373 hunch net-2009-10-03-Static vs. Dynamic multiclass prediction
18 0.094701849 389 hunch net-2010-02-26-Yahoo! ML events
19 0.094097644 10 hunch net-2005-02-02-Kolmogorov Complexity and Googling
20 0.093739428 158 hunch net-2006-02-24-A Fundamentalist Organization of Machine Learning
topicId topicWeight
[(0, 0.21), (1, -0.004), (2, -0.113), (3, 0.087), (4, -0.037), (5, -0.048), (6, 0.021), (7, 0.08), (8, -0.017), (9, -0.046), (10, 0.043), (11, -0.035), (12, 0.011), (13, -0.027), (14, -0.035), (15, -0.075), (16, 0.103), (17, 0.016), (18, 0.034), (19, 0.002), (20, 0.131), (21, -0.014), (22, -0.041), (23, 0.069), (24, -0.019), (25, -0.17), (26, 0.119), (27, 0.059), (28, -0.028), (29, 0.128), (30, -0.045), (31, 0.178), (32, 0.055), (33, 0.014), (34, 0.013), (35, 0.09), (36, -0.041), (37, -0.038), (38, 0.061), (39, -0.024), (40, 0.027), (41, 0.027), (42, 0.122), (43, 0.122), (44, 0.019), (45, -0.074), (46, 0.086), (47, -0.074), (48, 0.051), (49, 0.018)]
simIndex simValue blogId blogTitle
same-blog 1 0.94068342 228 hunch net-2007-01-15-The Machine Learning Department
Introduction: Carnegie Mellon School of Computer Science has the first academic Machine Learning department . This department already existed as the Center for Automated Learning and Discovery , but recently changed it’s name. The reason for changing the name is obvious: very few people think of themselves as “Automated Learner and Discoverers”, but there are number of people who think of themselves as “Machine Learners”. Machine learning is both more succinct and recognizable—good properties for a name. A more interesting question is “Should there be a Machine Learning Department?”. Tom Mitchell has a relevant whitepaper claiming that machine learning is answering a different question than other fields or departments. The fundamental debate here is “Is machine learning different from statistics?” At a cultural level, there is no real debate: they are different. Machine learning is characterized by several very active large peer reviewed conferences, operating in a computer
2 0.6983285 290 hunch net-2008-02-27-The Stats Handicap
Introduction: Graduating students in Statistics appear to be at a substantial handicap compared to graduating students in Machine Learning, despite being in substantially overlapping subjects. The problem seems to be cultural. Statistics comes from a mathematics background which emphasizes large publications slowly published under review at journals. Machine Learning comes from a Computer Science background which emphasizes quick publishing at reviewed conferences. This has a number of implications: Graduating statistics PhDs often have 0-2 publications while graduating machine learning PhDs might have 5-15. Graduating ML students have had a chance for others to build on their work. Stats students have had no such chance. Graduating ML students have attended a number of conferences and presented their work, giving them a chance to meet people. Stats students have had fewer chances of this sort. In short, Stats students have had relatively few chances to distinguish themselves and
3 0.69785452 64 hunch net-2005-04-28-Science Fiction and Research
Introduction: A big part of doing research is imagining how things could be different, and then trying to figure out how to get there. A big part of science fiction is imagining how things could be different, and then working through the implications. Because of the similarity here, reading science fiction can sometimes be helpful in understanding and doing research. (And, hey, it’s fun.) Here’s some list of science fiction books I enjoyed which seem particularly relevant to computer science and (sometimes) learning systems: Vernor Vinge, “True Names”, “A Fire Upon the Deep” Marc Stiegler, “David’s Sling”, “Earthweb” Charles Stross, “Singularity Sky” Greg Egan, “Diaspora” Joe Haldeman, “Forever Peace” (There are surely many others.) Incidentally, the nature of science fiction itself has changed. Decades ago, science fiction projected great increases in the power humans control (example: E.E. Smith Lensman series). That didn’t really happen in the last 50 years. Inste
4 0.66958147 414 hunch net-2010-10-17-Partha Niyogi has died
Introduction: from brain cancer. I asked Misha who worked with him to write about it. Partha Niyogi, Louis Block Professor in Computer Science and Statistics at the University of Chicago passed away on October 1, 2010, aged 43. I first met Partha Niyogi almost exactly ten years ago when I was a graduate student in math and he had just started as a faculty in Computer Science and Statistics at the University of Chicago. Strangely, we first talked at length due to a somewhat convoluted mathematical argument in a paper on pattern recognition. I asked him some questions about the paper, and, even though the topic was new to him, he had put serious thought into it and we started regular meetings. We made significant progress and developed a line of research stemming initially just from trying to understand that one paper and to simplify one derivation. I think this was typical of Partha, showing both his intellectual curiosity and his intuition for the serendipitous; having a sense and focus fo
5 0.6334098 366 hunch net-2009-08-03-Carbon in Computer Science Research
Introduction: Al Gore ‘s film and gradually more assertive and thorough science has managed to mostly shift the debate on climate change from “Is it happening?” to “What should be done?” In that context, it’s worthwhile to think a bit about what can be done within computer science research. There are two things we can think about: Doing Research At a cartoon level, computer science research consists of some combination of commuting to&from; work, writing programs, running them on computers, writing papers, and presenting them at conferences. A typical computer has a power usage on the order of 100 Watts, which works out to 2.4 kiloWatt-hours/day. Looking up David MacKay ‘s reference on power usage per person , it becomes clear that this is a relatively minor part of the lifestyle, although it could become substantial if many more computers are required. Much larger costs are associated with commuting (which is in common with many people) and attending conferences. Since local commuti
6 0.62767512 372 hunch net-2009-09-29-Machine Learning Protests at the G20
7 0.62732154 112 hunch net-2005-09-14-The Predictionist Viewpoint
8 0.61812484 13 hunch net-2005-02-04-JMLG
9 0.58873308 168 hunch net-2006-04-02-Mad (Neuro)science
10 0.581815 335 hunch net-2009-01-08-Predictive Analytics World
11 0.56621677 313 hunch net-2008-08-18-Radford Neal starts a blog
12 0.54744947 262 hunch net-2007-09-16-Optimizing Machine Learning Programs
13 0.54210925 282 hunch net-2008-01-06-Research Political Issues
14 0.53125256 215 hunch net-2006-10-22-Exemplar programming
15 0.52744448 106 hunch net-2005-09-04-Science in the Government
16 0.52549773 10 hunch net-2005-02-02-Kolmogorov Complexity and Googling
17 0.51270604 314 hunch net-2008-08-24-Mass Customized Medicine in the Future?
18 0.50584298 172 hunch net-2006-04-14-JMLR is a success
19 0.50328118 158 hunch net-2006-02-24-A Fundamentalist Organization of Machine Learning
20 0.49380431 68 hunch net-2005-05-10-Learning Reductions are Reductionist
topicId topicWeight
[(27, 0.205), (42, 0.019), (53, 0.071), (55, 0.07), (56, 0.023), (83, 0.335), (94, 0.131), (95, 0.061)]
simIndex simValue blogId blogTitle
1 0.89386499 261 hunch net-2007-08-28-Live ML Class
Introduction: Davor and Chunnan point out that MLSS 2007 in Tuebingen has live video for the majority of the world that is not there (heh).
2 0.89294583 52 hunch net-2005-04-04-Grounds for Rejection
Introduction: It’s reviewing season right now, so I thought I would list (at a high level) the sorts of problems which I see in papers. Hopefully, this will help us all write better papers. The following flaws are fatal to any paper: Incorrect theorem or lemma statements A typo might be “ok”, if it can be understood. Any theorem or lemma which indicates an incorrect understanding of reality must be rejected. Not doing so would severely harm the integrity of the conference. A paper rejected for this reason must be fixed. Lack of Understanding If a paper is understood by none of the (typically 3) reviewers then it must be rejected for the same reason. This is more controversial than it sounds because there are some people who maximize paper complexity in the hope of impressing the reviewer. The tactic sometimes succeeds with some reviewers (but not with me). As a reviewer, I sometimes get lost for stupid reasons. This is why an anonymized communication channel with the author can
3 0.883847 321 hunch net-2008-10-19-NIPS 2008 workshop on Kernel Learning
Introduction: We’d like to invite hunch.net readers to participate in the NIPS 2008 workshop on kernel learning. While the main focus is on automatically learning kernels from data, we are also also looking at the broader questions of feature selection, multi-task learning and multi-view learning. There are no restrictions on the learning problem being addressed (regression, classification, etc), and both theoretical and applied work will be considered. The deadline for submissions is October 24 . More detail can be found here . Corinna Cortes, Arthur Gretton, Gert Lanckriet, Mehryar Mohri, Afshin Rostamizadeh
4 0.87414414 135 hunch net-2005-12-04-Watchword: model
Introduction: In everyday use a model is a system which explains the behavior of some system, hopefully at the level where some alteration of the model predicts some alteration of the real-world system. In machine learning “model” has several variant definitions. Everyday . The common definition is sometimes used. Parameterized . Sometimes model is a short-hand for “parameterized model”. Here, it refers to a model with unspecified free parameters. In the Bayesian learning approach, you typically have a prior over (everyday) models. Predictive . Even further from everyday use is the predictive model. Examples of this are “my model is a decision tree” or “my model is a support vector machine”. Here, there is no real sense in which an SVM explains the underlying process. For example, an SVM tells us nothing in particular about how alterations to the real-world system would create a change. Which definition is being used at any particular time is important information. For examp
same-blog 5 0.83980948 228 hunch net-2007-01-15-The Machine Learning Department
Introduction: Carnegie Mellon School of Computer Science has the first academic Machine Learning department . This department already existed as the Center for Automated Learning and Discovery , but recently changed it’s name. The reason for changing the name is obvious: very few people think of themselves as “Automated Learner and Discoverers”, but there are number of people who think of themselves as “Machine Learners”. Machine learning is both more succinct and recognizable—good properties for a name. A more interesting question is “Should there be a Machine Learning Department?”. Tom Mitchell has a relevant whitepaper claiming that machine learning is answering a different question than other fields or departments. The fundamental debate here is “Is machine learning different from statistics?” At a cultural level, there is no real debate: they are different. Machine learning is characterized by several very active large peer reviewed conferences, operating in a computer
6 0.75967127 98 hunch net-2005-07-27-Not goal metrics
7 0.61545813 345 hunch net-2009-03-08-Prediction Science
8 0.61476022 286 hunch net-2008-01-25-Turing’s Club for Machine Learning
9 0.60561621 158 hunch net-2006-02-24-A Fundamentalist Organization of Machine Learning
10 0.60178769 132 hunch net-2005-11-26-The Design of an Optimal Research Environment
11 0.60001129 371 hunch net-2009-09-21-Netflix finishes (and starts)
12 0.59972364 22 hunch net-2005-02-18-What it means to do research.
13 0.59917629 229 hunch net-2007-01-26-Parallel Machine Learning Problems
14 0.59913641 95 hunch net-2005-07-14-What Learning Theory might do
15 0.59884268 57 hunch net-2005-04-16-Which Assumptions are Reasonable?
16 0.59845948 435 hunch net-2011-05-16-Research Directions for Machine Learning and Algorithms
17 0.59845501 359 hunch net-2009-06-03-Functionally defined Nonlinear Dynamic Models
18 0.59590322 360 hunch net-2009-06-15-In Active Learning, the question changes
19 0.59589088 256 hunch net-2007-07-20-Motivation should be the Responsibility of the Reviewer
20 0.59571457 225 hunch net-2007-01-02-Retrospective