hunch_net hunch_net-2007 hunch_net-2007-225 knowledge-graph by maker-knowledge-mining

225 hunch net-2007-01-02-Retrospective

meta infos for this blog

Source: html

Introduction: It’s been almost two years since this blog began. In that time, I’ve learned enough to shift my expectations in several ways. Initially, the idea was for a general purpose ML blog where different people could contribute posts. What has actually happened is most posts come from me, with a few guest posts that I greatly value. There are a few reasons I see for this. Overload . A couple years ago, I had not fully appreciated just how busy life gets for a researcher. Making a post is not simply a matter of getting to it, but rather of prioritizing between {writing a grant, finishing an overdue review, writing a paper, teaching a class, writing a program, etc…}. This is a substantial transition away from what life as a graduate student is like. At some point the question is not “when will I get to it?” but rather “will I get to it?” and the answer starts to become “no” most of the time. Feedback failure . This blog currently receives about 3K unique visitors per day from

Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 It’s been almost two years since this blog began. [sent-1, score-0.591]

2 Initially, the idea was for a general purpose ML blog where different people could contribute posts. [sent-3, score-0.404]

3 What has actually happened is most posts come from me, with a few guest posts that I greatly value. [sent-4, score-0.797]

4 A couple years ago, I had not fully appreciated just how busy life gets for a researcher. [sent-7, score-0.285]

5 Making a post is not simply a matter of getting to it, but rather of prioritizing between {writing a grant, finishing an overdue review, writing a paper, teaching a class, writing a program, etc…}. [sent-8, score-0.665]

6 This blog currently receives about 3K unique visitors per day from about 13K unique sites per month. [sent-14, score-1.027]

7 This number of visitors is large enough that it scares me somewhat—having several thousand people read a post is more attention than almost all papers published in academia get. [sent-15, score-0.626]

8 The internet has a huge untapped capacity to support content, so one of the traditional reasons for editorial control (limited space) simply no longer exists. [sent-19, score-0.391]

9 Nevertheless, the time of readers is important and there is a focus-of-attention issue since one blog with all posts on all topics would be virtually useless. [sent-20, score-0.991]

10 In an ideal world, the need for explicit content control would disappear and be replaced by a massive cooperative collaborative filtering process. [sent-21, score-0.556]

11 This shift is already well underway since anyone can start their own blog and read anything they choose. [sent-22, score-0.658]

12 Expending the effort to write clearly about them in a post is not too difficult from expending the effort to write clearly about them in a paper, which is the traditional mechanism of publishing. [sent-28, score-0.985]

13 There is no simply way around this problem, although changing people’s expectations may be helpful. [sent-29, score-0.299]

14 For the record, I’m always happy to consider posts by others. [sent-32, score-0.295]

15 If you are considering your own blog, trying a guest post or two is a great way to experiment. [sent-33, score-0.495]

16 Many people don’t have the time or inclination to run their own blog, so guest posts are essential. [sent-34, score-0.571]

17 A blog strongly encourages otherwise since the backgrounds of readers are very diverse. [sent-39, score-0.677]

18 As an example the posts of the form “interesting papers at” tend to get very few comments, but they are some of the most viewed. [sent-43, score-0.362]

19 The most obvious way to use a blog is as a mechanism for posting finished research. [sent-45, score-0.563]

20 It’s ok for this, but the most interesting way of using the blog are for topics which could not be stated as a research paper. [sent-46, score-0.674]

similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('blog', 0.404), ('posts', 0.295), ('guest', 0.207), ('post', 0.194), ('visitors', 0.149), ('expending', 0.149), ('control', 0.139), ('academia', 0.134), ('content', 0.126), ('expectations', 0.124), ('writing', 0.123), ('clearly', 0.12), ('collaborative', 0.119), ('readers', 0.119), ('comments', 0.117), ('blogs', 0.108), ('filtering', 0.103), ('years', 0.102), ('unique', 0.101), ('life', 0.098), ('traditional', 0.096), ('shift', 0.095), ('expectation', 0.095), ('way', 0.094), ('topics', 0.088), ('interesting', 0.088), ('couple', 0.085), ('write', 0.085), ('since', 0.085), ('simply', 0.081), ('thousand', 0.075), ('convoluted', 0.075), ('editorial', 0.075), ('prioritizing', 0.075), ('read', 0.074), ('lack', 0.073), ('per', 0.069), ('backgrounds', 0.069), ('inclination', 0.069), ('akin', 0.069), ('cooperative', 0.069), ('finishing', 0.069), ('receives', 0.069), ('aren', 0.069), ('effort', 0.068), ('get', 0.067), ('review', 0.065), ('sites', 0.065), ('barely', 0.065), ('posting', 0.065)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0000002 225 hunch net-2007-01-02-Retrospective

2 0.26563826 25 hunch net-2005-02-20-At One Month

Introduction: This is near the one month point, so it seems appropriate to consider meta-issues for the moment. The number of posts is a bit over 20. The number of people speaking up in discussions is about 10. The number of people viewing the site is somewhat more than 100. I am (naturally) dissatisfied with many things. Many of the potential uses haven’t been realized. This is partly a matter of opportunity (no conferences in the last month), partly a matter of will (no open problems because it’s hard to give them up), and partly a matter of tradition. In academia, there is a strong tradition of trying to get everything perfectly right before presentation. This is somewhat contradictory to the nature of making many posts, and it’s definitely contradictory to the idea of doing “public research”. If that sort of idea is to pay off, it must be significantly more succesful than previous methods. In an effort to continue experimenting, I’m going to use the next week as “open problems we

3 0.2645365 383 hunch net-2009-12-09-Inherent Uncertainty

Introduction: I’d like to point out Inherent Uncertainty , which I’ve added to the ML blog post scanner on the right. My understanding from Jake is that the intention is to have a multiauthor blog which is more specialized towards learning theory/game theory than this one. Nevertheless, several of the posts seem to be of wider interest.

4 0.2639541 151 hunch net-2006-01-25-1 year

Introduction: At the one year (+5 days) anniversary, the natural question is: “Was it helpful for research?” Answer: Yes, and so it shall continue. Some evidence is provided by noticing that I am about a factor of 2 more overloaded with paper ideas than I’ve ever previously been. It is always hard to estimate counterfactual worlds, but I expect that this is also a factor of 2 more than “What if I had not started the blog?” As for “Why?”, there seem to be two primary effects. A blog is a mechanism for connecting with people who either think like you or are interested in the same problems. This allows for concentration of thinking which is very helpful in solving problems. The process of stating things you don’t understand publicly is very helpful in understanding them. Sometimes you are simply forced to express them in a way which aids understanding. Sometimes someone else says something which helps. And sometimes you discover that someone else has already solved the problem. The

5 0.25384584 96 hunch net-2005-07-21-Six Months

Introduction: This is the 6 month point in the “run a research blog” experiment, so it seems like a good point to take stock and assess. One fundamental question is: “Is it worth it?” The idea of running a research blog will never become widely popular and useful unless it actually aids research. On the negative side, composing ideas for a post and maintaining a blog takes a significant amount of time. On the positive side, the process might yield better research because there is an opportunity for better, faster feedback implying better, faster thinking. My answer at the moment is a provisional “yes”. Running the blog has been incidentally helpful in several ways: It is sometimes educational. example More often, the process of composing thoughts well enough to post simply aids thinking. This has resulted in a couple solutions to problems of interest (and perhaps more over time). If you really want to solve a problem, letting the world know is helpful. This isn’t necessarily

6 0.2341162 214 hunch net-2006-10-13-David Pennock starts Oddhead

7 0.19520602 486 hunch net-2013-07-10-Thoughts on Artificial Intelligence

8 0.17824109 166 hunch net-2006-03-24-NLPers

9 0.1757143 59 hunch net-2005-04-22-New Blog: [Lowerbounds,Upperbounds]

10 0.16933005 296 hunch net-2008-04-21-The Science 2.0 article

11 0.15459135 92 hunch net-2005-07-11-AAAI blog

12 0.13869309 350 hunch net-2009-04-23-Jonathan Chang at Slycoder

13 0.1355152 480 hunch net-2013-03-22-I’m a bandit

14 0.12959132 30 hunch net-2005-02-25-Why Papers?

15 0.12753141 137 hunch net-2005-12-09-Machine Learning Thoughts

16 0.12585205 343 hunch net-2009-02-18-Decision by Vetocracy

17 0.12446942 297 hunch net-2008-04-22-Taking the next step

18 0.12298258 182 hunch net-2006-06-05-Server Shift, Site Tweaks, Suggestions?

19 0.11797123 246 hunch net-2007-06-13-Not Posting

20 0.11232368 22 hunch net-2005-02-18-What it means to do research.

similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.253), (1, -0.099), (2, -0.076), (3, 0.178), (4, -0.143), (5, 0.009), (6, 0.069), (7, -0.447), (8, 0.185), (9, -0.018), (10, 0.008), (11, 0.015), (12, 0.009), (13, -0.042), (14, -0.032), (15, -0.005), (16, -0.047), (17, -0.039), (18, -0.021), (19, 0.04), (20, -0.048), (21, 0.058), (22, -0.034), (23, -0.03), (24, -0.023), (25, 0.032), (26, 0.029), (27, -0.009), (28, 0.069), (29, -0.023), (30, -0.07), (31, 0.017), (32, -0.003), (33, -0.036), (34, 0.044), (35, 0.021), (36, 0.075), (37, 0.019), (38, -0.012), (39, 0.015), (40, 0.028), (41, 0.013), (42, -0.003), (43, -0.046), (44, 0.005), (45, 0.03), (46, 0.016), (47, -0.047), (48, 0.006), (49, 0.063)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.97825611 225 hunch net-2007-01-02-Retrospective

2 0.84630883 96 hunch net-2005-07-21-Six Months

3 0.82497311 383 hunch net-2009-12-09-Inherent Uncertainty

4 0.77459764 151 hunch net-2006-01-25-1 year

5 0.69716644 486 hunch net-2013-07-10-Thoughts on Artificial Intelligence

Introduction: David McAllester starts a blog .

6 0.68641281 214 hunch net-2006-10-13-David Pennock starts Oddhead

7 0.67072922 350 hunch net-2009-04-23-Jonathan Chang at Slycoder

8 0.66315573 166 hunch net-2006-03-24-NLPers

9 0.64567935 296 hunch net-2008-04-21-The Science 2.0 article

10 0.63996017 25 hunch net-2005-02-20-At One Month

11 0.6175791 182 hunch net-2006-06-05-Server Shift, Site Tweaks, Suggestions?

12 0.6100347 59 hunch net-2005-04-22-New Blog: [Lowerbounds,Upperbounds]

13 0.59470367 480 hunch net-2013-03-22-I’m a bandit

14 0.53991216 402 hunch net-2010-07-02-MetaOptimize

15 0.51699257 92 hunch net-2005-07-11-AAAI blog

16 0.50009489 246 hunch net-2007-06-13-Not Posting

17 0.48327598 467 hunch net-2012-06-15-Normal Deviate and the UCSC Machine Learning Summer School

18 0.46732673 137 hunch net-2005-12-09-Machine Learning Thoughts

19 0.45429513 297 hunch net-2008-04-22-Taking the next step

20 0.43254444 354 hunch net-2009-05-17-Server Update

similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(10, 0.029), (27, 0.264), (38, 0.037), (44, 0.171), (53, 0.082), (55, 0.159), (68, 0.038), (77, 0.014), (94, 0.054), (95, 0.079)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.930457 225 hunch net-2007-01-02-Retrospective

2 0.86518651 343 hunch net-2009-02-18-Decision by Vetocracy

Introduction: Few would mistake the process of academic paper review for a fair process, but sometimes the unfairness seems particularly striking. This is most easily seen by comparison: Paper Banditron Offset Tree Notes Problem Scope Multiclass problems where only the loss of one choice can be probed. Strictly greater: Cost sensitive multiclass problems where only the loss of one choice can be probed. Often generalizations don’t matter. That’s not the case here, since every plausible application I’ve thought of involves loss functions substantially different from 0/1. What’s new Analysis and Experiments Algorithm, Analysis, and Experiments As far as I know, the essence of the more general problem was first stated and analyzed with the EXP4 algorithm (page 16) (1998). It’s also the time horizon 1 simplification of the Reinforcement Learning setting for the random trajectory method (page 15) (2002). The Banditron algorithm itself is functionally identi

3 0.86330533 466 hunch net-2012-06-05-ICML acceptance statistics

Introduction: People are naturally interested in slicing the ICML acceptance statistics in various ways. Here’s a rundown for the top categories. 18/66 = 0.27 in (0.18,0.36) Reinforcement Learning 10/52 = 0.19 in (0.17,0.37) Supervised Learning 9/51 = 0.18 not in (0.18, 0.37) Clustering 12/46 = 0.26 in (0.17, 0.37) Kernel Methods 11/40 = 0.28 in (0.15, 0.4) Optimization Algorithms 8/33 = 0.24 in (0.15, 0.39) Learning Theory 14/33 = 0.42 not in (0.15, 0.39) Graphical Models 10/32 = 0.31 in (0.15, 0.41) Applications (+5 invited) 8/29 = 0.28 in (0.14, 0.41]) Probabilistic Models 13/29 = 0.45 not in (0.14, 0.41) NN & Deep Learning 8/26 = 0.31 in (0.12, 0.42) Transfer and Multi-Task Learning 13/25 = 0.52 not in (0.12, 0.44) Online Learning 5/25 = 0.20 in (0.12, 0.44) Active Learning 6/22 = 0.27 in (0.14, 0.41) Semi-Superv

4 0.86258495 437 hunch net-2011-07-10-ICML 2011 and the future

Introduction: Unfortunately, I ended up sick for much of this ICML. I did manage to catch one interesting paper: Richard Socher , Cliff Lin , Andrew Y. Ng , and Christopher D. Manning Parsing Natural Scenes and Natural Language with Recursive Neural Networks . I invited Richard to share his list of interesting papers, so hopefully we’ll hear from him soon. In the meantime, Paul and Hal have posted some lists. the future Joelle and I are program chairs for ICML 2012 in Edinburgh , which I previously enjoyed visiting in 2005 . This is a huge responsibility, that we hope to accomplish well. A part of this (perhaps the most fun part), is imagining how we can make ICML better. A key and critical constraint is choosing things that can be accomplished. So far we have: Colocation . The first thing we looked into was potential colocations. We quickly discovered that many other conferences precomitted their location. For the future, getting a colocation with ACL or SIGI

5 0.86181998 464 hunch net-2012-05-03-Microsoft Research, New York City

Introduction: Yahoo! laid off people . Unlike every previous time there have been layoffs, this is serious for Yahoo! Research . We had advanced warning from Prabhakar through the simple act of leaving . Yahoo! Research was a world class organization that Prabhakar recruited much of personally, so it is deeply implausible that he would spontaneously decide to leave. My first thought when I saw the news was “Uhoh, Rob said that he knew it was serious when the head of ATnT Research left.” In this case it was even more significant, because Prabhakar recruited me on the premise that Y!R was an experiment in how research should be done: via a combination of high quality people and high engagement with the company. Prabhakar’s departure is a clear end to that experiment. The result is ambiguous from a business perspective. Y!R clearly was not capable of saving the company from its illnesses. I’m not privy to the internal accounting of impact and this is the kind of subject where there c

6 0.85744792 266 hunch net-2007-10-15-NIPS workshops extended to 3 days

7 0.85319471 194 hunch net-2006-07-11-New Models

8 0.85276127 132 hunch net-2005-11-26-The Design of an Optimal Research Environment

9 0.85234755 89 hunch net-2005-07-04-The Health of COLT

10 0.85154343 454 hunch net-2012-01-30-ICML Posters and Scope

11 0.85142857 51 hunch net-2005-04-01-The Producer-Consumer Model of Research

12 0.84681684 406 hunch net-2010-08-22-KDD 2010

13 0.84590685 207 hunch net-2006-09-12-Incentive Compatible Reviewing

14 0.84366626 452 hunch net-2012-01-04-Why ICML? and the summer conferences

15 0.84363264 320 hunch net-2008-10-14-Who is Responsible for a Bad Review?

16 0.84329206 360 hunch net-2009-06-15-In Active Learning, the question changes

17 0.8429656 403 hunch net-2010-07-18-ICML & COLT 2010

18 0.8425765 220 hunch net-2006-11-27-Continuizing Solutions

19 0.84256095 134 hunch net-2005-12-01-The Webscience Future

20 0.84181744 279 hunch net-2007-12-19-Cool and interesting things seen at NIPS