hilary_mason_data hilary_mason_data-2010 hilary_mason_data-2010-43 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: E-mail automation, questions and answers Posted: May 27, 2010 | Author: hilary | Filed under: blog , projects | Tags: email , ignitenyc | 66 Comments » Welcome! I’ve gotten several hundred e-mails about my e-mail management code. I do want to share it as soon as possible. Here are the answers to the most common questions. Why separate scripts? My philosophy is based on the unix command-line tool model; Each script should be simple and useful alone, but when combined together they become extremely powerful. Why don’t we have the code yet?! I had no idea the talk would be shared beyond the couple hundred people in the audience or that it would be so popular! I started my position at bit.ly the same day I gave that IgniteNYC presentation, and I also have some other awesome projects that are competing for time. I have to admit that the trained classifiers are all based on my personal data and were also trained mostly through tweaking in ipython.
sentIndex sentText sentNum sentScore
1 E-mail automation, questions and answers Posted: May 27, 2010 | Author: hilary | Filed under: blog , projects | Tags: email , ignitenyc | 66 Comments » Welcome! [sent-1, score-0.259]
2 I’ve gotten several hundred e-mails about my e-mail management code. [sent-2, score-0.15]
3 My philosophy is based on the unix command-line tool model; Each script should be simple and useful alone, but when combined together they become extremely powerful. [sent-6, score-0.274]
4 I had no idea the talk would be shared beyond the couple hundred people in the audience or that it would be so popular! [sent-9, score-0.477]
5 I have to admit that the trained classifiers are all based on my personal data and were also trained mostly through tweaking in ipython. [sent-12, score-0.648]
6 I need to finish a generic framework for people to train their own filters before I can publish that piece of the system. [sent-13, score-0.225]
7 Are you going to commercialize your scripts / can I invest? [sent-16, score-0.348]
8 I have certainly thought about commercializing the application, but I’m uncomfortable asking people to give me access to their personal e-mail data (even if there are very interesting things to be learned by aggregate analysis). [sent-17, score-0.363]
9 Just imagine how much more creative, interesting work could be done if we could partially free the world from the e-mail workload… that alone is worth making the code open. [sent-18, score-0.784]
10 The scripts run on my gmail account through IMAP (and should work with any IMAP interface, though I’m sure there is debugging to be done). [sent-21, score-0.953]
11 They live on a Linode VPS and run individually via cron jobs. [sent-22, score-0.169]
12 I primarily use the gmail web interface (though I’ve flipflopped between Mail. [sent-25, score-0.349]
13 app and Thunderbird for a while), and the only cost is that I have to manually reload the page to see new labels and new drafts appear. [sent-26, score-0.15]
14 Do your scripts go mad with power and e-mail inappropriately? [sent-27, score-0.565]
15 I have all of the scripts deposit suggested responses in the draft folder, and then I use the gmail “multiple inboxes” feature to keep the draft folder up in the UI. [sent-29, score-1.32]
16 It’s very easy to go through and modify or delete responses before they are sent. [sent-30, score-0.344]
17 Of course, I only thought of that after one of the script DID go a bit mad. [sent-31, score-0.342]
18 I’m not a robot, though of course I would say that anyway! [sent-33, score-0.365]
19 The point of the automation is to remove the stupid parts of e-mail and leave me free to personally address the interesting messages. [sent-34, score-0.381]
20 If you’ve read this far, there are a few things I would love your feedback on: What’s a kickass name for this project? [sent-35, score-0.205]
wordName wordTfidf (topN-words)
[('scripts', 0.348), ('gmail', 0.225), ('automation', 0.175), ('draft', 0.175), ('folder', 0.175), ('nagging', 0.175), ('alone', 0.15), ('trained', 0.15), ('imap', 0.15), ('hundred', 0.15), ('robot', 0.15), ('go', 0.142), ('though', 0.141), ('ignitenyc', 0.135), ('responses', 0.135), ('would', 0.13), ('answers', 0.124), ('interface', 0.124), ('script', 0.113), ('free', 0.109), ('personal', 0.104), ('done', 0.099), ('interesting', 0.097), ('based', 0.094), ('run', 0.094), ('course', 0.094), ('keep', 0.087), ('could', 0.087), ('thought', 0.087), ('code', 0.085), ('reload', 0.075), ('generic', 0.075), ('labels', 0.075), ('debugging', 0.075), ('cron', 0.075), ('nltk', 0.075), ('classifiers', 0.075), ('feedback', 0.075), ('admit', 0.075), ('invest', 0.075), ('multiple', 0.075), ('power', 0.075), ('publish', 0.075), ('separate', 0.075), ('train', 0.075), ('uncomfortable', 0.075), ('work', 0.07), ('modify', 0.067), ('unix', 0.067), ('beyond', 0.067)]
simIndex simValue blogId blogTitle
same-blog 1 1.0000002 43 hilary mason data-2010-05-27-E-mail automation, questions and answers
Introduction: E-mail automation, questions and answers Posted: May 27, 2010 | Author: hilary | Filed under: blog , projects | Tags: email , ignitenyc | 66 Comments » Welcome! I’ve gotten several hundred e-mails about my e-mail management code. I do want to share it as soon as possible. Here are the answers to the most common questions. Why separate scripts? My philosophy is based on the unix command-line tool model; Each script should be simple and useful alone, but when combined together they become extremely powerful. Why don’t we have the code yet?! I had no idea the talk would be shared beyond the couple hundred people in the audience or that it would be so popular! I started my position at bit.ly the same day I gave that IgniteNYC presentation, and I also have some other awesome projects that are competing for time. I have to admit that the trained classifiers are all based on my personal data and were also trained mostly through tweaking in ipython.
2 0.22606529 37 hilary mason data-2009-11-25-IgniteNYC: How to Replace Yourself with a Very Small Shell Script
Introduction: IgniteNYC: How to Replace Yourself with a Very Small Shell Script Posted: November 25, 2009 | Author: hilary | Filed under: blog , Presentations | Tags: email , ignitenyc , presentations , scripts | 15 Comments » I recently gave a talk at IgniteNYC on How to Replace Yourself with a Very Small Shell Script . The Ignite events are a fun blend of performance, technology, and speaking skill. Each presenter gives a five minute talk with twenty slides that auto-advance after 15 seconds. The title of my talk is a classic geek reference (you can get the t-shirt ). I’m very interested in developing automated techniques for handling the massive and growing amounts of information that we all have to deal with. I started with e-mail and twitter, both of which are easy to access programmatically (via IMAP and the Twitter API ). In the talk, I went through several of the simple and successful e-mail management scripts that I’ve developed. I decided to
3 0.14391182 47 hilary mason data-2010-08-23-New York Times: Reinventing E-mail, One Message at a Time
Introduction: New York Times: Reinventing E-mail, One Message at a Time Posted: August 23, 2010 | Author: Hilary Mason | Filed under: Media | Tags: code , email , hacking , newyorktimes | Leave a comment Âť Nick Bilton did a writeup of my homegrown e-mail scripts in the New York Times!
4 0.13071211 38 hilary mason data-2009-12-24-IgniteNYC: The video!
Introduction: IgniteNYC: The video! Posted: December 24, 2009 | Author: hilary | Filed under: academics , blog | Tags: presentation , python | 15 Comments » The video of my IgniteNYC presentation is up, and has gotten a great response! I’m working on removing the me-specific bits from the code and I’ll be posting it as open-source very soon!
5 0.12233259 71 hilary mason data-2012-01-26-Identity Slippage, and what’s the weirdest thing you’ve been e-mailed by accident?
Introduction: Identity Slippage, and what’s the weirdest thing you’ve been e-mailed by accident? Posted: January 26, 2012 | Author: Hilary Mason | Filed under: blog | 31 Comments » I have an old, short, and concise gmail address (my first initial and last name at gmail.com). There are many other hmasons in the world who have since signed up for gmail, with variations on the “hmason” theme. Every so often, they mistype the address, or someone mishears it. I now receive between four and ten pieces of e-mail per week meant for other hmasons . This was pretty amusing until someone opened an amazon account on that address (which I had to shut down). Poor Holly has never seen a single Citibank credit card statement (and Citibank won’t remove the e-mail address from the account when I call, since I’m not the account holder). Heidi hasn’t linked her Paypal account to her bank account, but I’m waiting for someone to send her money. This sort of unwitting misattribution results in an
6 0.11578619 10 hilary mason data-2007-09-02-Autoscript Creates LSL Scripts Without Code
7 0.11150922 87 hilary mason data-2013-01-28-Startups: Why to Share Data with Academics
8 0.10927892 23 hilary mason data-2009-01-01-Twitter: A greasemonkey script to show who follows you
9 0.10595492 81 hilary mason data-2013-01-03-Interview Questions for Data Scientists
10 0.095707066 85 hilary mason data-2013-01-19-Startups: How to Share Data with Academics
11 0.091776714 46 hilary mason data-2010-08-15-Should you attend Hadoop World? Yes.
12 0.091567412 80 hilary mason data-2012-12-28-Getting Started with Data Science
13 0.081731044 105 hilary mason data-2013-07-05-Speaking: Spend at least 1-3 of the time practicing the talk
14 0.080266006 76 hilary mason data-2012-08-28-How do you prioritize research?
15 0.078419246 17 hilary mason data-2008-03-17-LSL: Notecard Selector
16 0.078378946 114 hilary mason data-2013-12-18-Using Twitter’s Lead-Gen Card to Recruit Beta Testers
17 0.076685101 82 hilary mason data-2013-01-08-Bitly Social Data APIs
18 0.07629019 31 hilary mason data-2009-08-12-My NYC Python Meetup Presentation: Practical Data Analysis in Python
19 0.074959278 110 hilary mason data-2013-10-06-What Mugshots Mean For Public Data
20 0.070642963 49 hilary mason data-2010-11-10-Machine Learning: A Love Story
topicId topicWeight
[(0, -0.295), (1, -0.014), (2, -0.111), (3, 0.101), (4, -0.101), (5, 0.025), (6, -0.155), (7, -0.042), (8, -0.157), (9, 0.244), (10, -0.049), (11, -0.101), (12, 0.201), (13, -0.173), (14, 0.018), (15, 0.143), (16, -0.037), (17, -0.168), (18, 0.017), (19, 0.053), (20, 0.101), (21, -0.001), (22, -0.165), (23, 0.033), (24, -0.038), (25, -0.028), (26, 0.024), (27, -0.19), (28, -0.108), (29, 0.041), (30, -0.045), (31, 0.016), (32, -0.015), (33, 0.085), (34, -0.021), (35, 0.097), (36, -0.134), (37, 0.014), (38, -0.039), (39, -0.027), (40, 0.035), (41, -0.143), (42, -0.01), (43, -0.056), (44, -0.019), (45, -0.064), (46, 0.094), (47, 0.105), (48, 0.029), (49, -0.015)]
simIndex simValue blogId blogTitle
same-blog 1 0.9801687 43 hilary mason data-2010-05-27-E-mail automation, questions and answers
Introduction: E-mail automation, questions and answers Posted: May 27, 2010 | Author: hilary | Filed under: blog , projects | Tags: email , ignitenyc | 66 Comments » Welcome! I’ve gotten several hundred e-mails about my e-mail management code. I do want to share it as soon as possible. Here are the answers to the most common questions. Why separate scripts? My philosophy is based on the unix command-line tool model; Each script should be simple and useful alone, but when combined together they become extremely powerful. Why don’t we have the code yet?! I had no idea the talk would be shared beyond the couple hundred people in the audience or that it would be so popular! I started my position at bit.ly the same day I gave that IgniteNYC presentation, and I also have some other awesome projects that are competing for time. I have to admit that the trained classifiers are all based on my personal data and were also trained mostly through tweaking in ipython.
2 0.60711777 37 hilary mason data-2009-11-25-IgniteNYC: How to Replace Yourself with a Very Small Shell Script
Introduction: IgniteNYC: How to Replace Yourself with a Very Small Shell Script Posted: November 25, 2009 | Author: hilary | Filed under: blog , Presentations | Tags: email , ignitenyc , presentations , scripts | 15 Comments » I recently gave a talk at IgniteNYC on How to Replace Yourself with a Very Small Shell Script . The Ignite events are a fun blend of performance, technology, and speaking skill. Each presenter gives a five minute talk with twenty slides that auto-advance after 15 seconds. The title of my talk is a classic geek reference (you can get the t-shirt ). I’m very interested in developing automated techniques for handling the massive and growing amounts of information that we all have to deal with. I started with e-mail and twitter, both of which are easy to access programmatically (via IMAP and the Twitter API ). In the talk, I went through several of the simple and successful e-mail management scripts that I’ve developed. I decided to
3 0.42025203 10 hilary mason data-2007-09-02-Autoscript Creates LSL Scripts Without Code
Introduction: Autoscript Creates LSL Scripts Without Code Posted: September 2, 2007 | Author: hilary | Filed under: blog | Tags: second life | 188 Comments » This week, I created autoscript , a simple LSL script generator for Second Life . The concept is to translate the way you think about interactive design – what should happen, and when – into functional code. Scripting is one of the most creative parts of designing immersive spaces, and I hope this will make it accessible to more people. Please feel free to use this code however you like! Writing a program like this requires a balance between simplicity and flexibility, and I’m erring on the side of the former. I do plan to add more functionality to it and would love your suggestions and ideas! Also, if you encounter anything that doesn’t work, please let me know in the comments here. The link: autoscript
4 0.40790331 38 hilary mason data-2009-12-24-IgniteNYC: The video!
Introduction: IgniteNYC: The video! Posted: December 24, 2009 | Author: hilary | Filed under: academics , blog | Tags: presentation , python | 15 Comments » The video of my IgniteNYC presentation is up, and has gotten a great response! I’m working on removing the me-specific bits from the code and I’ll be posting it as open-source very soon!
5 0.36934343 47 hilary mason data-2010-08-23-New York Times: Reinventing E-mail, One Message at a Time
Introduction: New York Times: Reinventing E-mail, One Message at a Time Posted: August 23, 2010 | Author: Hilary Mason | Filed under: Media | Tags: code , email , hacking , newyorktimes | Leave a comment Âť Nick Bilton did a writeup of my homegrown e-mail scripts in the New York Times!
6 0.36543351 71 hilary mason data-2012-01-26-Identity Slippage, and what’s the weirdest thing you’ve been e-mailed by accident?
7 0.30734986 81 hilary mason data-2013-01-03-Interview Questions for Data Scientists
8 0.30157322 87 hilary mason data-2013-01-28-Startups: Why to Share Data with Academics
9 0.29565164 23 hilary mason data-2009-01-01-Twitter: A greasemonkey script to show who follows you
10 0.28054938 114 hilary mason data-2013-12-18-Using Twitter’s Lead-Gen Card to Recruit Beta Testers
11 0.26237985 39 hilary mason data-2010-01-03-SMS to e-mail gateway: The SMS doorbell
12 0.25907269 85 hilary mason data-2013-01-19-Startups: How to Share Data with Academics
13 0.25712609 17 hilary mason data-2008-03-17-LSL: Notecard Selector
14 0.25694224 86 hilary mason data-2013-01-22-Introbot: A Script to Ease the Process of Writing Introductory E-mails
15 0.25590003 110 hilary mason data-2013-10-06-What Mugshots Mean For Public Data
16 0.2519035 80 hilary mason data-2012-12-28-Getting Started with Data Science
17 0.243696 63 hilary mason data-2011-09-26-Hacking the Food System: The Ultimate Chocolate Chip Cookie
18 0.2379542 34 hilary mason data-2009-10-16-Data: first and last names from the US Census
19 0.22276281 82 hilary mason data-2013-01-08-Bitly Social Data APIs
20 0.20935091 88 hilary mason data-2013-01-29-I’m a Dead Celebrity!
topicId topicWeight
[(2, 0.105), (15, 0.54), (31, 0.014), (54, 0.017), (56, 0.146), (60, 0.019), (63, 0.035), (87, 0.036)]
simIndex simValue blogId blogTitle
same-blog 1 0.93742114 43 hilary mason data-2010-05-27-E-mail automation, questions and answers
Introduction: E-mail automation, questions and answers Posted: May 27, 2010 | Author: hilary | Filed under: blog , projects | Tags: email , ignitenyc | 66 Comments » Welcome! I’ve gotten several hundred e-mails about my e-mail management code. I do want to share it as soon as possible. Here are the answers to the most common questions. Why separate scripts? My philosophy is based on the unix command-line tool model; Each script should be simple and useful alone, but when combined together they become extremely powerful. Why don’t we have the code yet?! I had no idea the talk would be shared beyond the couple hundred people in the audience or that it would be so popular! I started my position at bit.ly the same day I gave that IgniteNYC presentation, and I also have some other awesome projects that are competing for time. I have to admit that the trained classifiers are all based on my personal data and were also trained mostly through tweaking in ipython.
2 0.82158589 85 hilary mason data-2013-01-19-Startups: How to Share Data with Academics
Introduction: Startups: How to Share Data with Academics Posted: January 19, 2013 | Author: Hilary Mason | Filed under: blog | Tags: academics , data , research | 8 Comments » This post assumes that you want to share data. If you’re not convinced, don’t worry — that’s next on my list. You and your academic colleagues will benefit from having at least a quick chat about the research questions they want to address. I’ve read every paper I’ve been able to find that uses bitly data and all of the ones that acquired the data without our assistance had serious flaws, generally based on incorrect assumptions about the data they had acquired (this, unfortunately, makes me question the validity of most research done on commercial social data without cooperation from the subject company). The easiest way to share data is through your own API . Set generous rate limits where possible. Most projects are not realtime and they can gather the data (or, more likely, have a grad
3 0.31570375 105 hilary mason data-2013-07-05-Speaking: Spend at least 1-3 of the time practicing the talk
Introduction: Speaking: Spend at least 1/3 of the time practicing the talk Posted: July 5, 2013 | Author: Hilary Mason | Filed under: speaking | 3 Comments » This week we welcome a guest contribution. Matthew Trentacoste is a recovering academic and a computer scientist at Adobe, where he writes software to make pretty pictures. He’s constantly curious, often about data, and cooks a lot. You can follow his exploits at @mattttrent . In Hilary’s last post, she made the point that your slides != your talk . In a well-crafted talk, your message — in the form of the words you say — needs to dominate while the slides need to play a supporting role. Speak the important parts, and use your slides as a backdrop for what you’re saying. Hilary has provided a valuable strategy in her post, but how should someone approach crafting such a clearly-organized presentation? If you’re just getting started speaking, it can be a real challenge to make a coherent talk and along with slid
4 0.3029139 114 hilary mason data-2013-12-18-Using Twitter’s Lead-Gen Card to Recruit Beta Testers
Introduction: Using Twitter’s Lead-Gen Card to Recruit Beta Testers Posted: December 18, 2013 | Author: Hilary Mason | Filed under: blog | Tags: email , hack , twitter | 12 Comments » It turns out that it’s pretty easy to co-opt Twitter’s Lead Generation card for anything where you want to gather a bunch of e-mail addresses from your Twitter community. I was looking for people willing to alpha test a little side project of mine, and it worked great and didn’t cost anything. The tweet itself: Love tech discussion but looking for a better community? Help me beta test a side project! https://t.co/H3DYjbCy19 — Hilary Mason (@hmason) December 12, 2013 I created it pretty easily: First, go to ads.twitter.com , log in, and go to “creatives”, then “cards”. Click “Create Lead Generation Card”. It’s a big blue button. You can include a title and a short description. Curiously, you can also include a 600px by 150px image. This seems like an opportunity to
5 0.28945065 109 hilary mason data-2013-09-30-Need actual random numbers? Meet the NIST randomness beacon.
Introduction: Need actual random numbers? Meet the NIST randomness beacon. Posted: September 30, 2013 | Author: Hilary Mason | Filed under: projects | Tags: beacon , python , random , randomness , randomnumbers | 5 Comments » I wrote a python module that wraps that NIST Randomness Beacon , making it simple to get truly random numbers in python. It’s easy to use: b = Beacon() print b.last_record() print b.previous_record() #and so on There’s also a handy generator for getting a set of n random numbers. (One of the best gifts I ever got was a copy of 1,000,000 Random Numbers , and I’ve been intrigued ever since.) Please note that this the randomness beacon is not intended to be a source of cryptographic keys — indeed, it’s a public set of numbers, so I wouldn’t recommend doing anything that could be compromised by someone else having the access to the exact same set of numbers . Rather, this is interesting precisely for the scientific opportunities that
6 0.28930983 87 hilary mason data-2013-01-28-Startups: Why to Share Data with Academics
7 0.2879889 82 hilary mason data-2013-01-08-Bitly Social Data APIs
8 0.27864403 58 hilary mason data-2011-06-22-My Head is Open Source!
9 0.26619262 81 hilary mason data-2013-01-03-Interview Questions for Data Scientists
10 0.26593077 7 hilary mason data-2007-07-30-Tip: How to Search Google for Ideas
11 0.24833746 80 hilary mason data-2012-12-28-Getting Started with Data Science
12 0.2428221 40 hilary mason data-2010-02-16-Conference: Search and Social Media 2010
13 0.23647766 46 hilary mason data-2010-08-15-Should you attend Hadoop World? Yes.
14 0.23570904 34 hilary mason data-2009-10-16-Data: first and last names from the US Census
15 0.23565203 24 hilary mason data-2009-01-31-WordPress tip: Move comments from one post to another post
16 0.23352519 76 hilary mason data-2012-08-28-How do you prioritize research?
17 0.22684768 92 hilary mason data-2013-02-25-A (short) List of Data Science Blogs
18 0.22595164 83 hilary mason data-2013-01-10-Book Book — Goose!
19 0.22047745 91 hilary mason data-2013-02-22-Why YOU (an introverted nerd) Should Try Public Speaking
20 0.22026762 33 hilary mason data-2009-10-03-Hadoop World NYC