hilary_mason_data hilary_mason_data-2013 hilary_mason_data-2013-83 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Book Book — Goose! Posted: January 10, 2013 | Author: Hilary Mason | Filed under: projects | Tags: absurdity , hack | 25 Comments » I like to read. I love bookstores, I like to wander, and to find things that I didn’t know existed. But bookstores don’t have every book that exists. Amazon has most books, but search is a terrible way to discover new things. Amazon’s recommendations most likely maximize purchases, but are a terrible way to find something you didn’t know you were looking for (look at a book like Effective JavaScript , for example, and you get recommendations for Async JavaScript , Building Node Applications with MongoDB and Backbone , JavaScript Enlightenment ). Similarly, top 100 lists are great at showing you popular things that you’re probably more likely to buy, but not very good at helping you find a book with a story or idea that’s unlike anything you’ve read lately. There must be a better way to explore books. A random way to explor
sentIndex sentText sentNum sentScore
1 Posted: January 10, 2013 | Author: Hilary Mason | Filed under: projects | Tags: absurdity , hack | 25 Comments » I like to read. [sent-2, score-0.136]
2 I love bookstores, I like to wander, and to find things that I didn’t know existed. [sent-3, score-0.457]
3 Amazon has most books, but search is a terrible way to discover new things. [sent-5, score-0.405]
4 Similarly, top 100 lists are great at showing you popular things that you’re probably more likely to buy, but not very good at helping you find a book with a story or idea that’s unlike anything you’ve read lately. [sent-7, score-1.746]
5 A random way to explore books would be a good way to start. [sent-9, score-0.995]
6 Update: you can now find @bookbookgoose on Twitter, sharing one random book per hour. [sent-16, score-1.083]
7 Update: Dustin Kurtz at Melville House had an eloquent writeup of the beauty in this random literature . [sent-17, score-0.384]
wordName wordTfidf (topN-words)
[('book', 0.545), ('javascript', 0.273), ('bookbookgoose', 0.236), ('bookstores', 0.236), ('books', 0.234), ('random', 0.199), ('find', 0.191), ('terrible', 0.182), ('way', 0.172), ('amazon', 0.167), ('recommendations', 0.167), ('didn', 0.14), ('explore', 0.133), ('update', 0.122), ('likely', 0.122), ('unlike', 0.101), ('house', 0.101), ('literature', 0.101), ('lists', 0.091), ('effective', 0.091), ('good', 0.085), ('key', 0.084), ('showing', 0.084), ('helping', 0.084), ('writeup', 0.084), ('sharing', 0.084), ('like', 0.083), ('things', 0.08), ('applications', 0.078), ('enjoy', 0.078), ('building', 0.074), ('forward', 0.074), ('top', 0.07), ('must', 0.066), ('popular', 0.066), ('per', 0.064), ('probably', 0.061), ('better', 0.059), ('story', 0.059), ('know', 0.058), ('idea', 0.054), ('every', 0.053), ('anything', 0.053), ('hack', 0.053), ('search', 0.051), ('go', 0.048), ('something', 0.048), ('looking', 0.046), ('love', 0.045), ('january', 0.044)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999994 83 hilary mason data-2013-01-10-Book Book — Goose!
Introduction: Book Book — Goose! Posted: January 10, 2013 | Author: Hilary Mason | Filed under: projects | Tags: absurdity , hack | 25 Comments » I like to read. I love bookstores, I like to wander, and to find things that I didn’t know existed. But bookstores don’t have every book that exists. Amazon has most books, but search is a terrible way to discover new things. Amazon’s recommendations most likely maximize purchases, but are a terrible way to find something you didn’t know you were looking for (look at a book like Effective JavaScript , for example, and you get recommendations for Async JavaScript , Building Node Applications with MongoDB and Backbone , JavaScript Enlightenment ). Similarly, top 100 lists are great at showing you popular things that you’re probably more likely to buy, but not very good at helping you find a book with a story or idea that’s unlike anything you’ve read lately. There must be a better way to explore books. A random way to explor
2 0.26992726 112 hilary mason data-2013-11-01-Books Recommendations for Programming Excellence
Introduction: Books Recommendations for Programming Excellence Posted: November 1, 2013 | Author: Hilary Mason | Filed under: blog | 15 Comments » Yesterday I asked people on twitter for recommendations for things to read to improve as a programmer. I’m looking mainly for things on the philosophy side of software engineering. I do realize that practice is the most important thing, but sometimes you run into a design question and it’s always helpful to realize that very smart people have, indeed, thought about these things before. I assembled the book recommendations into a bitly bundle . I’ve only read a few of these (generally the older books) and so I can’t recommend specifics, but if you’d care to take a look here they are ! If you see something that you think should be included, please do let me know in the comments and I’ll add it to the list.
3 0.17957638 90 hilary mason data-2013-02-18-One Random Tweet, please.
Introduction: One Random Tweet, please. Posted: February 18, 2013 | Author: Hilary Mason | Filed under: projects | 4 Comments » One random tweet. It’s easy to believe that other people use social networks in the same way that you do. Your friends largely do use them the same way, which gives us an even more biased perspective. Unfortunately, most networks don’t provide a way to explore representative communications that you’re not connected to. Well, now you can! One random tweet , please. Update: There were some slight technical difficulties due to hitting Twitter’s oembed rate limit. They should be repaired now. (Note: between this and bookbookgoose.com I’m on a bit of a random kick lately. There’s a method to this madness!)
4 0.15767713 25 hilary mason data-2009-02-10-JWU Guest Lecture: Introduction to JavaScript and AJAX
Introduction: JWU Guest Lecture: Introduction to JavaScript and AJAX Posted: February 10, 2009 | Author: hilary | Filed under: blog | Tags: ajax , javascript , jwu , presentations | Leave a comment » JWU Guest Talk: JavaScript and AJAX View more presentations from Hilary Mason . (tags: javascript jwu ) I was invited to give a talk at JWU for an audience of graphic design students on how to enhance their XHTML/CSS skills with JavaScript and AJAX. Enjoy the slides, and I’m looking forward to part 2!
5 0.14044033 60 hilary mason data-2011-08-21-What do you read that changes the way you think?
Introduction: What do you read that changes the way you think? Posted: August 21, 2011 | Author: Hilary Mason | Filed under: blog | Tags: books , philosophy , reading | 24 Comments » A friend asked me which of three startup business books she should read. Obama’s reading list since entering office has nothing surprising on it. The most valuable books I read this year have been stories of things very different from what I spend most of my time thinking about. One of my favorites was China Meiville’s The City & The City , which I loved for the ambition and artistry, and another was Simon Winchester’s The Meaning of Everything: The Story of the Oxford English Dictionary , which I loved for the descriptions of creating an analog, scalable information system. What have you read recently that was really great? Edit: Thanks for the recommendations! There are also a bunch over on Google Plus .
6 0.086586863 56 hilary mason data-2011-05-02-How to get a random line from a file in bash.
7 0.083166666 1 hilary mason data-2006-02-20-JavaScript Rotating Images Tutorial
8 0.078289114 109 hilary mason data-2013-09-30-Need actual random numbers? Meet the NIST randomness beacon.
9 0.077766851 87 hilary mason data-2013-01-28-Startups: Why to Share Data with Academics
10 0.070127659 80 hilary mason data-2012-12-28-Getting Started with Data Science
11 0.069421023 91 hilary mason data-2013-02-22-Why YOU (an introverted nerd) Should Try Public Speaking
12 0.069226719 48 hilary mason data-2010-09-04-Twitter Succeeds Because it Fails
13 0.066775814 81 hilary mason data-2013-01-03-Interview Questions for Data Scientists
14 0.065935507 94 hilary mason data-2013-03-08-Speaking: Title Slides + Twitter = You Win
15 0.064414881 77 hilary mason data-2012-09-18-Hey Yahoo, You’re Optimizing the Wrong Thing
16 0.06217701 95 hilary mason data-2013-03-17-Speaking: Entertain, Don’t Teach
17 0.062036756 98 hilary mason data-2013-03-29-Speaking: Use the Narrative Arc
18 0.060958676 105 hilary mason data-2013-07-05-Speaking: Spend at least 1-3 of the time practicing the talk
19 0.060922928 96 hilary mason data-2013-03-23-Speaking: 15 Minutes Or Less Per Idea
20 0.056348275 24 hilary mason data-2009-01-31-WordPress tip: Move comments from one post to another post
topicId topicWeight
[(0, -0.21), (1, 0.096), (2, -0.122), (3, 0.029), (4, 0.049), (5, 0.214), (6, -0.001), (7, 0.333), (8, -0.023), (9, 0.025), (10, 0.316), (11, 0.174), (12, -0.048), (13, 0.039), (14, 0.14), (15, -0.184), (16, 0.039), (17, -0.223), (18, -0.076), (19, -0.029), (20, 0.026), (21, 0.117), (22, 0.053), (23, 0.006), (24, 0.077), (25, -0.05), (26, 0.151), (27, 0.009), (28, -0.167), (29, 0.029), (30, -0.071), (31, -0.012), (32, -0.015), (33, 0.047), (34, -0.083), (35, 0.033), (36, 0.059), (37, 0.04), (38, -0.033), (39, 0.055), (40, 0.038), (41, 0.092), (42, 0.105), (43, -0.008), (44, 0.086), (45, -0.086), (46, -0.006), (47, 0.027), (48, -0.084), (49, -0.011)]
simIndex simValue blogId blogTitle
same-blog 1 0.98632729 83 hilary mason data-2013-01-10-Book Book — Goose!
Introduction: Book Book — Goose! Posted: January 10, 2013 | Author: Hilary Mason | Filed under: projects | Tags: absurdity , hack | 25 Comments » I like to read. I love bookstores, I like to wander, and to find things that I didn’t know existed. But bookstores don’t have every book that exists. Amazon has most books, but search is a terrible way to discover new things. Amazon’s recommendations most likely maximize purchases, but are a terrible way to find something you didn’t know you were looking for (look at a book like Effective JavaScript , for example, and you get recommendations for Async JavaScript , Building Node Applications with MongoDB and Backbone , JavaScript Enlightenment ). Similarly, top 100 lists are great at showing you popular things that you’re probably more likely to buy, but not very good at helping you find a book with a story or idea that’s unlike anything you’ve read lately. There must be a better way to explore books. A random way to explor
2 0.63381827 112 hilary mason data-2013-11-01-Books Recommendations for Programming Excellence
Introduction: Books Recommendations for Programming Excellence Posted: November 1, 2013 | Author: Hilary Mason | Filed under: blog | 15 Comments » Yesterday I asked people on twitter for recommendations for things to read to improve as a programmer. I’m looking mainly for things on the philosophy side of software engineering. I do realize that practice is the most important thing, but sometimes you run into a design question and it’s always helpful to realize that very smart people have, indeed, thought about these things before. I assembled the book recommendations into a bitly bundle . I’ve only read a few of these (generally the older books) and so I can’t recommend specifics, but if you’d care to take a look here they are ! If you see something that you think should be included, please do let me know in the comments and I’ll add it to the list.
3 0.51763874 25 hilary mason data-2009-02-10-JWU Guest Lecture: Introduction to JavaScript and AJAX
Introduction: JWU Guest Lecture: Introduction to JavaScript and AJAX Posted: February 10, 2009 | Author: hilary | Filed under: blog | Tags: ajax , javascript , jwu , presentations | Leave a comment » JWU Guest Talk: JavaScript and AJAX View more presentations from Hilary Mason . (tags: javascript jwu ) I was invited to give a talk at JWU for an audience of graphic design students on how to enhance their XHTML/CSS skills with JavaScript and AJAX. Enjoy the slides, and I’m looking forward to part 2!
4 0.4816896 60 hilary mason data-2011-08-21-What do you read that changes the way you think?
Introduction: What do you read that changes the way you think? Posted: August 21, 2011 | Author: Hilary Mason | Filed under: blog | Tags: books , philosophy , reading | 24 Comments » A friend asked me which of three startup business books she should read. Obama’s reading list since entering office has nothing surprising on it. The most valuable books I read this year have been stories of things very different from what I spend most of my time thinking about. One of my favorites was China Meiville’s The City & The City , which I loved for the ambition and artistry, and another was Simon Winchester’s The Meaning of Everything: The Story of the Oxford English Dictionary , which I loved for the descriptions of creating an analog, scalable information system. What have you read recently that was really great? Edit: Thanks for the recommendations! There are also a bunch over on Google Plus .
5 0.47066236 90 hilary mason data-2013-02-18-One Random Tweet, please.
Introduction: One Random Tweet, please. Posted: February 18, 2013 | Author: Hilary Mason | Filed under: projects | 4 Comments » One random tweet. It’s easy to believe that other people use social networks in the same way that you do. Your friends largely do use them the same way, which gives us an even more biased perspective. Unfortunately, most networks don’t provide a way to explore representative communications that you’re not connected to. Well, now you can! One random tweet , please. Update: There were some slight technical difficulties due to hitting Twitter’s oembed rate limit. They should be repaired now. (Note: between this and bookbookgoose.com I’m on a bit of a random kick lately. There’s a method to this madness!)
6 0.30747229 1 hilary mason data-2006-02-20-JavaScript Rotating Images Tutorial
7 0.2636508 109 hilary mason data-2013-09-30-Need actual random numbers? Meet the NIST randomness beacon.
8 0.22969255 56 hilary mason data-2011-05-02-How to get a random line from a file in bash.
9 0.22249936 87 hilary mason data-2013-01-28-Startups: Why to Share Data with Academics
10 0.22248383 100 hilary mason data-2013-04-05-Speaking: 1 Kitten per Equation
11 0.20575482 96 hilary mason data-2013-03-23-Speaking: 15 Minutes Or Less Per Idea
12 0.17299461 77 hilary mason data-2012-09-18-Hey Yahoo, You’re Optimizing the Wrong Thing
13 0.17146391 24 hilary mason data-2009-01-31-WordPress tip: Move comments from one post to another post
14 0.16554753 48 hilary mason data-2010-09-04-Twitter Succeeds Because it Fails
15 0.16149265 91 hilary mason data-2013-02-22-Why YOU (an introverted nerd) Should Try Public Speaking
16 0.16121659 98 hilary mason data-2013-03-29-Speaking: Use the Narrative Arc
17 0.16064394 3 hilary mason data-2007-06-08-The Best Time to Search for Academic Jobs
18 0.15180327 80 hilary mason data-2012-12-28-Getting Started with Data Science
19 0.14562708 81 hilary mason data-2013-01-03-Interview Questions for Data Scientists
20 0.14195353 36 hilary mason data-2009-11-10-My code is on TV (and so am I)!
topicId topicWeight
[(2, 0.218), (56, 0.15), (58, 0.494)]
simIndex simValue blogId blogTitle
1 0.98463511 56 hilary mason data-2011-05-02-How to get a random line from a file in bash.
Introduction: How to get a random line from a file in bash. Posted: May 2, 2011 | Author: Hilary Mason | Filed under: blog | Tags: bash , code , one-liner , remember , tips | 25 Comments » I work with a lot of data, and while I’d like to pretend it’s all in upside-down quasi-indexed b-tree rocket ships or some other advanced database, the truth is that much of it is in text files. I often find myself wanting to see a random line from one of these files, just to get a sense of what the data looks like. I thought there must be an easy bash way to do this, but I couldn’t find it (‘shuf’ isn’t installed on my server), so I turned to twitter, and now I’m pleased to present more methods for finding a random line than you ever expected! sort -R | head -n 1 If you can use this, do so! If it isn’t available, consider one of the following commands: @andrewgilmartin suggests using awk: awk 'BEGIN { srand() } rand() >= 0.5 { print; exit }' @devinteske offer
2 0.97475249 102 hilary mason data-2013-05-03-Speaking: Explaining Technical Information to a Mixed Audience
Introduction: Speaking: Explaining Technical Information to a Mixed Audience Posted: May 3, 2013 | Author: Hilary Mason | Filed under: speaking | Tags: puppies , speaking | 11 Comments » It’s a challenge to present deeply technical material to a room of people with varying expertise levels. If you leave it out, you’re abandoning the substance of your presentation. If you focus on it exclusively, you will lose most of the room. Instead, include the material, but plan to repeat it two (or even three!) times. The first time you explain it, explain it for the expert audience. The second time you explain it, walk through an example of what the system enables. If you’re audience is on Twitter, throw in a third version — the concise and tweetable one! Let’s say we were giving a talk about a machine learning system to classify puppies. Slide one would have a technical diagram of the architecture of the system, and you might explain it as: “We use a naive bayesian classi
same-blog 3 0.93246806 83 hilary mason data-2013-01-10-Book Book — Goose!
Introduction: Book Book — Goose! Posted: January 10, 2013 | Author: Hilary Mason | Filed under: projects | Tags: absurdity , hack | 25 Comments » I like to read. I love bookstores, I like to wander, and to find things that I didn’t know existed. But bookstores don’t have every book that exists. Amazon has most books, but search is a terrible way to discover new things. Amazon’s recommendations most likely maximize purchases, but are a terrible way to find something you didn’t know you were looking for (look at a book like Effective JavaScript , for example, and you get recommendations for Async JavaScript , Building Node Applications with MongoDB and Backbone , JavaScript Enlightenment ). Similarly, top 100 lists are great at showing you popular things that you’re probably more likely to buy, but not very good at helping you find a book with a story or idea that’s unlike anything you’ve read lately. There must be a better way to explore books. A random way to explor
4 0.51016915 105 hilary mason data-2013-07-05-Speaking: Spend at least 1-3 of the time practicing the talk
Introduction: Speaking: Spend at least 1/3 of the time practicing the talk Posted: July 5, 2013 | Author: Hilary Mason | Filed under: speaking | 3 Comments » This week we welcome a guest contribution. Matthew Trentacoste is a recovering academic and a computer scientist at Adobe, where he writes software to make pretty pictures. He’s constantly curious, often about data, and cooks a lot. You can follow his exploits at @mattttrent . In Hilary’s last post, she made the point that your slides != your talk . In a well-crafted talk, your message — in the form of the words you say — needs to dominate while the slides need to play a supporting role. Speak the important parts, and use your slides as a backdrop for what you’re saying. Hilary has provided a valuable strategy in her post, but how should someone approach crafting such a clearly-organized presentation? If you’re just getting started speaking, it can be a real challenge to make a coherent talk and along with slid
5 0.43419456 34 hilary mason data-2009-10-16-Data: first and last names from the US Census
Introduction: Data: first and last names from the US Census Posted: October 16, 2009 | Author: hilary | Filed under: blog | Tags: data , dataset , mysql , sql | 1 Comment » I’ve found myself in need of a name distribution for a few projects recently, so I thought I would post it here so I won’t have to go looking for it again. The data is available from the US Census Bureau (from 1990 census) here , and I have it here in a friendly MySQL *.sql format (it will create the tables and insert the data). There are three tables: male first names, female first names, and surnames. I’ve noted several issues in the data that are likely the result of typos, so make sure to do your own validation if your application requires it. The format is simple: the name frequency (percentage of people in the sampled population with that name) cumulative frequency (as you read down the list, the percentage of total population covered) rank If you want to use this to generate
6 0.41338006 90 hilary mason data-2013-02-18-One Random Tweet, please.
7 0.40836629 60 hilary mason data-2011-08-21-What do you read that changes the way you think?
8 0.40666181 109 hilary mason data-2013-09-30-Need actual random numbers? Meet the NIST randomness beacon.
9 0.40575561 91 hilary mason data-2013-02-22-Why YOU (an introverted nerd) Should Try Public Speaking
10 0.39461997 94 hilary mason data-2013-03-08-Speaking: Title Slides + Twitter = You Win
11 0.39257297 80 hilary mason data-2012-12-28-Getting Started with Data Science
12 0.38810143 24 hilary mason data-2009-01-31-WordPress tip: Move comments from one post to another post
13 0.38688469 114 hilary mason data-2013-12-18-Using Twitter’s Lead-Gen Card to Recruit Beta Testers
14 0.38568351 113 hilary mason data-2013-11-22-Speaking: Two Questions to Ask Before You Give a Talk
15 0.38334766 81 hilary mason data-2013-01-03-Interview Questions for Data Scientists
16 0.37582162 85 hilary mason data-2013-01-19-Startups: How to Share Data with Academics
17 0.3653653 86 hilary mason data-2013-01-22-Introbot: A Script to Ease the Process of Writing Introductory E-mails
18 0.35702267 100 hilary mason data-2013-04-05-Speaking: 1 Kitten per Equation
19 0.35091037 82 hilary mason data-2013-01-08-Bitly Social Data APIs
20 0.34810281 87 hilary mason data-2013-01-28-Startups: Why to Share Data with Academics