hilary_mason_data hilary_mason_data-2013 hilary_mason_data-2013-90 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: One Random Tweet, please. Posted: February 18, 2013 | Author: Hilary Mason | Filed under: projects | 4 Comments » One random tweet. It’s easy to believe that other people use social networks in the same way that you do. Your friends largely do use them the same way, which gives us an even more biased perspective. Unfortunately, most networks don’t provide a way to explore representative communications that you’re not connected to. Well, now you can! One random tweet , please. Update: There were some slight technical difficulties due to hitting Twitter’s oembed rate limit. They should be repaired now. (Note: between this and bookbookgoose.com I’m on a bit of a random kick lately. There’s a method to this madness!)
sentIndex sentText sentNum sentScore
1 Posted: February 18, 2013 | Author: Hilary Mason | Filed under: projects | 4 Comments » One random tweet. [sent-2, score-0.58]
2 It’s easy to believe that other people use social networks in the same way that you do. [sent-3, score-1.068]
3 Your friends largely do use them the same way, which gives us an even more biased perspective. [sent-4, score-0.731]
4 Unfortunately, most networks don’t provide a way to explore representative communications that you’re not connected to. [sent-5, score-1.232]
5 Update: There were some slight technical difficulties due to hitting Twitter’s oembed rate limit. [sent-8, score-0.429]
wordName wordTfidf (topN-words)
[('random', 0.502), ('networks', 0.381), ('tweet', 0.295), ('way', 0.195), ('connected', 0.191), ('largely', 0.191), ('due', 0.191), ('communications', 0.191), ('kick', 0.191), ('well', 0.158), ('provide', 0.148), ('believe', 0.148), ('friends', 0.139), ('rate', 0.139), ('unfortunately', 0.132), ('gives', 0.132), ('explore', 0.126), ('one', 0.121), ('note', 0.12), ('update', 0.115), ('bit', 0.111), ('use', 0.105), ('social', 0.103), ('technical', 0.099), ('february', 0.096), ('easy', 0.096), ('even', 0.09), ('twitter', 0.078), ('projects', 0.078), ('us', 0.074), ('re', 0.072), ('people', 0.04), ('mason', 0.022), ('comments', 0.015)]
simIndex simValue blogId blogTitle
same-blog 1 1.0000001 90 hilary mason data-2013-02-18-One Random Tweet, please.
Introduction: One Random Tweet, please. Posted: February 18, 2013 | Author: Hilary Mason | Filed under: projects | 4 Comments » One random tweet. It’s easy to believe that other people use social networks in the same way that you do. Your friends largely do use them the same way, which gives us an even more biased perspective. Unfortunately, most networks don’t provide a way to explore representative communications that you’re not connected to. Well, now you can! One random tweet , please. Update: There were some slight technical difficulties due to hitting Twitter’s oembed rate limit. They should be repaired now. (Note: between this and bookbookgoose.com I’m on a bit of a random kick lately. There’s a method to this madness!)
2 0.19897094 109 hilary mason data-2013-09-30-Need actual random numbers? Meet the NIST randomness beacon.
Introduction: Need actual random numbers? Meet the NIST randomness beacon. Posted: September 30, 2013 | Author: Hilary Mason | Filed under: projects | Tags: beacon , python , random , randomness , randomnumbers | 5 Comments » I wrote a python module that wraps that NIST Randomness Beacon , making it simple to get truly random numbers in python. It’s easy to use: b = Beacon() print b.last_record() print b.previous_record() #and so on There’s also a handy generator for getting a set of n random numbers. (One of the best gifts I ever got was a copy of 1,000,000 Random Numbers , and I’ve been intrigued ever since.) Please note that this the randomness beacon is not intended to be a source of cryptographic keys — indeed, it’s a public set of numbers, so I wouldn’t recommend doing anything that could be compromised by someone else having the access to the exact same set of numbers . Rather, this is interesting precisely for the scientific opportunities that
3 0.17957638 83 hilary mason data-2013-01-10-Book Book — Goose!
Introduction: Book Book — Goose! Posted: January 10, 2013 | Author: Hilary Mason | Filed under: projects | Tags: absurdity , hack | 25 Comments » I like to read. I love bookstores, I like to wander, and to find things that I didn’t know existed. But bookstores don’t have every book that exists. Amazon has most books, but search is a terrible way to discover new things. Amazon’s recommendations most likely maximize purchases, but are a terrible way to find something you didn’t know you were looking for (look at a book like Effective JavaScript , for example, and you get recommendations for Async JavaScript , Building Node Applications with MongoDB and Backbone , JavaScript Enlightenment ). Similarly, top 100 lists are great at showing you popular things that you’re probably more likely to buy, but not very good at helping you find a book with a story or idea that’s unlike anything you’ve read lately. There must be a better way to explore books. A random way to explor
4 0.17407215 56 hilary mason data-2011-05-02-How to get a random line from a file in bash.
Introduction: How to get a random line from a file in bash. Posted: May 2, 2011 | Author: Hilary Mason | Filed under: blog | Tags: bash , code , one-liner , remember , tips | 25 Comments » I work with a lot of data, and while I’d like to pretend it’s all in upside-down quasi-indexed b-tree rocket ships or some other advanced database, the truth is that much of it is in text files. I often find myself wanting to see a random line from one of these files, just to get a sense of what the data looks like. I thought there must be an easy bash way to do this, but I couldn’t find it (‘shuf’ isn’t installed on my server), so I turned to twitter, and now I’m pleased to present more methods for finding a random line than you ever expected! sort -R | head -n 1 If you can use this, do so! If it isn’t available, consider one of the following commands: @andrewgilmartin suggests using awk: awk 'BEGIN { srand() } rand() >= 0.5 { print; exit }' @devinteske offer
5 0.10007437 1 hilary mason data-2006-02-20-JavaScript Rotating Images Tutorial
Introduction: JavaScript Rotating Images Tutorial Posted: February 20, 2006 | Author: hilary | Filed under: articles , blog | Tags: javascript , web dev | 1 Comment » Introduction Swapping new images into an established layout is one trick that can keep a site design fresh. Some examples: gettyimages benstraw There are several different techniques for accomplishing this effect. A server-side program is the best approach, because you can automatically pull files from a whole directory (eliminating the need to modify code), and it will work even if a user has JavaScript disabled. Unfortunately, not everyone has access to a host that supports server-side scripting. That’s okay, because the same thing can be accomplished with client-side code in JavaScript. I’ll show you how, with three JavaScript functions of varying levels of ease and functionality. First, gather two or more images that are the same size. Then read on… Setup Click here to see t
6 0.098303944 13 hilary mason data-2008-01-22-Create a group Twitter account
7 0.088624433 114 hilary mason data-2013-12-18-Using Twitter’s Lead-Gen Card to Recruit Beta Testers
8 0.087502494 48 hilary mason data-2010-09-04-Twitter Succeeds Because it Fails
9 0.080655091 94 hilary mason data-2013-03-08-Speaking: Title Slides + Twitter = You Win
10 0.07979022 36 hilary mason data-2009-11-10-My code is on TV (and so am I)!
11 0.078723013 85 hilary mason data-2013-01-19-Startups: How to Share Data with Academics
12 0.065130085 100 hilary mason data-2013-04-05-Speaking: 1 Kitten per Equation
13 0.064477973 11 hilary mason data-2007-10-07-An Experience with Using a Wiki for a Collaborative Classroom Documentation Project
14 0.0616589 87 hilary mason data-2013-01-28-Startups: Why to Share Data with Academics
15 0.055604126 40 hilary mason data-2010-02-16-Conference: Search and Social Media 2010
16 0.05365932 102 hilary mason data-2013-05-03-Speaking: Explaining Technical Information to a Mixed Audience
17 0.052254044 34 hilary mason data-2009-10-16-Data: first and last names from the US Census
18 0.05026684 82 hilary mason data-2013-01-08-Bitly Social Data APIs
19 0.048536316 24 hilary mason data-2009-01-31-WordPress tip: Move comments from one post to another post
20 0.047612172 4 hilary mason data-2007-06-11-Teaching Search Techniques with Google Games
topicId topicWeight
[(0, -0.173), (1, 0.057), (2, -0.132), (3, 0.006), (4, 0.005), (5, 0.199), (6, -0.098), (7, 0.294), (8, 0.083), (9, -0.228), (10, 0.225), (11, -0.24), (12, 0.119), (13, 0.001), (14, 0.072), (15, -0.191), (16, -0.06), (17, -0.009), (18, 0.093), (19, 0.079), (20, -0.005), (21, 0.137), (22, 0.009), (23, 0.031), (24, 0.107), (25, 0.108), (26, 0.019), (27, 0.086), (28, -0.003), (29, 0.046), (30, 0.083), (31, 0.064), (32, 0.024), (33, 0.019), (34, 0.142), (35, 0.076), (36, 0.034), (37, -0.021), (38, -0.063), (39, -0.016), (40, -0.171), (41, -0.057), (42, -0.017), (43, 0.041), (44, 0.032), (45, 0.002), (46, -0.076), (47, 0.084), (48, -0.056), (49, -0.049)]
simIndex simValue blogId blogTitle
same-blog 1 0.99206913 90 hilary mason data-2013-02-18-One Random Tweet, please.
Introduction: One Random Tweet, please. Posted: February 18, 2013 | Author: Hilary Mason | Filed under: projects | 4 Comments » One random tweet. It’s easy to believe that other people use social networks in the same way that you do. Your friends largely do use them the same way, which gives us an even more biased perspective. Unfortunately, most networks don’t provide a way to explore representative communications that you’re not connected to. Well, now you can! One random tweet , please. Update: There were some slight technical difficulties due to hitting Twitter’s oembed rate limit. They should be repaired now. (Note: between this and bookbookgoose.com I’m on a bit of a random kick lately. There’s a method to this madness!)
2 0.69738436 109 hilary mason data-2013-09-30-Need actual random numbers? Meet the NIST randomness beacon.
Introduction: Need actual random numbers? Meet the NIST randomness beacon. Posted: September 30, 2013 | Author: Hilary Mason | Filed under: projects | Tags: beacon , python , random , randomness , randomnumbers | 5 Comments » I wrote a python module that wraps that NIST Randomness Beacon , making it simple to get truly random numbers in python. It’s easy to use: b = Beacon() print b.last_record() print b.previous_record() #and so on There’s also a handy generator for getting a set of n random numbers. (One of the best gifts I ever got was a copy of 1,000,000 Random Numbers , and I’ve been intrigued ever since.) Please note that this the randomness beacon is not intended to be a source of cryptographic keys — indeed, it’s a public set of numbers, so I wouldn’t recommend doing anything that could be compromised by someone else having the access to the exact same set of numbers . Rather, this is interesting precisely for the scientific opportunities that
3 0.6176337 56 hilary mason data-2011-05-02-How to get a random line from a file in bash.
Introduction: How to get a random line from a file in bash. Posted: May 2, 2011 | Author: Hilary Mason | Filed under: blog | Tags: bash , code , one-liner , remember , tips | 25 Comments » I work with a lot of data, and while I’d like to pretend it’s all in upside-down quasi-indexed b-tree rocket ships or some other advanced database, the truth is that much of it is in text files. I often find myself wanting to see a random line from one of these files, just to get a sense of what the data looks like. I thought there must be an easy bash way to do this, but I couldn’t find it (‘shuf’ isn’t installed on my server), so I turned to twitter, and now I’m pleased to present more methods for finding a random line than you ever expected! sort -R | head -n 1 If you can use this, do so! If it isn’t available, consider one of the following commands: @andrewgilmartin suggests using awk: awk 'BEGIN { srand() } rand() >= 0.5 { print; exit }' @devinteske offer
4 0.49378291 83 hilary mason data-2013-01-10-Book Book — Goose!
Introduction: Book Book — Goose! Posted: January 10, 2013 | Author: Hilary Mason | Filed under: projects | Tags: absurdity , hack | 25 Comments » I like to read. I love bookstores, I like to wander, and to find things that I didn’t know existed. But bookstores don’t have every book that exists. Amazon has most books, but search is a terrible way to discover new things. Amazon’s recommendations most likely maximize purchases, but are a terrible way to find something you didn’t know you were looking for (look at a book like Effective JavaScript , for example, and you get recommendations for Async JavaScript , Building Node Applications with MongoDB and Backbone , JavaScript Enlightenment ). Similarly, top 100 lists are great at showing you popular things that you’re probably more likely to buy, but not very good at helping you find a book with a story or idea that’s unlike anything you’ve read lately. There must be a better way to explore books. A random way to explor
5 0.35911155 1 hilary mason data-2006-02-20-JavaScript Rotating Images Tutorial
Introduction: JavaScript Rotating Images Tutorial Posted: February 20, 2006 | Author: hilary | Filed under: articles , blog | Tags: javascript , web dev | 1 Comment » Introduction Swapping new images into an established layout is one trick that can keep a site design fresh. Some examples: gettyimages benstraw There are several different techniques for accomplishing this effect. A server-side program is the best approach, because you can automatically pull files from a whole directory (eliminating the need to modify code), and it will work even if a user has JavaScript disabled. Unfortunately, not everyone has access to a host that supports server-side scripting. That’s okay, because the same thing can be accomplished with client-side code in JavaScript. I’ll show you how, with three JavaScript functions of varying levels of ease and functionality. First, gather two or more images that are the same size. Then read on… Setup Click here to see t
6 0.34429207 114 hilary mason data-2013-12-18-Using Twitter’s Lead-Gen Card to Recruit Beta Testers
7 0.2961067 48 hilary mason data-2010-09-04-Twitter Succeeds Because it Fails
8 0.26636413 36 hilary mason data-2009-11-10-My code is on TV (and so am I)!
9 0.26198959 100 hilary mason data-2013-04-05-Speaking: 1 Kitten per Equation
10 0.25708187 13 hilary mason data-2008-01-22-Create a group Twitter account
11 0.23636444 94 hilary mason data-2013-03-08-Speaking: Title Slides + Twitter = You Win
12 0.21319702 87 hilary mason data-2013-01-28-Startups: Why to Share Data with Academics
13 0.19366191 11 hilary mason data-2007-10-07-An Experience with Using a Wiki for a Collaborative Classroom Documentation Project
14 0.18433374 85 hilary mason data-2013-01-19-Startups: How to Share Data with Academics
15 0.16642208 24 hilary mason data-2009-01-31-WordPress tip: Move comments from one post to another post
16 0.14888375 82 hilary mason data-2013-01-08-Bitly Social Data APIs
17 0.1488065 4 hilary mason data-2007-06-11-Teaching Search Techniques with Google Games
18 0.14717531 40 hilary mason data-2010-02-16-Conference: Search and Social Media 2010
19 0.1416661 102 hilary mason data-2013-05-03-Speaking: Explaining Technical Information to a Mixed Audience
20 0.12901691 34 hilary mason data-2009-10-16-Data: first and last names from the US Census
topicId topicWeight
[(2, 0.775), (56, 0.058)]
simIndex simValue blogId blogTitle
1 0.9999994 60 hilary mason data-2011-08-21-What do you read that changes the way you think?
Introduction: What do you read that changes the way you think? Posted: August 21, 2011 | Author: Hilary Mason | Filed under: blog | Tags: books , philosophy , reading | 24 Comments » A friend asked me which of three startup business books she should read. Obama’s reading list since entering office has nothing surprising on it. The most valuable books I read this year have been stories of things very different from what I spend most of my time thinking about. One of my favorites was China Meiville’s The City & The City , which I loved for the ambition and artistry, and another was Simon Winchester’s The Meaning of Everything: The Story of the Oxford English Dictionary , which I loved for the descriptions of creating an analog, scalable information system. What have you read recently that was really great? Edit: Thanks for the recommendations! There are also a bunch over on Google Plus .
same-blog 2 0.99974614 90 hilary mason data-2013-02-18-One Random Tweet, please.
Introduction: One Random Tweet, please. Posted: February 18, 2013 | Author: Hilary Mason | Filed under: projects | 4 Comments » One random tweet. It’s easy to believe that other people use social networks in the same way that you do. Your friends largely do use them the same way, which gives us an even more biased perspective. Unfortunately, most networks don’t provide a way to explore representative communications that you’re not connected to. Well, now you can! One random tweet , please. Update: There were some slight technical difficulties due to hitting Twitter’s oembed rate limit. They should be repaired now. (Note: between this and bookbookgoose.com I’m on a bit of a random kick lately. There’s a method to this madness!)
3 0.99853593 94 hilary mason data-2013-03-08-Speaking: Title Slides + Twitter = You Win
Introduction: Speaking: Title Slides + Twitter = You Win Posted: March 8, 2013 | Author: Hilary Mason | Filed under: speaking | Tags: slides , speaking , title , twitter | 2 Comments » Your title slide should focus on the title of the talk. It should also include your name and affiliation, your logo if you have a cute one, possibly your blog or e-mail address if you want people to get in touch, and your twitter handle. Here’s one of mine: I usually mention that the beginning of the talk that if people have questions they can tweet them at me. This isn’t just because Twitter is a great way to get questions from people too shy to speak up (or who don’t get an opportunity). Here’s the hack: letting people know that you’ll be reading everything they say about your talk on Twitter makes them more likely to say nice things. Further, in a multi-track conference, people who weren’t actually in your talk (or were there but not paying a lot of attention) will judge your
4 0.9974376 34 hilary mason data-2009-10-16-Data: first and last names from the US Census
Introduction: Data: first and last names from the US Census Posted: October 16, 2009 | Author: hilary | Filed under: blog | Tags: data , dataset , mysql , sql | 1 Comment » I’ve found myself in need of a name distribution for a few projects recently, so I thought I would post it here so I won’t have to go looking for it again. The data is available from the US Census Bureau (from 1990 census) here , and I have it here in a friendly MySQL *.sql format (it will create the tables and insert the data). There are three tables: male first names, female first names, and surnames. I’ve noted several issues in the data that are likely the result of typos, so make sure to do your own validation if your application requires it. The format is simple: the name frequency (percentage of people in the sampled population with that name) cumulative frequency (as you read down the list, the percentage of total population covered) rank If you want to use this to generate
5 0.8173992 105 hilary mason data-2013-07-05-Speaking: Spend at least 1-3 of the time practicing the talk
Introduction: Speaking: Spend at least 1/3 of the time practicing the talk Posted: July 5, 2013 | Author: Hilary Mason | Filed under: speaking | 3 Comments » This week we welcome a guest contribution. Matthew Trentacoste is a recovering academic and a computer scientist at Adobe, where he writes software to make pretty pictures. He’s constantly curious, often about data, and cooks a lot. You can follow his exploits at @mattttrent . In Hilary’s last post, she made the point that your slides != your talk . In a well-crafted talk, your message — in the form of the words you say — needs to dominate while the slides need to play a supporting role. Speak the important parts, and use your slides as a backdrop for what you’re saying. Hilary has provided a valuable strategy in her post, but how should someone approach crafting such a clearly-organized presentation? If you’re just getting started speaking, it can be a real challenge to make a coherent talk and along with slid
6 0.81476992 93 hilary mason data-2013-03-01-Speaking: Pick a Vague and Specific Title for Your Talk
7 0.76105428 91 hilary mason data-2013-02-22-Why YOU (an introverted nerd) Should Try Public Speaking
8 0.75685996 113 hilary mason data-2013-11-22-Speaking: Two Questions to Ask Before You Give a Talk
9 0.75039059 24 hilary mason data-2009-01-31-WordPress tip: Move comments from one post to another post
10 0.67789382 95 hilary mason data-2013-03-17-Speaking: Entertain, Don’t Teach
11 0.6569373 104 hilary mason data-2013-06-14-Speaking: Your Slides != Your Talk
12 0.65285695 83 hilary mason data-2013-01-10-Book Book — Goose!
13 0.64245242 13 hilary mason data-2008-01-22-Create a group Twitter account
14 0.60395557 100 hilary mason data-2013-04-05-Speaking: 1 Kitten per Equation
15 0.59628022 82 hilary mason data-2013-01-08-Bitly Social Data APIs
16 0.58549273 80 hilary mason data-2012-12-28-Getting Started with Data Science
17 0.58433723 37 hilary mason data-2009-11-25-IgniteNYC: How to Replace Yourself with a Very Small Shell Script
18 0.5678817 81 hilary mason data-2013-01-03-Interview Questions for Data Scientists
19 0.56721056 64 hilary mason data-2011-10-10-I’m in Glamour Magazine!
20 0.56620061 48 hilary mason data-2010-09-04-Twitter Succeeds Because it Fails