hilary_mason_data hilary_mason_data-2013 hilary_mason_data-2013-101 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Et tu, Google? Posted: April 14, 2013 | Author: Hilary Mason | Filed under: blog | Tags: google , search | 8 Comments » In 2008, cuil , a search engine startup, displayed my bio alongside a photo of deceased actress Hilary Mason . In January 2013, Bing confused us , this time putting my photo next to her bio (they fixed it after a suitable amount of mocking on Twitter). Today, Google did the same thing . ( live search link ) Today I win the internet? If you zoom in on the bio section, you can clearly see that it’s her bio with a photo of me (originally from Crain’s New York 40 under Forty ). Further, if you go into her filmography, you continue to see my photo. I’m most proud of my starring role in the amazing film Robot Jox . (bottom right of the image below) I know that entity disambiguation is a hard problem. I’ve worked on it, though never with the kind of resources that I imagine Google can bring to it. And yet, this
sentIndex sentText sentNum sentScore
1 Posted: April 14, 2013 | Author: Hilary Mason | Filed under: blog | Tags: google , search | 8 Comments » In 2008, cuil , a search engine startup, displayed my bio alongside a photo of deceased actress Hilary Mason . [sent-2, score-1.921]
2 In January 2013, Bing confused us , this time putting my photo next to her bio (they fixed it after a suitable amount of mocking on Twitter). [sent-3, score-1.065]
3 If you zoom in on the bio section, you can clearly see that it’s her bio with a photo of me (originally from Crain’s New York 40 under Forty ). [sent-6, score-1.332]
4 Further, if you go into her filmography, you continue to see my photo. [sent-7, score-0.181]
5 I’m most proud of my starring role in the amazing film Robot Jox . [sent-8, score-0.307]
6 (bottom right of the image below) I know that entity disambiguation is a hard problem. [sent-9, score-0.428]
7 I’ve worked on it, though never with the kind of resources that I imagine Google can bring to it. [sent-10, score-0.693]
8 Note: It’s also been pointed out to me that there’s a slim possibility that Google’s confusion stems from my own post about Bing’s error, in which case, this post will certainly make the confusion worse. [sent-12, score-0.947]
wordName wordTfidf (topN-words)
[('bio', 0.456), ('google', 0.331), ('confusion', 0.267), ('photo', 0.25), ('bing', 0.228), ('bring', 0.205), ('search', 0.172), ('today', 0.166), ('originally', 0.114), ('continue', 0.114), ('robot', 0.114), ('proud', 0.114), ('film', 0.114), ('pointed', 0.114), ('actress', 0.114), ('confused', 0.114), ('cuil', 0.114), ('deceased', 0.114), ('entity', 0.114), ('error', 0.114), ('section', 0.114), ('disambiguation', 0.103), ('clearly', 0.103), ('engine', 0.103), ('post', 0.102), ('resources', 0.095), ('live', 0.095), ('alongside', 0.095), ('imagine', 0.095), ('certainly', 0.095), ('yet', 0.095), ('putting', 0.095), ('win', 0.095), ('worked', 0.088), ('amount', 0.088), ('startup', 0.083), ('link', 0.083), ('amazing', 0.079), ('image', 0.075), ('note', 0.072), ('though', 0.072), ('kind', 0.072), ('hard', 0.072), ('see', 0.067), ('never', 0.066), ('april', 0.066), ('say', 0.066), ('right', 0.064), ('york', 0.064), ('next', 0.062)]
simIndex simValue blogId blogTitle
same-blog 1 0.9999997 101 hilary mason data-2013-04-14-Et tu, Google?
Introduction: Et tu, Google? Posted: April 14, 2013 | Author: Hilary Mason | Filed under: blog | Tags: google , search | 8 Comments » In 2008, cuil , a search engine startup, displayed my bio alongside a photo of deceased actress Hilary Mason . In January 2013, Bing confused us , this time putting my photo next to her bio (they fixed it after a suitable amount of mocking on Twitter). Today, Google did the same thing . ( live search link ) Today I win the internet? If you zoom in on the bio section, you can clearly see that it’s her bio with a photo of me (originally from Crain’s New York 40 under Forty ). Further, if you go into her filmography, you continue to see my photo. I’m most proud of my starring role in the amazing film Robot Jox . (bottom right of the image below) I know that entity disambiguation is a hard problem. I’ve worked on it, though never with the kind of resources that I imagine Google can bring to it. And yet, this
2 0.39383575 88 hilary mason data-2013-01-29-I’m a Dead Celebrity!
Introduction: I’m a Dead Celebrity! Posted: January 29, 2013 | Author: Hilary Mason | Filed under: blog | 4 Comments » Hilary Mason, Bing Celebrity I have a Google alert set up for my name, and over the weekend it sent me here . Update: Bing has removed the page and now redirects to a regular search. It’s a page on Bing Celebrities, merging my information with information about Hilary Mason, the (now deceased) British actress . According to this page, I have starred in movies before I was born and made videos after I died. It’s my photo and her filmography. It’s creepy, but it’s also intriguing. How does this happen? The data is credited to AMG and inbaseline , whose domain, though linked directly from Bing, does not resolve. Entity disambiguation is certainly a challenge, but I expect more from Microsoft, with so much data and so many brains. This kind of error makes it extremely clear that identity is not a solved problem . I’ve written a bit about iden
3 0.23921989 97 hilary mason data-2013-03-23-Why Google Now is Awesome
Introduction: Why Google Now is Awesome Posted: March 23, 2013 | Author: Hilary Mason | Filed under: blog | Tags: google | 11 Comments » Google Now is an extension to Google’s Android search app that uses all of the data that Google has about you along with what it can guess about your current context to present the information it thinks you need when it thinks you need it. It’ll tell you to leave a bit early to make your next calendar event because of heavy traffic, or that it’s a friend’s birthday, or that there’s a cool cafe nearby where you are. I think it’s amazing. It’s amazing because this is the first Google product that takes ALL OF THE DATA that they have about us and actually makes it useful for us . Not for advertisers. Finally.
4 0.19926047 4 hilary mason data-2007-06-11-Teaching Search Techniques with Google Games
Introduction: Teaching Search Techniques with Google Games Posted: June 11, 2007 | Author: hilary | Filed under: blog | Tags: games , search | 4 Comments » Educators routinely discuss how students have trouble evaluating and using the results of their Google searches. There are really two parts to this problem, though, and while it’s true that students may struggle to identify reliable sources, before we can address that, we need to teach them how to write good queries. It’s that old computer science maxim: Garbage In, Garbage Out . I like to teach students how to write interesting queries by playing games. This games force students to think about the queries they are writing, and not the results. I have no scientific proof of the results, but I do know that it keeps them entertained and thinking for a while! My favorite games: Google Whack – The classic! Find a two-word query, with no punctuation, that return one and only one result. The Google Whacks on the
5 0.13694207 69 hilary mason data-2012-01-02-Why do I miss google calendar invites?
Introduction: Why do I miss google calendar invites? Posted: January 2, 2012 | Author: Hilary Mason | Filed under: blog | Tags: calendar , configuration , google | 2 Comments » I keep missing Google calendar invites on both my personal and work accounts. I’ve had my google account for years (since 2004?) and assumed it was some quirk of how I had configured something along the way. Today I was following Google’s instructions for syncing calendars with an iOS device and discovered that if you click calendar settings (which means click the gear icon then ‘calendar settings’), then ‘calendar’, then ‘notifications’ next to the calendar that you care about, you can turn on e-mail and SMS notifications for any given calendar. (I’ll save my ranting about the number of clicks to find and configuration anything on google’s properties right now for another time.) I’m sharing this on the theory that I’m not the only one with this particular frustration. I hope it saves someon
6 0.13029438 7 hilary mason data-2007-07-30-Tip: How to Search Google for Ideas
7 0.10323174 79 hilary mason data-2012-11-05-Where’s the API that can tell me that this photo contains a puppy and a can of Coke?
8 0.092181832 114 hilary mason data-2013-12-18-Using Twitter’s Lead-Gen Card to Recruit Beta Testers
9 0.073721856 3 hilary mason data-2007-06-08-The Best Time to Search for Academic Jobs
10 0.070258707 74 hilary mason data-2012-08-19-Why I love New York City
11 0.069534749 43 hilary mason data-2010-05-27-E-mail automation, questions and answers
12 0.064731307 82 hilary mason data-2013-01-08-Bitly Social Data APIs
13 0.06404537 40 hilary mason data-2010-02-16-Conference: Search and Social Media 2010
14 0.060418185 67 hilary mason data-2011-10-31-Happy Halloween
15 0.059915964 13 hilary mason data-2008-01-22-Create a group Twitter account
16 0.058562905 64 hilary mason data-2011-10-10-I’m in Glamour Magazine!
17 0.056002609 103 hilary mason data-2013-06-04-Lucene Revolution Keynote: Search is Not a Solved Problem
18 0.053077124 81 hilary mason data-2013-01-03-Interview Questions for Data Scientists
19 0.050892532 75 hilary mason data-2012-08-22-DataGotham: The Empire State of Data
20 0.049784906 24 hilary mason data-2009-01-31-WordPress tip: Move comments from one post to another post
topicId topicWeight
[(0, -0.188), (1, -0.02), (2, -0.346), (3, -0.282), (4, 0.244), (5, -0.321), (6, -0.043), (7, -0.013), (8, -0.014), (9, 0.053), (10, -0.044), (11, -0.067), (12, 0.032), (13, -0.037), (14, -0.091), (15, 0.019), (16, -0.243), (17, -0.185), (18, -0.111), (19, -0.146), (20, 0.15), (21, 0.11), (22, 0.057), (23, -0.031), (24, -0.019), (25, -0.01), (26, -0.084), (27, 0.159), (28, 0.014), (29, -0.11), (30, -0.056), (31, -0.079), (32, -0.02), (33, -0.079), (34, -0.011), (35, 0.045), (36, 0.066), (37, 0.086), (38, -0.026), (39, -0.038), (40, -0.064), (41, -0.054), (42, -0.078), (43, 0.002), (44, -0.016), (45, -0.012), (46, 0.004), (47, 0.043), (48, -0.018), (49, 0.013)]
simIndex simValue blogId blogTitle
same-blog 1 0.98491043 101 hilary mason data-2013-04-14-Et tu, Google?
Introduction: Et tu, Google? Posted: April 14, 2013 | Author: Hilary Mason | Filed under: blog | Tags: google , search | 8 Comments » In 2008, cuil , a search engine startup, displayed my bio alongside a photo of deceased actress Hilary Mason . In January 2013, Bing confused us , this time putting my photo next to her bio (they fixed it after a suitable amount of mocking on Twitter). Today, Google did the same thing . ( live search link ) Today I win the internet? If you zoom in on the bio section, you can clearly see that it’s her bio with a photo of me (originally from Crain’s New York 40 under Forty ). Further, if you go into her filmography, you continue to see my photo. I’m most proud of my starring role in the amazing film Robot Jox . (bottom right of the image below) I know that entity disambiguation is a hard problem. I’ve worked on it, though never with the kind of resources that I imagine Google can bring to it. And yet, this
2 0.86677492 88 hilary mason data-2013-01-29-I’m a Dead Celebrity!
Introduction: I’m a Dead Celebrity! Posted: January 29, 2013 | Author: Hilary Mason | Filed under: blog | 4 Comments » Hilary Mason, Bing Celebrity I have a Google alert set up for my name, and over the weekend it sent me here . Update: Bing has removed the page and now redirects to a regular search. It’s a page on Bing Celebrities, merging my information with information about Hilary Mason, the (now deceased) British actress . According to this page, I have starred in movies before I was born and made videos after I died. It’s my photo and her filmography. It’s creepy, but it’s also intriguing. How does this happen? The data is credited to AMG and inbaseline , whose domain, though linked directly from Bing, does not resolve. Entity disambiguation is certainly a challenge, but I expect more from Microsoft, with so much data and so many brains. This kind of error makes it extremely clear that identity is not a solved problem . I’ve written a bit about iden
3 0.48663941 97 hilary mason data-2013-03-23-Why Google Now is Awesome
Introduction: Why Google Now is Awesome Posted: March 23, 2013 | Author: Hilary Mason | Filed under: blog | Tags: google | 11 Comments » Google Now is an extension to Google’s Android search app that uses all of the data that Google has about you along with what it can guess about your current context to present the information it thinks you need when it thinks you need it. It’ll tell you to leave a bit early to make your next calendar event because of heavy traffic, or that it’s a friend’s birthday, or that there’s a cool cafe nearby where you are. I think it’s amazing. It’s amazing because this is the first Google product that takes ALL OF THE DATA that they have about us and actually makes it useful for us . Not for advertisers. Finally.
4 0.45966402 4 hilary mason data-2007-06-11-Teaching Search Techniques with Google Games
Introduction: Teaching Search Techniques with Google Games Posted: June 11, 2007 | Author: hilary | Filed under: blog | Tags: games , search | 4 Comments » Educators routinely discuss how students have trouble evaluating and using the results of their Google searches. There are really two parts to this problem, though, and while it’s true that students may struggle to identify reliable sources, before we can address that, we need to teach them how to write good queries. It’s that old computer science maxim: Garbage In, Garbage Out . I like to teach students how to write interesting queries by playing games. This games force students to think about the queries they are writing, and not the results. I have no scientific proof of the results, but I do know that it keeps them entertained and thinking for a while! My favorite games: Google Whack – The classic! Find a two-word query, with no punctuation, that return one and only one result. The Google Whacks on the
5 0.37094483 69 hilary mason data-2012-01-02-Why do I miss google calendar invites?
Introduction: Why do I miss google calendar invites? Posted: January 2, 2012 | Author: Hilary Mason | Filed under: blog | Tags: calendar , configuration , google | 2 Comments » I keep missing Google calendar invites on both my personal and work accounts. I’ve had my google account for years (since 2004?) and assumed it was some quirk of how I had configured something along the way. Today I was following Google’s instructions for syncing calendars with an iOS device and discovered that if you click calendar settings (which means click the gear icon then ‘calendar settings’), then ‘calendar’, then ‘notifications’ next to the calendar that you care about, you can turn on e-mail and SMS notifications for any given calendar. (I’ll save my ranting about the number of clicks to find and configuration anything on google’s properties right now for another time.) I’m sharing this on the theory that I’m not the only one with this particular frustration. I hope it saves someon
6 0.27031165 79 hilary mason data-2012-11-05-Where’s the API that can tell me that this photo contains a puppy and a can of Coke?
7 0.2391361 7 hilary mason data-2007-07-30-Tip: How to Search Google for Ideas
8 0.2182105 103 hilary mason data-2013-06-04-Lucene Revolution Keynote: Search is Not a Solved Problem
9 0.19576378 67 hilary mason data-2011-10-31-Happy Halloween
10 0.19236483 114 hilary mason data-2013-12-18-Using Twitter’s Lead-Gen Card to Recruit Beta Testers
11 0.16425176 3 hilary mason data-2007-06-08-The Best Time to Search for Academic Jobs
12 0.15720978 82 hilary mason data-2013-01-08-Bitly Social Data APIs
13 0.14937547 43 hilary mason data-2010-05-27-E-mail automation, questions and answers
14 0.14040685 74 hilary mason data-2012-08-19-Why I love New York City
15 0.13740501 60 hilary mason data-2011-08-21-What do you read that changes the way you think?
16 0.13039179 47 hilary mason data-2010-08-23-New York Times: Reinventing E-mail, One Message at a Time
17 0.12913379 24 hilary mason data-2009-01-31-WordPress tip: Move comments from one post to another post
18 0.12227263 40 hilary mason data-2010-02-16-Conference: Search and Social Media 2010
19 0.11946824 100 hilary mason data-2013-04-05-Speaking: 1 Kitten per Equation
20 0.11687561 84 hilary mason data-2013-01-17-Need Data? Start Here
topicId topicWeight
[(2, 0.085), (7, 0.658), (56, 0.113), (63, 0.015)]
simIndex simValue blogId blogTitle
same-blog 1 0.96224093 101 hilary mason data-2013-04-14-Et tu, Google?
Introduction: Et tu, Google? Posted: April 14, 2013 | Author: Hilary Mason | Filed under: blog | Tags: google , search | 8 Comments » In 2008, cuil , a search engine startup, displayed my bio alongside a photo of deceased actress Hilary Mason . In January 2013, Bing confused us , this time putting my photo next to her bio (they fixed it after a suitable amount of mocking on Twitter). Today, Google did the same thing . ( live search link ) Today I win the internet? If you zoom in on the bio section, you can clearly see that it’s her bio with a photo of me (originally from Crain’s New York 40 under Forty ). Further, if you go into her filmography, you continue to see my photo. I’m most proud of my starring role in the amazing film Robot Jox . (bottom right of the image below) I know that entity disambiguation is a hard problem. I’ve worked on it, though never with the kind of resources that I imagine Google can bring to it. And yet, this
2 0.2021434 114 hilary mason data-2013-12-18-Using Twitter’s Lead-Gen Card to Recruit Beta Testers
Introduction: Using Twitter’s Lead-Gen Card to Recruit Beta Testers Posted: December 18, 2013 | Author: Hilary Mason | Filed under: blog | Tags: email , hack , twitter | 12 Comments » It turns out that it’s pretty easy to co-opt Twitter’s Lead Generation card for anything where you want to gather a bunch of e-mail addresses from your Twitter community. I was looking for people willing to alpha test a little side project of mine, and it worked great and didn’t cost anything. The tweet itself: Love tech discussion but looking for a better community? Help me beta test a side project! https://t.co/H3DYjbCy19 — Hilary Mason (@hmason) December 12, 2013 I created it pretty easily: First, go to ads.twitter.com , log in, and go to “creatives”, then “cards”. Click “Create Lead Generation Card”. It’s a big blue button. You can include a title and a short description. Curiously, you can also include a 600px by 150px image. This seems like an opportunity to
3 0.19254667 87 hilary mason data-2013-01-28-Startups: Why to Share Data with Academics
Introduction: Startups: Why to Share Data with Academics Posted: January 28, 2013 | Author: Hilary Mason | Filed under: blog | 5 Comments » Last week I wrote a bit about how to share data with academics . This is the complimentary piece, on why you should invest the time and energy in sharing your data with the academic community. As I was talking to people about this topic it became clear that there are really two different questions people ask. First, why do this at all? And second, what do I tell my boss? Let’s start with the second one. This is what you should tell your boss: Academic research based on our work is a great press opportunity and demonstrates that credible people outside of our company find our work interesting. Having researchers work on our data is an easy way to access highly educated brainpower, for free, that in no way competes with us. Who knows what interesting stuff they’ll come up with? Personal relationships with university faculty ar
4 0.19082008 109 hilary mason data-2013-09-30-Need actual random numbers? Meet the NIST randomness beacon.
Introduction: Need actual random numbers? Meet the NIST randomness beacon. Posted: September 30, 2013 | Author: Hilary Mason | Filed under: projects | Tags: beacon , python , random , randomness , randomnumbers | 5 Comments » I wrote a python module that wraps that NIST Randomness Beacon , making it simple to get truly random numbers in python. It’s easy to use: b = Beacon() print b.last_record() print b.previous_record() #and so on There’s also a handy generator for getting a set of n random numbers. (One of the best gifts I ever got was a copy of 1,000,000 Random Numbers , and I’ve been intrigued ever since.) Please note that this the randomness beacon is not intended to be a source of cryptographic keys — indeed, it’s a public set of numbers, so I wouldn’t recommend doing anything that could be compromised by someone else having the access to the exact same set of numbers . Rather, this is interesting precisely for the scientific opportunities that
5 0.18511897 58 hilary mason data-2011-06-22-My Head is Open Source!
Introduction: My Head is Open Source! Posted: June 22, 2011 | Author: Hilary Mason | Filed under: blog | Tags: 3d , makerbot , opensource | 8 Comments » Last night I visited friends at Makerbot , where artist-in-residence Jonathan Monaghan scanned my head with a high-resolution laser scanner. The model is available on Thingiverse and can be printed on your friendly neighborhood makerbot or other 3d printer. There are lots of other awesome models of people and things to play with, including Stephen Colbert’s head . I look forward to the emergence of plastic clone head armies! Edit: Please note: thanks for asking, but brains are not included.
6 0.17632346 7 hilary mason data-2007-07-30-Tip: How to Search Google for Ideas
7 0.17290252 105 hilary mason data-2013-07-05-Speaking: Spend at least 1-3 of the time practicing the talk
8 0.16694033 85 hilary mason data-2013-01-19-Startups: How to Share Data with Academics
9 0.16326259 80 hilary mason data-2012-12-28-Getting Started with Data Science
10 0.15878615 24 hilary mason data-2009-01-31-WordPress tip: Move comments from one post to another post
11 0.1585824 40 hilary mason data-2010-02-16-Conference: Search and Social Media 2010
12 0.15513746 82 hilary mason data-2013-01-08-Bitly Social Data APIs
13 0.15338862 46 hilary mason data-2010-08-15-Should you attend Hadoop World? Yes.
14 0.15231875 92 hilary mason data-2013-02-25-A (short) List of Data Science Blogs
15 0.15223636 81 hilary mason data-2013-01-03-Interview Questions for Data Scientists
16 0.15181126 83 hilary mason data-2013-01-10-Book Book — Goose!
17 0.1494775 34 hilary mason data-2009-10-16-Data: first and last names from the US Census
18 0.14834274 91 hilary mason data-2013-02-22-Why YOU (an introverted nerd) Should Try Public Speaking
19 0.14256328 90 hilary mason data-2013-02-18-One Random Tweet, please.
20 0.1412982 116 hilary mason data-2014-04-09-Come speak at DataGotham 2014!