high_scalability high_scalability-2009 high_scalability-2009-642 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Hey we're moving up in the world, jumping from 19th place to 3rd place. In case you aren't sure what I'm talking about, Jurgen Appelo goes through this massive effort of ranking blogs according to Google PageRank, Technorati Authority, Alexa Rank, Google links, and Twitter Grader Rank. Through some obviously mistaken calculations HighScalability comes out #3. Given all the superb competition I'm not exactly sure how that can be. Well, thanks for all the excellent people who contribute and all the even more excellent people that read. Now at least I have something worthy to put on my tombstone :-)
sentIndex sentText sentNum sentScore
1 Hey we're moving up in the world, jumping from 19th place to 3rd place. [sent-1, score-0.439]
2 In case you aren't sure what I'm talking about, Jurgen Appelo goes through this massive effort of ranking blogs according to Google PageRank, Technorati Authority, Alexa Rank, Google links, and Twitter Grader Rank. [sent-2, score-0.673]
3 Through some obviously mistaken calculations HighScalability comes out #3. [sent-3, score-0.402]
4 Given all the superb competition I'm not exactly sure how that can be. [sent-4, score-0.664]
5 Well, thanks for all the excellent people who contribute and all the even more excellent people that read. [sent-5, score-0.907]
wordName wordTfidf (topN-words)
[('appelo', 0.296), ('jurgen', 0.296), ('technorati', 0.279), ('jumping', 0.256), ('alexa', 0.241), ('superb', 0.241), ('pagerank', 0.225), ('worthy', 0.221), ('authority', 0.214), ('rank', 0.205), ('contribute', 0.177), ('excellent', 0.173), ('sure', 0.17), ('calculations', 0.162), ('obviously', 0.151), ('competition', 0.146), ('thanks', 0.138), ('hey', 0.129), ('google', 0.125), ('links', 0.123), ('talking', 0.121), ('effort', 0.12), ('exactly', 0.107), ('massive', 0.102), ('highscalability', 0.102), ('people', 0.099), ('least', 0.095), ('place', 0.093), ('twitter', 0.092), ('moving', 0.09), ('comes', 0.089), ('goes', 0.086), ('given', 0.086), ('case', 0.074), ('put', 0.072), ('world', 0.058), ('something', 0.058), ('well', 0.05), ('even', 0.048)]
simIndex simValue blogId blogTitle
same-blog 1 1.0000001 642 high scalability-2009-06-29-HighScalability Rated #3 Blog for Developers
Introduction: Hey we're moving up in the world, jumping from 19th place to 3rd place. In case you aren't sure what I'm talking about, Jurgen Appelo goes through this massive effort of ranking blogs according to Google PageRank, Technorati Authority, Alexa Rank, Google links, and Twitter Grader Rank. Through some obviously mistaken calculations HighScalability comes out #3. Given all the superb competition I'm not exactly sure how that can be. Well, thanks for all the excellent people who contribute and all the even more excellent people that read. Now at least I have something worthy to put on my tombstone :-)
2 0.27942032 394 high scalability-2008-09-25-HighScalability.com Rated 16th Best Blog for Development Managers
Introduction: Jurgen Appelo of Noop.nl asked for nominations for top blogs and then performed a sophisticated weighting of their popularity based on page range, trust authority, Alexa rank, Google hits, and number of comments. When all the results poured out of the blender HighScalability was ranked 16th. Not bad! Joel on Software was number one, of course. The next few were: 2) Coding Horror by Jeff Atwood 3) Seth's Blog by Seth Godin and 4) Paul Graham: Essays Paul Graham. All excellent blogs. Very cool.
3 0.15357387 18 high scalability-2007-07-16-Paper: MySQL Scale-Out by application partitioning
Introduction: MySQL Scale-Out by application partitioning by Oli Sennhauser Eventually every database system hit its limits. Especially on the Internet, where you have millions of users which theoretically access your database simultaneously, eventually your IO system will be a bottleneck. [A] promising but more complex solution with nearly no scale-out limits is application partitioning. If and when you get into the top-1000 rank on alexa [1], you have to think about such solutions. A Quick Hit of What's Inside Horizontal application partitioning, Vertical application partitioning, Disk IO calculations, How to partition an entity
4 0.10019814 837 high scalability-2010-06-07-Six Ways Twitter May Reach its Big Hairy Audacious Goal of One Billion Users
Introduction: Twitter has a big hairy audacious goal of reaching one billion users by 2013. Three forces stand against Twitter. The world will end in 2012 . But let's be optimistic and assume we'll make it. Next is Facebook. Currently Facebook is the user leader with over 400 million users . Will Facebook stumble or will they rocket to one billion users before Twitter? And lastly, there's Twitter's "low" starting point and "slow" growth rate. Twitter currently has 106 million registered users and adds about 300,000 new users a day. That doesn't add up to a billion in three years. Twitter needs to triple the number of registered users they add per day. How will Twitter reach its goal of over one billion users served? From recent infrastructure announcements and information gleaned at Chirp ( videos ) and other talks, it has become a little clearer how they hope to reach their billion user goal: 1) Make a Big Hairy Audacious Goal 2) Hire Lots of Quality People 3) Hug Developers and Users 4) D
5 0.091334753 1416 high scalability-2013-03-04-NoSQL Style - A Gangnam Style Parody
Introduction: Listen up all you IT people...NoSQL, it's the rage now, so turn the page now and boost your stack...Hey, mighty people...Go, go, go, hey, hey, hey, hey, hey, hey...Go NoSQL style... I for one feel both edified and entertained...can't wait for the Harlem Shake version.
6 0.08541587 845 high scalability-2010-06-22-Exploring the software behind Facebook, the world’s largest site
7 0.083926655 56 high scalability-2007-08-03-Running Hadoop MapReduce on Amazon EC2 and Amazon S3
8 0.071657047 978 high scalability-2011-01-26-Google Pro Tip: Use Back-of-the-envelope-calculations to Choose the Best Design
9 0.069660813 117 high scalability-2007-10-08-Paper: Understanding and Building High Availability-Load Balanced Clusters
10 0.066159785 166 high scalability-2007-11-27-Solving the Client Side API Scalability Problem with a Little Game Theory
11 0.065575384 132 high scalability-2007-10-25-Who can answer or analyze the image store and visit solution about alibaba.com?Thanks
12 0.063901097 1535 high scalability-2013-10-21-Google's Sanjay Ghemawat on What Made Google Google and Great Big Data Career Advice
13 0.062039014 765 high scalability-2010-01-25-Let's Welcome our Neo-Feudal Overlords
14 0.060680121 23 high scalability-2007-07-24-Major Websites Down: Or Why You Want to Run in Two or More Data Centers.
15 0.060416006 1253 high scalability-2012-05-28-The Anatomy of Search Technology: Crawling using Combinators
16 0.060185634 1084 high scalability-2011-07-22-Stuff The Internet Says On Scalability For July 22, 2011
17 0.053643469 335 high scalability-2008-05-30-Is "Scaling Engineer" a new job title?
18 0.053387791 517 high scalability-2009-02-21-Google AppEngine - A Second Look
19 0.053003386 750 high scalability-2009-12-16-Building Super Scalable Systems: Blade Runner Meets Autonomic Computing in the Ambient Cloud
topicId topicWeight
[(0, 0.064), (1, 0.05), (2, 0.008), (3, 0.037), (4, 0.024), (5, -0.035), (6, -0.053), (7, 0.054), (8, 0.048), (9, 0.006), (10, -0.025), (11, -0.01), (12, -0.011), (13, -0.002), (14, 0.018), (15, -0.024), (16, -0.01), (17, -0.017), (18, 0.025), (19, -0.025), (20, 0.023), (21, 0.002), (22, 0.026), (23, -0.031), (24, -0.023), (25, 0.042), (26, 0.048), (27, -0.004), (28, -0.043), (29, 0.02), (30, 0.003), (31, -0.079), (32, -0.004), (33, 0.025), (34, -0.068), (35, -0.001), (36, 0.025), (37, -0.007), (38, -0.005), (39, 0.025), (40, 0.038), (41, 0.021), (42, -0.005), (43, -0.0), (44, -0.015), (45, 0.035), (46, 0.046), (47, -0.035), (48, -0.027), (49, 0.002)]
simIndex simValue blogId blogTitle
same-blog 1 0.97628611 642 high scalability-2009-06-29-HighScalability Rated #3 Blog for Developers
Introduction: Hey we're moving up in the world, jumping from 19th place to 3rd place. In case you aren't sure what I'm talking about, Jurgen Appelo goes through this massive effort of ranking blogs according to Google PageRank, Technorati Authority, Alexa Rank, Google links, and Twitter Grader Rank. Through some obviously mistaken calculations HighScalability comes out #3. Given all the superb competition I'm not exactly sure how that can be. Well, thanks for all the excellent people who contribute and all the even more excellent people that read. Now at least I have something worthy to put on my tombstone :-)
2 0.73405236 394 high scalability-2008-09-25-HighScalability.com Rated 16th Best Blog for Development Managers
Introduction: Jurgen Appelo of Noop.nl asked for nominations for top blogs and then performed a sophisticated weighting of their popularity based on page range, trust authority, Alexa rank, Google hits, and number of comments. When all the results poured out of the blender HighScalability was ranked 16th. Not bad! Joel on Software was number one, of course. The next few were: 2) Coding Horror by Jeff Atwood 3) Seth's Blog by Seth Godin and 4) Paul Graham: Essays Paul Graham. All excellent blogs. Very cool.
3 0.66643679 242 high scalability-2008-02-07-Looking for good business examples of compaines using Hadoop
Introduction: I have read the blog about Mailtrust/Rackspace as well the interesting things with Google and Yahoo. Who else is using Hadoop/MapReduce to solve business problems. TIA johnmwillis.com
Introduction: In a People of ACM interview with Sanjay Ghemawat , a Google Fellow in the Systems Infrastructure Group (MapReduce, BigTable, Spanner, GFS, etc), talks about a few interesting aspects of Google's culture. What Made Google Google Progress is a modern idea. The conviction that future can be changed for the better through individual advancement and action has over hundreds of years driven an exponential growth in the technome. What drives progress? Challenges. Individuals finding and defeating a challenge. There's usually something someone wants to do so badly that they put in the effort, the thought, and the money into solving all the problems. The results are often something new and amazing. And so it was for Google: The main motivation behind the development of much of Google's infrastructure was the challenge of keeping up with ever-growing data sets. For example, at the same time Google's web search was gaining usage very quickly, we were also scaling up the size of ou
Introduction: Joseph Smarr, former CTO of Plaxo (which explains why I recognized his picture), in I'm a technical lead on the Google+ team. Ask me anything , reveals the stack used for building Google+: Our stack is pretty standard fare for Google apps these days: we use Java servlets for our server code and JavaScript for the browser-side of the UI, largely built with the (open-source) Closure framework, including Closure's JavaScript compiler and template system. A couple nifty tricks we do: we use the HTML5 History API to maintain pretty-looking URLs even though it's an AJAX app (falling back on hash-fragments for older browsers); and we often render our Closure templates server-side so the page renders before any JavaScript is loaded, then the JavaScript finds the right DOM nodes and hooks up event handlers, etc. to make it responsive (as a result, if you're on a slow connection and you click on stuff really fast, you may notice a lag before it does anything, but luckily most people don't run
6 0.61644644 363 high scalability-2008-08-12-Strategy: Limit The New, Not The Old
7 0.61177284 1010 high scalability-2011-03-24-Strategy: Disk Backup for Speed, Tape Backup to Save Your Bacon, Just Ask Google
8 0.61153322 1143 high scalability-2011-11-16-Google+ Infrastructure Update - the JavaScript Story
9 0.61045694 201 high scalability-2008-01-04-For $5 Million You Can Buy Enough Storage to Compete with Google
10 0.59550023 166 high scalability-2007-11-27-Solving the Client Side API Scalability Problem with a Little Game Theory
11 0.57909173 618 high scalability-2009-06-05-Google Wave Architecture
12 0.57126087 210 high scalability-2008-01-13-A Note on How to Create Teasers When Posting
13 0.56827527 408 high scalability-2008-10-10-Useful Corporate Blogs that Talk About Scalability
14 0.56024909 405 high scalability-2008-10-07-Help a Scoble out. What should Robert ask in his scalability interview?
15 0.55718094 747 high scalability-2009-11-26-What I'm Thankful For on Thanksgiving
16 0.55211413 837 high scalability-2010-06-07-Six Ways Twitter May Reach its Big Hairy Audacious Goal of One Billion Users
17 0.54884726 1107 high scalability-2011-08-29-The Three Ages of Google - Batch, Warehouse, Instant
18 0.54686666 1612 high scalability-2014-03-14-Stuff The Internet Says On Scalability For March 14th, 2014
19 0.54510134 1350 high scalability-2012-10-29-Gone Fishin' Two
20 0.54451358 409 high scalability-2008-10-13-Challenges from large scale computing at Google
topicId topicWeight
[(2, 0.119), (61, 0.108), (79, 0.146), (85, 0.069), (91, 0.415)]
simIndex simValue blogId blogTitle
same-blog 1 0.83475316 642 high scalability-2009-06-29-HighScalability Rated #3 Blog for Developers
Introduction: Hey we're moving up in the world, jumping from 19th place to 3rd place. In case you aren't sure what I'm talking about, Jurgen Appelo goes through this massive effort of ranking blogs according to Google PageRank, Technorati Authority, Alexa Rank, Google links, and Twitter Grader Rank. Through some obviously mistaken calculations HighScalability comes out #3. Given all the superb competition I'm not exactly sure how that can be. Well, thanks for all the excellent people who contribute and all the even more excellent people that read. Now at least I have something worthy to put on my tombstone :-)
2 0.77855283 921 high scalability-2010-10-18-NoCAP
Introduction: In this post i wanted to spend sometime on the CAP theorem and clarify some of the confusion that i often see when people associate CAP with scalability without fully understanding the implications that comes with it and the alternative approaches You can read the full article here
3 0.66354817 712 high scalability-2009-10-01-Moving Beyond End-to-End Path Information to Optimize CDN Performance
Introduction: You go through the expense of installing CDNs all over the globe to make sure users always have a node close by and you notice something curious and furious: clients still experience poor latencies. What's up with that? What do you do to find the problem? If you are Google you build a tool (WhyHigh) to figure out what's up. This paper is about the tool and the unexpected problem of high latencies on CDNs. The main problems they found: inefficient routing to nearby nodes and packet queuing. But more useful is the architecture of WhyHigh and how it goes about identifying bottle necks. And even more useful is the general belief in creating sophisticated tools to understand and improve your service. That's what professionals do. From the abstract: Replicating content across a geographically distributed set of servers and redirecting clients to the closest server in terms of latency has emerged as a common paradigm for improving client performance. In this paper, we analyze latenc
4 0.63598549 826 high scalability-2010-05-12-The Rise of the Virtual Cellular Machines
Introduction: My apologies if you were looking for a post about cell phones. This post is about high density nanodevices. It's a follow up to How will memristors change everything? for those wishing to pursue these revolutionary ideas in more depth. This is one of those areas where if you are in the space then there's a lot of available information and if you are on the outside then it doesn't even seem to exist. Fortunately, Ben Chandler from The SyNAPSE Project , was kind enough to point me to a great set of presentations given at the 12th IEEE CNNA - International Workshop on Cellular Nanoscale Networks and their Applications - Towards Megaprocessor Computing. WARNING: these papers contain extreme technical content. If you are like me and you aren't an electrical engineer, much of it may make a sort of surface sense, but the deep and twisty details will fly over head. For the more software minded there are a couple more accessible presentations: Intelligent Machines built with Memristiv
5 0.61317742 1338 high scalability-2012-10-11-RAMCube: Exploiting Network Proximity for RAM-Based Key-Value Store
Introduction: RAMCube is a datacenter oriented design for RAM-based key-value store that supports thousands or tens of thousands of servers to offer up to hundreds of terabytes of RAM storage. Here's the PDF Paper describing the system and here's a video of the presentation given at HotCloud . The big idea is: RAMCube exploits the proximity of a BCube network to construct a symmetric MultiRing structure, restricting all failure detection and recovery traffic within a one-hop neighborhood, which addresses problems including false failure detection and recovery traffic congestion. In addition, RAMCube leverages BCube’s multiple paths between any pairs of servers to handle switch failures. A few notes: 75% of Facebook data is stored in memcache. RAM is 1000 time faster than disk RAM is used in caches, but this increases application complexity as applications are responsible for cache consistency. Under a high work load a 1% cache miss rate can lead to a 10x performance penalty. So st
6 0.61152613 722 high scalability-2009-10-15-Hot Scalability Links for Oct 15 2009
7 0.58367378 453 high scalability-2008-12-01-Breakthrough Web-Tier Solutions with Record-Breaking Performance
8 0.53520757 1285 high scalability-2012-07-18-Disks Ain't Dead Yet: GraphChi - a disk-based large-scale graph computation
9 0.51916218 742 high scalability-2009-11-17-10 eBay Secrets for Planet Wide Scaling
11 0.45766345 651 high scalability-2009-07-02-Product: Project Voldemort - A Distributed Database
12 0.45448345 197 high scalability-2007-12-31-Product: collectd
13 0.44525456 1209 high scalability-2012-03-14-The Azure Outage: Time Is a SPOF, Leap Day Doubly So
14 0.43545568 283 high scalability-2008-03-18-Shared filesystem on EC2
15 0.43482649 1327 high scalability-2012-09-21-Stuff The Internet Says On Scalability For September 21, 2012
16 0.43399268 356 high scalability-2008-07-22-Scaling Bumper Sticker: A 1 Billion Page Per Month Facebook RoR App
17 0.43188435 561 high scalability-2009-04-08-N+1+caching is ok?
18 0.43003404 1242 high scalability-2012-05-09-Cell Architectures
19 0.42991433 1018 high scalability-2011-04-07-Paper: A Co-Relational Model of Data for Large Shared Data Banks
20 0.42700684 1535 high scalability-2013-10-21-Google's Sanjay Ghemawat on What Made Google Google and Great Big Data Career Advice