high_scalability high_scalability-2010 high_scalability-2010-854 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Facebook serves 3 billion Like buttons a day says VentureBeat. CloudScaling reports: Rumor Mill: Google EC2 Competitor Coming in 2010? It looks like GAE for PaaS and an EC2 clone for IaaS. Tweets of gold: alandipert : scalability is a drug seldo : Scalability lesson #23: if any part of your system involves a list that gets bigger over time, eventually that list will become too big. obfuscurity : Her: "Go look at the pictures on the database." Me: "You mean our fileserver?" Her: "Whatever." luiscab : Ouch, I just read on an Info Mgmt rag that Hadoop could easily be an acronym for "Heck, Another Darn Obscure Open-source Project." sanity : Depressed about how much time I've had to spend searching for the right database solution for a new project. Each has it's flaws ioshints : You cannot take a car, grow it 10 times and expect to get a mining truck. A contentious thread on Hacker News: Mong
sentIndex sentText sentNum sentScore
1 It looks like GAE for PaaS and an EC2 clone for IaaS. [sent-3, score-0.226]
2 Tweets of gold: alandipert : scalability is a drug seldo : Scalability lesson #23: if any part of your system involves a list that gets bigger over time, eventually that list will become too big. [sent-4, score-0.085]
3 " luiscab : Ouch, I just read on an Info Mgmt rag that Hadoop could easily be an acronym for "Heck, Another Darn Obscure Open-source Project. [sent-8, score-0.25]
4 " sanity : Depressed about how much time I've had to spend searching for the right database solution for a new project. [sent-9, score-0.1]
5 A contentious thread on Hacker News: MongoDB Performance & Durability . [sent-11, score-0.128]
6 This wonderfully varied and detailed talk that isn't just about Erlang, it's about building distributed systems in general. [sent-15, score-0.189]
7 How to model data is one of the big questions with NoSQL and this looks like a very solid slide deck on the nuts and bolts. [sent-18, score-0.413]
8 com is a new StackOverflow style site with scalability as the topic. [sent-23, score-0.085]
9 Yottaa explains how they build Scalable Event Analytics with Ruby on Rails & MongoDB . [sent-25, score-0.126]
10 Roberto Zicari compares and contrasts odbms systems to the new nosql datastores . [sent-26, score-0.34]
11 It's about choice and fitting the data store to the task. [sent-28, score-0.114]
12 It literally looks something like out of Mission Impossible. [sent-30, score-0.217]
13 Also more good stuff at Data Center Knowledge: Schooner: Flash-Driven Performance Gains , Are Massively Multi-Core Servers Game Changers? [sent-31, score-0.131]
14 After you’ve finished experimenting with small areas and start moving to a global map, you find that disk IO is by far the most important thing Relational Data, Document Databases and Schema Design by Mathias Meyer. [sent-33, score-0.197]
15 It's easy to say that all the nice features document databases offer are just aiming for one thing, to scale up. [sent-34, score-0.233]
16 bg - You don't have to learn cloud computing with their cPanel/WHM control panel. [sent-40, score-0.096]
wordName wordTfidf (topN-words)
[('geographic', 0.163), ('erlang', 0.141), ('rag', 0.136), ('contrasts', 0.136), ('diggwho', 0.136), ('looks', 0.132), ('stuff', 0.131), ('contentious', 0.128), ('designby', 0.128), ('mathias', 0.128), ('ouch', 0.128), ('explains', 0.126), ('schema', 0.123), ('rumor', 0.122), ('petascale', 0.122), ('mill', 0.122), ('zicari', 0.122), ('document', 0.119), ('shine', 0.118), ('aiming', 0.114), ('acronym', 0.114), ('fitting', 0.114), ('nosql', 0.111), ('obscure', 0.111), ('national', 0.111), ('roi', 0.108), ('gross', 0.106), ('experimenting', 0.104), ('competitor', 0.104), ('toadvertisea', 0.104), ('usfor', 0.104), ('mongodb', 0.103), ('sanity', 0.1), ('nuts', 0.1), ('andy', 0.1), ('varied', 0.097), ('computing', 0.096), ('clone', 0.094), ('finished', 0.093), ('compares', 0.093), ('stackoverflow', 0.093), ('wonderfully', 0.092), ('pleasecontact', 0.092), ('deck', 0.092), ('gold', 0.089), ('slide', 0.089), ('voltdb', 0.088), ('heck', 0.087), ('literally', 0.085), ('scalability', 0.085)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999982 854 high scalability-2010-07-09-Hot Scalability Links for July 9, 2010
Introduction: Facebook serves 3 billion Like buttons a day says VentureBeat. CloudScaling reports: Rumor Mill: Google EC2 Competitor Coming in 2010? It looks like GAE for PaaS and an EC2 clone for IaaS. Tweets of gold: alandipert : scalability is a drug seldo : Scalability lesson #23: if any part of your system involves a list that gets bigger over time, eventually that list will become too big. obfuscurity : Her: "Go look at the pictures on the database." Me: "You mean our fileserver?" Her: "Whatever." luiscab : Ouch, I just read on an Info Mgmt rag that Hadoop could easily be an acronym for "Heck, Another Darn Obscure Open-source Project." sanity : Depressed about how much time I've had to spend searching for the right database solution for a new project. Each has it's flaws ioshints : You cannot take a car, grow it 10 times and expect to get a mining truck. A contentious thread on Hacker News: Mong
2 0.16212033 954 high scalability-2010-12-06-What the heck are you actually using NoSQL for?
Introduction: It's a truism that we should choose the right tool for the job . Everyone says that. And who can disagree? The problem is this is not helpful advice without being able to answer more specific questions like: What jobs are the tools good at? Will they work on jobs like mine? Is it worth the risk to try something new when all my people know something else and we have a deadline to meet? How can I make all the tools work together? In the NoSQL space this kind of real-world data is still a bit vague. When asked, vendors tend to give very general answers like NoSQL is good for BigData or key-value access. What does that mean for for the developer in the trenches faced with the task of solving a specific problem and there are a dozen confusing choices and no obvious winner? Not a lot. It's often hard to take that next step and imagine how their specific problems could be solved in a way that's worth taking the trouble and risk. Let's change that. What problems are you using NoSQL to sol
3 0.15133083 1064 high scalability-2011-06-20-35+ Use Cases for Choosing Your Next NoSQL Database
Introduction: We've asked What The Heck Are You Actually Using NoSQL For? . We've asked 101 Questions To Ask When Considering A NoSQL Database . We've even had a webinar What Should I Do? Choosing SQL, NoSQL or Both for Scalable Web Applications . Now we get to the point of considering use cases and which systems might be appropriate for those use cases. What are your options? First, let's cover what are the various data models. These have been adapted from Emil Eifrem and NoSQL databases . Document Databases Lineage: Inspired by Lotus Notes. Data model: Collections of documents, which contain key-value collections. Example: CouchDB, MongoDB Good at: Natural data modeling. Programmer friendly. Rapid development. Web friendly, CRUD. Graph Databases Lineage: Euler and graph theory. Data model: Nodes & relationships, both which can hold key-value pairs Example: AllegroGraph, InfoGrid, Neo4j Good at: Rock complicated graph problems. Fast.
4 0.1379891 1089 high scalability-2011-07-29-Stuff The Internet Says On Scalability For July 29, 2011
Introduction: Submitted for your end of July scaling pleasure: YouTube : 3 billion videos viewed a day; 48 hours of footage uploaded every minute. 64 core Tilera chip . Google wants to be your CDN. They figure the only way to make the web faster...is to host it. Page Speed Service - Web Performance, Delivered . An eventually for pay service that caches your website and distributes it around the world. No cost information. Your speed may vary. See the longish list of limitations . Nobody said anything interesting on scalability this week! A disaster of non-quotable proportions. If I missed something, now is your chance. Moving an Elephant: Large Scale Hadoop Data Migration at Facebook . Paul Yang describes the greatest westward expansion since the land bridge across the Bering Strait. It's a story of moving a 30PB Hadoop cluster from an over populated datacenter to the wide open spaces of a new continent. Unlike the early settlers, Facebook did not move the boxes over, that would dis
5 0.13269834 1040 high scalability-2011-05-13-Stuff The Internet Says On Scalability For May 13, 2011
Introduction: Submitted for your reading pleasure on this beautiful blue sky, birds chirping Friday morning: Unlocked Achievements: Twitter: 900,000 Apps, 600,000 Developers And 13 Billion API Requests Per Day ; Rackspace Now Has 70,000 Servers Quotable Quotes for the speed of light Alex: @LusciousPear According to everyone I talk to, in aggregate, every single #nosql company is irrelevant, doing great, making tons of money, and broke. XIX Give a man a GO compiler and he eats for a day. Give a man a C compiler and he feeds upon php/python/jscript/lua/etc for the rest of his life. @movingsideways This tweet features a robust architecture and high scalability. @serenestudios Human touch trumps code 'scalability'? People matter more than code - we agree. @SunGardFS At current growth rates, server farms will consume more energy than all air travel by 2020 @marcloney After the hype of NoSQL, are we just going to end up with a bunch of enterprise-class produ
7 0.12905854 935 high scalability-2010-11-05-Hot Scalability Links For November 5th, 2010
8 0.1284214 848 high scalability-2010-06-25-Hot Scalability Links for June 25, 2010
10 0.12159979 851 high scalability-2010-07-02-Hot Scalability Links for July 2, 2010
12 0.11779046 1606 high scalability-2014-03-05-10 Things You Should Know About Running MongoDB at Scale
13 0.11673769 1036 high scalability-2011-05-06-Stuff The Internet Says On Scalability For May 6th, 2011
15 0.11479013 931 high scalability-2010-10-28-Notes from A NOSQL Evening in Palo Alto
18 0.11367391 750 high scalability-2009-12-16-Building Super Scalable Systems: Blade Runner Meets Autonomic Computing in the Ambient Cloud
19 0.11213121 811 high scalability-2010-04-16-Hot Scalability Links for April 16, 2010
20 0.1110434 445 high scalability-2008-11-14-Useful Cloud Computing Blogs
topicId topicWeight
[(0, 0.226), (1, 0.056), (2, 0.057), (3, 0.091), (4, 0.029), (5, 0.057), (6, -0.054), (7, -0.022), (8, 0.079), (9, 0.005), (10, -0.017), (11, -0.039), (12, -0.031), (13, 0.004), (14, -0.005), (15, -0.036), (16, 0.056), (17, -0.02), (18, -0.051), (19, 0.011), (20, -0.003), (21, -0.025), (22, -0.003), (23, -0.029), (24, 0.08), (25, -0.044), (26, 0.025), (27, 0.06), (28, -0.027), (29, 0.047), (30, 0.036), (31, 0.003), (32, 0.012), (33, 0.005), (34, 0.022), (35, -0.002), (36, 0.015), (37, -0.013), (38, -0.029), (39, -0.019), (40, -0.009), (41, -0.034), (42, -0.009), (43, 0.063), (44, -0.007), (45, -0.038), (46, -0.031), (47, 0.059), (48, 0.004), (49, -0.015)]
simIndex simValue blogId blogTitle
same-blog 1 0.95876557 854 high scalability-2010-07-09-Hot Scalability Links for July 9, 2010
Introduction: Facebook serves 3 billion Like buttons a day says VentureBeat. CloudScaling reports: Rumor Mill: Google EC2 Competitor Coming in 2010? It looks like GAE for PaaS and an EC2 clone for IaaS. Tweets of gold: alandipert : scalability is a drug seldo : Scalability lesson #23: if any part of your system involves a list that gets bigger over time, eventually that list will become too big. obfuscurity : Her: "Go look at the pictures on the database." Me: "You mean our fileserver?" Her: "Whatever." luiscab : Ouch, I just read on an Info Mgmt rag that Hadoop could easily be an acronym for "Heck, Another Darn Obscure Open-source Project." sanity : Depressed about how much time I've had to spend searching for the right database solution for a new project. Each has it's flaws ioshints : You cannot take a car, grow it 10 times and expect to get a mining truck. A contentious thread on Hacker News: Mong
2 0.85987687 935 high scalability-2010-11-05-Hot Scalability Links For November 5th, 2010
Introduction: So much good stuff this week... Adrian Cockcroft Compares NoSQL Availability Models . Let's risk feeding the CAP trolls, and try to get some insight into the differences between the many NoSQL contenders . Adrian asks how each NoSQL product will add a movie to its favorites list, read it back, and how this works across availability zones. Much trickier than it sounds with multiple writers. Cassandra and MongoDB answer back. Stuff the Internet Says: @jerng : Reading up on scalability. WHY THE HELL FOR? Because I want to know the future. @freerangedata : The #nosql options are the micro brews/craft beers of data stores. So many good ones, so little time to try them all. @edward_ribeiro : Soon, Darwinism will start to play its role on #NoSQL systems. You know, only the fittest will survive. @connectionreq : I'm always wowed when I hear how Facebook abuses their MySQL databases in crazy ways @louismrose : This is the kind of scalability we should be wo
3 0.85108888 1007 high scalability-2011-03-18-Stuff The Internet Says On Scalability For March 18, 2011
Introduction: Submitted for your reading pleasure on this day of wind and rain... The cloud is falling. Or at least shared networked storage as Reddit has a couple of long periods of downtime. Good write up at Why reddit was down for 6 of the last 24 hours. Upshot: Reddit is moving off EBS to local disks. Quotable Quotes: @thomleggett : 32 or so joins - the sweet-spot of suck for MySQL - @emileifrem #nosql #neo4j #qconlondon @jkalucki : The [Twitter] Streaming API pushes 100MB in less than a second. @LusciousPear : Finding the true test of a DB is recovery when things go wrong, not "who's most web-scale" on paper #nosql @TimelessP : Functionality, scalability, security... pick two. @beaknit : #ccevent #nosql @adrianco: A year from oracle to simpledb. A week from simpledb to cassandra. Mental shift biggest hurdle. Heather Willems captured this very cool Ogilvy Note , which is a visual representation of a panel talk on Scalability: Covering Your Rear with a Good
4 0.83715463 1040 high scalability-2011-05-13-Stuff The Internet Says On Scalability For May 13, 2011
Introduction: Submitted for your reading pleasure on this beautiful blue sky, birds chirping Friday morning: Unlocked Achievements: Twitter: 900,000 Apps, 600,000 Developers And 13 Billion API Requests Per Day ; Rackspace Now Has 70,000 Servers Quotable Quotes for the speed of light Alex: @LusciousPear According to everyone I talk to, in aggregate, every single #nosql company is irrelevant, doing great, making tons of money, and broke. XIX Give a man a GO compiler and he eats for a day. Give a man a C compiler and he feeds upon php/python/jscript/lua/etc for the rest of his life. @movingsideways This tweet features a robust architecture and high scalability. @serenestudios Human touch trumps code 'scalability'? People matter more than code - we agree. @SunGardFS At current growth rates, server farms will consume more energy than all air travel by 2020 @marcloney After the hype of NoSQL, are we just going to end up with a bunch of enterprise-class produ
5 0.815835 848 high scalability-2010-06-25-Hot Scalability Links for June 25, 2010
Introduction: Royans Tharakan is blogging like a mad man at the Velocity Conference . Read a summary of many of the presentations on his blog . Zuckerberg almost guarantees 1 billion Facebook users . And I almost believe him. Northscale introduces Membase , a new distributed key-value NoSQL competitor featuring a memcache compatible interface, yet is persistent like a database. Hopefully we'll have more on their internals later. Notable Tweets: Aaron Cordova - scalability means "can change size" and also "works at large sizes" - this conflates two orthogonal features of cloud computing. Jaime Garcia Reinoso - I t's the scalability, stupid! Alex Averbuch - when I read/hear "unlimited/inifinite scalability" I stop reading/listening and start thinking about cake. Dennis Clark - I used to smirk at developers whose main DB experience was in MUMPS or Pick, until I realized those are old-school #NoSQL engines. Hypertable vs. HBase Performance Evaluation .
6 0.81494159 1036 high scalability-2011-05-06-Stuff The Internet Says On Scalability For May 6th, 2011
7 0.80683011 1067 high scalability-2011-06-24-Stuff The Internet Says On Scalability For June 24, 2011
8 0.80467635 883 high scalability-2010-08-20-Hot Scalability Links For Aug 20, 2010
9 0.79595667 940 high scalability-2010-11-12-Stuff the Internet Says on Scalability For November 12th, 2010
10 0.79418755 1089 high scalability-2011-07-29-Stuff The Internet Says On Scalability For July 29, 2011
11 0.77987003 1411 high scalability-2013-02-22-Stuff The Internet Says On Scalability For February 22, 2013
12 0.77271181 1210 high scalability-2012-03-16-Stuff The Internet Says On Scalability For March 16, 2012
13 0.77017194 1147 high scalability-2011-11-25-Stuff The Internet Says On Scalability For November 25, 2011
14 0.76901978 893 high scalability-2010-09-03-Hot Scalability Links For Sep 3, 2010
15 0.76563889 1327 high scalability-2012-09-21-Stuff The Internet Says On Scalability For September 21, 2012
16 0.76529866 1024 high scalability-2011-04-15-Stuff The Internet Says On Scalability For April 15, 2011
17 0.76115894 984 high scalability-2011-02-04-Stuff The Internet Says On Scalability For February 4, 2011
18 0.76057845 1137 high scalability-2011-11-04-Stuff The Internet Says On Scalability For November 4, 2011
19 0.76038241 1129 high scalability-2011-09-30-Stuff The Internet Says On Scalability For September 30, 2011
20 0.76029694 1015 high scalability-2011-04-01-Stuff The Internet Says On Scalability For April 1, 2011
topicId topicWeight
[(1, 0.101), (2, 0.193), (10, 0.046), (40, 0.017), (56, 0.288), (61, 0.141), (79, 0.092), (94, 0.053)]
simIndex simValue blogId blogTitle
1 0.93840045 779 high scalability-2010-02-16-Seven Signs You May Need a NoSQL Database
Introduction: While exploring deep into some dusty old library stacks, I dug up Nostradamus' long lost NoSQL codex. What are the chances? Strangely, it also gave the plot to the next Dan Brown novel, but I left that out for reasons of sanity. About NoSQL, here is what Nosty (his friends call him Nosty) predicted are the signs you may need a NoSQL database... You noticed a lot of your database fields are really serialized complex objects in disguise . Why bother with a RDBMS at all then? Storing serialized objects in a relational database is like being on the pill while trying to get pregnant, a bit counter productive. Just use a schemaless database from the start. Using a standard query language has become too confining . You just want to be free. SQL is so easy, so convenient, and so standard, it's really not a challenge anymore. You need to be different. Then NoSQL is for you. Each has their own completely different query mechanism . Your toolbox only contains a hammer . Hammers wh
2 0.9229297 1022 high scalability-2011-04-13-Paper: NoSQL Databases - NoSQL Introduction and Overview
Introduction: Christof Strauch, from Stuttgart Media University, has written an incredible 120+ page paper titled NoSQL Databases as an introduction and overview to NoSQL databases . The paper was written between 2010-06 and 2011-02, so it may be a bit out of date, but if you are looking to take in the NoSQL world in one big gulp, this is your chance. I asked Christof to give us a short taste of what he was trying to accomplish in his paper: The paper aims at giving a systematic and thorough introduction and overview of the NoSQL field by assembling information dispersed among blogs, wikis and scientific papers. It firstly discusses reasons, rationales and motives for the development and usage of nonrelational database systems. These can be summarized by the need for high scalability, the processing of large amounts of data, the ability to distribute data among many (often commodity) servers, consequently a distribution-aware design of DBMSs. The paper then introduces fundamental concepts,
3 0.90608209 941 high scalability-2010-11-15-How Google's Instant Previews Reduces HTTP Requests
Introduction: In a strange case of synchronicity, Google just published Instant Previews: Under the hood , a very well written blog post by Matías Pelenur of the Instant Previews team, giving some fascinating inside details on how Google implemented Instant Previews . It's syncronicty because I had just posted Strategy: Biggest Performance Impact Is To Reduce The Number Of HTTP Requests and one of the major ideas behind the design Instant Previews is to reduce the number of HTTP requests through a few well chosen tricks. Cosmic! Some of what Google does to reduce HTTP requests: Data URIs , which are are base64 encodings of image data, are used instead of static images that are served from the server. This means the whole preview can be pieced together from image slices in one request as both the data and the image are returned in the same request. Google found that even though base64 encoding adds about 33% to the size of the image, tests showed that gzip-compressed data URIs are compara
same-blog 4 0.88864535 854 high scalability-2010-07-09-Hot Scalability Links for July 9, 2010
Introduction: Facebook serves 3 billion Like buttons a day says VentureBeat. CloudScaling reports: Rumor Mill: Google EC2 Competitor Coming in 2010? It looks like GAE for PaaS and an EC2 clone for IaaS. Tweets of gold: alandipert : scalability is a drug seldo : Scalability lesson #23: if any part of your system involves a list that gets bigger over time, eventually that list will become too big. obfuscurity : Her: "Go look at the pictures on the database." Me: "You mean our fileserver?" Her: "Whatever." luiscab : Ouch, I just read on an Info Mgmt rag that Hadoop could easily be an acronym for "Heck, Another Darn Obscure Open-source Project." sanity : Depressed about how much time I've had to spend searching for the right database solution for a new project. Each has it's flaws ioshints : You cannot take a car, grow it 10 times and expect to get a mining truck. A contentious thread on Hacker News: Mong
5 0.88703215 732 high scalability-2009-10-29-Digg - Looking to the Future with Cassandra
Introduction: Digg has been researching ways to scale our database infrastructure for some time now. We’ve adopted a traditional vertically partitioned master-slave configuration with MySQL, and also investigated sharding MySQL with IDDB . Ultimately, these solutions left us wanting. In the case of the traditional architecture, the lack of redundancy on the write masters is painful, and both approaches have significant management overhead to keep running. Since it was already necessary to abandon data normalization and consistency to make these approaches work, we felt comfortable looking at more exotic, non-relational data stores. After considering HBase, Hypertable, Cassandra, Tokyo Cabinet/Tyrant, Voldemort, and Dynomite, we settled on Cassandra . Each system has its own strengths and weaknesses, but Cassandra has a good blend of everything. It offers column-oriented data storage, so you have a bit more structure than plain key/value stores. It operates in a distributed, highly available,
6 0.8839618 1394 high scalability-2013-01-25-Stuff The Internet Says On Scalability For January 25, 2013
7 0.8592509 446 high scalability-2008-11-18-Scalability Perspectives #2: Van Jacobson – Content-Centric Networking
8 0.85387802 67 high scalability-2007-08-17-What is the best hosting option?
9 0.84076685 659 high scalability-2009-07-20-A Scalability Lament
10 0.8403914 759 high scalability-2010-01-11-Strategy: Don't Use Polling for Real-time Feeds
11 0.83248198 45 high scalability-2007-07-30-Product: SmarterStats
12 0.80223697 1322 high scalability-2012-09-14-Stuff The Internet Says On Scalability For September 14, 2012
13 0.79587221 815 high scalability-2010-04-27-Paper: Dapper, Google's Large-Scale Distributed Systems Tracing Infrastructure
14 0.74272984 479 high scalability-2008-12-29-Platform virtualization - top 25 providers (software, hardware, combined)
15 0.74160302 1565 high scalability-2013-12-16-22 Recommendations for Building Effective High Traffic Web Software
16 0.72525764 1180 high scalability-2012-01-24-The State of NoSQL in 2012
17 0.71933436 475 high scalability-2008-12-22-SLAs in the SaaS space
18 0.71486825 1183 high scalability-2012-01-30-37signals Still Happily Scaling on Moore RAM and SSDs
19 0.71344984 851 high scalability-2010-07-02-Hot Scalability Links for July 2, 2010
20 0.7110402 981 high scalability-2011-02-01-Google Strategy: Tree Distribution of Requests and Responses