high_scalability high_scalability-2009 high_scalability-2009-537 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Geir Magnusson from 10gen presented a talk titled Cloud Data Persistence or ‘We’re in a database reneaissance - pay attention” today at QCon London 2009. The main message of his talk was that “physical limitations of today’s technology combined with the computational complexity of conventional relational databases are driving databases into new exciting spaces”, or to put it simpler the database landscape is changing and we should keep our eyes on that.
sentIndex sentText sentNum sentScore
1 Geir Magnusson from 10gen presented a talk titled Cloud Data Persistence or ‘We’re in a database reneaissance - pay attention” today at QCon London 2009. [sent-1, score-1.008]
wordName wordTfidf (topN-words)
[('magnusson', 0.34), ('titled', 0.27), ('qcon', 0.253), ('spaces', 0.238), ('eyes', 0.235), ('re', 0.216), ('landscape', 0.212), ('conventional', 0.203), ('today', 0.197), ('london', 0.179), ('driving', 0.178), ('talk', 0.177), ('combined', 0.175), ('presented', 0.172), ('simpler', 0.171), ('persistence', 0.171), ('limitations', 0.169), ('exciting', 0.163), ('computational', 0.16), ('databases', 0.159), ('attention', 0.149), ('message', 0.129), ('changing', 0.121), ('complexity', 0.12), ('main', 0.115), ('relational', 0.11), ('pay', 0.109), ('physical', 0.104), ('database', 0.083), ('put', 0.083), ('technology', 0.079), ('keep', 0.064), ('cloud', 0.055), ('new', 0.033), ('data', 0.025)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 537 high scalability-2009-03-12-QCon London 2009: Database projects to watch closely
Introduction: Geir Magnusson from 10gen presented a talk titled Cloud Data Persistence or ‘We’re in a database reneaissance - pay attention” today at QCon London 2009. The main message of his talk was that “physical limitations of today’s technology combined with the computational complexity of conventional relational databases are driving databases into new exciting spaces”, or to put it simpler the database landscape is changing and we should keep our eyes on that.
2 0.23809415 544 high scalability-2009-03-18-QCon London 2009: Upgrading Twitter without service disruptions
Introduction: Evan Weaver from Twitter presented a talk on Twitter software upgrades, titled Improving running components as part of the Systems that never stop track at QCon London 2009 conference last Friday. The talk focused on several upgrades performed since last May, while Twitter was experiencing serious performance problems.
3 0.10311224 154 high scalability-2007-11-15-Lessons from Yahoo, eBay, Orbitz, LinkedIn architecture
Introduction: I'm moving this from the forum section to the front page. Just FYI, any registered user can Submit a Link to this blog. You don't have to use the forums. In The Architectures You've Always Wondered About track at the Qcon conference, Second Life, eBay, Yahoo, LinkedIn and Orbitz presented how they dealt with different aspects of their applications, such as scalability. There were quite a few lessons that I learned that day that I thought were worth sharing.
4 0.096251704 409 high scalability-2008-10-13-Challenges from large scale computing at Google
Introduction: From Greg Linden on a talk Google Fellow Jeff Dean gave last week at University of Washington Computer Science titled "Research Challenges Inspired by Large-Scale Computing at Google" : Coming away from the talk, the biggest points for me were the considerable interest in reducing costs (especially reducing power costs), the suggestion that the Google cluster may eventually contain 10M machines at 1k locations, and the call to action for researchers on distributed systems and databases to think orders of magnitude bigger than they often are, not about running on hundreds of machines in one location, but hundreds of thousands of machines across many locations.
5 0.083210491 762 high scalability-2010-01-18-The Missing Piece in the Virtualization Stack (Part 1)
Introduction: This and the next post will discuss how virtualization and cloud computing, as we know it today, is only a small part of the solution for today’s IT inefficiencies. While new technologies and delivery models have made it much simpler to manage the infrastructure, this is not where our core inefficiencies lie. Virtualization principles must be extended to higher levels of the application stack, to make it easier for all of us to manage, tune and integrate applications. Otherwise we will continue to spend most of our time on things that don’t provide real value to the business. Read the full article here
6 0.08042644 693 high scalability-2009-09-03-Storage Systems for High Scalable Systems presentation
7 0.07266748 173 high scalability-2007-12-05-Easier Production Releases
8 0.072104804 236 high scalability-2008-02-03-Ideas on how to scale a shared inventory database???
9 0.071246974 626 high scalability-2009-06-10-Paper: Graph Databases and the Future of Large-Scale Knowledge Management
10 0.070965536 354 high scalability-2008-07-20-The clouds are coming
11 0.069703899 538 high scalability-2009-03-16-Are Cloud Based Memory Architectures the Next Big Thing?
12 0.069400154 264 high scalability-2008-03-03-Read This Site and Ace Your Next Interview!
13 0.062658347 1110 high scalability-2011-09-06-Big Data Application Platform
14 0.061809048 441 high scalability-2008-11-13-CloudCamp London 2: private clouds and standardisation
15 0.058775298 252 high scalability-2008-02-18-limit on the number of databases open
16 0.05842185 1170 high scalability-2012-01-06-Stuff The Internet Says On Scalability For January 6, 2012
17 0.057954632 935 high scalability-2010-11-05-Hot Scalability Links For November 5th, 2010
18 0.057418477 1440 high scalability-2013-04-15-Scaling Pinterest - From 0 to 10s of Billions of Page Views a Month in Two Years
19 0.056987204 873 high scalability-2010-08-06-Hot Scalability Links for Aug 6, 2010
20 0.055567238 733 high scalability-2009-10-29-Paper: No Relation: The Mixed Blessings of Non-Relational Databases
topicId topicWeight
[(0, 0.08), (1, 0.015), (2, 0.017), (3, 0.044), (4, -0.002), (5, 0.044), (6, -0.041), (7, -0.035), (8, 0.005), (9, -0.007), (10, 0.003), (11, 0.01), (12, -0.005), (13, 0.041), (14, 0.012), (15, -0.011), (16, 0.023), (17, 0.005), (18, 0.003), (19, 0.014), (20, -0.02), (21, -0.013), (22, -0.018), (23, -0.004), (24, 0.042), (25, -0.003), (26, -0.049), (27, -0.017), (28, -0.015), (29, 0.032), (30, -0.005), (31, 0.011), (32, -0.018), (33, 0.027), (34, -0.006), (35, 0.023), (36, 0.015), (37, 0.001), (38, 0.036), (39, 0.008), (40, 0.05), (41, -0.022), (42, -0.026), (43, 0.024), (44, -0.042), (45, -0.021), (46, -0.016), (47, 0.011), (48, -0.025), (49, 0.112)]
simIndex simValue blogId blogTitle
same-blog 1 0.93340367 537 high scalability-2009-03-12-QCon London 2009: Database projects to watch closely
Introduction: Geir Magnusson from 10gen presented a talk titled Cloud Data Persistence or ‘We’re in a database reneaissance - pay attention” today at QCon London 2009. The main message of his talk was that “physical limitations of today’s technology combined with the computational complexity of conventional relational databases are driving databases into new exciting spaces”, or to put it simpler the database landscape is changing and we should keep our eyes on that.
2 0.63715076 974 high scalability-2011-01-18-Paper: Relational Cloud: A Database-as-a-Service for the Cloud
Introduction: The Relational Cloud Project is an effort by a group of researchers at MIT to investigate technologies and challenges related to Database-as-a-Service within cloud-computing . They are trying to figure out how the advantages of the DaaS (Database-as-a-Service) model, that we've seen arise in other areas like OLAP and NoSQL, can be applied to relational databases. The DaaS advantages as they see them are: 1) predictable costs, proportional to the quality of service and actual workloads, 2) lower technical complexity, thanks to a unified and simplified service access interface, and 3) virtually infinite resources ready at hand. An interesting description of their approach is explained in the paper Relational Cloud: A Database-as-a-Service for the Cloud . From the abstract: This paper introduces a new transactional “database-as-a-service” (DBaaS) called Relational Cloud. A DBaaS promises to move much of the operational burden of provisioning, configuration, scaling, performance tun
3 0.61739695 1054 high scalability-2011-06-06-NoSQL Pain? Learn How to Read-write Scale Without a Complete Re-write
Introduction: Lately I've been reading more cases were different people have started to realize the limitations of the NoSQL promise to database scalability. Note the references below: Why does Quora use MySQL as the data store instead of NoSQLs such as Cassandra, MongoDB, CouchDB etc? Why did Diaspora abandon MongoDB for MySQL? How scalable is CouchDB in practice, not just in theory? Take MongoDB for example. It's damn fast, but it doesn't really know how to save data reliably to disk. I've had it set up in a replica pair to mitigate that risk. Guess what - both servers in the pair failed and corrupted their data files at the same day. It appears that for many, the switch to NoSQL can be rather painful. IMO that doesn't necessarily mean that NoSQL is wrong in general, but it's a combination of 1) lack of maturity 2) not the right tool for the job. That brings the question of what's the alternative solution? In the following post I tried to summarize the lessons from
4 0.6172843 1025 high scalability-2011-04-16-The NewSQL Market Breakdown
Introduction: Matt Aslett from the 451 group created a term called “NewSQL ”. On the definition of NewSQL, Aslett writes: “NewSQL” is our shorthand for the various new scalable/high performance SQL database vendors. We have previously referred to these products as ‘ScalableSQL’ to differentiate them from the incumbent relational database products. Since this implies horizontal scalability, which is not necessarily a feature of all the products, we adopted the term ‘NewSQL’ in the new report. And to clarify, like NoSQL, NewSQL is not to be taken too literally: the new thing about the NewSQL vendors is the vendor, not the SQL. As with NoSQL, under the NewSQL umbrella you can see various providers, with various solutions. I think these can be divided into several sub-types: New MySQL storage engines . These give MySQL users the same programming interface, but scale very well. You can Xeround or Akiban in this field. The good part is that you still use MySQL, but on the downside it’s n
5 0.60857958 885 high scalability-2010-08-23-Building a Scalable Key-Value Database: Project Hydracus
Introduction: The world of NoSQL and alternative database implementations (i.e. non-relational) is deeply fascinating to me. I can’t help but be swept up in the whirl of planet-scale web development scalability techniques and the evolution of how developers think about building their applications knowing that with success comes the inevitable need to scale to levels almost unimaginable just five or ten years ago. I’m going to make a prediction: Developers will be expected to understand the fundamentals of how different database systems can be applied within a singular application; their strengths and weaknesses, and when it is appropriate to leverage them. There’s a lot out there about scaling relational databases. Partitioning your data over application-managed shards is a topic that has seen its fair share of attention. But I think there’s a whole slew over NoSQL databases that have been built and designed by some very smart people that are new enough that their internals aren’t we
6 0.60022146 236 high scalability-2008-02-03-Ideas on how to scale a shared inventory database???
7 0.59363973 607 high scalability-2009-05-26-Database Optimize patterns
8 0.58583933 137 high scalability-2007-10-30-Database parallelism choices greatly impact scalability
9 0.58368975 782 high scalability-2010-02-23-When to migrate your database?
10 0.58244628 1007 high scalability-2011-03-18-Stuff The Internet Says On Scalability For March 18, 2011
11 0.58147973 1546 high scalability-2013-11-11-Ask HS: What is a good OLAP database choice with node.js?
12 0.57608575 935 high scalability-2010-11-05-Hot Scalability Links For November 5th, 2010
13 0.56715786 867 high scalability-2010-07-27-YeSQL: An Overview of the Various Query Semantics in the Post Only-SQL World
14 0.56267381 940 high scalability-2010-11-12-Stuff the Internet Says on Scalability For November 12th, 2010
15 0.55855829 749 high scalability-2009-12-15-The Common Principles Behind the NOSQL Alternatives
16 0.55405134 567 high scalability-2009-04-14-Challanges for Developing Enterprise Application on the Cloud
17 0.55210209 961 high scalability-2010-12-21-SQL + NoSQL = Yes !
18 0.55185586 1092 high scalability-2011-08-04-Jim Starkey is Creating a Brave New World by Rethinking Databases for the Cloud
19 0.54975826 580 high scalability-2009-04-24-INFOSCALE 2009 in June in Hong Kong
20 0.54872829 675 high scalability-2009-08-08-1dbase vs. many and cloud hosting vs. dedicated server(s)?
topicId topicWeight
[(2, 0.168), (14, 0.498), (61, 0.182)]
simIndex simValue blogId blogTitle
1 0.8484233 599 high scalability-2009-05-14-Who Has the Most Web Servers?
Introduction: An interesting post on DataCenterKnowledge! 1&1 Internet: 55,000 servers Rackspace: 50,038 servers The Planet: 48,500 servers Akamai Technologies: 48,000 servers OVH: 40,000 servers SBC Communications: 29,193 servers Verizon: 25,788 servers Time Warner Cable: 24,817 servers SoftLayer: 21,000 servers AT&T;: 20,268 servers iWeb: 10,000 servers How about Google , Microsoft, Amazon , eBay , Yahoo, GoDaddy, Facebook? Check out the post on DataCenterKnowledge and of course here on highscalability.com!
2 0.8336966 441 high scalability-2008-11-13-CloudCamp London 2: private clouds and standardisation
Introduction: CloudCamp returned to London yesterday, organised with the help of Skills Matter at the Crypt on the Clarkenwell green. The main topics of this cloud/grid computing community meeting were service-level agreements, connecting private and public clouds and standardisation issues.
3 0.65453893 405 high scalability-2008-10-07-Help a Scoble out. What should Robert ask in his scalability interview?
Introduction: One of the cool things about Mr. Scoble is he doesn't pretend to know everything, which can be an deadly boring affliction in this field. In this case Robert is asking for help in an upcoming interview. Maybe we can help? Here's Robert's plight: I’m really freaked out. I have one of the biggest interviews of my life coming up and I’m way under qualified to host it. It’s on Thursday and it’s about Scalability and Performance of Web Services. Look at who will be on. Matt Mullenweg, founder of Automattic, the company behind WordPress (and behind this blog). Paul Bucheit, one of the founders of FriendFeed and the creator of Gmail (he’s also the guy who gave Google the “don’t be evil” admonishion). Nat Brown, CTO of iLike, which got six million users on Facebook in about 10 days. What would you ask?
4 0.59502411 495 high scalability-2009-01-17-Intro to Caching,Caching algorithms and caching frameworks part 1
Introduction: Informative and well organized post on caching . Talks about: Why do we need cache?, What is Cache?, Cache Hit, Cache Miss, Storage Cost, Retrieval Cost, Invalidation, Replacement Policy, Optimal Replacement Policy, Caching Algorithms, Least Frequently Used (LFU), Least Recently Used (LRU), Least Recently Used 2(LRU2), Two Queues, Adaptive Replacement Cache (ACR), Most Recently Used (MRU), First in First out (FIFO), Distributed caching, Measuring Cache.
5 0.58804208 981 high scalability-2011-02-01-Google Strategy: Tree Distribution of Requests and Responses
Introduction: If a large number of leaf node machines send requests to a central root node then that root node can become overwhelmed: The CPU becomes a bottleneck, for either processing requests or sending replies, because it can't possibly deal with the flood of requests. The network interface becomes a bottleneck because a wide fan-in causes TCP drops and retransmissions, which causes latency. Then clients start retrying requests which quickly causes a spiral of death in an undisciplined system. One solution to this problem is a strategy given by Dr. Jeff Dean , Head of Google's School of Infrastructure Wizardry, in this Stanford video presentation : Tree Distribution of Requests and Responses . Instead of having a root node connected to leaves in a flat topology, the idea is to create a tree of nodes. So a root node talks to a number of parent nodes and the parent nodes talk to a number of leaf nodes. Requests are pushed down the tree through the parents and only hit a subset
6 0.58014148 487 high scalability-2009-01-08-Paper: Sharding with Oracle Database
7 0.54495275 725 high scalability-2009-10-21-Manage virtualized sprawl with VRMs
8 0.53227174 1253 high scalability-2012-05-28-The Anatomy of Search Technology: Crawling using Combinators
same-blog 9 0.53212166 537 high scalability-2009-03-12-QCon London 2009: Database projects to watch closely
10 0.46128312 694 high scalability-2009-09-04-Hot Links for 2009-9-4
11 0.44420955 744 high scalability-2009-11-24-Hot Scalability Links for Nov 24 2009
12 0.43227792 322 high scalability-2008-05-19-Conference: Infoscale 2008 in Italy (June 4-6)
13 0.42639667 99 high scalability-2007-09-23-HA for switches
14 0.42629173 208 high scalability-2008-01-11-FTP Sanity: Redundancy, archiving, consolidation.
15 0.4256815 332 high scalability-2008-05-28-Job queue and search engine
16 0.42246181 1287 high scalability-2012-07-20-Stuff The Internet Says On Scalability For July 20, 2012
17 0.41932046 1278 high scalability-2012-07-06-Stuff The Internet Says On Scalability For July 6, 2012
18 0.41408187 930 high scalability-2010-10-28-NoSQL Took Away the Relational Model and Gave Nothing Back
19 0.41360706 173 high scalability-2007-12-05-Easier Production Releases
20 0.41210842 177 high scalability-2007-12-08-thesimsonstage.ea.com