high_scalability high_scalability-2009 high_scalability-2009-737 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: NorthScale's Steven Yen in his highly entertaining NoSQL is a Horseless Carriage presentation has come up with a NoSQL taxonomy that thankfully focuses a little more on what NoSQL is, than what it isn't : key‐value‐cache memcached, repcached, coherence, infinispan, eXtreme scale, jboss cache, velocity, terracoqa key‐value‐store keyspace, flare, schema‐free, RAMCloud eventually‐consistent key‐value‐store dynamo, voldemort, Dynomite, SubRecord, Mo8onDb, Dovetaildb ordered‐key‐value‐store tokyo tyrant, lightcloud, NMDB, luxio, memcachedb, actord data‐structures server redis tuple‐store gigaspaces, coord, apache river object database ZopeDB, db4o, Shoal document store CouchDB, Mongo, Jackrabbit, XML Databases, ThruDB, CloudKit, Perservere, Riak Basho, Scalaris wide columnar store BigTable, Hbase, Cassandra, Hypertable, KAI, OpenNeptune, Qbase, KDI "Who will win?"
sentIndex sentText sentNum sentScore
1 He answers: t he most approachable API with enough power will win . [sent-3, score-0.126]
2 Steven touts the contender with the most devastating knock out punch will be document stores because "everyone groks documents. [sent-4, score-0.434]
3 " Though the thought is there will be just a few winners and products will converge in functionality. [sent-5, score-0.202]
4 Steven is banking on the "worse is better" model of dominance, which is hard to argue with as it has been so successful an adoption pattern in our field. [sent-6, score-0.178]
5 The convergence idea is something I also agree with. [sent-7, score-0.169]
6 Over time they will merge together to become more full featured offerings. [sent-9, score-0.161]
7 The key question though is what is enough power to win? [sent-10, score-0.245]
8 Just getting a value back for a key won't be enough. [sent-11, score-0.372]
9 Related Articles NoSQL is a horseless carriage blog post by Steven Yen. [sent-13, score-0.566]
wordName wordTfidf (topN-words)
[('steven', 0.409), ('horseless', 0.297), ('carriage', 0.269), ('value', 0.207), ('key', 0.165), ('flare', 0.134), ('lightcloud', 0.134), ('yen', 0.134), ('repcached', 0.134), ('masquerading', 0.134), ('kai', 0.126), ('win', 0.126), ('devastating', 0.121), ('thrudb', 0.121), ('dominance', 0.121), ('taxonomy', 0.121), ('dynomite', 0.121), ('northscale', 0.121), ('touts', 0.116), ('thankfully', 0.116), ('infinispan', 0.116), ('articlesnosql', 0.116), ('tyrant', 0.112), ('nosql', 0.11), ('punch', 0.109), ('memcachedb', 0.109), ('converge', 0.102), ('basho', 0.102), ('winners', 0.1), ('banking', 0.1), ('entertaining', 0.097), ('convergence', 0.097), ('hypertable', 0.097), ('columnar', 0.095), ('mongo', 0.093), ('organizing', 0.092), ('featured', 0.089), ('voldemort', 0.089), ('knock', 0.088), ('meetup', 0.086), ('focuses', 0.084), ('though', 0.08), ('coherence', 0.079), ('argue', 0.078), ('velocity', 0.075), ('riak', 0.073), ('agree', 0.072), ('merge', 0.072), ('xml', 0.07), ('extreme', 0.07)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999994 737 high scalability-2009-11-05-A Yes for a NoSQL Taxonomy
Introduction: NorthScale's Steven Yen in his highly entertaining NoSQL is a Horseless Carriage presentation has come up with a NoSQL taxonomy that thankfully focuses a little more on what NoSQL is, than what it isn't : key‐value‐cache memcached, repcached, coherence, infinispan, eXtreme scale, jboss cache, velocity, terracoqa key‐value‐store keyspace, flare, schema‐free, RAMCloud eventually‐consistent key‐value‐store dynamo, voldemort, Dynomite, SubRecord, Mo8onDb, Dovetaildb ordered‐key‐value‐store tokyo tyrant, lightcloud, NMDB, luxio, memcachedb, actord data‐structures server redis tuple‐store gigaspaces, coord, apache river object database ZopeDB, db4o, Shoal document store CouchDB, Mongo, Jackrabbit, XML Databases, ThruDB, CloudKit, Perservere, Riak Basho, Scalaris wide columnar store BigTable, Hbase, Cassandra, Hypertable, KAI, OpenNeptune, Qbase, KDI "Who will win?"
2 0.19810896 736 high scalability-2009-11-04-Damn, Which Database do I Use Now?
Introduction: With so many database options available these days, like for the rest of life, it's natural to wonder how it all fits together . Amazon complicated, or rather expanded the available options by introducing RDS, their relational database service . RDS is MySQL safely cocooned as a manageable cloud element, resting boldly within an energy providing elastic CPU pool , supported by a virtually infinite supply of very capable virtualized storage . MySQL in AWS is now easy to start, stop, monitor, backup, snapshot, expand, and effortlessly move up and down the instance hierarchy. What it's not, contrary to what you might expect, is a scale-out solution, it's a scale-up solution. You get more by buying a bigger instance, not by horizontally adding more instances. There's a limit . Admittedly a larger limit now with Amazon's new high memory instances . That's OK, well maybe not for people who helped grow Amazon's ecosystem by offering a similar product, but so many projects use M
3 0.16180231 670 high scalability-2009-08-05-Anti-RDBMS: A list of distributed key-value stores
Introduction: Update 8: Introducing MongoDB by Eliot Horowit . Update 7: The Future of Scalable Databases by Robin Mathew. Update 6: NoSQL : If Only it Was that Easy . BJ Clark lays down the law on which databases are scalable: Tokyo - NO, Redis - NO, Voldemort - YES, MongoDB - Not Yet, Cassandra - Probably, Amazon S3 - YES * 2, MySQL - NO. The real thing to point out is that if you are being held back from making something super awesome because you can’t choose a database, you are doing it wrong. Update 5: Exciting stuff happening in Japan at this Key-Value Storage meeting in Tokyo . Presentations on Groonga, Senna, Lux IO, Tokyo-Cabinet, Tx, repcached, Kai, Cagra, kumofs, ROMA, and Flare. Update 4: NoSQL and the Relational Model: don’t throw the baby out with the bathwater by Matthew Willson. So my key point is, this kind of modelling is WORTH DOING, regardless of which database tool you end up using for physical storage. Update 3: Choosing a non-relational database
Introduction: This is a guest post by Doug Judd , original creator of Hypertable and the CEO of Hypertable, Inc. Hypertable delivers 2X better throughput in most tests -- HBase fails 41 and 167 billion record insert tests, overwhelmed by garbage collection -- Both systems deliver similar results for random read uniform test We recently conducted a test comparing the performance of Hypertable ( @hypertable ) version 0.9.5.5 to that of HBase ( @HBase ) version 0.90.4 (CDH3u2) running Zookeeper 3.3.4. In this post, we summarize the results and offer explanations for the discrepancies. For the full test report, see Hypertable vs. HBase II . Introduction Hypertable and HBase are both open source, scalable databases modeled after Google's proprietary Bigtable database. The primary difference between the two systems is that Hypertable is written in C++, while HBase is written in Java. We modeled this test after the one described in section 7 of the Bigtable paper and tuned both systems fo
5 0.11752261 931 high scalability-2010-10-28-Notes from A NOSQL Evening in Palo Alto
Introduction: I along with 180 other people and veritable who's who of NoSQL vendors, attended the A NoSQL Evening in Palo Alto NoSQL Meetup on Tuesday. The format was a panel of 10 vendors--10gen, Basho, CouchOne, Cloudant, Cloudera, GoGrid, InfiniteGraph, Membase, Riptano, Scality--sitting in two rows of chairs in front of what seemed like a pretty diverse audience. Tim Anglade (founder, A NOSQL Summer) moderated. Tim kept things moving by asking a few leading questions and the panel chimed in with answers. Quite a few questions came from the audience, which was refreshing. Overall a genial evening with some good discussion. I was pleased that the panel members didn't just automatically slip into marketing speak. Most of the discussions were on point rather than just another excuse to hit the talking points. There were some complaints about the talk not being technical enough, but I don't think that was really the purpose of this kind of talk. The panel format is excellent at giving a wide ra
6 0.113566 739 high scalability-2009-11-09-10 NoSQL Systems Reviewed
7 0.1109203 696 high scalability-2009-09-07-Product: Infinispan - Open Source Data Grid
8 0.10969818 708 high scalability-2009-09-17-Infinispan narrows the gap between open source and commercial data caches
9 0.10261817 954 high scalability-2010-12-06-What the heck are you actually using NoSQL for?
10 0.092279062 528 high scalability-2009-03-06-Product: Lightcloud - Key-Value Database
11 0.086207949 875 high scalability-2010-08-09-NoSQL on the Microsoft Platform
12 0.083690941 1022 high scalability-2011-04-13-Paper: NoSQL Databases - NoSQL Introduction and Overview
13 0.082378961 512 high scalability-2009-02-14-Scaling Digg and Other Web Applications
14 0.0818156 910 high scalability-2010-09-30-Facebook and Site Failures Caused by Complex, Weakly Interacting, Layered Systems
15 0.080658711 634 high scalability-2009-06-20-Building a data cycle at LinkedIn with Hadoop and Project Voldemort
16 0.079844028 961 high scalability-2010-12-21-SQL + NoSQL = Yes !
17 0.078386895 1180 high scalability-2012-01-24-The State of NoSQL in 2012
18 0.074224979 732 high scalability-2009-10-29-Digg - Looking to the Future with Cassandra
19 0.072806716 1204 high scalability-2012-03-06-Ask For Forgiveness Programming - Or How We'll Program 1000 Cores
20 0.071970515 647 high scalability-2009-07-02-Hypertable is a New BigTable Clone that Runs on HDFS or KFS
topicId topicWeight
[(0, 0.09), (1, 0.041), (2, 0.012), (3, 0.038), (4, 0.043), (5, 0.091), (6, -0.042), (7, -0.039), (8, 0.038), (9, -0.021), (10, -0.011), (11, -0.01), (12, -0.018), (13, 0.021), (14, -0.003), (15, -0.023), (16, 0.034), (17, -0.032), (18, -0.068), (19, -0.058), (20, -0.041), (21, -0.004), (22, 0.05), (23, 0.013), (24, 0.035), (25, -0.04), (26, 0.015), (27, 0.044), (28, -0.035), (29, -0.042), (30, -0.015), (31, 0.025), (32, 0.016), (33, -0.036), (34, 0.02), (35, -0.018), (36, -0.009), (37, -0.024), (38, -0.01), (39, -0.031), (40, 0.008), (41, 0.023), (42, 0.038), (43, 0.023), (44, -0.042), (45, 0.033), (46, -0.019), (47, 0.006), (48, 0.031), (49, 0.008)]
simIndex simValue blogId blogTitle
same-blog 1 0.95832622 737 high scalability-2009-11-05-A Yes for a NoSQL Taxonomy
Introduction: NorthScale's Steven Yen in his highly entertaining NoSQL is a Horseless Carriage presentation has come up with a NoSQL taxonomy that thankfully focuses a little more on what NoSQL is, than what it isn't : key‐value‐cache memcached, repcached, coherence, infinispan, eXtreme scale, jboss cache, velocity, terracoqa key‐value‐store keyspace, flare, schema‐free, RAMCloud eventually‐consistent key‐value‐store dynamo, voldemort, Dynomite, SubRecord, Mo8onDb, Dovetaildb ordered‐key‐value‐store tokyo tyrant, lightcloud, NMDB, luxio, memcachedb, actord data‐structures server redis tuple‐store gigaspaces, coord, apache river object database ZopeDB, db4o, Shoal document store CouchDB, Mongo, Jackrabbit, XML Databases, ThruDB, CloudKit, Perservere, Riak Basho, Scalaris wide columnar store BigTable, Hbase, Cassandra, Hypertable, KAI, OpenNeptune, Qbase, KDI "Who will win?"
2 0.82232356 875 high scalability-2010-08-09-NoSQL on the Microsoft Platform
Introduction: NoSQL is a trend that is gaining steam primarily in the world of Open Source. There are numerous NoSQL solutions available for all levels of complexity: from queryable distributed solutions like MongoDB to simpler distributed key-value storage solutions like Cassandra. Then there’s Riak, Tokyo Cabinet, Voldemort, CouchDB, and Redis. However, very few of these packaged NoSQL products are available for the other end of the platform market: Microsoft Windows. I’m going to outline what’s available now and briefly touch on some opportunities that are still available to the daring Microsoft engineer. You can read the full story here .
3 0.7856515 670 high scalability-2009-08-05-Anti-RDBMS: A list of distributed key-value stores
Introduction: Update 8: Introducing MongoDB by Eliot Horowit . Update 7: The Future of Scalable Databases by Robin Mathew. Update 6: NoSQL : If Only it Was that Easy . BJ Clark lays down the law on which databases are scalable: Tokyo - NO, Redis - NO, Voldemort - YES, MongoDB - Not Yet, Cassandra - Probably, Amazon S3 - YES * 2, MySQL - NO. The real thing to point out is that if you are being held back from making something super awesome because you can’t choose a database, you are doing it wrong. Update 5: Exciting stuff happening in Japan at this Key-Value Storage meeting in Tokyo . Presentations on Groonga, Senna, Lux IO, Tokyo-Cabinet, Tx, repcached, Kai, Cagra, kumofs, ROMA, and Flare. Update 4: NoSQL and the Relational Model: don’t throw the baby out with the bathwater by Matthew Willson. So my key point is, this kind of modelling is WORTH DOING, regardless of which database tool you end up using for physical storage. Update 3: Choosing a non-relational database
4 0.77720243 872 high scalability-2010-08-05-Pairing NoSQL and Relational Data Storage: MySQL with MongoDB
Introduction: I’ve largely steered clear of publicly commenting on the “NoSQL vs. Relational” conflict. Keeping in mind that this argument is more about currently available solutions and the features their developers have chosen to build in, I’d like to dig into this and provide a decidedly neutral viewpoint. In fact, by erring on the side of caution, I’ve inadvertently given myself plenty of time to consider the pros and cons of both data storage approaches, and although my mind was initially swaying toward the NoSQL camp, I can say with a fair amount of certainty, that I’ve found a good compromise. You can read the full store here .
5 0.76867908 874 high scalability-2010-08-07-ArchCamp: Scalable Databases (NoSQL)
Introduction: ArchCamp: Scalable Databasess (NoSQL) The ArchCamp unconference was held this past Friday at HackerDojo in Mountain View, CA. There was plenty of pizza, beer, and great conversation. This session started out free-form, but shaped up pretty quickly into a discussion of the popular open source scalable NoSQL databases and the architectural categories in which they belong.
6 0.76815212 931 high scalability-2010-10-28-Notes from A NOSQL Evening in Palo Alto
7 0.7603901 739 high scalability-2009-11-09-10 NoSQL Systems Reviewed
8 0.75303847 745 high scalability-2009-11-25-Brian Aker's Hilarious NoSQL Stand Up Routine
9 0.72623235 930 high scalability-2010-10-28-NoSQL Took Away the Relational Model and Gave Nothing Back
10 0.71379876 935 high scalability-2010-11-05-Hot Scalability Links For November 5th, 2010
11 0.699085 779 high scalability-2010-02-16-Seven Signs You May Need a NoSQL Database
12 0.69626164 1022 high scalability-2011-04-13-Paper: NoSQL Databases - NoSQL Introduction and Overview
13 0.69235235 1180 high scalability-2012-01-24-The State of NoSQL in 2012
14 0.6724385 961 high scalability-2010-12-21-SQL + NoSQL = Yes !
15 0.65274364 1054 high scalability-2011-06-06-NoSQL Pain? Learn How to Read-write Scale Without a Complete Re-write
16 0.64614546 736 high scalability-2009-11-04-Damn, Which Database do I Use Now?
17 0.64497298 732 high scalability-2009-10-29-Digg - Looking to the Future with Cassandra
18 0.64214355 954 high scalability-2010-12-06-What the heck are you actually using NoSQL for?
19 0.63815624 1169 high scalability-2012-01-05-Shutterfly Saw a Speedup of 500% With Flashcache
20 0.63405085 770 high scalability-2010-02-03-NoSQL Means Never Having to Store Blobs Again
topicId topicWeight
[(1, 0.065), (2, 0.117), (10, 0.056), (49, 0.421), (56, 0.04), (61, 0.108), (79, 0.092)]
simIndex simValue blogId blogTitle
1 0.85418385 843 high scalability-2010-06-16-WTF is Elastic Data Grid? (By Example)
Introduction: Forrester released their new wave report: T he Forrester Wave™: Elastic Caching Platforms, Q2 2010 where they listed GigaSpaces, IBM, Oracle, and Terracotta as leading vendors in the field. In this post I'd like to take some time to explain what some of these terms mean, and why they’re important to you. I’ll start with a definition of Elastic Data Grid (Elastic Caching), how it is different then other caching and NoSQL alternatives, and more importantly -- I'll illustrate how it works through some real code examples. You can read the full story here .
same-blog 2 0.78694916 737 high scalability-2009-11-05-A Yes for a NoSQL Taxonomy
Introduction: NorthScale's Steven Yen in his highly entertaining NoSQL is a Horseless Carriage presentation has come up with a NoSQL taxonomy that thankfully focuses a little more on what NoSQL is, than what it isn't : key‐value‐cache memcached, repcached, coherence, infinispan, eXtreme scale, jboss cache, velocity, terracoqa key‐value‐store keyspace, flare, schema‐free, RAMCloud eventually‐consistent key‐value‐store dynamo, voldemort, Dynomite, SubRecord, Mo8onDb, Dovetaildb ordered‐key‐value‐store tokyo tyrant, lightcloud, NMDB, luxio, memcachedb, actord data‐structures server redis tuple‐store gigaspaces, coord, apache river object database ZopeDB, db4o, Shoal document store CouchDB, Mongo, Jackrabbit, XML Databases, ThruDB, CloudKit, Perservere, Riak Basho, Scalaris wide columnar store BigTable, Hbase, Cassandra, Hypertable, KAI, OpenNeptune, Qbase, KDI "Who will win?"
3 0.70147628 400 high scalability-2008-10-01-The Pattern Bible for Distributed Computing
Introduction: Software design patterns are an emerging tool for guiding and documenting system design. Patterns usually describe software abstractions used by advanced designers and programmers in their software. Patterns can provide guidance for designing highly scalable distributed systems. Let's see how! Patterns are in essence solutions to problems. Most of them are expressed in a format called Alexandrian form which draws on constructs used by Christopher Alexander. There are variants but most look like this: The pattern name The problem the pattern is trying to solve Context Solution Examples Design rationale: This tells where the pattern came from, why it works, and why experts use it Patterns rarely stand alone. Each pattern works on a context, and transforms the system in that context to produce a new system in a new context. New problems arise in the new system and context, and the next ‘‘layer’’ of patterns can be applied. A pattern language is a structured col
4 0.61167949 735 high scalability-2009-11-01-Squeeze more performance from Parallelism
Introduction: In many posts, such as: The Future of the Parallelism and its Challenges I mentioned that synchronization the access to the shared resource is the major challenge to write parallel code. The synchronization and coordination take long time from the overall execution time, which reduce the benefits of the parallelism; the synchronization and coordination also reduce the scalability. There are many forms of synchronization and coordination, such as: Create Task object in frameworks such as: Microsoft TPL, Intel TDD, and Parallel Runtime Library. Create and enqueue task objects require synchronization that it’s takes long time especially if we create it into recursive work such as: Quick Sort algorithm. Synchronization the access to shared data. But there are a few techniques to avoid these issues, such as: Shared-Nothing, Actor Model, and Hyper Object (A.K.A. Combinable Object). Simply if we reduce the shared data by re-architect our code this will gives us a huge benefits
5 0.60131663 321 high scalability-2008-05-17-WebSphere Commerce High Availability and Performance Configurations
Introduction: Nobody came up with an example of a website powered by a Websphere product (which has a community edition) and backed up by a DB2 database. I guess you all know about usopen.org so here's the story: While the re-emergence of 35-year-old Andre Agassi and the continued dominance of wunderkind Maria Sharapova have highlighted the on-court headlines at this year's U.S. Open Tennis Championships in Flushing Meadows, N.Y., IBM is hoping its new Power5 chip-based IT support for USOpen.org can make news among those more interested in .NET than tennis nets. Big Blue has partnered with the U.S. Tennis Association and the U.S. Open -- the most prestigious tennis tournament in the U.S. -- since 1992. Together, they launched USOpen.org in 1995 so racket heads could follow the matches online. The iSeries' role this year is in powering a Web-based end-user application called "Point Tracker," a graphics tool using autonomic technology that recreates the trajectory of every shot. On-c
6 0.55702442 1051 high scalability-2011-06-01-Why is your network so slow? Your switch should tell you.
8 0.55047768 749 high scalability-2009-12-15-The Common Principles Behind the NOSQL Alternatives
9 0.5327009 1311 high scalability-2012-08-24-Stuff The Internet Says On Scalability For August 24, 2012
10 0.52275938 183 high scalability-2007-12-12-Report from OpenSocial Meetup at Google
11 0.49909735 823 high scalability-2010-05-05-How will memristors change everything?
12 0.48341054 1359 high scalability-2012-11-15-Gone Fishin': Justin.Tv's Live Video Broadcasting Architecture
13 0.48281491 796 high scalability-2010-03-16-Justin.tv's Live Video Broadcasting Architecture
14 0.48159835 35 high scalability-2007-07-28-Product: FastStats Log Analyzer
15 0.47692823 568 high scalability-2009-04-14-Designing a Scalable Twitter
16 0.46924663 1482 high scalability-2013-06-26-Leveraging Cloud Computing at Yelp - 102 Million Monthly Vistors and 39 Million Reviews
17 0.46440727 399 high scalability-2008-10-01-Joyent - Cloud Computing Built on Accelerators
18 0.44089836 119 high scalability-2007-10-10-WAN Accelerate Your Way to Lightening Fast Transfers Between Data Centers
19 0.43853289 1258 high scalability-2012-06-05-Thesis: Concurrent Programming for Scalable Web Architectures
20 0.43775147 916 high scalability-2010-10-07-Hot Scalability Links For Oct 8, 2010