high_scalability high_scalability-2009 high_scalability-2009-718 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Update: Short presentation NYC by Bryan Fink demonstrating the riak web-shaped data storage engine Riak is another new and interesting key-value store entrant. Some of the features it offers are: Document-oriented Scalable, decentralized key-value store Standard get , put , and delete operations. Distributed, fault-tolerant storage solution. Configurable levels of consistency, availability, and partition tolerance Support for Erlang, Ruby, PHP, Javascript, Java, Python, HTTP open source and NoSQL Pluggable backends Eventing system Monitoring Inter-cluster replication Links between records that can be traversed. Map/Reduce. Functions are executed on the data node. One interesting difference is that a list keys are required to specify which values are operated on as apposed to running calculations on all values. Related Articles Hacker News Thread . More juicy details on how Riak compares to Cassandra, mongodb, couchdb, etc.
sentIndex sentText sentNum sentScore
1 Update: Short presentation NYC by Bryan Fink demonstrating the riak web-shaped data storage engine Riak is another new and interesting key-value store entrant. [sent-1, score-0.532]
2 Some of the features it offers are: Document-oriented Scalable, decentralized key-value store Standard get , put , and delete operations. [sent-2, score-0.591]
3 Configurable levels of consistency, availability, and partition tolerance Support for Erlang, Ruby, PHP, Javascript, Java, Python, HTTP open source and NoSQL Pluggable backends Eventing system Monitoring Inter-cluster replication Links between records that can be traversed. [sent-4, score-0.49]
4 One interesting difference is that a list keys are required to specify which values are operated on as apposed to running calculations on all values. [sent-7, score-1.515]
5 More juicy details on how Riak compares to Cassandra, mongodb, couchdb, etc. [sent-9, score-0.487]
wordName wordTfidf (topN-words)
[('riak', 0.32), ('operated', 0.24), ('demonstrating', 0.24), ('articleshacker', 0.24), ('bryan', 0.229), ('apposed', 0.229), ('compares', 0.201), ('juicy', 0.198), ('decentralized', 0.181), ('tolerant', 0.169), ('delete', 0.166), ('calculations', 0.161), ('specify', 0.159), ('executed', 0.155), ('erlang', 0.153), ('couchdb', 0.146), ('values', 0.13), ('keys', 0.13), ('records', 0.13), ('partition', 0.124), ('difference', 0.123), ('ruby', 0.121), ('javascript', 0.117), ('levels', 0.117), ('functions', 0.117), ('interesting', 0.115), ('python', 0.114), ('mongodb', 0.111), ('offers', 0.107), ('etc', 0.104), ('news', 0.101), ('cassandra', 0.1), ('php', 0.1), ('consistency', 0.097), ('storage', 0.097), ('short', 0.094), ('details', 0.088), ('list', 0.086), ('required', 0.086), ('related', 0.082), ('update', 0.078), ('java', 0.074), ('availability', 0.072), ('put', 0.072), ('http', 0.067), ('features', 0.065), ('source', 0.061), ('open', 0.058), ('running', 0.056), ('distributed', 0.045)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 718 high scalability-2009-10-08-Riak - web-shaped data storage system
Introduction: Update: Short presentation NYC by Bryan Fink demonstrating the riak web-shaped data storage engine Riak is another new and interesting key-value store entrant. Some of the features it offers are: Document-oriented Scalable, decentralized key-value store Standard get , put , and delete operations. Distributed, fault-tolerant storage solution. Configurable levels of consistency, availability, and partition tolerance Support for Erlang, Ruby, PHP, Javascript, Java, Python, HTTP open source and NoSQL Pluggable backends Eventing system Monitoring Inter-cluster replication Links between records that can be traversed. Map/Reduce. Functions are executed on the data node. One interesting difference is that a list keys are required to specify which values are operated on as apposed to running calculations on all values. Related Articles Hacker News Thread . More juicy details on how Riak compares to Cassandra, mongodb, couchdb, etc.
2 0.12059759 1262 high scalability-2012-06-11-Monday Fun: Seven Databases in Song
Introduction: If you understand things best when they're formatted as a musical, this video is for you. It teaches the essentials of PostgreSQL, Riak, HBase, MongoDB, CouchDB, Neo4J and Redis in the style of My Fair Lady. And for a change, it's very SFW.
3 0.10664842 875 high scalability-2010-08-09-NoSQL on the Microsoft Platform
Introduction: NoSQL is a trend that is gaining steam primarily in the world of Open Source. There are numerous NoSQL solutions available for all levels of complexity: from queryable distributed solutions like MongoDB to simpler distributed key-value storage solutions like Cassandra. Then there’s Riak, Tokyo Cabinet, Voldemort, CouchDB, and Redis. However, very few of these packaged NoSQL products are available for the other end of the platform market: Microsoft Windows. I’m going to outline what’s available now and briefly touch on some opportunities that are still available to the daring Microsoft engineer. You can read the full story here .
4 0.098972954 1089 high scalability-2011-07-29-Stuff The Internet Says On Scalability For July 29, 2011
Introduction: Submitted for your end of July scaling pleasure: YouTube : 3 billion videos viewed a day; 48 hours of footage uploaded every minute. 64 core Tilera chip . Google wants to be your CDN. They figure the only way to make the web faster...is to host it. Page Speed Service - Web Performance, Delivered . An eventually for pay service that caches your website and distributes it around the world. No cost information. Your speed may vary. See the longish list of limitations . Nobody said anything interesting on scalability this week! A disaster of non-quotable proportions. If I missed something, now is your chance. Moving an Elephant: Large Scale Hadoop Data Migration at Facebook . Paul Yang describes the greatest westward expansion since the land bridge across the Bering Strait. It's a story of moving a 30PB Hadoop cluster from an over populated datacenter to the wide open spaces of a new continent. Unlike the early settlers, Facebook did not move the boxes over, that would dis
5 0.09890566 1559 high scalability-2013-12-06-Stuff The Internet Says On Scalability For December 6th, 2013
Introduction: Hey, it's HighScalability time: Test your sense of scale. Is this image of something microscopic or macroscopic? Find out . 72 : Intel's 72 core x86 Processor; One Trillion : number of fonts served by Google. Quotable Quotes: West-Eberhard : The gene does not lead, it follows. @waldojaquith : To an ant, gravity is nothing, but surface tension is a powerful force. When you change scale, you play by different rules. Nicholas Christakis : The spread of germs is the price we pay for the spread of ideas. We assemble ourselves into networks to facilitate the flow information but we pay a price, the spread of disease. James Mickens : When you debug a distributed system or an OS kernel, you do it Texas-style. You gather some mean, stoic people, people who have seen things die, and you get some primitive tools, like a compass and a rucksack and a stick that’s pointed on one end, and you walk into the wilderness and you look for troub
6 0.09732195 401 high scalability-2008-10-04-Is MapReduce going mainstream?
7 0.090862289 854 high scalability-2010-07-09-Hot Scalability Links for July 9, 2010
8 0.087765843 651 high scalability-2009-07-02-Product: Project Voldemort - A Distributed Database
9 0.0862737 971 high scalability-2011-01-10-Riak's Bitcask - A Log-Structured Hash Table for Fast Key-Value Data
10 0.081526414 739 high scalability-2009-11-09-10 NoSQL Systems Reviewed
11 0.079807803 417 high scalability-2008-10-15-Outside.in Scales Up with Engine Yard and moving from PHP to Ruby on Rails
12 0.079613477 625 high scalability-2009-06-10-Managing cross partition transactions in a distributed KV system
13 0.079005256 318 high scalability-2008-05-14-New Facebook Chat Feature Scales to 70 Million Users Using Erlang
14 0.078869887 1147 high scalability-2011-11-25-Stuff The Internet Says On Scalability For November 25, 2011
15 0.078765221 954 high scalability-2010-12-06-What the heck are you actually using NoSQL for?
16 0.077026747 459 high scalability-2008-12-03-Java World Interview on Scalability and Other Java Scalability Secrets
17 0.075205848 230 high scalability-2008-01-29-Speed up (Oracle) database code with result caching
18 0.074756309 203 high scalability-2008-01-07-How Ruby on Rails Survived a 550k Pageview Digging
19 0.074665383 498 high scalability-2009-01-20-Product: Amazon's SimpleDB
20 0.074155822 649 high scalability-2009-07-02-Product: Facebook's Cassandra - A Massive Distributed Store
topicId topicWeight
[(0, 0.107), (1, 0.04), (2, -0.015), (3, 0.031), (4, 0.036), (5, 0.087), (6, 0.006), (7, -0.012), (8, 0.031), (9, 0.03), (10, 0.003), (11, -0.012), (12, -0.013), (13, -0.047), (14, -0.049), (15, -0.021), (16, 0.014), (17, -0.014), (18, -0.023), (19, -0.077), (20, 0.004), (21, 0.0), (22, -0.015), (23, -0.006), (24, -0.031), (25, -0.039), (26, 0.055), (27, 0.028), (28, -0.014), (29, -0.035), (30, -0.005), (31, -0.044), (32, -0.044), (33, -0.012), (34, 0.049), (35, -0.06), (36, -0.016), (37, -0.016), (38, -0.06), (39, 0.05), (40, -0.059), (41, -0.063), (42, 0.037), (43, 0.065), (44, 0.043), (45, -0.033), (46, 0.033), (47, -0.005), (48, -0.031), (49, -0.007)]
simIndex simValue blogId blogTitle
same-blog 1 0.95147473 718 high scalability-2009-10-08-Riak - web-shaped data storage system
Introduction: Update: Short presentation NYC by Bryan Fink demonstrating the riak web-shaped data storage engine Riak is another new and interesting key-value store entrant. Some of the features it offers are: Document-oriented Scalable, decentralized key-value store Standard get , put , and delete operations. Distributed, fault-tolerant storage solution. Configurable levels of consistency, availability, and partition tolerance Support for Erlang, Ruby, PHP, Javascript, Java, Python, HTTP open source and NoSQL Pluggable backends Eventing system Monitoring Inter-cluster replication Links between records that can be traversed. Map/Reduce. Functions are executed on the data node. One interesting difference is that a list keys are required to specify which values are operated on as apposed to running calculations on all values. Related Articles Hacker News Thread . More juicy details on how Riak compares to Cassandra, mongodb, couchdb, etc.
2 0.6076287 651 high scalability-2009-07-02-Product: Project Voldemort - A Distributed Database
Introduction: Update: Presentation from the NoSQL conference : slides , video 1 , video 2 . Project Voldemort is an open source implementation of the basic parts of Dynamo (Amazon’s Highly Available Key-value Store) distributed key-value storage system. LinkedIn is using it in their production environment for "certain high-scalability storage problems where simple functional partitioning is not sufficient." From their website: Data is automatically replicated over multiple servers. Data is automatically partitioned so each server contains only a subset of the total data Server failure is handled transparently Pluggable serialization is supported to allow rich keys and values including lists and tuples with named fields, as well as to integrate with common serialization frameworks like Protocol Buffers, Thrift, and Java Serialization Data items are versioned to maximize data integrity in failure scenarios without compromising availability of the system Each node is independent o
3 0.60049194 545 high scalability-2009-03-19-Product: Redis - Not Just Another Key-Value Store
Introduction: With the introduction of Redis your options in the key-value space just grew and your choice of which to pick just got a lot harder. But when you think about it, that's not a bad position to be in at all. Redis (REmote DIctionary Server) - a key-value database. It's similar to memcached but the dataset is not volatile, and values can be strings, exactly like in memcached, but also lists and sets with atomic operations to push/pop elements. The key points are: open source; speed (benchmarked performing 110,000 SET operations, and 81,000 GETs, per second); persistence, but in an asynchronous way taking everything in memory; support for higher level data structures and atomic operations. The home page is well organized so I'll spare the excessive-copying-to-make-this-post-longer. For a good overview of Redis take a look at Antonio Cangiano's article: Introducing Redis: a fast key-value database . If you are looking at a way to understand how Redis is different than something like
4 0.5927968 979 high scalability-2011-01-27-Comet - An Example of the New Key-Code Databases
Introduction: Comet is an active distributed key-value store built at the University of Washington. The paper describing Comet is Comet: An active distributed key-value store , there are also slides , and a MP3 of a presentation given at OSDI '10 . Here's a succinct overview of Comet : Today's cloud storage services, such as Amazon S3 or peer-to-peer DHTs, are highly inflexible and impose a variety of constraints on their clients: specific replication and consistency schemes, fixed data timeouts, limited logging, etc. We witnessed such inflexibility first-hand as part of our Vanish work, where we used a DHT to store encryption keys temporarily. To address this issue, we built Comet, an extensible storage service that allows clients to inject snippets of code that control their data's behavior inside the storage service. I found this paper quite interesting because it takes the initial steps of collocating code with a key-value store, which turns it into what might called a key-code
5 0.56724536 649 high scalability-2009-07-02-Product: Facebook's Cassandra - A Massive Distributed Store
Introduction: Update 2: Presentation from the NoSQL conference : slides , video . Update: Why you won't be building your killer app on a distributed hash table by Jonathan Ellis. Why I think Cassandra is the most promising of the open-source distributed databases --you get a relatively rich data model and a distribution model that supports efficient range queries. These are not things that can be grafted on top of a simpler DHT foundation, so Cassandra will be useful for a wider variety of applications. James Hamilton has published a thorough summary of Facebook's Cassandra, another scalable key-value store for your perusal. It's open source and is described as a "BigTable data model running on a Dynamo-like infrastructure." Cassandra is used in Facebook as an email search system containing 25TB and over 100m mailboxes. Google Code for Cassandra - A Structured Storage System on a P2P Network SIGMOD 2008 Presentation . Video Presentation at Facebook Facebook Engineering Blo
6 0.55654407 1210 high scalability-2012-03-16-Stuff The Internet Says On Scalability For March 16, 2012
7 0.55449456 1459 high scalability-2013-05-16-Paper: Warp: Multi-Key Transactions for Key-Value Stores
8 0.55354214 756 high scalability-2009-12-30-Terrastore - Scalable, elastic, consistent document store.
9 0.55266422 1194 high scalability-2012-02-16-A Super Short on the Youporn Stack - 300K QPS and 100 Million Page Views Per Day
10 0.55027622 710 high scalability-2009-09-20-PaxosLease: Diskless Paxos for Leases
11 0.54816365 967 high scalability-2011-01-03-Stuff The Internet Says On Scalability For January 3, 2010
12 0.54515934 433 high scalability-2008-10-29-CTL - Distributed Control Dispatching Framework
13 0.54376978 1262 high scalability-2012-06-11-Monday Fun: Seven Databases in Song
14 0.54256952 1220 high scalability-2012-04-02-YouPorn - Targeting 200 Million Views a Day and Beyond
15 0.53670156 1151 high scalability-2011-12-05-Stuff The Internet Says On Scalability For December 5, 2011
16 0.5355311 510 high scalability-2009-02-09-Paper: Consensus Protocols: Two-Phase Commit
17 0.53053737 787 high scalability-2010-03-03-Hot Scalability Links for March 3, 2010
18 0.53031486 670 high scalability-2009-08-05-Anti-RDBMS: A list of distributed key-value stores
19 0.53025502 1022 high scalability-2011-04-13-Paper: NoSQL Databases - NoSQL Introduction and Overview
20 0.51918072 1142 high scalability-2011-11-14-Using Gossip Protocols for Failure Detection, Monitoring, Messaging and Other Good Things
topicId topicWeight
[(1, 0.177), (2, 0.172), (61, 0.153), (67, 0.291), (85, 0.081)]
simIndex simValue blogId blogTitle
1 0.85334271 1422 high scalability-2013-03-12-If Your System was a Symphony it Might Sound Like This...
Introduction: I am in no way a music expert, but when I listen to Symphony No. 4 by Charles Ives , I imagine it's what a complex software/hardware system might sound like if we could hear its inner workings. Ives uses a lot of riotously competing rhythms in this work. It can sound discordant, yet the effect is deeply layered and eventually harmonious, just like the systems we use, create, and become part of. I was pointed to this piece by someone who said there were two conductors. I'd never heard of such a thing! So I was intrigued. The first version of the performance sounds and looks great, but it unfortunately does not use two conductors. The second version uses two conductors, but is unfortunately just a snippet. It's strikingly odd to see two conductors, but I imagine different parts of our systems using different conductors too, running at different rhythms, sometimes slow, sometimes fast, sometimes there are outbursts, sometimes in vicious conflict. Yet conceptually it all stills seem
same-blog 2 0.84253728 718 high scalability-2009-10-08-Riak - web-shaped data storage system
Introduction: Update: Short presentation NYC by Bryan Fink demonstrating the riak web-shaped data storage engine Riak is another new and interesting key-value store entrant. Some of the features it offers are: Document-oriented Scalable, decentralized key-value store Standard get , put , and delete operations. Distributed, fault-tolerant storage solution. Configurable levels of consistency, availability, and partition tolerance Support for Erlang, Ruby, PHP, Javascript, Java, Python, HTTP open source and NoSQL Pluggable backends Eventing system Monitoring Inter-cluster replication Links between records that can be traversed. Map/Reduce. Functions are executed on the data node. One interesting difference is that a list keys are required to specify which values are operated on as apposed to running calculations on all values. Related Articles Hacker News Thread . More juicy details on how Riak compares to Cassandra, mongodb, couchdb, etc.
3 0.8121165 898 high scalability-2010-09-09-6 Scalability Lessons
Introduction: Jesper Söderlund not only put together a few interesting scalability patterns , he also came up with a few interesting scalability lessons : Lesson #1 . Put Smarty compile and template caches on an active-active DRBD cluster with high load and your servers will DIE! Lesson #2 . Don't use out-of-the-box configurations. Lesson #3 . Single points of contention will eventually become a bottleneck. Lesson #4 . Plan in advance. Lesson #5 . Offload your databases as much as possible. Lesson #6 . File systems matter and can run out of space / inodes. For more details and explanations see the original post.
4 0.7408542 86 high scalability-2007-09-09-Clustering Solution
Introduction: Hi, I'm i nterested in peop l es thoughts on the best choice for a database clustering so l ution. I have a database that is most l y varchars and numbers that doesn't store any b i nary data at a l l. It's used at about 70% read and 30% wr i tes - though we're using memcached at the moment so it's not rea l ly hit that hard. We're current l y using mysql w i th m/cluster, but are interested in a new so l ution. Possib l e candidate so far are unic l uster (which doesn't seem mature yet.) or DRBD. Had anyone had a simi l ar experience and can make any suggest i ons? Thanks
5 0.67476964 950 high scalability-2010-11-30-NoCAP – Part III – GigaSpaces clustering explained..
Introduction: In many of the recent discussions on the design of large scale systems (a.k.a. Web Scale) it was argued that the right set of tradeoffs for building large scale systems would be to give away C onsistency for A vailability and P artition tolerance. Those arguments relied on the foundation of the CAP theorem developed in early 2000-2002. One of the core principals behind the CAP theorem is that you must choose two out of the three CAP properties. In many of the transactional systems giving away consistency is either impossible or yields a huge complexity in the design of those systems. In this series of posts, I've tried to suggest a different set of tradeoffs in which we could achieve scalability without compromising on consistency. I also argued that rather than choosing only two out of the three CAP properties we could choose various degrees of all three. The degrees would be determined by the most likely availability and partition tolerance scenarios in our specific application.
6 0.67427874 1031 high scalability-2011-04-28-PaaS on OpenStack - Run Applications on Any Cloud, Any Time Using Any Thing
7 0.67379397 1399 high scalability-2013-02-05-Ask HighScalability: Memcached and Relations
8 0.67356223 423 high scalability-2008-10-19-Alternatives to Google App Engine
9 0.6716612 298 high scalability-2008-04-07-Lazy web sites run faster
10 0.66833013 218 high scalability-2008-01-17-Moving old to new. Do not be afraid of the re-write -- but take some help
11 0.66635484 775 high scalability-2010-02-10-ElasticSearch - Open Source, Distributed, RESTful Search Engine
12 0.66601884 64 high scalability-2007-08-10-How do we make a large real-time search engine?
13 0.66543704 36 high scalability-2007-07-28-Product: Web Log Expert
14 0.66352761 337 high scalability-2008-05-31-memcached and Storage of Friend list
15 0.66350847 426 high scalability-2008-10-22-Server load balancing architectures, Part 1: Transport-level load balancing
16 0.66256088 776 high scalability-2010-02-12-Hot Scalability Links for February 12, 2010
17 0.66174829 833 high scalability-2010-06-01-Sponsored Post: Get Your High Scalability Fix at Digg
18 0.6615954 928 high scalability-2010-10-26-Scaling DISQUS to 75 Million Comments and 17,000 RPS
19 0.66156846 626 high scalability-2009-06-10-Paper: Graph Databases and the Future of Large-Scale Knowledge Management
20 0.66147578 787 high scalability-2010-03-03-Hot Scalability Links for March 3, 2010