high_scalability high_scalability-2010 high_scalability-2010-921 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: In this post i wanted to spend sometime on the CAP theorem and clarify some of the confusion that i often see when people associate CAP with scalability without fully understanding the implications that comes with it and the alternative approaches You can read the full article here
sentIndex sentText sentNum sentScore
wordName wordTfidf (topN-words)
[('cap', 0.397), ('articlehere', 0.376), ('clarify', 0.359), ('sometime', 0.325), ('confusion', 0.304), ('associate', 0.298), ('theorem', 0.25), ('implications', 0.212), ('alternative', 0.171), ('wanted', 0.167), ('understanding', 0.154), ('spend', 0.142), ('fully', 0.125), ('comes', 0.12), ('often', 0.099), ('full', 0.093), ('post', 0.08), ('read', 0.074), ('people', 0.067), ('without', 0.066), ('see', 0.064), ('scalability', 0.062)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 921 high scalability-2010-10-18-NoCAP
Introduction: In this post i wanted to spend sometime on the CAP theorem and clarify some of the confusion that i often see when people associate CAP with scalability without fully understanding the implications that comes with it and the alternative approaches You can read the full article here
2 0.20175262 950 high scalability-2010-11-30-NoCAP – Part III – GigaSpaces clustering explained..
Introduction: In many of the recent discussions on the design of large scale systems (a.k.a. Web Scale) it was argued that the right set of tradeoffs for building large scale systems would be to give away C onsistency for A vailability and P artition tolerance. Those arguments relied on the foundation of the CAP theorem developed in early 2000-2002. One of the core principals behind the CAP theorem is that you must choose two out of the three CAP properties. In many of the transactional systems giving away consistency is either impossible or yields a huge complexity in the design of those systems. In this series of posts, I've tried to suggest a different set of tradeoffs in which we could achieve scalability without compromising on consistency. I also argued that rather than choosing only two out of the three CAP properties we could choose various degrees of all three. The degrees would be determined by the most likely availability and partition tolerance scenarios in our specific application.
3 0.17245764 762 high scalability-2010-01-18-The Missing Piece in the Virtualization Stack (Part 1)
Introduction: This and the next post will discuss how virtualization and cloud computing, as we know it today, is only a small part of the solution for today’s IT inefficiencies. While new technologies and delivery models have made it much simpler to manage the infrastructure, this is not where our core inefficiencies lie. Virtualization principles must be extended to higher levels of the application stack, to make it easier for all of us to manage, tune and integrate applications. Otherwise we will continue to spend most of our time on things that don’t provide real value to the business. Read the full article here
4 0.1441413 926 high scalability-2010-10-24-Hot Scalability Links For Oct 24, 2010
Introduction: On a cold and rainy Fall day, a day stolen from winter rather than our usual gorgeous Indian Summers , a day not even the SF Giants winning the pennant can help warm, here are some hot links to read by a digital flame: Using MySQL as a NoSQL - A story for exceeding 750,000 qps on a commodity server by Yoshinori Matsunobu. Wonderfully detailed post on how you can lookup a row by ID really fast if you bypass all the typical MySQL query parsing overhead. Minecraftwiki.net and minecraftforum.net now serve more traffic than Slashdot and Stackoverflow! 1 million pageviews and 100k uniques per day, per site; 10TB of bandwidth a month; 4+ machines running Varnish, HAProxy, PHP, MySQL, Nginx. Stuff the Internet Says: @ old_sound : Somebody make me a t-shirt that says "I've read the CAP theorem and I liked it" @dscape : How relevant do I think the CAP theorem is? Not at all. I honestly hate conversations where anyone talks about crap.. cap, sorry. @humidbei
5 0.11286769 658 high scalability-2009-07-17-Against all the odds
Introduction: This article not about Mariah Carey, or its song. It's about Storing System, Database. First let's describe what means by odds: In my social network, I found 93% of the mainstream developers sanctify the database, or at least consider it in any data persistence challenge as the ultimate, superhero, and undefeatable solution. I think this problem come from the education, personally, and some companies also I think it's involved in this. To start to fix this bad thinking, we all should agree in the following points: Every challenge have its own solutions, so whatever you want to save/persistent, there are always many solutions. For example the Web search engines, such as: Google, Kngine, Yahoo, Bing don't use database at all instead we use Indexes (Index file) for better performance. The Database in general whatever the vendor it's slow compared with other solutions such as: Key-Value storing system, Index file, DHT. The Database currently employ Relation Data model
6 0.091820493 787 high scalability-2010-03-03-Hot Scalability Links for March 3, 2010
8 0.084670246 1546 high scalability-2013-11-11-Ask HS: What is a good OLAP database choice with node.js?
9 0.067145683 809 high scalability-2010-04-13-Strategy: Saving Your Butt With Deferred Deletes
10 0.06213345 1611 high scalability-2014-03-12-Paper: Scalable Eventually Consistent Counters over Unreliable Networks
11 0.057618853 96 high scalability-2007-09-18-Amazon Architecture
12 0.056514293 1403 high scalability-2013-02-08-Stuff The Internet Says On Scalability For February 8, 2013
13 0.055798348 62 high scalability-2007-08-08-Partial String Matching
14 0.054372996 734 high scalability-2009-10-30-Hot Scalabilty Links for October 30 2009
15 0.051348656 925 high scalability-2010-10-22-Paper: Netflix’s Transition to High-Availability Storage Systems
16 0.04934673 1303 high scalability-2012-08-13-Ask HighScalability: Facing scaling issues with news feeds on Redis. Any advice?
17 0.04910332 954 high scalability-2010-12-06-What the heck are you actually using NoSQL for?
18 0.046569213 806 high scalability-2010-04-08-Hot Scalability Links for April 8, 2010
19 0.046370659 1025 high scalability-2011-04-16-The NewSQL Market Breakdown
20 0.045839094 1374 high scalability-2012-12-18-Georeplication: When Bad Things Happen to Good Systems
topicId topicWeight
[(0, 0.048), (1, 0.01), (2, 0.006), (3, 0.023), (4, 0.005), (5, 0.024), (6, -0.04), (7, -0.004), (8, 0.009), (9, -0.021), (10, -0.012), (11, 0.02), (12, -0.038), (13, 0.001), (14, -0.005), (15, -0.01), (16, 0.047), (17, -0.015), (18, -0.016), (19, -0.009), (20, 0.023), (21, 0.002), (22, -0.009), (23, 0.012), (24, -0.024), (25, -0.016), (26, -0.002), (27, 0.003), (28, -0.004), (29, 0.012), (30, 0.009), (31, -0.01), (32, 0.008), (33, 0.063), (34, -0.033), (35, 0.005), (36, -0.018), (37, -0.005), (38, 0.035), (39, 0.004), (40, 0.027), (41, -0.008), (42, 0.017), (43, -0.048), (44, 0.005), (45, 0.012), (46, 0.037), (47, -0.027), (48, -0.007), (49, -0.021)]
simIndex simValue blogId blogTitle
same-blog 1 0.9559924 921 high scalability-2010-10-18-NoCAP
Introduction: In this post i wanted to spend sometime on the CAP theorem and clarify some of the confusion that i often see when people associate CAP with scalability without fully understanding the implications that comes with it and the alternative approaches You can read the full article here
2 0.7372905 950 high scalability-2010-11-30-NoCAP – Part III – GigaSpaces clustering explained..
Introduction: In many of the recent discussions on the design of large scale systems (a.k.a. Web Scale) it was argued that the right set of tradeoffs for building large scale systems would be to give away C onsistency for A vailability and P artition tolerance. Those arguments relied on the foundation of the CAP theorem developed in early 2000-2002. One of the core principals behind the CAP theorem is that you must choose two out of the three CAP properties. In many of the transactional systems giving away consistency is either impossible or yields a huge complexity in the design of those systems. In this series of posts, I've tried to suggest a different set of tradeoffs in which we could achieve scalability without compromising on consistency. I also argued that rather than choosing only two out of the three CAP properties we could choose various degrees of all three. The degrees would be determined by the most likely availability and partition tolerance scenarios in our specific application.
3 0.70395195 747 high scalability-2009-11-26-What I'm Thankful For on Thanksgiving
Introduction: I try to keep this blog targeted and on topic. So even though I may be thankful for the song of the tinniest sparrow at sunrise , I'll save you from all that. It's hard to tie scalability and the giving of thanks together, especially as it sometimes occurs to me that this blog may be a self-indulgent waste of time. But I think I found a sentiment in A New THEORY of AWESOMENESS and MIRACLES by James Bridle that manages to marry the topic of this blog and giving thanks meaningfully together: I distrust commercial definitions of innovation, and particularly of awesomeness. It’s an overused term. When I think of awesomeness, I want something awe-inspiring, vast and mind-expanding. So I started thinking about things that I think are awesome, or miraculous, and for me, it kept coming back to scale and complexity. We’re not actually very good about thinking about scale and complexity in real terms, so we have to use metaphors and examples. Douglas Adams writes s
Introduction: We are not yet at the End of History for database theory as Peter Bailis and the Database Group at UC Berkeley continue to prove with a great companion blog post to their new paper: Scalable Atomic Visibility with RAMP Transactions . I like the approach of pairing a blog post with a paper. A paper almost by definition is formal, but a blog post can help put a paper in context and give it some heart. From the abstract: Databases can provide scalability by partitioning data across several servers. However, multi-partition, multi-operation transactional access is often expensive, employing coordination-intensive locking, validation, or scheduling mechanisms. Accordingly, many real-world systems avoid mechanisms that provide useful semantics for multi-partition operations . This leads to incorrect behavior for a large class of applications including secondary indexing, foreign key enforcement, and materialized view maintenance . In this work, we identify a new isolation mode
5 0.56492126 781 high scalability-2010-02-23-Sponsored Post: Job Openings - Squarespace
Introduction: There was a bit of drama earlier when I posted a free job opening for Zynga. It caused unfortunate and just plain wrong accusations. It also caused a number of requests for more free job posts, which I should have anticipated, but obviously I can't let this blog become cluttered with that kind of stuff. Earlier I tried a job board type service, but that never really worked out. So what to do? Someone suggested a sponsored post approach and I think that's a good compromise. It minimizes the noise, let's people know about work, and brings in a little revenue. It works like an advertisement. If you are interested please let me know. When we have any job openings there will be a sponsored post like this one, that you can easily ignore or pay attention to, depending on your situation. Squarespace Looking for Full-time Scaling Expert Interested in helping a cutting-edge, high-growth startup scale? Squarespace, which was profiled here last year in Squarespace Architecture - A Grid Handle
6 0.55898929 402 high scalability-2008-10-05-Paper: Scalability Design Patterns
7 0.55060673 926 high scalability-2010-10-24-Hot Scalability Links For Oct 24, 2010
8 0.53669858 648 high scalability-2009-07-02-It Must be Crap on Relational Dabases Week
10 0.53138745 877 high scalability-2010-08-12-Designing Web Applications for Scalability
11 0.52688086 625 high scalability-2009-06-10-Managing cross partition transactions in a distributed KV system
12 0.52620244 220 high scalability-2008-01-22-The high scalability community
13 0.52547592 749 high scalability-2009-12-15-The Common Principles Behind the NOSQL Alternatives
14 0.51523644 361 high scalability-2008-08-08-Separation into read-write only databases
17 0.50486439 864 high scalability-2010-07-24-4 New Podcasts for Scalable Summertime Reading
18 0.50147438 94 high scalability-2007-09-17-Blog: Adding Simplicity by Dan Pritchett
19 0.49424508 88 high scalability-2007-09-10-Blog: Scalable Web Architectures by Royans Tharakan
20 0.4930093 791 high scalability-2010-03-09-Sponsored Post: Job Openings - Squarespace
topicId topicWeight
[(2, 0.09), (61, 0.185), (91, 0.531)]
simIndex simValue blogId blogTitle
same-blog 1 0.81298566 921 high scalability-2010-10-18-NoCAP
Introduction: In this post i wanted to spend sometime on the CAP theorem and clarify some of the confusion that i often see when people associate CAP with scalability without fully understanding the implications that comes with it and the alternative approaches You can read the full article here
2 0.64914036 642 high scalability-2009-06-29-HighScalability Rated #3 Blog for Developers
Introduction: Hey we're moving up in the world, jumping from 19th place to 3rd place. In case you aren't sure what I'm talking about, Jurgen Appelo goes through this massive effort of ranking blogs according to Google PageRank, Technorati Authority, Alexa Rank, Google links, and Twitter Grader Rank. Through some obviously mistaken calculations HighScalability comes out #3. Given all the superb competition I'm not exactly sure how that can be. Well, thanks for all the excellent people who contribute and all the even more excellent people that read. Now at least I have something worthy to put on my tombstone :-)
3 0.51416266 712 high scalability-2009-10-01-Moving Beyond End-to-End Path Information to Optimize CDN Performance
Introduction: You go through the expense of installing CDNs all over the globe to make sure users always have a node close by and you notice something curious and furious: clients still experience poor latencies. What's up with that? What do you do to find the problem? If you are Google you build a tool (WhyHigh) to figure out what's up. This paper is about the tool and the unexpected problem of high latencies on CDNs. The main problems they found: inefficient routing to nearby nodes and packet queuing. But more useful is the architecture of WhyHigh and how it goes about identifying bottle necks. And even more useful is the general belief in creating sophisticated tools to understand and improve your service. That's what professionals do. From the abstract: Replicating content across a geographically distributed set of servers and redirecting clients to the closest server in terms of latency has emerged as a common paradigm for improving client performance. In this paper, we analyze latenc
4 0.46804291 722 high scalability-2009-10-15-Hot Scalability Links for Oct 15 2009
Introduction: Update: Social networks in the database: using a graph database . Anders Nawroth puts graphs through their paces by representing, traversing, and performing other common social network operations using a graph database. Update: Deployment with Capistrano by Charles Max Wood. Simple step-by-step for using Capistrano for deployment. Log-structured file systems: There's one in every SSD by Valerie Aurora. SSDs have totally changed the performance characteristics of storage! Disks are dead! Long live flash! An Engineer's Guide to Bandwidth by DGentry. I t's a rough world out there, and we need to to a better job of thinking about and testing under realistic network conditions. Analyzing air traffic performance with InfoBright and MonetDB by Vadim of the MySQL Performance Blog. Scalable Delivery of Stream Query Result by Zhou, Y ; Salehi, A ; Aberer, K. In this paper, we leverage Distributed Publish/Subscribe System (DPSS), a scalable data dissemination infrastruct
5 0.45044607 826 high scalability-2010-05-12-The Rise of the Virtual Cellular Machines
Introduction: My apologies if you were looking for a post about cell phones. This post is about high density nanodevices. It's a follow up to How will memristors change everything? for those wishing to pursue these revolutionary ideas in more depth. This is one of those areas where if you are in the space then there's a lot of available information and if you are on the outside then it doesn't even seem to exist. Fortunately, Ben Chandler from The SyNAPSE Project , was kind enough to point me to a great set of presentations given at the 12th IEEE CNNA - International Workshop on Cellular Nanoscale Networks and their Applications - Towards Megaprocessor Computing. WARNING: these papers contain extreme technical content. If you are like me and you aren't an electrical engineer, much of it may make a sort of surface sense, but the deep and twisty details will fly over head. For the more software minded there are a couple more accessible presentations: Intelligent Machines built with Memristiv
6 0.42537391 1338 high scalability-2012-10-11-RAMCube: Exploiting Network Proximity for RAM-Based Key-Value Store
7 0.42497104 453 high scalability-2008-12-01-Breakthrough Web-Tier Solutions with Record-Breaking Performance
8 0.37352678 197 high scalability-2007-12-31-Product: collectd
9 0.35790682 1285 high scalability-2012-07-18-Disks Ain't Dead Yet: GraphChi - a disk-based large-scale graph computation
10 0.35767925 208 high scalability-2008-01-11-FTP Sanity: Redundancy, archiving, consolidation.
11 0.3568759 1303 high scalability-2012-08-13-Ask HighScalability: Facing scaling issues with news feeds on Redis. Any advice?
12 0.35481617 549 high scalability-2009-03-26-Performance - When do I start worrying?
13 0.35185167 1201 high scalability-2012-02-29-Strategy: Put Mobile Video Into Cold Storage After 30 Days
14 0.34968054 930 high scalability-2010-10-28-NoSQL Took Away the Relational Model and Gave Nothing Back
15 0.34886762 173 high scalability-2007-12-05-Easier Production Releases
16 0.34807122 493 high scalability-2009-01-16-Just-In-Time Scalability: Agile Methods to Support Massive Growth (IMVU case study)
17 0.34688511 746 high scalability-2009-11-26-Kngine Snippet Search New Indexing Technology
18 0.34558019 793 high scalability-2010-03-10-Saying Yes to NoSQL; Going Steady with Cassandra at Digg
19 0.34432471 675 high scalability-2009-08-08-1dbase vs. many and cloud hosting vs. dedicated server(s)?
20 0.34432402 322 high scalability-2008-05-19-Conference: Infoscale 2008 in Italy (June 4-6)