high_scalability high_scalability-2008 high_scalability-2008-467 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: The SHOP.COM Cache System is now available at http://code.google.com/p/sccache/ The SHOP.COM Cache System is an object cache system that... * is an in-process cache and external, shared Cache * is horizontally scalable * stores cached objects to disk * supports associative keys * is non-transactional * can have any size key and any size data * does auto-GC based on TTL * is container and platform neutral It was built in-house at SHOP.COM (by me) and has powered our website for years. We are open-sourcing it in the hope that it will be useful to others and to get some help in its maintenance. This is our first open source attempt and we'd appreciate any help and comments.
sentIndex sentText sentNum sentScore
1 COM (by me) and has powered our website for years. [sent-9, score-0.299]
2 We are open-sourcing it in the hope that it will be useful to others and to get some help in its maintenance. [sent-10, score-0.578]
3 This is our first open source attempt and we'd appreciate any help and comments. [sent-11, score-0.789]
wordName wordTfidf (topN-words)
[('cache', 0.41), ('associative', 0.31), ('ttl', 0.268), ('sourcing', 0.255), ('container', 0.228), ('appreciate', 0.222), ('attempt', 0.213), ('size', 0.209), ('powered', 0.206), ('horizontally', 0.179), ('keys', 0.158), ('cached', 0.156), ('external', 0.153), ('hope', 0.152), ('help', 0.15), ('comments', 0.147), ('stores', 0.133), ('supports', 0.13), ('object', 0.129), ('objects', 0.128), ('others', 0.125), ('shared', 0.11), ('system', 0.11), ('useful', 0.108), ('platform', 0.097), ('website', 0.093), ('disk', 0.092), ('http', 0.081), ('built', 0.08), ('available', 0.077), ('source', 0.074), ('key', 0.074), ('open', 0.071), ('scalable', 0.065), ('based', 0.06), ('first', 0.059), ('get', 0.043), ('data', 0.027)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 467 high scalability-2008-12-16-[ANN] New Open Source Cache System
Introduction: The SHOP.COM Cache System is now available at http://code.google.com/p/sccache/ The SHOP.COM Cache System is an object cache system that... * is an in-process cache and external, shared Cache * is horizontally scalable * stores cached objects to disk * supports associative keys * is non-transactional * can have any size key and any size data * does auto-GC based on TTL * is container and platform neutral It was built in-house at SHOP.COM (by me) and has powered our website for years. We are open-sourcing it in the hope that it will be useful to others and to get some help in its maintenance. This is our first open source attempt and we'd appreciate any help and comments.
2 0.19943558 360 high scalability-2008-08-04-A Bunch of Great Strategies for Using Memcached and MySQL Better Together
Introduction: The primero recommendation for speeding up a website is almost always to add cache and more cache. And after that add a little more cache just in case. Memcached is almost always given as the recommended cache to use. What we don't often hear is how to effectively use a cache in our own products. MySQL hosted two excellent webinars (referenced below) on the subject of how to deploy and use memcached. The star of the show, other than MySQL of course, is Farhan Mashraqi of Fotolog. You may recall we did an earlier article on Fotolog in Secrets to Fotolog's Scaling Success , which was one of my personal favorites. Fotolog, as they themselves point out, is probably the largest site nobody has ever heard of, pulling in more page views than even Flickr. Fotolog has 51 instances of memcached on 21 servers with 175G in use and 254G available. As a large successful photo-blogging site they have very demanding performance and scaling requirements. To meet those requirements they've developed a
3 0.18089318 174 high scalability-2007-12-05-Product: Tugela Cache
Introduction: Tugela Cache is a cache system like memecached, but instead of storing data just in RAM, it stores data in the file system using a b-tree. You trade latency in order to have a very large cache. It's useful for sites that have caching requirements that exceed their available memory. It uses the same wire protocol as memcached so it can be dropped in without a hassle. From the website: As large MediaWiki deployments may gain performance using Memcached, at some level cost of RAM to store all objects becomes too high. In order to balance resource usage and make more use of our Apache server disks, Tugela, the distributed cached on-disk hash database, has arrived. Tugela Cache is derived from Memcached. Much of the code remains the same, but notably, these changes: Internal slab allocator replaced by BerkeleyDB B-Tree database. Expiry policy management moved to external program tugela-expire Much statistics code made obsolete. An interesting point brought up in the comme
4 0.15428048 436 high scalability-2008-11-02-Strategy: How to Manage Sessions Using Memcached
Introduction: Dormando shows an enlightened middle way for storing sessions in cache and the database. Sessions are a perfect cache candidate because they are transient, smallish, and since they are usually accessed on every page access removing all that load from the database is a good thing. But as Dormando points out session caches have problems. If you remove expiration times from the cache and you run out of memory then no more logins. If a cache server fails or needs to be upgrade then you just logged out a bunch of potentially angry users. The middle ground Dormando proposes is using both the cache and the database: Reads : read from the cache first, then the database. Typical cache logic. Writes : write to memcached every time, write to the database every N seconds (assuming the data has changed). There's a small chance of data loss, but you've still greatly reduced the database load while providing reliability. Nice solution.
5 0.15026277 577 high scalability-2009-04-22-Gear6 Web cache - the hardware solution for working with Memcache
Introduction: The Gear6 Web Cache hybrid DRAM-flash memory architecture allows for 5-10 times more memcache memory per unit of rack space than DRAM-only configurations, and cuts memory costs by 50%. Other software enhancements include a slab allocator that is more efficient than traditional memcache implementations due to its fine-grained bucket sizing. Gear6 Web Cache also supports object sizes greater than 1 megabyte and manages evictions based on the cost of replacing objects, depending on the size and frequency of object access. It intelligently places cache instances across DRAM and flash, taking into account their different characteristics, while at the same time monitoring their health and detecting and de�allocating faulty or failing memory. Gear6 Web Cache is a Memcached protocol compliant solution that scales and accelerates web applications, reduces memory footprint, enhances availability and implements comprehensive Memcached management features. Designed to work with all popular memcac
6 0.14574882 495 high scalability-2009-01-17-Intro to Caching,Caching algorithms and caching frameworks part 1
7 0.14327173 373 high scalability-2008-08-29-Product: ScaleOut StateServer is Memcached on Steroids
8 0.14318492 673 high scalability-2009-08-07-Strategy: Break Up the Memcache Dog Pile
9 0.1409896 248 high scalability-2008-02-13-What's your scalability plan?
10 0.13260953 1346 high scalability-2012-10-24-Saving Cash Using Less Cache - 90% Savings in the Caching Tier
11 0.13091213 703 high scalability-2009-09-12-How Google Taught Me to Cache and Cash-In
13 0.12621576 700 high scalability-2009-09-10-The technology behind Tornado, FriendFeed's web server
14 0.12496663 911 high scalability-2010-09-30-More Troubles with Caching
15 0.1208867 1633 high scalability-2014-04-16-Six Lessons Learned the Hard Way About Scaling a Million User System
16 0.12071896 589 high scalability-2009-05-05-Drop ACID and Think About Data
17 0.1183219 1205 high scalability-2012-03-07-Scale Indefinitely on S3 With These Secrets of the S3 Masters
18 0.11621769 662 high scalability-2009-07-27-Handle 700 Percent More Requests Using Squid and APC Cache
19 0.11590435 602 high scalability-2009-05-17-Scaling Django Web Apps by Mike Malone
20 0.11410557 164 high scalability-2007-11-22-Why not Cache from Intersystems?
topicId topicWeight
[(0, 0.116), (1, 0.065), (2, -0.04), (3, -0.089), (4, 0.015), (5, 0.077), (6, 0.008), (7, -0.023), (8, -0.074), (9, 0.023), (10, -0.001), (11, -0.053), (12, -0.007), (13, 0.124), (14, -0.104), (15, -0.063), (16, -0.068), (17, -0.051), (18, 0.043), (19, -0.046), (20, -0.122), (21, 0.067), (22, 0.116), (23, 0.099), (24, -0.093), (25, -0.006), (26, 0.083), (27, 0.082), (28, -0.06), (29, -0.034), (30, -0.06), (31, 0.007), (32, -0.014), (33, -0.049), (34, -0.058), (35, 0.039), (36, -0.01), (37, 0.02), (38, 0.064), (39, 0.04), (40, 0.041), (41, 0.02), (42, 0.004), (43, -0.032), (44, -0.029), (45, 0.065), (46, -0.079), (47, 0.027), (48, -0.015), (49, 0.038)]
simIndex simValue blogId blogTitle
same-blog 1 0.9900378 467 high scalability-2008-12-16-[ANN] New Open Source Cache System
Introduction: The SHOP.COM Cache System is now available at http://code.google.com/p/sccache/ The SHOP.COM Cache System is an object cache system that... * is an in-process cache and external, shared Cache * is horizontally scalable * stores cached objects to disk * supports associative keys * is non-transactional * can have any size key and any size data * does auto-GC based on TTL * is container and platform neutral It was built in-house at SHOP.COM (by me) and has powered our website for years. We are open-sourcing it in the hope that it will be useful to others and to get some help in its maintenance. This is our first open source attempt and we'd appreciate any help and comments.
2 0.91648602 495 high scalability-2009-01-17-Intro to Caching,Caching algorithms and caching frameworks part 1
Introduction: Informative and well organized post on caching . Talks about: Why do we need cache?, What is Cache?, Cache Hit, Cache Miss, Storage Cost, Retrieval Cost, Invalidation, Replacement Policy, Optimal Replacement Policy, Caching Algorithms, Least Frequently Used (LFU), Least Recently Used (LRU), Least Recently Used 2(LRU2), Two Queues, Adaptive Replacement Cache (ACR), Most Recently Used (MRU), First in First out (FIFO), Distributed caching, Measuring Cache.
3 0.84332514 436 high scalability-2008-11-02-Strategy: How to Manage Sessions Using Memcached
Introduction: Dormando shows an enlightened middle way for storing sessions in cache and the database. Sessions are a perfect cache candidate because they are transient, smallish, and since they are usually accessed on every page access removing all that load from the database is a good thing. But as Dormando points out session caches have problems. If you remove expiration times from the cache and you run out of memory then no more logins. If a cache server fails or needs to be upgrade then you just logged out a bunch of potentially angry users. The middle ground Dormando proposes is using both the cache and the database: Reads : read from the cache first, then the database. Typical cache logic. Writes : write to memcached every time, write to the database every N seconds (assuming the data has changed). There's a small chance of data loss, but you've still greatly reduced the database load while providing reliability. Nice solution.
4 0.8363955 174 high scalability-2007-12-05-Product: Tugela Cache
Introduction: Tugela Cache is a cache system like memecached, but instead of storing data just in RAM, it stores data in the file system using a b-tree. You trade latency in order to have a very large cache. It's useful for sites that have caching requirements that exceed their available memory. It uses the same wire protocol as memcached so it can be dropped in without a hassle. From the website: As large MediaWiki deployments may gain performance using Memcached, at some level cost of RAM to store all objects becomes too high. In order to balance resource usage and make more use of our Apache server disks, Tugela, the distributed cached on-disk hash database, has arrived. Tugela Cache is derived from Memcached. Much of the code remains the same, but notably, these changes: Internal slab allocator replaced by BerkeleyDB B-Tree database. Expiry policy management moved to external program tugela-expire Much statistics code made obsolete. An interesting point brought up in the comme
5 0.80670881 836 high scalability-2010-06-04-Strategy: Cache Larger Chunks - Cache Hit Rate is a Bad Indicator
Introduction: Isn't the secret to fast, scalable websites to cache everything ? Caching, if not the secret sauce of many a website, is it at least a popular condiment. But not so fast says Peter Zaitsev in Beyond great cache hit ratio . The point Peter makes is that we read about websites like Amazon and Facebook that can literally make hundreds of calls to satisfy a user request. Even if you have an awesome cache hit ratio, pages can still be slow because making and processing all those requests takes time. The solution is to remove requests all together . You do this by caching larger blocks so you have to make fewer requests. The post has a lot of good advice worth reading: 1) Make non cacheable blocks as small as possible, 2) Maximize amount of uses of the cache item, 3) Control invalidation, 4) Multi-Get.
6 0.78984159 673 high scalability-2009-08-07-Strategy: Break Up the Memcache Dog Pile
7 0.78194851 360 high scalability-2008-08-04-A Bunch of Great Strategies for Using Memcached and MySQL Better Together
8 0.76174194 708 high scalability-2009-09-17-Infinispan narrows the gap between open source and commercial data caches
9 0.75730771 359 high scalability-2008-07-29-Ehcache - A Java Distributed Cache
10 0.75406677 577 high scalability-2009-04-22-Gear6 Web cache - the hardware solution for working with Memcache
11 0.75061476 164 high scalability-2007-11-22-Why not Cache from Intersystems?
13 0.71635664 911 high scalability-2010-09-30-More Troubles with Caching
14 0.71357965 1346 high scalability-2012-10-24-Saving Cash Using Less Cache - 90% Savings in the Caching Tier
15 0.71099824 247 high scalability-2008-02-12-We want to cache a lot :) How do we go about it ?
16 0.68569887 696 high scalability-2009-09-07-Product: Infinispan - Open Source Data Grid
17 0.68296361 1633 high scalability-2014-04-16-Six Lessons Learned the Hard Way About Scaling a Million User System
18 0.68118358 703 high scalability-2009-09-12-How Google Taught Me to Cache and Cash-In
19 0.67344522 367 high scalability-2008-08-17-Strategy: Drop Memcached, Add More MySQL Servers
20 0.67138529 996 high scalability-2011-02-28-A Practical Guide to Varnish - Why Varnish Matters
topicId topicWeight
[(1, 0.203), (2, 0.195), (17, 0.258), (79, 0.128), (94, 0.066)]
simIndex simValue blogId blogTitle
1 0.9469595 506 high scalability-2009-02-03-10 More Rules for Even Faster Websites
Introduction: Update: How-To Minimize Load Time for Fast User Experiences . Shows how to analyze the bottlenecks preventing websites and blogs from loading quickly and how to resolve them. 80-90% of the end-user response time is spent on the frontend, so it makes sense to concentrate efforts there before heroically rewriting the backend. Take a shower before buying a Porsche, if you know what I mean. Steve Souders, author of High Performance Websites and Yslow , has ten more best practices to speed up your website : Split the initial payload Load scripts without blocking Don’t scatter scripts Split dominant content domains Make static content cookie-free Reduce cookie weight Minify CSS Optimize images Use iframes sparingly To www or not to www Sadly, according to String Theory, there are only 26.7 rules left, so get them while they're still in our dimension. Here are slides on the first few rules. Love the speeding dog slide. That's exactly what my dog looks like trav
2 0.94132948 631 high scalability-2009-06-15-Large-scale Graph Computing at Google
Introduction: To continue the graph theme Google has got into the act and released information on Pregel . Pregel does not appear to be a new type of potato chip. Pregel is instead a scalable infrastructure... ...to mine a wide range of graphs. In Pregel, programs are expressed as a sequence of iterations. In each iteration, a vertex can, independently of other vertices, receive messages sent to it in the previous iteration, send messages to other vertices, modify its own and its outgoing edges' states, and mutate the graph's topology. Currently, Pregel scales to billions of vertices and edges, but this limit will keep expanding. Pregel's applicability is harder to quantify, but so far we haven't come across a type of graph or a practical graph computing problem which is not solvable with Pregel. It computes over large graphs much faster than alternatives, and the application programming interface is easy to use. Implementing PageRank, for example, takes only about 15 lines of code. Developers
3 0.92955738 543 high scalability-2009-03-17-Sun to Announce Open Cloud APIs at CommunityOne
Introduction: One of the key items Sun will be talking about in today's cloud computing announcement (at 9AM EST/6AM PST) will be Sun's opening of the APIs that we'll use for the Sun Cloud. We're making these available so that those who are interested will be able to review and comment on these APIs. Continuing our commitment to openness, we're making these APIs available via the Creative Commons Version 3.0 license. ...
4 0.90704274 1225 high scalability-2012-04-09-Why My Slime Mold is Better than Your Hadoop Cluster
Introduction: Update : Organism without a brain creates external memories for navigation shows slime mold is even cooler than originally thought, storing a record of where it's been using slime: The authors conclude, the slime isn't just the mold's calling card. Instead, it's a way of marking the environment so that the organism can sense where it's been, and not expend effort on searches that won't pay off. Although the situation isn't an exact parallel, the authors make a comparison to the pheromone trails used by ants. In After Life: The Strange Science Of Decay there’s a truly incredible sequence of gorgeously shot video showing how creeping slime mold solves mazes and performs other other amazing feats of computation. Take a look at what simple one celled organisms can do: The whole video is really well done and shockingly revelatory. It’s the story of decay, how atoms created during the Big Bang and through countless supernova explosions are continually rearranged an
5 0.88564348 1467 high scalability-2013-05-30-Google Finds NUMA Up to 20% Slower for Gmail and Websearch
Introduction: When you have a large population of servers you have both the opportunity and the incentive to perform interesting studies. Authors from Google and the University of California in Optimizing Google’s Warehouse Scale Computers: The NUMA Experience conducted such a study, taking a look at how jobs run on clusters of machines using a NUMA architecture. Since NUMA is common on server class machines it's a topic of general interest for those looking to maximize machine utilization across clusters. Some of the results are surprising: The methodology of how to attribute such fine performance variations to NUMA effects within such a complex system is perhaps more interesting than the results themselves. Well worth reading just for that story. The performance swing due to NUMA is up to 15% on AMD Barcelona for Gmail backend and 20% on Intel Westmere for Web-search frontend. Memory locality is not always King. Because of the interaction between NUMA and cache sharing/contention it
7 0.86189026 465 high scalability-2008-12-14-Scaling MySQL on a 256-way T5440 server using Solaris ZFS and Java 1.7
8 0.84229159 956 high scalability-2010-12-08-How To Get Experience Working With Large Datasets
same-blog 9 0.81828749 467 high scalability-2008-12-16-[ANN] New Open Source Cache System
10 0.81513721 869 high scalability-2010-07-30-Hot Scalability Links for July 30, 2010
11 0.79063523 765 high scalability-2010-01-25-Let's Welcome our Neo-Feudal Overlords
12 0.77979821 1333 high scalability-2012-10-04-LinkedIn Moved from Rails to Node: 27 Servers Cut and Up to 20x Faster
13 0.7792275 507 high scalability-2009-02-03-Paper: Optimistic Replication
14 0.77047527 427 high scalability-2008-10-22-Server load balancing architectures, Part 2: Application-level load balancing
15 0.76919353 1392 high scalability-2013-01-23-Building Redundant Datacenter Networks is Not For Sissies - Use an Outside WAN Backbone
16 0.76656443 444 high scalability-2008-11-14-Private-Public Cloud
17 0.75776041 623 high scalability-2009-06-10-Dealing with multi-partition transactions in a distributed KV solution
18 0.75470531 15 high scalability-2007-07-16-Blog: MySQL Performance Blog - Everything about MySQL Performance.
19 0.75279325 1472 high scalability-2013-06-07-Stuff The Internet Says On Scalability For June 7, 2013
20 0.75252086 79 high scalability-2007-09-01-On-Demand Infinitely Scalable Database Seed the Amazon EC2 Cloud