high_scalability high_scalability-2007 high_scalability-2007-52 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Memcached is a high-performance, distributed memory object caching system, generic in nature, but intended for use in speeding up dynamic web applications by alleviating database load. Danga Interactive developed memcached to enhance the speed of LiveJournal.com, a site which was already doing 20 million+ dynamic page views per day for 1 million users with a bunch of webservers and a bunch of database servers. memcached dropped the database load to almost nothing, yielding faster page load times for users, better resource utilization, and faster access to the databases on a memcache miss.
sentIndex sentText sentNum sentScore
1 Memcached is a high-performance, distributed memory object caching system, generic in nature, but intended for use in speeding up dynamic web applications by alleviating database load. [sent-1, score-1.682]
2 Danga Interactive developed memcached to enhance the speed of LiveJournal. [sent-2, score-0.627]
3 com, a site which was already doing 20 million+ dynamic page views per day for 1 million users with a bunch of webservers and a bunch of database servers. [sent-3, score-2.028]
4 memcached dropped the database load to almost nothing, yielding faster page load times for users, better resource utilization, and faster access to the databases on a memcache miss. [sent-4, score-2.092]
wordName wordTfidf (topN-words)
[('alleviating', 0.332), ('bunch', 0.315), ('yielding', 0.27), ('webservers', 0.236), ('speeding', 0.226), ('enhance', 0.223), ('miss', 0.196), ('dynamic', 0.192), ('memcached', 0.19), ('intended', 0.188), ('generic', 0.185), ('page', 0.175), ('dropped', 0.169), ('utilization', 0.163), ('faster', 0.16), ('interactive', 0.153), ('memcache', 0.144), ('million', 0.143), ('views', 0.132), ('nature', 0.131), ('developed', 0.125), ('database', 0.122), ('object', 0.12), ('users', 0.113), ('nothing', 0.108), ('almost', 0.108), ('load', 0.108), ('resource', 0.096), ('speed', 0.089), ('already', 0.084), ('databases', 0.078), ('caching', 0.077), ('times', 0.075), ('day', 0.075), ('access', 0.071), ('site', 0.069), ('memory', 0.064), ('better', 0.058), ('per', 0.057), ('applications', 0.055), ('distributed', 0.051), ('web', 0.041), ('system', 0.034), ('use', 0.029)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 52 high scalability-2007-08-01-Product: Memcached
Introduction: Memcached is a high-performance, distributed memory object caching system, generic in nature, but intended for use in speeding up dynamic web applications by alleviating database load. Danga Interactive developed memcached to enhance the speed of LiveJournal.com, a site which was already doing 20 million+ dynamic page views per day for 1 million users with a bunch of webservers and a bunch of database servers. memcached dropped the database load to almost nothing, yielding faster page load times for users, better resource utilization, and faster access to the databases on a memcache miss.
2 0.20032194 360 high scalability-2008-08-04-A Bunch of Great Strategies for Using Memcached and MySQL Better Together
Introduction: The primero recommendation for speeding up a website is almost always to add cache and more cache. And after that add a little more cache just in case. Memcached is almost always given as the recommended cache to use. What we don't often hear is how to effectively use a cache in our own products. MySQL hosted two excellent webinars (referenced below) on the subject of how to deploy and use memcached. The star of the show, other than MySQL of course, is Farhan Mashraqi of Fotolog. You may recall we did an earlier article on Fotolog in Secrets to Fotolog's Scaling Success , which was one of my personal favorites. Fotolog, as they themselves point out, is probably the largest site nobody has ever heard of, pulling in more page views than even Flickr. Fotolog has 51 instances of memcached on 21 servers with 175G in use and 254G available. As a large successful photo-blogging site they have very demanding performance and scaling requirements. To meet those requirements they've developed a
3 0.15576445 927 high scalability-2010-10-26-Marrying memcached and NoSQL
Introduction: Memcached is one of the most common In-Memory cache implementation. It was originally developed by Danga Interactive for LiveJournal , but is now used by many other sites as a side cache to speed up read mostly operations. It gained popularity in the non-Java world, too, especially since it’s a language-neutral side cache for which few alternatives existed. As a side-cache, Memcache clients relies on the database as the system of record, The database is still used for write,update and complex query operations. Since the memcached specification includes no query operations, memcached is not a database alternative, unlike most of the NoSQL offerings. It also exclude memcache from being a real solution for write scalability. As a result of that many of the heavy sites started to move away from Memcache and replace it with other NoSQL alternatives as noted in a recent highscalability post MySQL And Memcached: End Of An Era? The transition away from memcached to NoSQL
4 0.13147098 118 high scalability-2007-10-09-High Load on production Webservers after Sourcecode sync
Introduction: Hi everybody :) We have a bunch of webservers (about 14 at th i s time) running Apache. As app l ication framework we're using PHP with the APC Cache insta l led to improve performance. For load ba l ancing we're using a Big F5 system with dynamic ratio (SNMP driven) To sync new/updated sourcecode we're using subversion to "automaticly" update these servers w i th our latest software relases. After updating the new source to these production servers the load of the mashines is ra i sing to hell. While updating the servers, they are sti l l in "production", serving webpages to the users. Otherwise the process of updating wou l d take ages. Most of the time we're only updating in the morning hours while less users are online, because of the above issue. My guess is, that the load is raising that h i gh, because APC needs to recompile a bunch of new f i les each time. Before and while compi l ing the performance simply is "bad". My goal would be t
5 0.12762107 638 high scalability-2009-06-26-PlentyOfFish Architecture
Introduction: Update 5 : PlentyOfFish Update - 6 Billion Pageviews And 32 Billion Images A Month Update 4 : Jeff Atwood costs out Markus' scale up approach against a scale out approach and finds scale up wanting. The discussion in the comments is as interesting as the article. My guess is Markus doesn't want to rewrite his software to work across a scale out cluster so even if it's more expensive scale up works better for his needs. Update 3 : POF now has 200 million images and serves 10,000 images served per second. They'll be moving to a 250,000 IOPS RamSan to handle the load. Also upgraded to a core database machine with 512 GB of RAM, 32 CPU’s, SQLServer 2008 and Windows 2008. Update 2 : This seems to be a POF Peer1 love fest infomercial . It's pretty content free, but the production values are high. Lots of quirky sounds and fish swimming on the screen. Update : by Facebook standards Read/WriteWeb says POF is worth a cool one billion dollars . It helps to talk like Dr. Evil whe
6 0.12666464 313 high scalability-2008-05-02-Friends for Sale Architecture - A 300 Million Page View-Month Facebook RoR App
7 0.12621365 1361 high scalability-2012-11-22-Gone Fishin': PlentyOfFish Architecture
8 0.11839807 106 high scalability-2007-10-02-Secrets to Fotolog's Scaling Success
9 0.11608372 554 high scalability-2009-04-04-Digg Architecture
10 0.11146598 1123 high scalability-2011-09-23-The Real News is Not that Facebook Serves Up 1 Trillion Pages a Month…
11 0.10279091 577 high scalability-2009-04-22-Gear6 Web cache - the hardware solution for working with Memcache
12 0.1025286 729 high scalability-2009-10-28-And the winner is: MySQL or Memcached or Tokyo Tyrant?
13 0.10137466 511 high scalability-2009-02-12-MySpace Architecture
14 0.10011063 294 high scalability-2008-04-01-How to update video views count effectively?
15 0.099974841 884 high scalability-2010-08-23-6 Ways to Kill Your Servers - Learning How to Scale the Hard Way
16 0.098576434 538 high scalability-2009-03-16-Are Cloud Based Memory Architectures the Next Big Thing?
17 0.094970293 603 high scalability-2009-05-19-Scaling Memcached: 500,000+ Operations-Second with a Single-Socket UltraSPARC T2
18 0.09213008 367 high scalability-2008-08-17-Strategy: Drop Memcached, Add More MySQL Servers
19 0.089686662 436 high scalability-2008-11-02-Strategy: How to Manage Sessions Using Memcached
20 0.089535318 373 high scalability-2008-08-29-Product: ScaleOut StateServer is Memcached on Steroids
topicId topicWeight
[(0, 0.132), (1, 0.061), (2, -0.047), (3, -0.15), (4, 0.018), (5, 0.016), (6, -0.036), (7, -0.015), (8, -0.03), (9, 0.032), (10, 0.009), (11, -0.055), (12, -0.005), (13, 0.085), (14, -0.074), (15, -0.025), (16, -0.024), (17, -0.023), (18, 0.037), (19, 0.016), (20, -0.028), (21, 0.054), (22, -0.002), (23, -0.026), (24, 0.021), (25, -0.055), (26, -0.078), (27, 0.068), (28, -0.008), (29, -0.005), (30, -0.005), (31, 0.026), (32, -0.039), (33, -0.041), (34, -0.021), (35, 0.007), (36, -0.014), (37, 0.028), (38, 0.013), (39, 0.034), (40, -0.034), (41, 0.01), (42, -0.07), (43, 0.039), (44, 0.005), (45, -0.011), (46, 0.05), (47, 0.02), (48, 0.01), (49, 0.007)]
simIndex simValue blogId blogTitle
same-blog 1 0.97596645 52 high scalability-2007-08-01-Product: Memcached
Introduction: Memcached is a high-performance, distributed memory object caching system, generic in nature, but intended for use in speeding up dynamic web applications by alleviating database load. Danga Interactive developed memcached to enhance the speed of LiveJournal.com, a site which was already doing 20 million+ dynamic page views per day for 1 million users with a bunch of webservers and a bunch of database servers. memcached dropped the database load to almost nothing, yielding faster page load times for users, better resource utilization, and faster access to the databases on a memcache miss.
2 0.85336089 908 high scalability-2010-09-28-6 Strategies for Scaling BBC iPlayer
Introduction: The BBC's iPlayer site averages 8 million page views a day for 1.3 million users. Technical Architect Simon Frost describes how they scaled their site in Scaling the BBC iPlayer to handle demand : Use frameworks . Frameworks support component based development which makes it convenient for team development, but can introduce delays that have to be minimized. Zend/PHP is used because it supports components and is easy to recruit for. MySQL is used for program metadata. CouchDB is used for key-value access for fast read/write of user-focused data. Prove architecture before building it . Eliminate guesswork by coming up with alternate architectures and create prototypes to determine which option works best. Balance performance with factors like ease of development. Cache a lot . Data is cached in memcached for a few seconds to minutes. Short cache invalidation periods keep the data up to date for the users, but even these short periods make a huge difference in performance.
3 0.77577758 638 high scalability-2009-06-26-PlentyOfFish Architecture
Introduction: Update 5 : PlentyOfFish Update - 6 Billion Pageviews And 32 Billion Images A Month Update 4 : Jeff Atwood costs out Markus' scale up approach against a scale out approach and finds scale up wanting. The discussion in the comments is as interesting as the article. My guess is Markus doesn't want to rewrite his software to work across a scale out cluster so even if it's more expensive scale up works better for his needs. Update 3 : POF now has 200 million images and serves 10,000 images served per second. They'll be moving to a 250,000 IOPS RamSan to handle the load. Also upgraded to a core database machine with 512 GB of RAM, 32 CPU’s, SQLServer 2008 and Windows 2008. Update 2 : This seems to be a POF Peer1 love fest infomercial . It's pretty content free, but the production values are high. Lots of quirky sounds and fish swimming on the screen. Update : by Facebook standards Read/WriteWeb says POF is worth a cool one billion dollars . It helps to talk like Dr. Evil whe
4 0.77300662 1361 high scalability-2012-11-22-Gone Fishin': PlentyOfFish Architecture
Introduction: Other than StackOverflow , PlentyOfFish is perhaps the most spectacular example of scale-up architectures working for what your average sane person would consider a large system. It doesn't hurt that it's also a sexy story. Update 5 : PlentyOfFish Update - 6 Billion Pageviews And 32 Billion Images A Month Update 4 : Jeff Atwood costs out Markus' scale up approach against a scale out approach and finds scale up wanting. The discussion in the comments is as interesting as the article. My guess is Markus doesn't want to rewrite his software to work across a scale out cluster so even if it's more expensive scale up works better for his needs. Update 3 : POF now has 200 million images and serves 10,000 images served per second. They'll be moving to a 250,000 IOPS RamSan to handle the load. Also upgraded to a core database machine with 512 GB of RAM, 32 CPU’s, SQLServer 2008 and Windows 2008. Update 2 : This seems to be a POF Peer1 love fest infomercial . It's pretty cont
5 0.76634312 928 high scalability-2010-10-26-Scaling DISQUS to 75 Million Comments and 17,000 RPS
Introduction: This presentation and video by Jason Yan and David Cramer discusses how they scaled DISQUS, a comments as a service service for easily adding comments to your site and connecting communities. The presentation is very good, so here are just a few highlights: Traffic : 17,000 requests/second peak; 450,000 websites; 15 million profiles; 75 million comments; 250 million visitors; 40 million monthly users / developer. Forces : unpredictable traffic patterns because of celebrity gossip and events like disasters; discussion never expire which means they can't fit in memory; must always be up. Machines : 100 servers; 30% web servers (Appache + mod_wsgi); 10% databases (PostgreSQL); 25% cache servers (memcached); 20% load balancing / high availability (HAProxy + heartbeat); 15% Utility servers (Python scripts). Architecture : Requests are load balanced across an Apache cluster. Apache talks to memcached, HAProxy/pgbouncer to handle connection pooling to the database, and a ce
6 0.76051706 703 high scalability-2009-09-12-How Google Taught Me to Cache and Cash-In
7 0.74682426 511 high scalability-2009-02-12-MySpace Architecture
8 0.73997533 106 high scalability-2007-10-02-Secrets to Fotolog's Scaling Success
9 0.73514646 436 high scalability-2008-11-02-Strategy: How to Manage Sessions Using Memcached
10 0.7087484 360 high scalability-2008-08-04-A Bunch of Great Strategies for Using Memcached and MySQL Better Together
11 0.70515853 513 high scalability-2009-02-16-Handle 1 Billion Events Per Day Using a Memory Grid
12 0.70412683 602 high scalability-2009-05-17-Scaling Django Web Apps by Mike Malone
13 0.69302607 662 high scalability-2009-07-27-Handle 700 Percent More Requests Using Squid and APC Cache
14 0.6829344 554 high scalability-2009-04-04-Digg Architecture
15 0.67812878 136 high scalability-2007-10-28-Scaling Early Stage Startups
16 0.67549264 7 high scalability-2007-07-12-FeedBurner Architecture
17 0.6712485 998 high scalability-2011-03-03-Stack Overflow Architecture Update - Now at 95 Million Page Views a Month
18 0.66602278 1346 high scalability-2012-10-24-Saving Cash Using Less Cache - 90% Savings in the Caching Tier
19 0.66558844 1220 high scalability-2012-04-02-YouPorn - Targeting 200 Million Views a Day and Beyond
20 0.66333246 1620 high scalability-2014-03-27-Strategy: Cache Stored Procedure Results
topicId topicWeight
[(1, 0.117), (2, 0.278), (40, 0.026), (52, 0.181), (61, 0.017), (77, 0.062), (79, 0.063), (85, 0.121)]
simIndex simValue blogId blogTitle
1 0.89448869 1406 high scalability-2013-02-14-When all the Program's a Graph - Prismatic's Plumbing Library
Introduction: At some point as a programmer you might have the insight/fear that all programming is just doing stuff to other stuff. Then you may observe after coding the same stuff over again that stuff in a program often takes the form of interacting patterns of flows. Then you may think hey, a program isn't only useful for coding datastructures, but a program is a kind of datastructure and that with a meta level jump you could program a program in terms of flows over data and flow over other flows. That's the kind of stuff Prismatic is making available in the Graph extension to their plumbing package ( code examples ), which is described in an excellent post: Graph: Abstractions for Structured Computation . You may remember Prismatic from previous profile we did on HighScalability: Prismatic Architecture - Using Machine Learning On Social Networks To Figure Out What You Should Read On The Web . We learned how Prismatic, an interest driven content suggestion service, builds programs in
same-blog 2 0.89064401 52 high scalability-2007-08-01-Product: Memcached
Introduction: Memcached is a high-performance, distributed memory object caching system, generic in nature, but intended for use in speeding up dynamic web applications by alleviating database load. Danga Interactive developed memcached to enhance the speed of LiveJournal.com, a site which was already doing 20 million+ dynamic page views per day for 1 million users with a bunch of webservers and a bunch of database servers. memcached dropped the database load to almost nothing, yielding faster page load times for users, better resource utilization, and faster access to the databases on a memcache miss.
3 0.8889035 47 high scalability-2007-07-30-Product: Yslow to speed up your web pages
Introduction: Update : Speed up Apache - how I went from F to A in YSlow . Good example of using YSlow to speed up a website with solid code examples. Every layer in the multi-layer cake that is your website contributes to how long a page takes to display. YSlow , from Yahoo, is a cool tool for discovering how the ingredients of your site's top layer contribute to performance. YSlow analyzes web pages and tells you why they're slow based on the rules for high performance web sites. YSlow is a Firefox add-on integrated with the popular Firebug web development tool. YSlow gives you: Performance report card HTTP/HTML summary List of components in the page Tools including JSLint
Introduction: Scoble the Ubiquitous has a fascinating post on how Mogulus, a live video channel startup, uses S3/EC2 and doesn't own a single server. The trends that have been happening for a while now are going mainstream. To do great things you no longer need to start by creating a huge war chest. You can forage off the land, like any good mobile, light weight fighting unit. For a strategy hit he mentions the same needed change in perspective as Beau Lebens talked about when making FeedBlendr : One tip he gave us is that when using Amazon’s services you have to design your systems with the assumption that they will never be up and running. What he means by that is services are “volatile” and can go up and down without notice. So, he’s designed his systems to survive that. He told me that it meant his engineering teams had to be quite disciplined in designing their architecture.
5 0.8785516 164 high scalability-2007-11-22-Why not Cache from Intersystems?
Introduction: I have some experience with a very large OLTP system that is 7+ TB in size and performs very well for 30K+ concurrent users. It is built using Intersystems Cache based on the very old but very scalable MUMPS platform. Why don't I see more discussions about archiectures such as these in this forum? I am curious why this platform scales so much better then the typical RDBMS.
6 0.86919409 872 high scalability-2010-08-05-Pairing NoSQL and Relational Data Storage: MySQL with MongoDB
7 0.86170942 317 high scalability-2008-05-10-Hitting 300 SimbleDB Requests Per Second on a Small EC2 Instance
8 0.84403771 638 high scalability-2009-06-26-PlentyOfFish Architecture
9 0.84298694 1361 high scalability-2012-11-22-Gone Fishin': PlentyOfFish Architecture
10 0.84238631 1080 high scalability-2011-07-15-Stuff The Internet Says On Scalability For July 15, 2011
11 0.84125006 1451 high scalability-2013-05-03-Stuff The Internet Says On Scalability For May 3, 2013
12 0.83903646 118 high scalability-2007-10-09-High Load on production Webservers after Sourcecode sync
13 0.83761197 244 high scalability-2008-02-11-Yahoo Live's Scaling Problems Prove: Release Early and Often - Just Don't Screw Up
14 0.83505595 1032 high scalability-2011-05-02-Stack Overflow Makes Slow Pages 100x Faster by Simple SQL Tuning
15 0.83479714 1327 high scalability-2012-09-21-Stuff The Internet Says On Scalability For September 21, 2012
16 0.83370185 1432 high scalability-2013-04-01-Khan Academy Checkbook Scaling to 6 Million Users a Month on GAE
17 0.83210814 1239 high scalability-2012-05-04-Stuff The Internet Says On Scalability For May 4, 2012
18 0.83116734 1446 high scalability-2013-04-25-Paper: Making reliable distributed systems in the presence of software errors
19 0.82713038 1592 high scalability-2014-02-07-Stuff The Internet Says On Scalability For February 7th, 2014
20 0.82448113 513 high scalability-2009-02-16-Handle 1 Billion Events Per Day Using a Memory Grid