high_scalability high_scalability-2008 high_scalability-2008-314 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Update 6 : nginx_http_push_module . Turn nginx into a long-polling message queuing HTTP push server. Update 5 : In Load Balancer Update Barry describes how WordPress.com moved from Pound to Nginx and are now "regularly serving about 8-9k requests/second and about 1.2Gbit/sec through a few Nginx instances and have plenty of room to grow!". Update 4 : Nginx better than Pound for load balancing. Pound spikes at 80% CPU, Nginx uses 3% and is easier to understand and better documented. Update 3 : igvita.com combines two cool tools together for better performance in Nginx and Memcached, a 400% boost! . Update 2 : Software Project on Installing Nginx Web Server w/ PHP and SSL . Breaking away from mother Apache can be a scary proposition and this kind of getting started article really helps easy the separation. Update: Slicehost has some nice tutorials on setting up Nginx . From their website: Nginx ("engine x") is a high-performance HTTP server and reverse proxy, as wel
sentIndex sentText sentNum sentScore
1 Turn nginx into a long-polling message queuing HTTP push server. [sent-2, score-0.76]
2 2Gbit/sec through a few Nginx instances and have plenty of room to grow! [sent-5, score-0.103]
3 Update 4 : Nginx better than Pound for load balancing. [sent-7, score-0.053]
4 Pound spikes at 80% CPU, Nginx uses 3% and is easier to understand and better documented. [sent-8, score-0.175]
5 com combines two cool tools together for better performance in Nginx and Memcached, a 400% boost! [sent-10, score-0.105]
6 Breaking away from mother Apache can be a scary proposition and this kind of getting started article really helps easy the separation. [sent-13, score-0.277]
7 Update: Slicehost has some nice tutorials on setting up Nginx . [sent-14, score-0.051]
8 From their website: Nginx ("engine x") is a high-performance HTTP server and reverse proxy, as well as an IMAP/POP3/SMTP proxy server. [sent-15, score-0.247]
9 ru, Russia's second-most visited website, where it has been running in production for over two and a half years. [sent-17, score-0.075]
10 I currently have Nginx doing reverse proxy of over tens of millions of HTTP requests per day (thats a few hundred per second) on a single server. [sent-23, score-0.401]
11 At peak load it uses about 15MB RAM and 10% CPU on my particular configuration (FreeBSD 6). [sent-24, score-0.125]
12 Under the same kind of load, Apache falls over (after using 1000 or so processes and god knows how much RAM), Pound falls over (too many threads, and using 400MB+ of RAM for all the thread stacks), and Lighty leaks more than 20MB per hour (and uses more CPU, but not significantly more). [sent-25, score-0.589]
wordName wordTfidf (topN-words)
[('nginx', 0.708), ('pound', 0.299), ('igor', 0.201), ('proxy', 0.146), ('falls', 0.124), ('reverse', 0.101), ('inload', 0.1), ('innginx', 0.1), ('ippolito', 0.1), ('ssi', 0.1), ('updatebarry', 0.1), ('lighty', 0.094), ('servernginx', 0.094), ('thats', 0.094), ('russia', 0.09), ('slicehost', 0.084), ('ram', 0.082), ('proposition', 0.082), ('freebsd', 0.08), ('god', 0.08), ('leaks', 0.08), ('mirror', 0.075), ('visited', 0.075), ('regularly', 0.071), ('hello', 0.071), ('mother', 0.07), ('uses', 0.07), ('cpu', 0.069), ('apache', 0.068), ('http', 0.068), ('scary', 0.065), ('drupal', 0.065), ('beta', 0.063), ('boost', 0.061), ('weight', 0.061), ('stacks', 0.061), ('kind', 0.06), ('lighttpd', 0.059), ('configuration', 0.055), ('better', 0.053), ('room', 0.052), ('spikes', 0.052), ('queuing', 0.052), ('tens', 0.052), ('website', 0.052), ('combines', 0.052), ('plenty', 0.051), ('per', 0.051), ('setting', 0.051), ('clean', 0.05)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 314 high scalability-2008-05-03-Product: nginx
Introduction: Update 6 : nginx_http_push_module . Turn nginx into a long-polling message queuing HTTP push server. Update 5 : In Load Balancer Update Barry describes how WordPress.com moved from Pound to Nginx and are now "regularly serving about 8-9k requests/second and about 1.2Gbit/sec through a few Nginx instances and have plenty of room to grow!". Update 4 : Nginx better than Pound for load balancing. Pound spikes at 80% CPU, Nginx uses 3% and is easier to understand and better documented. Update 3 : igvita.com combines two cool tools together for better performance in Nginx and Memcached, a 400% boost! . Update 2 : Software Project on Installing Nginx Web Server w/ PHP and SSL . Breaking away from mother Apache can be a scary proposition and this kind of getting started article really helps easy the separation. Update: Slicehost has some nice tutorials on setting up Nginx . From their website: Nginx ("engine x") is a high-performance HTTP server and reverse proxy, as wel
2 0.34242466 1329 high scalability-2012-09-26-WordPress.com Serves 70,000 req-sec and over 15 Gbit-sec of Traffic using NGINX
Introduction: This is a guest post by Barry Abrahamson , Chief Systems Wrangler at Automattic, and Nginx's Coufounder Andrew Alexeev. WordPress.com serves more than 33 million sites attracting over 339 million people and 3.4 billion pages each month. Since April 2008, WordPress.com has experienced about 4.4 times growth in page views. WordPress.com VIP hosts many popular sites including CNN’s Political Ticker, NFL, Time Inc’s The Page, People Magazine’s Style Watch, corporate blogs for Flickr and KROQ, and many more. Automattic operates two thousand servers in twelve, globally distributed, data centers. WordPress.com customer data is instantly replicated between different locations to provide an extremely reliable and fast web experience for hundreds of millions of visitors. Problem WordPress.com, which began in 2005, started on shared hosting, much like all of the WordPress.org sites. It was soon moved to a single dedicated server and then to two servers. In late 2005, WordPress.com
3 0.24916156 808 high scalability-2010-04-12-Poppen.de Architecture
Introduction: This is a guest a post by Alvaro Videla describing their architecture for Poppen.de , a popular German dating site. This site is very much NSFW, so be careful before clicking on the link. What I found most interesting is how they manage to sucessfully blend a little of the old with a little of the new, using technologies like Nginx, MySQL, CouchDB, and Erlang, Memcached, RabbitMQ, PHP, Graphite, Red5, and Tsung. What is Poppen.de? Poppen.de (NSFW) is the top dating website in Germany, and while it may be a small site compared to giants like Flickr or Facebook, we believe it's a nice architecture to learn from if you are starting to get some scaling problems. The Stats 2.000.000 users 20.000 concurrent users 300.000 private messages per day 250.000 logins per day We have a team of eleven developers, two designers and two sysadmins for this project. Business Model The site works with a freemium model, where users can do for free things like: Search
4 0.21390386 1644 high scalability-2014-05-07-Update on Disqus: It's Still About Realtime, But Go Demolishes Python
Introduction: Our last article on Disqus: How Disqus Went Realtime With 165K Messages Per Second And Less Than .2 Seconds Latency , was a little out of date, but the folks at Disqus have been busy implementing, not talking, so we don't know a lot about what they are doing now, but we do have a short update in C1MM and NGINX by John Watson and an article Trying out this Go thing . So Disqus has grown a bit: 1.3 billion unique visitors 10 billion page views 500 million users engaged in discussions 3 million communities 25 million comments They are still all about realtime, but Go replaced Python in their Realtime system: Original Realtime backend was written in a pretty lightweight Python + gevent. The realtime service is a hybrid of CPU intensive tasks + lots of network IO. Gevent was handling the network IO without an issue, but at higher contention, the CPU was choking everything. Switching over to Go removed that contention, which was the primary issue that was being se
5 0.21228775 172 high scalability-2007-12-02-nginx: high performance smpt-pop-imap proxy
Introduction: nginx is a high performance smtp/pop/imap proxy that lets you do custom authorization and lookups and is very scalable. (just add nodes) Nginx by default is a reverse proxy and this is what it is doing here for pop/imap connections. It is also an excellelent reverse proxy for web servers. Advantage: You dont have to have a speacial database or ldap schema. Just an url to do auth and lookup with. A url that may be accessed by a unix or a tcp socket. Write your own auth handler - according to your own policy. For example: A user called atif tries to login with the pass testxyz. You pass this infomation to a URL such as socket:/var/tmp/xyz.sock or http://auth.corp.mailserver.net:someport/someurl The auth server replies with either a FAILURE such as Auth-Status: Invalid Login or password or with a success such as Auth-Status: OK Auth-Server: OneOfThe100Servers Auth-Port: optionalyAPort We have implemented it at our ISP and it has saves us a
6 0.17071338 1640 high scalability-2014-04-30-10 Tips for Optimizing NGINX and PHP-fpm for High Traffic Sites
7 0.1435962 1428 high scalability-2013-03-22-Stuff The Internet Says On Scalability For March 22, 2013
8 0.13724123 1287 high scalability-2012-07-20-Stuff The Internet Says On Scalability For July 20, 2012
9 0.13492143 1193 high scalability-2012-02-16-A Short on the Pinterest Stack for Handling 3+ Million Users
10 0.13340922 203 high scalability-2008-01-07-How Ruby on Rails Survived a 550k Pageview Digging
11 0.13333929 1638 high scalability-2014-04-28-How Disqus Went Realtime with 165K Messages Per Second and Less than .2 Seconds Latency
12 0.12642804 1516 high scalability-2013-09-13-Stuff The Internet Says On Scalability For September 13, 2013
13 0.12061736 1594 high scalability-2014-02-12-Paper: Network Stack Specialization for Performance
14 0.11982933 711 high scalability-2009-09-22-How Ravelry Scales to 10 Million Requests Using Rails
15 0.1167094 1443 high scalability-2013-04-19-Stuff The Internet Says On Scalability For April 19, 2013
16 0.11630289 313 high scalability-2008-05-02-Friends for Sale Architecture - A 300 Million Page View-Month Facebook RoR App
17 0.11277705 1296 high scalability-2012-08-02-Strategy: Use Spare Region Capacity to Survive Availability Zone Failures
18 0.10265251 300 high scalability-2008-04-07-Scalr - Open Source Auto-scaling Hosting on Amazon EC2
19 0.10093863 812 high scalability-2010-04-19-Strategy: Order Two Mediums Instead of Two Smalls and the EC2 Buffet
20 0.10024816 1469 high scalability-2013-06-03-GOV.UK - Not Your Father's Stack
topicId topicWeight
[(0, 0.114), (1, 0.044), (2, -0.027), (3, -0.111), (4, -0.016), (5, -0.042), (6, 0.036), (7, -0.028), (8, -0.002), (9, 0.027), (10, -0.018), (11, -0.05), (12, 0.062), (13, -0.038), (14, -0.086), (15, -0.051), (16, 0.035), (17, 0.007), (18, 0.005), (19, -0.021), (20, 0.004), (21, -0.013), (22, -0.056), (23, -0.035), (24, 0.048), (25, 0.028), (26, 0.012), (27, 0.026), (28, -0.026), (29, -0.027), (30, -0.02), (31, 0.001), (32, -0.063), (33, 0.077), (34, 0.062), (35, -0.083), (36, -0.031), (37, 0.057), (38, -0.039), (39, 0.034), (40, -0.044), (41, 0.039), (42, 0.034), (43, -0.106), (44, 0.008), (45, 0.032), (46, -0.043), (47, -0.022), (48, -0.024), (49, -0.009)]
simIndex simValue blogId blogTitle
same-blog 1 0.96528053 314 high scalability-2008-05-03-Product: nginx
Introduction: Update 6 : nginx_http_push_module . Turn nginx into a long-polling message queuing HTTP push server. Update 5 : In Load Balancer Update Barry describes how WordPress.com moved from Pound to Nginx and are now "regularly serving about 8-9k requests/second and about 1.2Gbit/sec through a few Nginx instances and have plenty of room to grow!". Update 4 : Nginx better than Pound for load balancing. Pound spikes at 80% CPU, Nginx uses 3% and is easier to understand and better documented. Update 3 : igvita.com combines two cool tools together for better performance in Nginx and Memcached, a 400% boost! . Update 2 : Software Project on Installing Nginx Web Server w/ PHP and SSL . Breaking away from mother Apache can be a scary proposition and this kind of getting started article really helps easy the separation. Update: Slicehost has some nice tutorials on setting up Nginx . From their website: Nginx ("engine x") is a high-performance HTTP server and reverse proxy, as wel
2 0.72658563 172 high scalability-2007-12-02-nginx: high performance smpt-pop-imap proxy
Introduction: nginx is a high performance smtp/pop/imap proxy that lets you do custom authorization and lookups and is very scalable. (just add nodes) Nginx by default is a reverse proxy and this is what it is doing here for pop/imap connections. It is also an excellelent reverse proxy for web servers. Advantage: You dont have to have a speacial database or ldap schema. Just an url to do auth and lookup with. A url that may be accessed by a unix or a tcp socket. Write your own auth handler - according to your own policy. For example: A user called atif tries to login with the pass testxyz. You pass this infomation to a URL such as socket:/var/tmp/xyz.sock or http://auth.corp.mailserver.net:someport/someurl The auth server replies with either a FAILURE such as Auth-Status: Invalid Login or password or with a success such as Auth-Status: OK Auth-Server: OneOfThe100Servers Auth-Port: optionalyAPort We have implemented it at our ISP and it has saves us a
3 0.63859183 138 high scalability-2007-10-30-Feedblendr Architecture - Using EC2 to Scale
Introduction: A man had a dream. His dream was to blend a bunch of RSS/Atom/RDF feeds into a single feed. The man is Beau Lebens of Feedville and like most dreamers he was a little short on coin. So he took refuge in the home of a cheap hosting provider and Beau realized his dream, creating FEEDblendr . But FEEDblendr chewed up so much CPU creating blended feeds that the cheap hosting provider ordered Beau to find another home. Where was Beau to go? He eventually found a new home in the virtual machine room of Amazon's EC2. This is the story of how Beau was finally able to create his one feeds safe within the cradle of affordable CPU cycles. Site: http://feedblendr.com/ The Platform EC2 (Fedora Core 6 Lite distro) S3 Apache PHP MySQL DynDNS (for round robin DNS) The Stats Beau is a developer with some sysadmin skills, not a web server admin, so a lot of learning was involved in creating FEEDblendr. FEEDblendr uses 2 EC2 instances. The same Amazon Instance (AMI) is
4 0.63801652 1329 high scalability-2012-09-26-WordPress.com Serves 70,000 req-sec and over 15 Gbit-sec of Traffic using NGINX
Introduction: This is a guest post by Barry Abrahamson , Chief Systems Wrangler at Automattic, and Nginx's Coufounder Andrew Alexeev. WordPress.com serves more than 33 million sites attracting over 339 million people and 3.4 billion pages each month. Since April 2008, WordPress.com has experienced about 4.4 times growth in page views. WordPress.com VIP hosts many popular sites including CNN’s Political Ticker, NFL, Time Inc’s The Page, People Magazine’s Style Watch, corporate blogs for Flickr and KROQ, and many more. Automattic operates two thousand servers in twelve, globally distributed, data centers. WordPress.com customer data is instantly replicated between different locations to provide an extremely reliable and fast web experience for hundreds of millions of visitors. Problem WordPress.com, which began in 2005, started on shared hosting, much like all of the WordPress.org sites. It was soon moved to a single dedicated server and then to two servers. In late 2005, WordPress.com
5 0.63387913 29 high scalability-2007-07-25-Product: lighttpd
Introduction: Lighttpd (pronounced "lighty") is a web server which is designed to be secure, fast, standards-compliant, and flexible while being optimized for speed-critical environments. Its low memory footprint (compared to other web servers), light CPU load and its speed goals make lighttpd suitable for servers that are suffering load problems, or for serving static media separately from dynamic content. lighttpd is free software / open source, and is distributed under the BSD license. lighttpd runs on GNU/Linux and other Unix-like operating systems and Microsoft Windows. Load-balancing FastCGI, SCGI and HTTP-proxy support chroot support select()-/poll()-based web server Support for more efficient event notification schemes like kqueue and epoll Conditional rewrites (mod_rewrite) SSL and TLS support, via openSSL. Authentication against an LDAP server rrdtool statistics Rule-based downloading with possibility of a script handling only authentication Server-side includes supp
6 0.63180697 74 high scalability-2007-08-23-Product: Varnish
7 0.61367595 1486 high scalability-2013-07-03-5 Rockin' Tips for Scaling PHP to 30,000 Concurrent Users Per Server
8 0.61299866 1615 high scalability-2014-03-19-Strategy: Three Techniques to Survive Traffic Surges by Quickly Scaling Your Site
9 0.59980577 312 high scalability-2008-04-30-Rather small site architecture.
10 0.59471142 290 high scalability-2008-03-28-How to Get DNS Names of a Web Server
11 0.59415185 1644 high scalability-2014-05-07-Update on Disqus: It's Still About Realtime, But Go Demolishes Python
12 0.58652824 996 high scalability-2011-02-28-A Practical Guide to Varnish - Why Varnish Matters
13 0.58100975 1193 high scalability-2012-02-16-A Short on the Pinterest Stack for Handling 3+ Million Users
14 0.56470484 1638 high scalability-2014-04-28-How Disqus Went Realtime with 165K Messages Per Second and Less than .2 Seconds Latency
15 0.55731457 808 high scalability-2010-04-12-Poppen.de Architecture
16 0.55429643 300 high scalability-2008-04-07-Scalr - Open Source Auto-scaling Hosting on Amazon EC2
17 0.55165458 662 high scalability-2009-07-27-Handle 700 Percent More Requests Using Squid and APC Cache
18 0.53140491 203 high scalability-2008-01-07-How Ruby on Rails Survived a 550k Pageview Digging
19 0.52788335 1517 high scalability-2013-09-16-The Hidden DNS Tax - Cascading Timeouts and Errors
20 0.52117115 134 high scalability-2007-10-26-Paper: Wikipedia's Site Internals, Configuration, Code Examples and Management Issues
topicId topicWeight
[(1, 0.104), (2, 0.163), (10, 0.061), (23, 0.011), (30, 0.029), (37, 0.267), (61, 0.164), (79, 0.077), (94, 0.014)]
simIndex simValue blogId blogTitle
1 0.94385666 1033 high scalability-2011-05-02-The Updated Big List of Articles on the Amazon Outage
Introduction: Since The Big List Of Articles On The Amazon Outage was published we've a had few updates that people might not have seen. Amazon of course released their Summary of the Amazon EC2 and Amazon RDS Service Disruption in the US East Region . Netlix shared their Lessons Learned from the AWS Outage as did Heroku ( How Heroku Survived the Amazon Outage ), Smug Mug ( How SmugMug survived the Amazonpocalypse ), and SimpleGeo ( How SimpleGeo Stayed Up During the AWS Downtime ). The curious thing from my perspective is the general lack of response to Amazon's explanation. I expected more discussion. There's been almost none that I've seen. My guess is very few people understand what Amazon was talking about enough to comment whereas almost everyone feels qualified to talk about the event itself. Lesson for crisis handlers : deep dive post-mortems that are timely, long, honestish, and highly technical are the most effective means of staunching the downward spiral of media attention.
2 0.90488923 1029 high scalability-2011-04-25-The Big List of Articles on the Amazon Outage
Introduction: Please see The Updated Big List Of Articles On The Amazon Outage for a new improved list. So many great articles have been written on the Amazon Outage. Some aim at being helpful, some chastise developers for being so stupid, some chastise Amazon for being so incompetent, some talk about the pain they and their companies have experienced, and some even predict the downfall of the cloud. Still others say we have seen a sea change in future of the cloud, a prediction that's hard to disagree with, though the shape of the change remains...cloudy. I'll try to keep this list update as more information comes out. There will be a lot for developers to consider going forward. If there's a resource you think should be added, just let me know. Amazon's Explanation of What Happened Summary of the Amazon EC2 and Amazon RDS Service Disruption in the US East Region Hackers News thread on AWS Service Disruption Post Mortem Quite Funny Commentary on the Summary Experiences f
3 0.83898246 311 high scalability-2008-04-29-Strategy: Sample to Reduce Data Set
Introduction: Update: Arjen links to video Supporting Scalable Online Statistical Processing which shows "rather than doing complete aggregates, use statistical sampling to provide a reasonable estimate (unbiased guess) of the result." When you have a lot of data, sampling allows you to draw conclusions from a much smaller amount of data. That's why sampling is a scalability solution. If you don't have to process all your data to get the information you need then you've made the problem smaller and you'll need fewer resources and you'll get more timely results. Sampling is not useful when you need a complete list that matches a specific criteria. If you need to know the exact set of people who bought a car in the last week then sampling won't help. But, if you want to know many people bought a car then you could take a sample and then create estimate of the full data-set. The difference is you won't really know the exact car count. You'll have a confidence interval saying how confident
4 0.81612837 965 high scalability-2010-12-29-Pinboard.in Architecture - Pay to Play to Keep a System Small
Introduction: How do you keep a system small enough, while still being successful, that a simple scale-up strategy becomes the preferred architecture? StackOverflow , for example, could stick with a tool chain they were comfortable with because they had a natural brake on how fast they could grow: there are only so many programmers in the world. If this doesn't work for you, here's another natural braking strategy to consider: charge for your service . Paul Houle summarized this nicely as: avoid scaling problems by building a service that's profitable at a small scale . This interesting point, one I hadn't properly considered before, was brought up by Maciej Ceglowski, co-founder of Pinboard.in , in an interview with Leo Laporte and Amber MacArthur on their their net@night show. Pinboard is a lean, mean, pay for bookmarking machine, a timely replacement for the nearly departed Delicious. And as a self professed anti-social bookmarking site, it emphasizes speed over socializing . Maciej
same-blog 5 0.81536311 314 high scalability-2008-05-03-Product: nginx
Introduction: Update 6 : nginx_http_push_module . Turn nginx into a long-polling message queuing HTTP push server. Update 5 : In Load Balancer Update Barry describes how WordPress.com moved from Pound to Nginx and are now "regularly serving about 8-9k requests/second and about 1.2Gbit/sec through a few Nginx instances and have plenty of room to grow!". Update 4 : Nginx better than Pound for load balancing. Pound spikes at 80% CPU, Nginx uses 3% and is easier to understand and better documented. Update 3 : igvita.com combines two cool tools together for better performance in Nginx and Memcached, a 400% boost! . Update 2 : Software Project on Installing Nginx Web Server w/ PHP and SSL . Breaking away from mother Apache can be a scary proposition and this kind of getting started article really helps easy the separation. Update: Slicehost has some nice tutorials on setting up Nginx . From their website: Nginx ("engine x") is a high-performance HTTP server and reverse proxy, as wel
6 0.78943497 1133 high scalability-2011-10-27-Strategy: Survive a Comet Strike in the East With Reserved Instances in the West
7 0.77682549 891 high scalability-2010-09-01-Scale-out vs Scale-up
8 0.76561952 103 high scalability-2007-09-28-Kosmos File System (KFS) is a New High End Google File System Option
9 0.76401567 329 high scalability-2008-05-27-Secure Remote Administration for Large-Scale Networks
10 0.76219648 1379 high scalability-2012-12-31-Designing for Resiliency will be so 2013
11 0.70939779 1107 high scalability-2011-08-29-The Three Ages of Google - Batch, Warehouse, Instant
12 0.70681757 1444 high scalability-2013-04-23-Facebook Secrets of Web Performance
13 0.70633936 1366 high scalability-2012-12-03-Resiliency is the New Normal - A Deep Look at What It Means and How to Build It
14 0.7050032 1415 high scalability-2013-03-04-7 Life Saving Scalability Defenses Against Load Monster Attacks
15 0.70464242 279 high scalability-2008-03-17-Microsoft's New Database Cloud Ready to Rumble with Amazon
16 0.70337164 150 high scalability-2007-11-12-Slashdot Architecture - How the Old Man of the Internet Learned to Scale
17 0.70320201 1191 high scalability-2012-02-13-Tumblr Architecture - 15 Billion Page Views a Month and Harder to Scale than Twitter
19 0.70317411 1461 high scalability-2013-05-20-The Tumblr Architecture Yahoo Bought for a Cool Billion Dollars
20 0.70048124 1089 high scalability-2011-07-29-Stuff The Internet Says On Scalability For July 29, 2011