high_scalability high_scalability-2007 high_scalability-2007-59 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: This scalability strategy is brought to you by Erik Osterman: My recommendations for anyone dealing with explosive growth on a limited budget with lots of cachable content (e.g. content capable of returning valid expiration headers) is employ a reverse proxy as mentioned in this article. In the last week, we had a site get AP'd, triggering 100K unique visitors to a single IIS server in under 5 hours. It took out the IIS server. Placing a single squid infront of the server handled the entire onslaught with a max server load of 0.10 on a modest Intel IV 3Ghz. It's trivial to implement for anyone interested...
sentIndex sentText sentNum sentScore
1 This scalability strategy is brought to you by Erik Osterman: My recommendations for anyone dealing with explosive growth on a limited budget with lots of cachable content (e. [sent-1, score-1.504]
2 content capable of returning valid expiration headers) is employ a reverse proxy as mentioned in this article. [sent-3, score-0.965]
3 In the last week, we had a site get AP'd, triggering 100K unique visitors to a single IIS server in under 5 hours. [sent-4, score-0.718]
4 Placing a single squid infront of the server handled the entire onslaught with a max server load of 0. [sent-6, score-1.219]
wordName wordTfidf (topN-words)
[('iis', 0.344), ('cachable', 0.234), ('infront', 0.234), ('onslaught', 0.234), ('iv', 0.22), ('triggering', 0.21), ('osterman', 0.202), ('ap', 0.182), ('placing', 0.182), ('explosive', 0.178), ('erik', 0.175), ('anyone', 0.168), ('headers', 0.167), ('modest', 0.164), ('employ', 0.162), ('expiration', 0.16), ('squid', 0.152), ('returning', 0.152), ('trivial', 0.151), ('valid', 0.141), ('max', 0.139), ('budget', 0.124), ('content', 0.124), ('recommendations', 0.119), ('mentioned', 0.118), ('intel', 0.116), ('brought', 0.112), ('visitors', 0.111), ('capable', 0.108), ('dealing', 0.108), ('week', 0.101), ('handled', 0.097), ('server', 0.094), ('took', 0.085), ('limited', 0.085), ('single', 0.077), ('growth', 0.077), ('implement', 0.076), ('unique', 0.075), ('last', 0.074), ('strategy', 0.072), ('lots', 0.066), ('entire', 0.06), ('site', 0.049), ('load', 0.038), ('scalability', 0.037), ('get', 0.028)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 59 high scalability-2007-08-04-Try Squid as a Reverse Proxy
Introduction: This scalability strategy is brought to you by Erik Osterman: My recommendations for anyone dealing with explosive growth on a limited budget with lots of cachable content (e.g. content capable of returning valid expiration headers) is employ a reverse proxy as mentioned in this article. In the last week, we had a site get AP'd, triggering 100K unique visitors to a single IIS server in under 5 hours. It took out the IIS server. Placing a single squid infront of the server handled the entire onslaught with a max server load of 0.10 on a modest Intel IV 3Ghz. It's trivial to implement for anyone interested...
2 0.18027276 175 high scalability-2007-12-05-how to: Load Balancing with iis
Introduction: he l l o wor l d, can you te l l me how i can i mp l ement a l oad ba l anc i ng of a web s i te runn i ng under i i s - w i ndows server 2003/08
3 0.11870206 105 high scalability-2007-10-01-Statistics Logging Scalability
Introduction: My company is developing a centralized web platform to service our clients. We currently use about 3Mb/s on our uplink at our ISP serving web pages for about 100 clients. We'd like to offer them statistics that mean something to their businesses and have been contemplating writing our own statistics code to handle the task. All statistics would be gathered at the page view level and we're implementing a HttpModule in ASP.Net 2.0 to handle the gather of the data. That said, I'm curious to hear comments on writing this data (~500 bytes of log data/page request). We need to write this data somewhere and then build a process to aggregate the data into a warehouse application used in our reporting system. Google Analytics is out of the question because we do not want our hosting infrastructure dependant upon a remote server. Web Trends et al. are too expensive for our clients. I'm thinking of a couple of options. 1) Writing log data directly to a SQL Server 2000 db and havin
4 0.10832363 571 high scalability-2009-04-15-Using HTTP cache headers effectively
Introduction: Hi, Some time ago , martin fowler bloged about how HTTP cache headers can be very effectively used in web site design. http://www.martinfowler.com/bliki/SegmentationByFreshness.html How actively HTTP cache headers are considered in web site design? I think it is a great tool to reduce lot of load on server and should be considered before designing any complex caching strategy. Thoughts? Thanks, Unmesh
5 0.095154174 81 high scalability-2007-09-06-Scaling IMAP and POP3
Introduction: Another scalability strategy brought to you by Erik Osterman: Just thought I'd drop a brief suggestion to anyone building a large mail system. Our solution for scaling mail pickup was to develop a sharded architecture whereby accounts are spread across a cluster of servers, each with imap/pop3 capability. Then we use a cluster of reverse proxies (Perdition) speaking to the backend imap/pop3 servers . The benefit of this approach is you can use simply use round-robin or HA load balancing on the perdition servers that end users connect to (e.g. admins can easily move accounts around on the backend storage servers without affecting end users). Perdition manages routing users to the appropriate backend servers and has MySQL support. What we also liked about this approach was that it had no dependency on a distributed or networked file system, so less chance of corruption or data consistency issues. When an individual server reaches capacity, we just off load users to a less u
6 0.089526884 90 high scalability-2007-09-12-Technology behind mediatemple grid service
7 0.087547757 135 high scalability-2007-10-27-.Net2 and AJAX scalability?
8 0.087432317 74 high scalability-2007-08-23-Product: Varnish
9 0.077294379 177 high scalability-2007-12-08-thesimsonstage.ea.com
10 0.076552048 36 high scalability-2007-07-28-Product: Web Log Expert
11 0.075917773 662 high scalability-2009-07-27-Handle 700 Percent More Requests Using Squid and APC Cache
12 0.075030871 621 high scalability-2009-06-06-Graph server
13 0.06553784 453 high scalability-2008-12-01-Breakthrough Web-Tier Solutions with Record-Breaking Performance
14 0.062976688 72 high scalability-2007-08-22-Wikimedia architecture
15 0.061495185 856 high scalability-2010-07-12-Creating Scalable Digital Libraries
16 0.061402019 638 high scalability-2009-06-26-PlentyOfFish Architecture
17 0.060724877 1361 high scalability-2012-11-22-Gone Fishin': PlentyOfFish Architecture
18 0.058966745 30 high scalability-2007-07-26-Product: AWStats a Log Analyzer
19 0.058451273 199 high scalability-2008-01-01-S3 for image storing
20 0.058450706 176 high scalability-2007-12-07-Synchronizing databases in different geographic locations
topicId topicWeight
[(0, 0.059), (1, 0.031), (2, -0.026), (3, -0.074), (4, -0.008), (5, -0.031), (6, -0.015), (7, -0.018), (8, -0.008), (9, 0.05), (10, -0.017), (11, -0.024), (12, -0.046), (13, -0.019), (14, 0.009), (15, 0.002), (16, 0.035), (17, 0.017), (18, 0.006), (19, 0.001), (20, -0.016), (21, -0.01), (22, -0.031), (23, 0.005), (24, 0.009), (25, -0.021), (26, 0.004), (27, -0.012), (28, 0.001), (29, 0.021), (30, 0.01), (31, 0.014), (32, -0.009), (33, -0.035), (34, -0.027), (35, 0.027), (36, 0.048), (37, 0.013), (38, 0.005), (39, -0.039), (40, 0.012), (41, 0.039), (42, 0.009), (43, -0.03), (44, 0.003), (45, -0.028), (46, -0.017), (47, 0.02), (48, 0.03), (49, 0.013)]
simIndex simValue blogId blogTitle
same-blog 1 0.95595443 59 high scalability-2007-08-04-Try Squid as a Reverse Proxy
Introduction: This scalability strategy is brought to you by Erik Osterman: My recommendations for anyone dealing with explosive growth on a limited budget with lots of cachable content (e.g. content capable of returning valid expiration headers) is employ a reverse proxy as mentioned in this article. In the last week, we had a site get AP'd, triggering 100K unique visitors to a single IIS server in under 5 hours. It took out the IIS server. Placing a single squid infront of the server handled the entire onslaught with a max server load of 0.10 on a modest Intel IV 3Ghz. It's trivial to implement for anyone interested...
2 0.72769821 251 high scalability-2008-02-18-How to deal with an I-O bottleneck to disk?
Introduction: A site I'm working with has an I/O bottleneck. They're using a static server to deliver all of the pictures/video content/zip downloads ecetera but now that the bandwith out of that server is approaching 50Mbit/second the latency on serving small files has increased to become unacceptable. I'm curious how other people have dealt with this situation. Seperating into two different servers would require a significant change to the sites architecutre (because the premise is that all uploads go into one server, all subdirectorie are created in one directory, etc.) and may not really solve the problem.
3 0.71246034 177 high scalability-2007-12-08-thesimsonstage.ea.com
Introduction: Cou l d anyone make an overv i ew of thesimsonstage.ea.com arch i tecture, i.e. some stats, w i ch techno l ogy thay use, how they imp l ement karaoke f l ash-based p l ayer, wh i ch med i a server they use, how many bandw i d t h does it need, etc. Any informat i on wi l l be he l pful. Thanks.
4 0.69954485 598 high scalability-2009-05-12-P2P server technology?
Introduction: Is there any type of server technology that allows visitors to a website to become part of the server? Like with bittorrent, users share some of their bandwidth, so would this be possible with web servers where a person goes to a website, downloads and runs the software which makes their internet connection and cpu and hdd become part of the web server?
5 0.69001639 175 high scalability-2007-12-05-how to: Load Balancing with iis
Introduction: he l l o wor l d, can you te l l me how i can i mp l ement a l oad ba l anc i ng of a web s i te runn i ng under i i s - w i ndows server 2003/08
6 0.68016219 26 high scalability-2007-07-25-Paper: Lightweight Web servers
7 0.67298812 620 high scalability-2009-06-05-SSL RPC API Scalability
8 0.66398901 319 high scalability-2008-05-14-Scaling an image upload service
9 0.6533075 176 high scalability-2007-12-07-Synchronizing databases in different geographic locations
10 0.65127164 262 high scalability-2008-02-26-Architecture to Allow High Availability File Upload
11 0.64791459 67 high scalability-2007-08-17-What is the best hosting option?
12 0.64390361 426 high scalability-2008-10-22-Server load balancing architectures, Part 1: Transport-level load balancing
13 0.60974348 70 high scalability-2007-08-22-How many machines do you need to run your site?
14 0.60634375 290 high scalability-2008-03-28-How to Get DNS Names of a Web Server
15 0.60327619 150 high scalability-2007-11-12-Slashdot Architecture - How the Old Man of the Internet Learned to Scale
16 0.58914906 379 high scalability-2008-09-04-Database question for upcoming project
17 0.58451343 571 high scalability-2009-04-15-Using HTTP cache headers effectively
18 0.5800367 856 high scalability-2010-07-12-Creating Scalable Digital Libraries
19 0.57386947 1288 high scalability-2012-07-23-Ask HighScalability: How Do I Build My MegaUpload + Itunes + YouTube Startup?
20 0.56918269 427 high scalability-2008-10-22-Server load balancing architectures, Part 2: Application-level load balancing
topicId topicWeight
[(2, 0.127), (61, 0.103), (85, 0.636)]
simIndex simValue blogId blogTitle
same-blog 1 0.95473653 59 high scalability-2007-08-04-Try Squid as a Reverse Proxy
Introduction: This scalability strategy is brought to you by Erik Osterman: My recommendations for anyone dealing with explosive growth on a limited budget with lots of cachable content (e.g. content capable of returning valid expiration headers) is employ a reverse proxy as mentioned in this article. In the last week, we had a site get AP'd, triggering 100K unique visitors to a single IIS server in under 5 hours. It took out the IIS server. Placing a single squid infront of the server handled the entire onslaught with a max server load of 0.10 on a modest Intel IV 3Ghz. It's trivial to implement for anyone interested...
2 0.88418984 1049 high scalability-2011-05-31-Awesome List of Advanced Distributed Systems Papers
Introduction: As part of Dr. Indranil Gupta 's CS 525 Spring 2011 Advanced Distributed Systems class, he has collected an incredible list of resources on distributed systems . His research group is also doing some interesting work. The various topics include: Before there Were Clouds, Cloud Computing, P2P Systems, Basic Distributed Computing Concepts, Sensor Networks, Overlays and DHTs, Cloud Programming, Cloud Scheduling, Key-Value Stores, Storage, Sensor Net Routing, Geo-Distribution, P2P Apps, In-network processing, Epidemics, Probabilistic Membership Protocols, Distributed Monitoring and Management, Publish-Subscribe/CDNs, Measurement Studies, Old Wine: Stale or Vintage?, In Byzantium, Cloud Pricing, Other Industrial Systems, Structure of Networks, Completing the Circle, Green Clouds, Distributed Debugging, Flash!, The Middle or the End?, Availability-Aware Systems, Design Methodologies, Handling Stress, Sources of unreliability in networks, Handling Stress, Selfish algorithms, Securi
3 0.84851462 191 high scalability-2007-12-23-Synchronizing Memcached application
Introduction: I have an application with couple of web servers that uses MemcacheD. How can i synchronize concurrent put to the cache? The value of the entry is list. Atomic append operation could have been helpful, but unfortunately memcahe doesn't support atomic append.
4 0.80321097 143 high scalability-2007-11-06-Product: ChironFS
Introduction: If you are trying to create highly available file systems, especially across data centers, then ChironFS is one potential solution. It's relatively new, so there aren't lots of experience reports, but it looks worth considering. What is ChironFS and how does it work? Adapted from the ChironFS website: The Chiron Filesystem is a Fuse based filesystem that frees you from single points of failure. It's main purpose is to guarantee filesystem availability using replication. But it isn't a RAID implementation. RAID replicates DEVICES not FILESYSTEMS. Why not just use RAID over some network block device? Because it is a block device and if one server mounts that device in RW mode, no other server will be able to mount it in RW mode. Any real network may have many servers and offer a variety of services. Keeping everything running can become a real nightmare!
5 0.77831352 102 high scalability-2007-09-27-Product: Sequoia Database Clustering Technology
Introduction: Sequoia is a transparent middleware solution offering clustering, load balancing and failover services for any database. Sequoia is the continuation of the C-JDBC project. The database is distributed and replicated among several nodes and Sequoia balances the queries among these nodes. Sequoia handles node and network failures with transparent failover. It also provides support for hot recovery, online maintenance operations and online upgrades. Features in a nutshell No modification of existing applications or databases. Operational with any database providing a JDBC driver. High availability provided by advanced RAIDb technology. Transparent failover and recovery capabilities. Performance scalability with unique load balancing and query result caching features. Integrated JMX-based administration and monitoring. 100% Java implementation allowing portability across platforms with a JRE 1.4 or greater. Open source licensed under Apache v2 license. Professi
6 0.77226847 1039 high scalability-2011-05-12-Paper: Mind the Gap: Reconnecting Architecture and OS Research
7 0.76928002 820 high scalability-2010-05-03-100 Node Hazelcast cluster on Amazon EC2
8 0.76612049 447 high scalability-2008-11-19-High Definition Video Delivery on the Web?
9 0.69218332 1164 high scalability-2011-12-27-PlentyOfFish Update - 6 Billion Pageviews and 32 Billion Images a Month
10 0.68487895 1032 high scalability-2011-05-02-Stack Overflow Makes Slow Pages 100x Faster by Simple SQL Tuning
11 0.67047304 492 high scalability-2009-01-16-Database Sharding for startups
12 0.66448772 1500 high scalability-2013-08-12-100 Curse Free Lessons from Gordon Ramsay on Building Great Software
13 0.66345769 1577 high scalability-2014-01-13-NYTimes Architecture: No Head, No Master, No Single Point of Failure
14 0.65942407 1239 high scalability-2012-05-04-Stuff The Internet Says On Scalability For May 4, 2012
15 0.65292454 646 high scalability-2009-07-01-Podcast about Facebook's Cassandra Project and the New Wave of Distributed Databases
16 0.60341108 53 high scalability-2007-08-01-Product: MogileFS
17 0.60101891 1024 high scalability-2011-04-15-Stuff The Internet Says On Scalability For April 15, 2011
18 0.59381562 118 high scalability-2007-10-09-High Load on production Webservers after Sourcecode sync
19 0.58701575 638 high scalability-2009-06-26-PlentyOfFish Architecture
20 0.58408463 1361 high scalability-2012-11-22-Gone Fishin': PlentyOfFish Architecture