high_scalability high_scalability-2007 high_scalability-2007-175 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: he l l o wor l d, can you te l l me how i can i mp l ement a l oad ba l anc i ng of a web s i te runn i ng under i i s - w i ndows server 2003/08
sentIndex sentText sentNum sentScore
wordName wordTfidf (topN-words)
[('aload', 0.616), ('iis', 0.481), ('hello', 0.466), ('windows', 0.276), ('balancing', 0.218), ('site', 0.137), ('running', 0.123), ('server', 0.087), ('web', 0.08)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 175 high scalability-2007-12-05-how to: Load Balancing with iis
Introduction: he l l o wor l d, can you te l l me how i can i mp l ement a l oad ba l anc i ng of a web s i te runn i ng under i i s - w i ndows server 2003/08
2 0.19261512 632 high scalability-2009-06-15-starting small with growth in mind
Introduction: Hello all, I'm working on a web site that might totally flop or it might explode to be the next facebook/flickr/digg/etc. Since I really don't know how popular the site will be I don't want to spend a ton of money on the hardware/hosting right away but I want to be able to scale it easily if it does grow rapidly. With this in mind, what would be the best approach to launch the site? Thanks, Dan
3 0.18810976 105 high scalability-2007-10-01-Statistics Logging Scalability
Introduction: My company is developing a centralized web platform to service our clients. We currently use about 3Mb/s on our uplink at our ISP serving web pages for about 100 clients. We'd like to offer them statistics that mean something to their businesses and have been contemplating writing our own statistics code to handle the task. All statistics would be gathered at the page view level and we're implementing a HttpModule in ASP.Net 2.0 to handle the gather of the data. That said, I'm curious to hear comments on writing this data (~500 bytes of log data/page request). We need to write this data somewhere and then build a process to aggregate the data into a warehouse application used in our reporting system. Google Analytics is out of the question because we do not want our hosting infrastructure dependant upon a remote server. Web Trends et al. are too expensive for our clients. I'm thinking of a couple of options. 1) Writing log data directly to a SQL Server 2000 db and havin
4 0.1830737 178 high scalability-2007-12-10-1 Master, N Slaves
Introduction: Hello all, Reading the site you can note that "1 Master for writes, N Slaves for reads" scheme is used offen. How is this implemented? Who decides where writes and reads go? Something in application level or specific database proxies, like Slony-I? Thanks.
5 0.18027276 59 high scalability-2007-08-04-Try Squid as a Reverse Proxy
Introduction: This scalability strategy is brought to you by Erik Osterman: My recommendations for anyone dealing with explosive growth on a limited budget with lots of cachable content (e.g. content capable of returning valid expiration headers) is employ a reverse proxy as mentioned in this article. In the last week, we had a site get AP'd, triggering 100K unique visitors to a single IIS server in under 5 hours. It took out the IIS server. Placing a single squid infront of the server handled the entire onslaught with a max server load of 0.10 on a modest Intel IV 3Ghz. It's trivial to implement for anyone interested...
6 0.13562134 200 high scalability-2008-01-02-WEB hosting Select
7 0.12318274 427 high scalability-2008-10-22-Server load balancing architectures, Part 2: Application-level load balancing
8 0.12122677 8 high scalability-2007-07-12-Should I use LAMP or Windows?
10 0.10464329 36 high scalability-2007-07-28-Product: Web Log Expert
11 0.097177781 135 high scalability-2007-10-27-.Net2 and AJAX scalability?
12 0.088543341 584 high scalability-2009-04-27-Some Questions from a newbie
13 0.086264975 30 high scalability-2007-07-26-Product: AWStats a Log Analyzer
14 0.079611897 35 high scalability-2007-07-28-Product: FastStats Log Analyzer
15 0.078030564 109 high scalability-2007-10-03-Save on a Load Balancer By Using Client Side Load Balancing
16 0.077842377 638 high scalability-2009-06-26-PlentyOfFish Architecture
17 0.077559479 366 high scalability-2008-08-17-Many updates against MySQL
18 0.076983929 1361 high scalability-2012-11-22-Gone Fishin': PlentyOfFish Architecture
19 0.072930217 66 high scalability-2007-08-16-What tech is used to build your favorite site?
20 0.072871424 426 high scalability-2008-10-22-Server load balancing architectures, Part 1: Transport-level load balancing
topicId topicWeight
[(0, 0.04), (1, 0.005), (2, -0.023), (3, -0.089), (4, -0.003), (5, -0.049), (6, -0.016), (7, -0.04), (8, 0.003), (9, 0.058), (10, -0.026), (11, -0.036), (12, -0.021), (13, -0.025), (14, 0.034), (15, -0.052), (16, 0.052), (17, 0.003), (18, 0.019), (19, 0.01), (20, 0.013), (21, -0.004), (22, -0.052), (23, -0.017), (24, 0.034), (25, -0.071), (26, -0.019), (27, -0.004), (28, -0.008), (29, -0.044), (30, 0.013), (31, 0.016), (32, 0.024), (33, -0.097), (34, -0.014), (35, 0.01), (36, 0.018), (37, 0.001), (38, -0.01), (39, -0.017), (40, -0.015), (41, 0.01), (42, -0.015), (43, 0.011), (44, 0.038), (45, -0.043), (46, 0.037), (47, 0.014), (48, 0.063), (49, -0.018)]
simIndex simValue blogId blogTitle
same-blog 1 0.96274245 175 high scalability-2007-12-05-how to: Load Balancing with iis
Introduction: he l l o wor l d, can you te l l me how i can i mp l ement a l oad ba l anc i ng of a web s i te runn i ng under i i s - w i ndows server 2003/08
2 0.73241323 8 high scalability-2007-07-12-Should I use LAMP or Windows?
Introduction: Hi, I stumb l ed on your s i te and I am th i nking about start i ng a website. I haven't rece i ved a good answer about what I shou l d use to bui l d i t, so I thought I wou l d give it a shot. I am a w i ndows guy. I know .Net and ASP and how to bu i ld web s i tes using that stack. But I not i ce most sites use LAMP and that's what most people ta l k about using. What's wrong w i th using Windows? .Net Programmer
3 0.70076942 232 high scalability-2008-01-29-When things aren't scalable
Introduction: OK, I know this site is for scalable web site design. But as there aren't any sites I can find for graceful failure under "slashdotted" like pressure I'll ask here. Does anyone have a sensible way, once you have a "web application" that either won't scale, or can't scale, that you can give some users a good consistent experience and bounce other users to a busy site page. I have seen sites do this to varying degrees, some of which work better than others, but no explanations beyond simply bouncing requests to a "we're busy page server" when you have more than a given number of connections. This is obviously useless as a web page likely requires multiple connection (ignoring keep-alive, pipelining etc) multiple connection to completely render properly. The normal problem is users getting a page and not the "furniture" for that page like images or css. Other problems are having to wait ages to get the busy page or the site being slow even if you do "get in". And some site let
4 0.6865012 59 high scalability-2007-08-04-Try Squid as a Reverse Proxy
Introduction: This scalability strategy is brought to you by Erik Osterman: My recommendations for anyone dealing with explosive growth on a limited budget with lots of cachable content (e.g. content capable of returning valid expiration headers) is employ a reverse proxy as mentioned in this article. In the last week, we had a site get AP'd, triggering 100K unique visitors to a single IIS server in under 5 hours. It took out the IIS server. Placing a single squid infront of the server handled the entire onslaught with a max server load of 0.10 on a modest Intel IV 3Ghz. It's trivial to implement for anyone interested...
5 0.67364383 251 high scalability-2008-02-18-How to deal with an I-O bottleneck to disk?
Introduction: A site I'm working with has an I/O bottleneck. They're using a static server to deliver all of the pictures/video content/zip downloads ecetera but now that the bandwith out of that server is approaching 50Mbit/second the latency on serving small files has increased to become unacceptable. I'm curious how other people have dealt with this situation. Seperating into two different servers would require a significant change to the sites architecutre (because the premise is that all uploads go into one server, all subdirectorie are created in one directory, etc.) and may not really solve the problem.
6 0.66272712 632 high scalability-2009-06-15-starting small with growth in mind
7 0.66066325 321 high scalability-2008-05-17-WebSphere Commerce High Availability and Performance Configurations
8 0.65363365 71 high scalability-2007-08-22-Profiling WEB applications
9 0.63137132 262 high scalability-2008-02-26-Architecture to Allow High Availability File Upload
10 0.61936754 509 high scalability-2009-02-05-Product: HAProxy - The Reliable, High Performance TCP-HTTP Load Balancer
11 0.61793333 66 high scalability-2007-08-16-What tech is used to build your favorite site?
12 0.61677736 26 high scalability-2007-07-25-Paper: Lightweight Web servers
13 0.61585081 427 high scalability-2008-10-22-Server load balancing architectures, Part 2: Application-level load balancing
14 0.61398625 35 high scalability-2007-07-28-Product: FastStats Log Analyzer
15 0.60249448 298 high scalability-2008-04-07-Lazy web sites run faster
16 0.59545726 10 high scalability-2007-07-15-Book: Building Scalable Web Sites
17 0.58083004 620 high scalability-2009-06-05-SSL RPC API Scalability
18 0.58047038 426 high scalability-2008-10-22-Server load balancing architectures, Part 1: Transport-level load balancing
19 0.56172544 611 high scalability-2009-05-31-Need help on Site loading & database optimization - URGENT
20 0.55929118 118 high scalability-2007-10-09-High Load on production Webservers after Sourcecode sync
topicId topicWeight
[(2, 0.068), (61, 0.067), (93, 0.586)]
simIndex simValue blogId blogTitle
same-blog 1 0.77067298 175 high scalability-2007-12-05-how to: Load Balancing with iis
Introduction: he l l o wor l d, can you te l l me how i can i mp l ement a l oad ba l anc i ng of a web s i te runn i ng under i i s - w i ndows server 2003/08
2 0.59745967 403 high scalability-2008-10-06-Paper: Scaling Genome Sequencing - Complete Genomics Technology Overview
Introduction: Although the problem of scaling human genome sequencing is not exactly about building bigger, faster and more reliable websites it is most interesting in terms of scalability. The paper describes a new technology by the startup company Complete Genomics to sequence the full human genome for the fraction of the cost of earlier possibilities. Complete Genomics is building the world’s largest commercial human genome sequencing center to provide turnkey, outsourced complete human genome sequencing to customers worldwide. By 2010, their data center will contain approximately 60,000 processors with 30 petabytes of storage running their sequencing software on Linux clusters. Do you find this interesting and relevant to HighScalability.com?
3 0.43964171 349 high scalability-2008-07-10-Can cloud computing smite down evil zombie botnet armies?
Introduction: In the more cool stuff I've never heard of before department is something called Self Cleansing Intrusion Tolerance (SCIT). Botnets are created when vulnerable computers live long enough to become infected with the will to do the evil bidding of their evil masters. Security is almost always about removing vulnerabilities (a process which to outside observers often looks like a dog chasing its tail ). SCIT takes a different approach, it works on the availability angle. Something I never thought of before, but which makes a great deal of sense once I thought about it. With SCIT you stop and restart VM instances every minute (or whatever depending in your desired window vulnerability).... This short exposure window means worms and viri do not have long enough to fully infect a machine and carry out a coordinated attack. A machine is up for a while. Does work. And then is torn down again only to be reborn as a clean VM with no possibility of infection (unless of course the VM
4 0.37331489 1450 high scalability-2013-05-01-Myth: Eric Brewer on Why Banks are BASE Not ACID - Availability Is Revenue
Introduction: In NoSQL: Past, Present, Future Eric Brewer has a particularly fine section on explaining the often hard to understand ideas of BASE (Basically Available, Soft State, Eventually Consistent), ACID (Atomicity, Consistency, Isolation, Durability), CAP (Consistency Availability, Partition Tolerance), in terms of a pernicious long standing myth about the sanctity of consistency in banking. Myth : Money is important, so banks must use transactions to keep money safe and consistent, right? Reality : Banking transactions are inconsistent, particularly for ATMs. ATMs are designed to have a normal case behaviour and a partition mode behaviour. In partition mode Availability is chosen over Consistency. Why? 1) Availability correlates with revenue and consistency generally does not. 2) Historically there was never an idea of perfect communication so everything was partitioned. Your ATM transaction must go through so Availability is more important than
5 0.36597562 58 high scalability-2007-08-04-Product: Cacti
Introduction: Cacti is a network statistics graphing tool designed as a frontend to RRDtool's data storage and graphing functionality. It is intended to be intuitive and easy to use, as well as robust and scalable. It is generally used to graph time-series data like CPU load and bandwidth use. The frontend is written in PHP; it can handle multiple users, each with their own graph sets, so it is sometimes used by web hosting providers (especially dedicated server, virtual private server, and colocation providers) to display bandwidth statistics for their customers. It can be used to configure the data collection itself, allowing certain setups to be monitored without any manual configuration of RRDtool.
6 0.3456282 168 high scalability-2007-11-30-Strategy: Efficiently Geo-referencing IPs
7 0.32056949 30 high scalability-2007-07-26-Product: AWStats a Log Analyzer
8 0.29925489 1513 high scalability-2013-09-06-Stuff The Internet Says On Scalability For September 6, 2013
9 0.29872245 166 high scalability-2007-11-27-Solving the Client Side API Scalability Problem with a Little Game Theory
10 0.26375267 1198 high scalability-2012-02-24-Stuff The Internet Says On Scalability For February 24, 2012
11 0.24877319 944 high scalability-2010-11-17-Some Services are More Equal than Others
12 0.24078786 1330 high scalability-2012-09-28-Stuff The Internet Says On Scalability For September 28, 2012
13 0.22848876 573 high scalability-2009-04-16-Serving 250M quotes-day at CNBC.com with aiCache
14 0.21615736 1279 high scalability-2012-07-09-Data Replication in NoSQL Databases
15 0.19615957 1637 high scalability-2014-04-25-Stuff The Internet Says On Scalability For April 25th, 2014
16 0.1860887 1233 high scalability-2012-04-25-The Anatomy of Search Technology: blekko’s NoSQL database
17 0.15976675 1253 high scalability-2012-05-28-The Anatomy of Search Technology: Crawling using Combinators
18 0.15391378 322 high scalability-2008-05-19-Conference: Infoscale 2008 in Italy (June 4-6)
19 0.15291421 99 high scalability-2007-09-23-HA for switches
20 0.15189812 332 high scalability-2008-05-28-Job queue and search engine