high_scalability high_scalability-2008 high_scalability-2008-375 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Hi everyone, I'm researching on Scalability for a college paper, and found this site great, but it has too many tips, articles and the like, but I can't see a hierarchical organization of subjects, I would need something like a checklist of things or fields, or technologies to take into account when assesing scalability. So far I've identified these: - Hardware scalability: - scale out - scale up - Cache What types of cache are there? app-level, os-level, network-level, I/O-level? - Load Balancing - DB Clustering Am I missing something important? (I'm sure I am) I don't expect you to give a lecture here, but maybe point some things out, give me some useful links... Thanks!
sentIndex sentText sentNum sentScore
1 So far I've identified these: - Hardware scalability: - scale out - scale up - Cache What types of cache are there? [sent-2, score-0.721]
2 - Load Balancing - DB Clustering Am I missing something important? [sent-4, score-0.305]
3 (I'm sure I am) I don't expect you to give a lecture here, but maybe point some things out, give me some useful links. [sent-5, score-1.246]
wordName wordTfidf (topN-words)
[('college', 0.314), ('researching', 0.302), ('subjects', 0.278), ('checklist', 0.278), ('lecture', 0.257), ('hi', 0.238), ('identified', 0.211), ('hierarchical', 0.198), ('tips', 0.195), ('fields', 0.194), ('missing', 0.168), ('give', 0.165), ('organization', 0.156), ('account', 0.142), ('db', 0.138), ('something', 0.137), ('things', 0.135), ('articles', 0.127), ('maybe', 0.125), ('expect', 0.122), ('types', 0.12), ('far', 0.114), ('everyone', 0.112), ('scalability', 0.109), ('paper', 0.108), ('technologies', 0.106), ('useful', 0.105), ('sure', 0.101), ('scale', 0.098), ('found', 0.093), ('important', 0.084), ('cache', 0.08), ('hardware', 0.076), ('site', 0.073), ('ca', 0.073), ('point', 0.071), ('great', 0.063), ('like', 0.059), ('load', 0.057), ('see', 0.056), ('take', 0.052), ('would', 0.045), ('many', 0.044), ('need', 0.041)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 375 high scalability-2008-09-01-A Scalability checklist?
Introduction: Hi everyone, I'm researching on Scalability for a college paper, and found this site great, but it has too many tips, articles and the like, but I can't see a hierarchical organization of subjects, I would need something like a checklist of things or fields, or technologies to take into account when assesing scalability. So far I've identified these: - Hardware scalability: - scale out - scale up - Cache What types of cache are there? app-level, os-level, network-level, I/O-level? - Load Balancing - DB Clustering Am I missing something important? (I'm sure I am) I don't expect you to give a lecture here, but maybe point some things out, give me some useful links... Thanks!
2 0.25181082 54 high scalability-2007-08-02-Multilanguage Website
Introduction: Hi , someone can point me to some good resurce about how to bulid a multilanguage website ? the only resource i have found is this http://www.indiawebdevelopers.com/technology/multilanguage_support.asp thanks! p.s. great site ;)
3 0.13236912 91 high scalability-2007-09-13-Design Preparations for Scaling
Introduction: Hi there, what do you think is crucial in the code designing of a scalable site? How does one prepare for webfarms and clusters (e.g. in PHP)? Thanks, Stephan
4 0.09968783 8 high scalability-2007-07-12-Should I use LAMP or Windows?
Introduction: Hi, I stumb l ed on your s i te and I am th i nking about start i ng a website. I haven't rece i ved a good answer about what I shou l d use to bui l d i t, so I thought I wou l d give it a shot. I am a w i ndows guy. I know .Net and ASP and how to bu i ld web s i tes using that stack. But I not i ce most sites use LAMP and that's what most people ta l k about using. What's wrong w i th using Windows? .Net Programmer
5 0.098449275 69 high scalability-2007-08-21-What does the next generation data center look like?
Introduction: That's what people at the NGDC Conference get together and talk about. A lot of interesting subjects: data center virtualization HPC & grid; advanced facilitates management and planning; advanced network and services; applications; data center optimization and security; managing and protecting information. The Grid – Distributed Computing at Scale presentation is an interesting one.
6 0.093905352 620 high scalability-2009-06-05-SSL RPC API Scalability
7 0.09046904 948 high scalability-2010-11-24-Great Introductory Video on Scalability from Harvard Computer Science
8 0.090167053 6 high scalability-2007-07-11-Friendster Architecture
9 0.089636013 410 high scalability-2008-10-13-SQL Server 2008 Database Performance and Scalability
10 0.089145601 640 high scalability-2009-06-28-Google Voice Architecture
11 0.087975614 78 high scalability-2007-09-01-2 tier switch selection for colocation
12 0.084321603 623 high scalability-2009-06-10-Dealing with multi-partition transactions in a distributed KV solution
13 0.08297585 199 high scalability-2008-01-01-S3 for image storing
14 0.077165447 248 high scalability-2008-02-13-What's your scalability plan?
15 0.071937561 232 high scalability-2008-01-29-When things aren't scalable
16 0.069000289 1420 high scalability-2013-03-08-Stuff The Internet Says On Scalability For March 8, 2013
17 0.066009142 532 high scalability-2009-03-11-Sharding and Connection Pools
18 0.065730333 903 high scalability-2010-09-17-Hot Scalability Links For Sep 17, 2010
19 0.065581463 44 high scalability-2007-07-30-Product: Photobucket
20 0.065010227 1387 high scalability-2013-01-15-More Numbers Every Awesome Programmer Must Know
topicId topicWeight
[(0, 0.109), (1, 0.05), (2, -0.017), (3, -0.037), (4, 0.015), (5, -0.027), (6, -0.058), (7, -0.002), (8, 0.003), (9, 0.013), (10, -0.036), (11, -0.045), (12, -0.027), (13, 0.025), (14, 0.039), (15, -0.091), (16, 0.059), (17, -0.011), (18, 0.005), (19, 0.007), (20, 0.002), (21, 0.008), (22, -0.028), (23, 0.03), (24, -0.037), (25, -0.055), (26, 0.036), (27, -0.016), (28, -0.004), (29, -0.027), (30, 0.036), (31, -0.008), (32, 0.016), (33, -0.012), (34, -0.021), (35, 0.051), (36, 0.064), (37, -0.057), (38, -0.046), (39, 0.078), (40, 0.028), (41, 0.01), (42, -0.014), (43, -0.018), (44, 0.022), (45, 0.033), (46, -0.081), (47, 0.093), (48, -0.035), (49, -0.048)]
simIndex simValue blogId blogTitle
same-blog 1 0.93272632 375 high scalability-2008-09-01-A Scalability checklist?
Introduction: Hi everyone, I'm researching on Scalability for a college paper, and found this site great, but it has too many tips, articles and the like, but I can't see a hierarchical organization of subjects, I would need something like a checklist of things or fields, or technologies to take into account when assesing scalability. So far I've identified these: - Hardware scalability: - scale out - scale up - Cache What types of cache are there? app-level, os-level, network-level, I/O-level? - Load Balancing - DB Clustering Am I missing something important? (I'm sure I am) I don't expect you to give a lecture here, but maybe point some things out, give me some useful links... Thanks!
2 0.83768326 54 high scalability-2007-08-02-Multilanguage Website
Introduction: Hi , someone can point me to some good resurce about how to bulid a multilanguage website ? the only resource i have found is this http://www.indiawebdevelopers.com/technology/multilanguage_support.asp thanks! p.s. great site ;)
3 0.75198966 206 high scalability-2008-01-10-MONO ASP.NET. Will it make the web???
Introduction: I was wondering if it is already possible to scale a MONO's .NET website. I cannot see any real websites (with the term real I mean "a highly visited website") running mono. What do you think? Will MONO ASP.NET scale??? Is it worth planning a site to run with Mono asp.net? Or should we leave it to the future? What do you think?
4 0.7369988 91 high scalability-2007-09-13-Design Preparations for Scaling
Introduction: Hi there, what do you think is crucial in the code designing of a scalable site? How does one prepare for webfarms and clusters (e.g. in PHP)? Thanks, Stephan
5 0.71858007 167 high scalability-2007-11-27-Starting a website from scratch - what technologies should I use?
Introduction: Hi, if you were to design your own highly scalable website from scratch, what technologies would you use? Based on Web 2.0 popularity, LAMP seems to be high in the running. But would you tack on CakePHP? Drupal? or build your framework/CMS from scratch? What version of Linux runs best for a scalable website? Would you consider Windows and .NET? Java? Or do you want to throw a brick at me for even suggesting such heresies? Would you prefer Postgres, Tomcat, Perl, Python, or any of that other *NIX fancy stuff...why or why not? Please forget for the moment, "use what you know" argument. I am pretty versatile, and can look for an expert in whatever platform I choose. So all skills being equal, I'm looking for the best community support, the fastest development time and most importantly, the best scaling approach. Let's say, for fun, that I'm planning for the website to have as many messages going back & forth as an eBay. Definitely building this on a
6 0.6946342 199 high scalability-2008-01-01-S3 for image storing
7 0.67000121 8 high scalability-2007-07-12-Should I use LAMP or Windows?
8 0.6667614 121 high scalability-2007-10-14-Newbie in scalability design issues
9 0.66083837 611 high scalability-2009-05-31-Need help on Site loading & database optimization - URGENT
10 0.64438629 95 high scalability-2007-09-17-Scalable CMS?
11 0.64167887 1 high scalability-2007-07-06-Start Here
12 0.63257551 276 high scalability-2008-03-15-New Website Design Considerations
13 0.6255914 165 high scalability-2007-11-26-Scale to China
14 0.61893427 1349 high scalability-2012-10-29-Gone Fishin': Welcome to High Scalability
15 0.61587524 193 high scalability-2007-12-26-Finding an excellent LAMP developer
16 0.61134571 632 high scalability-2009-06-15-starting small with growth in mind
17 0.59696645 1453 high scalability-2013-05-07-Not Invented Here: A Comical Series on Scalability
18 0.58631277 2 high scalability-2007-07-08-Welcome to High Scalability
19 0.58057338 232 high scalability-2008-01-29-When things aren't scalable
20 0.57283747 51 high scalability-2007-07-31-Book: Scalable Internet Architectures
topicId topicWeight
[(2, 0.164), (10, 0.059), (61, 0.117), (66, 0.37), (79, 0.151)]
simIndex simValue blogId blogTitle
Introduction: I remain neutral, but time and again, when people talk Windows or SQL Server, they seem to consider them unreliable with limits around scalability, performance and availability. And then you start looking at some of the big boys you have listed here in the architectural section and most of them are on Linux, MySQL,Oracle platforms that we dont see Windows and SQL Server in there.. What are your thoughts ?
same-blog 2 0.70540798 375 high scalability-2008-09-01-A Scalability checklist?
Introduction: Hi everyone, I'm researching on Scalability for a college paper, and found this site great, but it has too many tips, articles and the like, but I can't see a hierarchical organization of subjects, I would need something like a checklist of things or fields, or technologies to take into account when assesing scalability. So far I've identified these: - Hardware scalability: - scale out - scale up - Cache What types of cache are there? app-level, os-level, network-level, I/O-level? - Load Balancing - DB Clustering Am I missing something important? (I'm sure I am) I don't expect you to give a lecture here, but maybe point some things out, give me some useful links... Thanks!
3 0.67164081 283 high scalability-2008-03-18-Shared filesystem on EC2
Introduction: Hi. I'm looking for a way to share files between EC2 nodes. Currently we are using glusterfs to do this. It has been reliable recently, but in the past it has crashed under high load and we've had trouble starting it up again. We've only been able to restart it by removing the files, restarting the cluster, and filing it up again with our files from backup. This takes ages, and will take even longer the more files we get. What worries me is that it seems to make each node a point of failure for the entire system. One node crashes and soon the entire cluster has crashed. The other problem is adding another node. It seems like you have to take down the whole thing, reconfigure to include the new node, and restart. This kind of defeats the horizontal scaling strategy. We are using 2 EC2 instances as web servers, 1 as a DB master, and 1 as a slave. GlusterFS is installed on the web server machines as well as the DB slave machine (we backup files to s3 from this machine). The files
4 0.62685364 185 high scalability-2007-12-13-Is premature scalation a real disease?
Introduction: Update 3: InfoQ's Big Architecture Up Front - A Case of Premature Scalaculation? twines several different threads on the topic together into a fine noose. Update 2: Kevin says the biggest problems he sees with startups is they need to scale their backend (no, the other one). Update: My bad. It's hard to sell scalability so just forget it. The premise of Startups and The Problem Of Premature Scalaculation and Don’t scale: 99.999% uptime is for Wal-Mart is that you shouldn't spend precious limited resources worrying about scaling before you've first implemented the functionality that will make you successful enough to have scaling problems in the first place. It's kind of an embodied life force model of system creation. Energy is scarce so any parasites siphoning off energy must be hunted down and destroyed so the body has its best chance of survival. Is this really how it works? If I ever believed this I certainly don't believe it anymore. The world has c
5 0.62175286 622 high scalability-2009-06-08-Distribution of queries per second
Introduction: We need to measure the number of queries-per-second our site gets for capacity planning purposes. Obviously, we need to provision the site based on the peak QPS, not average QPS. There will always be some spikes in traffic, though, where for one particular second we get a really huge number of queries. It's ok if site performance slightly degrades during that time. So what I'd really like to do is estimate the *near* peak QPS based on average or median QPS. Near peak might be defined as the QPS that I get at the 95th percentile of the busiest seconds during the day. My guess is that this is similar to what ISPs do when they measure your bandwidth usage and then charge for usage over the 95th percentile. What we've done is analyzed our logs, counted the queries executed during each second during the day, sorted from the busiest seconds to the least busy ones, and graphed it. What you get is a histogram that steeply declines and flattens out near zero. Does anyone know if there is a
6 0.57604563 973 high scalability-2011-01-14-Stuff The Internet Says On Scalability For January 14, 2011
7 0.55877846 684 high scalability-2009-08-18-Real World Web: Performance & Scalability
8 0.54897428 130 high scalability-2007-10-24-Scaling Operations Saves Money and Scales Faster
10 0.53253484 1242 high scalability-2012-05-09-Cell Architectures
11 0.53216779 1337 high scalability-2012-10-10-Antirez: You Need to Think in Terms of Organizing Your Data for Fetching
12 0.53087157 1018 high scalability-2011-04-07-Paper: A Co-Relational Model of Data for Large Shared Data Banks
13 0.52998465 526 high scalability-2009-03-05-Strategy: In Cloud Computing Systematically Drive Load to the CPU
14 0.52970493 780 high scalability-2010-02-19-Twitter’s Plan to Analyze 100 Billion Tweets
15 0.52524608 862 high scalability-2010-07-20-Strategy: Consider When a Service Starts Billing in Your Algorithm Cost
16 0.52501357 383 high scalability-2008-09-10-Shard servers -- go big or small?
18 0.52305013 1183 high scalability-2012-01-30-37signals Still Happily Scaling on Moore RAM and SSDs
19 0.52144569 687 high scalability-2009-08-24-How Google Serves Data from Multiple Datacenters
20 0.52025867 1165 high scalability-2011-12-28-Strategy: Guaranteed Availability Requires Reserving Instances in Specific Zones