high_scalability high_scalability-2008 high_scalability-2008-361 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: At least in the articles on Plenty of Fish and Slashdot it was mentioned that one can achieve higher performance by creating read-only and write-only databases where possible. I have read the comments and tried unsuccessfully to find more information on the net about this. I still do not understand the concept. Can someone explain it in more detail, as well as recommend resources for further investigation? (Are there books written specifically about this technique?) I think it is a very important issue, because databases are oftentimes the bottleneck.
sentIndex sentText sentNum sentScore
1 At least in the articles on Plenty of Fish and Slashdot it was mentioned that one can achieve higher performance by creating read-only and write-only databases where possible. [sent-1, score-1.037]
2 I have read the comments and tried unsuccessfully to find more information on the net about this. [sent-2, score-0.774]
3 Can someone explain it in more detail, as well as recommend resources for further investigation? [sent-4, score-0.673]
4 ) I think it is a very important issue, because databases are oftentimes the bottleneck. [sent-6, score-0.69]
wordName wordTfidf (topN-words)
[('oftentimes', 0.366), ('slashdot', 0.298), ('investigation', 0.298), ('fish', 0.25), ('net', 0.236), ('recommend', 0.219), ('books', 0.21), ('plenty', 0.188), ('technique', 0.185), ('mentioned', 0.184), ('explain', 0.177), ('databases', 0.171), ('specifically', 0.169), ('tried', 0.168), ('bottleneck', 0.16), ('detail', 0.16), ('issue', 0.151), ('comments', 0.15), ('achieve', 0.14), ('articles', 0.133), ('least', 0.118), ('understand', 0.118), ('someone', 0.116), ('higher', 0.11), ('creating', 0.106), ('resources', 0.099), ('written', 0.098), ('important', 0.088), ('find', 0.078), ('information', 0.075), ('read', 0.067), ('still', 0.067), ('think', 0.065), ('well', 0.062), ('performance', 0.042), ('one', 0.033)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 361 high scalability-2008-08-08-Separation into read-write only databases
Introduction: At least in the articles on Plenty of Fish and Slashdot it was mentioned that one can achieve higher performance by creating read-only and write-only databases where possible. I have read the comments and tried unsuccessfully to find more information on the net about this. I still do not understand the concept. Can someone explain it in more detail, as well as recommend resources for further investigation? (Are there books written specifically about this technique?) I think it is a very important issue, because databases are oftentimes the bottleneck.
2 0.2076755 150 high scalability-2007-11-12-Slashdot Architecture - How the Old Man of the Internet Learned to Scale
Introduction: Slashdot effect : overwhelming unprepared sites with an avalanche of reader's clicks after being mentioned on Slashdot. Sure, we now have the "Digg effect" and other hot new stars, but Slashdot was the original. And like many stars from generations past, Slashdot plays the elder statesman's role with with class, dignity, and restraint. Yet with millions and millions of users Slashdot is still box office gold and more than keeps up with the young'ins. And with age comes the wisdom of learning how to handle all those users. Just how does Slashdot scale and what can you learn by going old school? Site: http://slashdot.org Information Sources Slashdot's Setup, Part 1- Hardware Slashdot's Setup, Part 2- Software History of Slashdot Part 3- Going Corporate The History of Slashdot Part 4 - Yesterday, Today, Tomorrow The Platform MySQL Linux (CentOS/RHEL) Pound Apache Perl Memcached LVS The Stats Started building the system in 1999
3 0.116207 621 high scalability-2009-06-06-Graph server
Introduction: I've seen mentioned in few times sites like Digg or LinkedIn using graph servers to hold their social graphs. But the only sort of open source graph server I've found is http://neo4j.org/ . Can anyone recommend an open source graph server? Thanks Aaron
4 0.070364311 917 high scalability-2010-10-08-4 Scalability Themes from Surgecon
Introduction: Robert Haas in his SURGE Recap of the Surge conference, reflected a bit, and came up with an interesting checklist of general themes from what he was seeing. I'm directly quoting his post, so please see the post for a full discussion. He uses this framework to think about the larger picture and where PostgreSQL stands in its progression. Make use of the academic literature . Inventing your own way to do something is fine, but at least consider the possibility that someone smarter than you has thought about this problem before. Failures are inevitable, so plan for them . Try to minimize the possibility of cascading failures, and plan in advance how you can operate in degraded mode if disaster (or the Slashdot effect) strikes. Disk technology matters . Drive firmware bugs are common and nightmarish, and you can expect very limited help from the manufacturer, especially if the drive is billed as consumer-grade rather than enterprise-grade. SSDs can save you a lot of m
5 0.068456374 442 high scalability-2008-11-13-Plenty of Fish Says Scaling for Free Doesn't Pay
Introduction: Plenty of Fish CEO Markus Frind, famous nerd hero for making over $10 million a year from Google ads on a free dating site he made and ran all by himself, now sees a problem with the free model : The problem with free is that every time you double the size of your database the cost of maintaining the site grows 6 fold. I really underestimated how much resources it would take, I have one database table now that exceeds 3 billion records. The bigger you get as a free site the less money you make per visit and the more it costs to service a visit...There is really no money in being free and we have to start experimenting with other models now or we won’t be able to compete in 3 or 4 years. As one commenter succinctly put it: the “golden time” of AdSense is over . Time to look at costs. The POF architecture is to run scarily huge tables on single machines. They also buy and maintain their own SAN. So it seems scaling up is what is increasing costs and decreasing profits. I wo
6 0.065575272 182 high scalability-2007-12-12-Oracle Can Do Read-Write Splitting Too
7 0.065369107 1369 high scalability-2012-12-10-Switch your databases to Flash storage. Now. Or you're doing it wrong.
8 0.063915849 220 high scalability-2008-01-22-The high scalability community
9 0.0634095 252 high scalability-2008-02-18-limit on the number of databases open
10 0.062813908 446 high scalability-2008-11-18-Scalability Perspectives #2: Van Jacobson – Content-Centric Networking
11 0.061178979 1190 high scalability-2012-02-10-Stuff The Internet Says On Scalability For February 10, 2012
12 0.060355268 1514 high scalability-2013-09-09-Need Help with Database Scalability? Understand I-O
13 0.059744168 704 high scalability-2009-09-13-How is Berkely DB fare against other Key-Value Database
14 0.057795797 847 high scalability-2010-06-23-Product: dbShards - Share Nothing. Shard Everything.
15 0.057674218 539 high scalability-2009-03-16-Books: Web 2.0 Architectures and Cloud Application Architectures
16 0.057519011 502 high scalability-2009-01-26-Paper: Scalability by Design - Coding for Systems With Large CPU Counts
17 0.056743782 162 high scalability-2007-11-20-what is j2ee stack
18 0.055589199 672 high scalability-2009-08-06-An Unorthodox Approach to Database Design : The Coming of the Shard
19 0.055444088 384 high scalability-2008-09-16-EE-Appserver Clustering OR Terracota OR Coherence OR something else?
20 0.054692339 884 high scalability-2010-08-23-6 Ways to Kill Your Servers - Learning How to Scale the Hard Way
topicId topicWeight
[(0, 0.071), (1, 0.05), (2, -0.004), (3, -0.006), (4, -0.0), (5, 0.013), (6, -0.025), (7, -0.015), (8, -0.001), (9, -0.001), (10, -0.017), (11, 0.0), (12, -0.036), (13, 0.001), (14, 0.023), (15, -0.029), (16, 0.016), (17, 0.031), (18, -0.017), (19, 0.028), (20, -0.032), (21, -0.012), (22, -0.015), (23, 0.029), (24, -0.005), (25, -0.003), (26, -0.007), (27, -0.029), (28, -0.025), (29, -0.016), (30, 0.016), (31, -0.005), (32, 0.01), (33, 0.019), (34, -0.022), (35, 0.017), (36, 0.033), (37, 0.007), (38, 0.018), (39, 0.0), (40, 0.016), (41, 0.017), (42, -0.002), (43, -0.047), (44, 0.028), (45, 0.055), (46, -0.009), (47, -0.05), (48, 0.001), (49, 0.018)]
simIndex simValue blogId blogTitle
same-blog 1 0.95974451 361 high scalability-2008-08-08-Separation into read-write only databases
Introduction: At least in the articles on Plenty of Fish and Slashdot it was mentioned that one can achieve higher performance by creating read-only and write-only databases where possible. I have read the comments and tried unsuccessfully to find more information on the net about this. I still do not understand the concept. Can someone explain it in more detail, as well as recommend resources for further investigation? (Are there books written specifically about this technique?) I think it is a very important issue, because databases are oftentimes the bottleneck.
2 0.61962342 917 high scalability-2010-10-08-4 Scalability Themes from Surgecon
Introduction: Robert Haas in his SURGE Recap of the Surge conference, reflected a bit, and came up with an interesting checklist of general themes from what he was seeing. I'm directly quoting his post, so please see the post for a full discussion. He uses this framework to think about the larger picture and where PostgreSQL stands in its progression. Make use of the academic literature . Inventing your own way to do something is fine, but at least consider the possibility that someone smarter than you has thought about this problem before. Failures are inevitable, so plan for them . Try to minimize the possibility of cascading failures, and plan in advance how you can operate in degraded mode if disaster (or the Slashdot effect) strikes. Disk technology matters . Drive firmware bugs are common and nightmarish, and you can expect very limited help from the manufacturer, especially if the drive is billed as consumer-grade rather than enterprise-grade. SSDs can save you a lot of m
3 0.61124039 370 high scalability-2008-08-18-Forum sort order
Introduction: G'day, I noticed the default sort order for the forum is to show the posts with the most replies first. That seems a bit odd for a forum. Would it not make sense to show the posts with the most recently replies first? It is possible to re-sort the forum threads that way by clicking on the "Last post" header (twice). It would seem like a more sensible default. I've checked and I see the same behaviour as both a registered (logged in) and anonymous user. Cheers - Callum .
4 0.61091375 719 high scalability-2009-10-09-Have you collectl'd yet? If not, maybe collectl-utils will make it easier to do so
Introduction: I'm not sure how many people who follow this have even tried collectl but I wanted to let you all know that I just released a set of utilities called strangely enough collectl-utils, which you can get at http://collectl-utils.sourceforge.net . One web-based utility called colplot gives you the ability to very easily plot data from multiple systems in a way that makes correlating them over time very easy. Another utility called colmux lets you look at multiple systems in real time. In fact if you go the page that describes it in more detail you'll see a photo which shows the CPU loads on 192 systems one a second, one set of data/line! in fact the display so wide it takes 3 large monitors side-by-side to see it all and even though you can't actually read the displays you can easily see which systems are loaded and which aren't. Anyhow give it a look and let me know what you think. -mark
5 0.60429597 451 high scalability-2008-11-30-Creating a high-performing online database
Introduction: Hi there, I have an idea for an online database that services a large number of people. I've been studying it for a while and it seems feasible to me to create it and get people to populate it. It will need time to grow but eventually it will get there. The model I'm looking at is IMDB, the depth of information is fascinating, yet it's fast, not so easy to use though, but it's pretty usable! What do you think I need to create a database an online database like IMDB. I know that IMDB power comes from it's information, not the design of the site. This is something I kind of figured out. But what I need to know is the best tools to publish database contents on the web, retrieve it in that fast way like IMDB. I'm sure that I will need to create data entry logs for my users to populate the database. What programming languages you suggest? development environment? approaches? your contribution is highly appreciated. Regards, Jalil
6 0.5959484 222 high scalability-2008-01-25-Application Database and DAL Architecture
7 0.59059227 748 high scalability-2009-11-30-Why Existing Databases (RAC) are So Breakable!
8 0.58605146 210 high scalability-2008-01-13-A Note on How to Create Teasers When Posting
9 0.58469981 747 high scalability-2009-11-26-What I'm Thankful For on Thanksgiving
10 0.57634807 1506 high scalability-2013-08-23-Stuff The Internet Says On Scalability For August 23, 2013
11 0.57408792 654 high scalability-2009-07-09-No to SQL? Anti-database movement gains steam – My Take
12 0.57281941 677 high scalability-2009-08-09-NoSQL: If Only It Was That Easy
13 0.5722 1199 high scalability-2012-02-27-Zen and the Art of Scaling - A Koan and Epigram Approach
14 0.57174158 330 high scalability-2008-05-27-Should Twitter be an All-You-Can-Eat Buffet or a Vending Machine?
15 0.57129133 1453 high scalability-2013-05-07-Not Invented Here: A Comical Series on Scalability
16 0.56918633 1490 high scalability-2013-07-12-Stuff The Internet Says On Scalability For July 12, 2013
17 0.56614292 1288 high scalability-2012-07-23-Ask HighScalability: How Do I Build My MegaUpload + Itunes + YouTube Startup?
18 0.56369156 481 high scalability-2009-01-02-Strategy: Understanding Your Data Leads to the Best Scalability Solutions
19 0.5635463 1458 high scalability-2013-05-15-Lesson from Airbnb: Give Yourself Permission to Experiment with Non-scalable Changes
20 0.55912304 351 high scalability-2008-07-16-The Mother of All Database Normalization Debates on Coding Horror
topicId topicWeight
[(1, 0.082), (2, 0.221), (10, 0.08), (30, 0.184), (55, 0.059), (61, 0.128), (85, 0.099)]
simIndex simValue blogId blogTitle
same-blog 1 0.94230026 361 high scalability-2008-08-08-Separation into read-write only databases
Introduction: At least in the articles on Plenty of Fish and Slashdot it was mentioned that one can achieve higher performance by creating read-only and write-only databases where possible. I have read the comments and tried unsuccessfully to find more information on the net about this. I still do not understand the concept. Can someone explain it in more detail, as well as recommend resources for further investigation? (Are there books written specifically about this technique?) I think it is a very important issue, because databases are oftentimes the bottleneck.
2 0.90128976 261 high scalability-2008-02-25-Make Your Site Run 10 Times Faster
Introduction: This is what Mike Peters says he can do : make your site run 10 times faster. His test bed is "half a dozen servers parsing 200,000 pages per hour over 40 IP addresses, 24 hours a day." Before optimization CPU spiked to 90% with 50 concurrent connections. After optimization each machine "was effectively handling 500 concurrent connections per second with CPU at 8% and no degradation in performance." Mike identifies six major bottlenecks: Database write access (read is cheaper) Database read access PHP, ASP, JSP and any other server side scripting Client side JavaScript Multiple/Fat Images, scripts or css files from different domains on your page Slow keep-alive client connections, clogging your available sockets Mike's solutions: Switch all database writes to offline processing Minimize number of database read access to the bare minimum. No more than two queries per page. Denormalize your database and Optimize MySQL tables Implement MemCached and cha
3 0.87545753 917 high scalability-2010-10-08-4 Scalability Themes from Surgecon
Introduction: Robert Haas in his SURGE Recap of the Surge conference, reflected a bit, and came up with an interesting checklist of general themes from what he was seeing. I'm directly quoting his post, so please see the post for a full discussion. He uses this framework to think about the larger picture and where PostgreSQL stands in its progression. Make use of the academic literature . Inventing your own way to do something is fine, but at least consider the possibility that someone smarter than you has thought about this problem before. Failures are inevitable, so plan for them . Try to minimize the possibility of cascading failures, and plan in advance how you can operate in degraded mode if disaster (or the Slashdot effect) strikes. Disk technology matters . Drive firmware bugs are common and nightmarish, and you can expect very limited help from the manufacturer, especially if the drive is billed as consumer-grade rather than enterprise-grade. SSDs can save you a lot of m
4 0.85916519 263 high scalability-2008-02-27-Product: System Imager - Automate Deployment and Installs
Introduction: From their website: SystemImager is software that makes the installation of Linux to masses of similar machines relatively easy. It makes software distribution, configuration, and operating system updates easy, and can also be used for content distribution. SystemImager makes it easy to do automated installs (clones), software distribution, content or data distribution, configuration changes, and operating system updates to your network of Linux machines. You can even update from one Linux release version to another! It can also be used to ensure safe production deployments. By saving your current production image before updating to your new production image, you have a highly reliable contingency mechanism. If the new production enviroment is found to be flawed, simply roll-back to the last production image with a simple update command! Some typical environments include: Internet server farms, database server farms, high performance clusters, computer labs, and corporate
5 0.85444313 831 high scalability-2010-05-26-End-To-End Performance Study of Cloud Services
Introduction: Cloud computing promises a number of advantages for the deployment of data-intensive applications. Most prominently, these include reducing cost with a pay-as-you-go pricing model and (virtually) unlimited throughput by adding servers if the workload increases. At the Systems Group , ETH Zurich, we did an extensive end-to-end performance study to compare the major cloud offerings regarding their ability to fulfill these promises and their implied cost. The focus of the work is on transaction processing (i.e., read and update work-loads), rather than analytics workloads. We used the TPC-W , a standardized benchmark simulating a Web-shop, as the baseline for our comparison. The TPC-W defines that users are simulated through emulated browsers (EB) and issue page requests, called web-interactions (WI), against the system. As a major modification to the benchmark, we constantly increase the load from 1 to 9000 simultaneous users to measure the scalability and cost variance of the syst
6 0.84904975 783 high scalability-2010-02-24-Hot Scalability Links for February 24, 2010
7 0.84647739 703 high scalability-2009-09-12-How Google Taught Me to Cache and Cash-In
8 0.8347795 1459 high scalability-2013-05-16-Paper: Warp: Multi-Key Transactions for Key-Value Stores
9 0.82916337 464 high scalability-2008-12-13-Strategy: Facebook Tweaks to Handle 6 Time as Many Memcached Requests
10 0.8289631 1284 high scalability-2012-07-16-Cinchcast Architecture - Producing 1,500 Hours of Audio Every Day
11 0.82859808 269 high scalability-2008-03-08-Audiogalaxy.com Architecture
12 0.82837713 800 high scalability-2010-03-26-Strategy: Caching 404s Saved the Onion 66% on Server Time
13 0.82654881 16 high scalability-2007-07-16-Book: High Performance MySQL
14 0.82389909 312 high scalability-2008-04-30-Rather small site architecture.
15 0.82269251 52 high scalability-2007-08-01-Product: Memcached
16 0.81777012 991 high scalability-2011-02-16-Paper: An Experimental Investigation of the Akamai Adaptive Video Streaming
17 0.81750762 1016 high scalability-2011-04-04-Scaling Social Ecommerce Architecture Case study
18 0.81653237 342 high scalability-2008-06-08-Search fast in million rows
19 0.81574535 1291 high scalability-2012-07-25-Vertical Scaling Ascendant - How are SSDs Changing Architectures?
20 0.81485784 998 high scalability-2011-03-03-Stack Overflow Architecture Update - Now at 95 Million Page Views a Month