high_scalability high_scalability-2011 high_scalability-2011-1099 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Sean Hall with some really good MySQL advice : Tune those queries . Biggest bang for your buck. Enable the slow query log and watch it. Once you've found a heavy resource intensive query, optimize it! Tune what receives real-world traffic. Employ Master-Master Replication . Immediately have a read-only slave for your application to hit as well. Use Your Memory . Set innodb_buffer_pool_size, key_buffer_size and other key options. RAID Your Disk I/O . Use RAID 10 mirroring and striping. On EC2 striping across a number of EBS volumes using the Linux md software raid. Tune Key Parameters . speeds up inserts & updates with innodb_flush_log_at_trx_commit=2; create a tablespace and underlying datafile for each table with innodb_file_per_table. Full details at the original article .
sentIndex sentText sentNum sentScore
1 Sean Hall with some really good MySQL advice : Tune those queries . [sent-1, score-0.195]
2 Once you've found a heavy resource intensive query, optimize it! [sent-4, score-0.554]
3 Immediately have a read-only slave for your application to hit as well. [sent-7, score-0.303]
4 On EC2 striping across a number of EBS volumes using the Linux md software raid. [sent-12, score-0.941]
5 speeds up inserts & updates with innodb_flush_log_at_trx_commit=2; create a tablespace and underlying datafile for each table with innodb_file_per_table. [sent-14, score-0.668]
wordName wordTfidf (topN-words)
[('md', 0.308), ('striping', 0.276), ('tune', 0.25), ('mirroring', 0.234), ('theoriginal', 0.226), ('employ', 0.212), ('sean', 0.212), ('bang', 0.207), ('inserts', 0.192), ('receives', 0.187), ('query', 0.176), ('speeds', 0.167), ('volumes', 0.167), ('slave', 0.155), ('ebs', 0.151), ('watch', 0.149), ('intensive', 0.146), ('raid', 0.144), ('advice', 0.144), ('underlying', 0.134), ('immediately', 0.131), ('key', 0.126), ('enable', 0.125), ('heavy', 0.121), ('biggest', 0.119), ('optimize', 0.116), ('updates', 0.111), ('table', 0.106), ('hit', 0.106), ('log', 0.102), ('slow', 0.095), ('linux', 0.094), ('details', 0.092), ('resource', 0.089), ('article', 0.085), ('found', 0.082), ('disk', 0.079), ('mysql', 0.067), ('create', 0.064), ('set', 0.06), ('number', 0.056), ('across', 0.055), ('really', 0.051), ('software', 0.05), ('good', 0.043), ('application', 0.042), ('using', 0.029), ('use', 0.027)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 1099 high scalability-2011-08-16-The 5 Biggest Ways to Boost MySQL Scalability
Introduction: Sean Hall with some really good MySQL advice : Tune those queries . Biggest bang for your buck. Enable the slow query log and watch it. Once you've found a heavy resource intensive query, optimize it! Tune what receives real-world traffic. Employ Master-Master Replication . Immediately have a read-only slave for your application to hit as well. Use Your Memory . Set innodb_buffer_pool_size, key_buffer_size and other key options. RAID Your Disk I/O . Use RAID 10 mirroring and striping. On EC2 striping across a number of EBS volumes using the Linux md software raid. Tune Key Parameters . speeds up inserts & updates with innodb_flush_log_at_trx_commit=2; create a tablespace and underlying datafile for each table with innodb_file_per_table. Full details at the original article .
2 0.15045346 1121 high scalability-2011-09-21-5 Scalability Poisons and 3 Cloud Scalability Antidotes
Introduction: Sean Hull with two helpful posts: 5 Things That are Toxic to Scalability : Object Relational Mappers. Create complex queries that hard to optimize and tweak. Synchronous, Serial, Coupled or Locking Processes. Locks are like stop signs, traffic circles keep the traffic flowing. Row level locking is better than table level locking. Use async replication. Use eventual consistency for clusters. One Copy of Your Database. A single database server is a choke point. Create parallel databases and let a driver select between them. Having No Metrics. Visualize what's happening to your system using one of the many monitoring packages. Lack of Feature Flags. Be able to turn off features via a flag so when a spike hits features can be turned off to reduce load. 3 Ways to Boost Cloud Scalability : Use Auto-scaling. Spin-up new instances when a threshold is passed and back down again when traffic drops. Horizontally Scale the Database Tier. MySQL in a master
Introduction: Amazon created a whole new class of service with their Provisioned IOPS for RDS, EBS, and DynamoDB. The idea is simple. If you want more performance, you turn a dial up. If you want less, you turn a dial down. A beautifully simple model. You pay for the performance you want, which is different than their previous cloud model, where performance varied, but you paid only for what you used. The question: Do these higher priced services really work better? Rodrigo Campos put this question to the test (only for EBS) by running a benchmark he describes in IOMelt Provisioned IOPS EBS Benchmark Results - December 2012 . The result? Yes, AWS Provisioned IOPS Volumes Really Deliver More Consistent and Higher Performance IO : It is clear that the provisioned IOPS EBS volumes offer a huge performance upgrade when compared to the non-optimized EBS volumes, but as data has to be spread among more underlying disks or systems, it seems that the volume is increasingly more susceptibl
4 0.11846897 829 high scalability-2010-05-20-Strategy: Scale Writes to 734 Million Records Per Day Using Time Partitioning
Introduction: In Scaling writes in MySQL ( slides ) Philip Tellis, while working for Yahoo, describes how using time based partitions they were able to increase their write capability from 2100 inserts per second (7 million a day) to a sustained 8500 inserts per second (734 million a day). This was capacity enough to handle the load during Michael Jackson's memorial service. In summary, the secrets to scalable writes are: Bulk inserts push up insert rate Partitioning lets you insert more records Partition based on incoming data for fast inserts Partitioning is a standard approach for handling high write loads because it means data can be written to different hard disks in parallel. In this example Phillip created a separate table for each day with each table having it's own database file. Each table is partitioned on time, 12 partitions per day, 2 hours of data per partition. Huge log streams are often handled this way. Other advantages of this approach: 1) fast drop table operations 2
5 0.11818278 881 high scalability-2010-08-16-Scaling an AWS infrastructure - Tools and Patterns
Introduction: This is a guest post by Frédéric Faure (architect at Ysance ), you can follow him on twitter . How do you scale an AWS (Amazon Web Services) infrastructure? This article will give you a detailed reply in two parts: the tools you can use to make the most of Amazon’s dynamic approach, and the architectural model you should adopt for a scalable infrastructure. I base my report on my experience gained in several AWS production projects in casual gaming (Facebook), e-commerce infrastructures and within the mainstream GIS (Geographic Information System). It’s true that my experience in gaming ( IsCool, The Game ) is currently the most representative in terms of scalability, due to the number of users (over 800 thousand DAU – daily active users – at peak usage and over 20 million page views every day), however my experiences in e-commerce and GIS (currently underway) provide a different view of scalability, taking into account the various problems of availability and da
6 0.10631812 1631 high scalability-2014-04-14-How do you even do anything without using EBS?
7 0.10112073 570 high scalability-2009-04-15-Implementing large scale web analytics
8 0.10090921 1114 high scalability-2011-09-13-Must see: 5 Steps to Scaling MongoDB (Or Any DB) in 8 Minutes
9 0.08918307 1529 high scalability-2013-10-08-F1 and Spanner Holistically Compared
10 0.088434875 1514 high scalability-2013-09-09-Need Help with Database Scalability? Understand I-O
11 0.08791668 1508 high scalability-2013-08-28-Sean Hull's 20 Biggest Bottlenecks that Reduce and Slow Down Scalability
12 0.087229893 1473 high scalability-2013-06-10-The 10 Deadly Sins Against Scalability
13 0.086752236 1348 high scalability-2012-10-26-Stuff The Internet Says On Scalability For October 26, 2012
14 0.078764699 986 high scalability-2011-02-10-Database Isolation Levels And Their Effects on Performance and Scalability
15 0.076570049 1279 high scalability-2012-07-09-Data Replication in NoSQL Databases
16 0.075661123 831 high scalability-2010-05-26-End-To-End Performance Study of Cloud Services
17 0.07339821 233 high scalability-2008-01-30-How Rackspace Now Uses MapReduce and Hadoop to Query Terabytes of Data
18 0.072671741 1640 high scalability-2014-04-30-10 Tips for Optimizing NGINX and PHP-fpm for High Traffic Sites
19 0.070313692 274 high scalability-2008-03-12-YouTube Architecture
20 0.069754243 53 high scalability-2007-08-01-Product: MogileFS
topicId topicWeight
[(0, 0.095), (1, 0.051), (2, -0.039), (3, -0.027), (4, -0.012), (5, 0.064), (6, 0.019), (7, -0.043), (8, 0.038), (9, -0.047), (10, -0.004), (11, -0.042), (12, 0.012), (13, -0.002), (14, -0.015), (15, 0.012), (16, -0.002), (17, -0.009), (18, -0.007), (19, 0.011), (20, 0.058), (21, -0.034), (22, -0.03), (23, 0.102), (24, 0.018), (25, 0.031), (26, -0.066), (27, -0.003), (28, 0.001), (29, -0.008), (30, -0.06), (31, -0.04), (32, 0.072), (33, 0.036), (34, -0.003), (35, 0.007), (36, 0.081), (37, -0.002), (38, -0.035), (39, -0.069), (40, 0.013), (41, 0.016), (42, -0.026), (43, -0.028), (44, 0.03), (45, -0.042), (46, 0.02), (47, 0.04), (48, 0.025), (49, 0.047)]
simIndex simValue blogId blogTitle
same-blog 1 0.9713679 1099 high scalability-2011-08-16-The 5 Biggest Ways to Boost MySQL Scalability
Introduction: Sean Hall with some really good MySQL advice : Tune those queries . Biggest bang for your buck. Enable the slow query log and watch it. Once you've found a heavy resource intensive query, optimize it! Tune what receives real-world traffic. Employ Master-Master Replication . Immediately have a read-only slave for your application to hit as well. Use Your Memory . Set innodb_buffer_pool_size, key_buffer_size and other key options. RAID Your Disk I/O . Use RAID 10 mirroring and striping. On EC2 striping across a number of EBS volumes using the Linux md software raid. Tune Key Parameters . speeds up inserts & updates with innodb_flush_log_at_trx_commit=2; create a tablespace and underlying datafile for each table with innodb_file_per_table. Full details at the original article .
Introduction: Amazon created a whole new class of service with their Provisioned IOPS for RDS, EBS, and DynamoDB. The idea is simple. If you want more performance, you turn a dial up. If you want less, you turn a dial down. A beautifully simple model. You pay for the performance you want, which is different than their previous cloud model, where performance varied, but you paid only for what you used. The question: Do these higher priced services really work better? Rodrigo Campos put this question to the test (only for EBS) by running a benchmark he describes in IOMelt Provisioned IOPS EBS Benchmark Results - December 2012 . The result? Yes, AWS Provisioned IOPS Volumes Really Deliver More Consistent and Higher Performance IO : It is clear that the provisioned IOPS EBS volumes offer a huge performance upgrade when compared to the non-optimized EBS volumes, but as data has to be spread among more underlying disks or systems, it seems that the volume is increasingly more susceptibl
3 0.58215946 986 high scalability-2011-02-10-Database Isolation Levels And Their Effects on Performance and Scalability
Introduction: Some of us are not aware of the tremendous job databases perform, particularly their efforts to maintain the Isolation aspect of ACID. For example, some people believe that transactions are only related to data manipulation and not to queries, which is an incorrect assumption. Transaction Isolation is all about queries, and the consistency and completeness of the data retrieved by queries. This is how it works: Isolation gives the querying user the feeling that he owns the database. It does not matter that hundreds or thousands of concurrent users work with the same database and the same schema (or even the same data). These other uses can generate new data, modify existing data or perform any other action. The querying user must be able to get a complete, consistent picture of the data, unaffected by other users’ actions. Let’s take the following scenario, which is based on an Orders table that has 1,000,000 rows, with a disk size of 20 GB: 8:00: UserA started a query “SELECT
4 0.54671198 881 high scalability-2010-08-16-Scaling an AWS infrastructure - Tools and Patterns
Introduction: This is a guest post by Frédéric Faure (architect at Ysance ), you can follow him on twitter . How do you scale an AWS (Amazon Web Services) infrastructure? This article will give you a detailed reply in two parts: the tools you can use to make the most of Amazon’s dynamic approach, and the architectural model you should adopt for a scalable infrastructure. I base my report on my experience gained in several AWS production projects in casual gaming (Facebook), e-commerce infrastructures and within the mainstream GIS (Geographic Information System). It’s true that my experience in gaming ( IsCool, The Game ) is currently the most representative in terms of scalability, due to the number of users (over 800 thousand DAU – daily active users – at peak usage and over 20 million page views every day), however my experiences in e-commerce and GIS (currently underway) provide a different view of scalability, taking into account the various problems of availability and da
5 0.53983408 1121 high scalability-2011-09-21-5 Scalability Poisons and 3 Cloud Scalability Antidotes
Introduction: Sean Hull with two helpful posts: 5 Things That are Toxic to Scalability : Object Relational Mappers. Create complex queries that hard to optimize and tweak. Synchronous, Serial, Coupled or Locking Processes. Locks are like stop signs, traffic circles keep the traffic flowing. Row level locking is better than table level locking. Use async replication. Use eventual consistency for clusters. One Copy of Your Database. A single database server is a choke point. Create parallel databases and let a driver select between them. Having No Metrics. Visualize what's happening to your system using one of the many monitoring packages. Lack of Feature Flags. Be able to turn off features via a flag so when a spike hits features can be turned off to reduce load. 3 Ways to Boost Cloud Scalability : Use Auto-scaling. Spin-up new instances when a threshold is passed and back down again when traffic drops. Horizontally Scale the Database Tier. MySQL in a master
6 0.53606331 1281 high scalability-2012-07-11-FictionPress: Publishing 6 Million Works of Fiction on the Web
8 0.52739358 261 high scalability-2008-02-25-Make Your Site Run 10 Times Faster
9 0.52672064 1304 high scalability-2012-08-14-MemSQL Architecture - The Fast (MVCC, InMem, LockFree, CodeGen) and Familiar (SQL)
11 0.52432019 995 high scalability-2011-02-24-Strategy: Eliminate Unnecessary SQL
12 0.52307904 729 high scalability-2009-10-28-And the winner is: MySQL or Memcached or Tokyo Tyrant?
13 0.51846254 1348 high scalability-2012-10-26-Stuff The Internet Says On Scalability For October 26, 2012
14 0.51715505 1114 high scalability-2011-09-13-Must see: 5 Steps to Scaling MongoDB (Or Any DB) in 8 Minutes
15 0.51452953 1631 high scalability-2014-04-14-How do you even do anything without using EBS?
16 0.51077604 1650 high scalability-2014-05-19-A Short On How the Wayback Machine Stores More Pages than Stars in the Milky Way
17 0.50608093 1364 high scalability-2012-11-29-Performance data for LevelDB, Berkley DB and BangDB for Random Operations
18 0.50379962 1306 high scalability-2012-08-16-Stuff The Internet Says On Scalability For August 17, 2012
19 0.50284356 1508 high scalability-2013-08-28-Sean Hull's 20 Biggest Bottlenecks that Reduce and Slow Down Scalability
20 0.50269431 257 high scalability-2008-02-22-Kevin's Great Adventures in SSDland
topicId topicWeight
[(1, 0.194), (2, 0.208), (10, 0.036), (45, 0.261), (79, 0.082), (85, 0.087)]
simIndex simValue blogId blogTitle
1 0.90514523 600 high scalability-2009-05-15-Wolfram|Alpha Architecture
Introduction: Making the world's knowledge computable Today's Wolfram|Alpha is the first step in an ambitious, long-term project to make all systematic knowledge immediately computable by anyone. You enter your question or calculation, and Wolfram|Alpha uses its built-in algorithms and growing collection of data to compute the answer. Answer Engine vs Search Engine When Wolfram|Alpha launches later today, it will be one of the most computationally intensive websites on the internet . The Wolfram|Alpha computational knowledge engine is an "answer engine" that is able to produce answers to various questions such as What is the GDP of France? Weather is Springfield when David Ortiz was born 33 g of gold LDL vs. serum potassium 150 smoker male age 40 life expectancy male age 40 finland highschool teacher median wage Wolfram|Alpha excels at different areas like mathematics, statistics, physics, engineering, astronomy, chemistry, life sciences, geology, busine
same-blog 2 0.88637066 1099 high scalability-2011-08-16-The 5 Biggest Ways to Boost MySQL Scalability
Introduction: Sean Hall with some really good MySQL advice : Tune those queries . Biggest bang for your buck. Enable the slow query log and watch it. Once you've found a heavy resource intensive query, optimize it! Tune what receives real-world traffic. Employ Master-Master Replication . Immediately have a read-only slave for your application to hit as well. Use Your Memory . Set innodb_buffer_pool_size, key_buffer_size and other key options. RAID Your Disk I/O . Use RAID 10 mirroring and striping. On EC2 striping across a number of EBS volumes using the Linux md software raid. Tune Key Parameters . speeds up inserts & updates with innodb_flush_log_at_trx_commit=2; create a tablespace and underlying datafile for each table with innodb_file_per_table. Full details at the original article .
3 0.87826288 267 high scalability-2008-03-05-Oprah is the Real Social Network
Introduction: A lot of new internet TV station startups are in the wind these days and there's a question about how they can scale their broadcasts. Today's state of the art shows you can't yet mimic the reach of broadcast TV with internet tech. But as Oprah proves, you can still capture a lot of eyeballs, if you are Oprah... Oprah drew a stunning 500,000 simultaneous viewers for an Eckhart Tolle webcast. Move Networks and Limelight Networks hosted the "broadcast" where traffic peaked at 242Gbps. A variable bitrate scheme was used so depending on their connection, a viewer could have seen 150Kbps or as high as 750Kbps. Dan Rayburn thinks The big take away from this webcast is that it shows proof that the Internet is not built to handle TV like distribution and those who think that live TV shows will be broadcast on the Internet with millions and millions of people watching, it's just not going to happen. To handle more users comments suggested capping the bitrate at 300K, using P2P
4 0.82175624 24 high scalability-2007-07-24-Product: Hibernate Shards
Introduction: If you want to adopt a shard architecture, but don't want to start from scratch, you may want to consider Hibernate's sharding system. Hibernate Shards is a framework that is designed to encapsulate and minimize this complexity by adding support for horizontal partitioning to Hibernate Core. Hibernate Shards key features: Standard Hibernate programming model - Hibernate Shards allows you to continue using the Hibernate APIs you know and love: SessionFactory, Session, Criteria, Query. If you already know how to use Hibernate, you already know how to use Hibernate Shards. Flexible sharding strategies - Distribute data across your shards any way you want. Use one of the default strategies we provide or plug in your own application-specific logic. Support for virtual shards - Think your sharding strategy is never going to change? Think again. Adding new shards and redistributing your data is one of the toughest operational challenges you will face once you've deployed your
5 0.80065477 819 high scalability-2010-04-30-Hot Scalability Links for April 30, 2010
Introduction: I Want a New Data Store . Jeremy Zawodny of Craigslist wants a new database, one that can do what it should: perform alter table operations faster, has efficient queries when most of the data is on disk and not in RAM, and matches their data that now looks more document oriented than relational. A lot of people willing to help. Computer Science Unplugged . An extensive collection of free resources that teach principles of Computer Science such as binary numbers , algorithms and data compression through engaging games and puzzles that use cards, string, crayons and lots of running around . And it's free! Fascinating Interview with Tim Bell on teaching complex computing concepts, creating makers not just users, and how to change schools. From O'Reilly Radar . Akamai’s Network Now Pushes Terabits of Data Every Second . Akamai handles 12 million requests per second, logs more than 500 billion requests for content per day, and sends 3.45 terabits per secon
6 0.7804628 729 high scalability-2009-10-28-And the winner is: MySQL or Memcached or Tokyo Tyrant?
7 0.7791273 527 high scalability-2009-03-06-Cloud Programming Directly Feeds Cost Allocation Back into Software Design
8 0.76009113 260 high scalability-2008-02-25-Architecture Template Advice Needed
9 0.76009077 259 high scalability-2008-02-25-Any Suggestions for the Architecture Template?
10 0.75570053 1354 high scalability-2012-11-05-Are we seeing the renaissance of enterprises in the cloud?
11 0.75050306 603 high scalability-2009-05-19-Scaling Memcached: 500,000+ Operations-Second with a Single-Socket UltraSPARC T2
12 0.74504155 231 high scalability-2008-01-29-Too many databases
13 0.74110526 1428 high scalability-2013-03-22-Stuff The Internet Says On Scalability For March 22, 2013
14 0.74011278 1472 high scalability-2013-06-07-Stuff The Internet Says On Scalability For June 7, 2013
15 0.73947901 502 high scalability-2009-01-26-Paper: Scalability by Design - Coding for Systems With Large CPU Counts
16 0.73680931 558 high scalability-2009-04-06-How do you monitor the performance of your cluster?
17 0.73633969 904 high scalability-2010-09-21-Playfish's Social Gaming Architecture - 50 Million Monthly Users and Growing
18 0.73577106 511 high scalability-2009-02-12-MySpace Architecture
19 0.73526853 118 high scalability-2007-10-09-High Load on production Webservers after Sourcecode sync
20 0.73415178 865 high scalability-2010-07-27-A Metric A$$-Ton of Joe Stump: The Cloud is Cheaper than Bare Metal