high_scalability high_scalability-2009 high_scalability-2009-487 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: The upshot of the paper is Oracle rules and MySQL sucks for sharding. Which is technically probable, if you don't throw in minor points like cost and ease of use. The points where they think Oracle wins: online schema changes, more robust replication, higher availability, better corruption handling, better use of large RAM and multiple cores, better and better tested partitioning features, better monitoring, and better gas mileage.
sentIndex sentText sentNum sentScore
1 The upshot of the paper is Oracle rules and MySQL sucks for sharding. [sent-1, score-0.685]
2 Which is technically probable, if you don't throw in minor points like cost and ease of use. [sent-2, score-1.044]
3 The points where they think Oracle wins: online schema changes, more robust replication, higher availability, better corruption handling, better use of large RAM and multiple cores, better and better tested partitioning features, better monitoring, and better gas mileage. [sent-3, score-3.683]
wordName wordTfidf (topN-words)
[('better', 0.339), ('mileage', 0.32), ('probable', 0.301), ('upshot', 0.254), ('gas', 0.248), ('corruption', 0.224), ('points', 0.218), ('minor', 0.215), ('oracle', 0.21), ('technically', 0.196), ('sucks', 0.188), ('wins', 0.182), ('throw', 0.163), ('tested', 0.157), ('ease', 0.156), ('schema', 0.145), ('rules', 0.144), ('robust', 0.142), ('partitioning', 0.128), ('cores', 0.122), ('handling', 0.106), ('paper', 0.099), ('online', 0.098), ('higher', 0.096), ('replication', 0.093), ('ram', 0.087), ('changes', 0.085), ('monitoring', 0.083), ('availability', 0.078), ('features', 0.07), ('mysql', 0.07), ('cost', 0.069), ('multiple', 0.06), ('think', 0.056), ('large', 0.049), ('use', 0.028), ('like', 0.027)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999994 487 high scalability-2009-01-08-Paper: Sharding with Oracle Database
Introduction: The upshot of the paper is Oracle rules and MySQL sucks for sharding. Which is technically probable, if you don't throw in minor points like cost and ease of use. The points where they think Oracle wins: online schema changes, more robust replication, higher availability, better corruption handling, better use of large RAM and multiple cores, better and better tested partitioning features, better monitoring, and better gas mileage.
2 0.084170073 208 high scalability-2008-01-11-FTP Sanity: Redundancy, archiving, consolidation.
Introduction: Easy FTP redundancy and consolidation with the Open Source project Generic-FTP. Works with probably any Linux FTP Server (ProFTPD only one tested). Get rid of some single points of failure. A very easy to set up solution using scripts written in PHP. Tested thoroughly in a production environment.
3 0.07805346 1564 high scalability-2013-12-13-Stuff The Internet Says On Scalability For December 13th, 2013
Introduction: Hey, it's HighScalability time: Test your sense of scale. Is this image of something microscopic or macroscopic? Find out . 80 billion : Netflix logging events per day; 10 petabytes : Ancestry.com data; six million : Foursquare checkins per day; Quotable Quotes: George Lakoff : What can't all your thoughts be conscious? Because consciousness is linear and your brain is parallel. The linear structure of consciousness could never keep up. @peakscale : "Engineers like to solve problems. If there are no problems handily available, they will create their own problems" - Scott Adams @kiwipom : “Immutability is magic pixie dust that makes distributed systems work” - Adrian Cockcroft @LachM : Netflix: SPEED at SCALE = breaks EVERYTHING. #yow13 Joe Landman : … you get really annoyed at the performance of grep on file IO (seriously folks? 32k or page size sized IO? What is this … 1992?) so you rewrite it in 20 minu
4 0.075144619 538 high scalability-2009-03-16-Are Cloud Based Memory Architectures the Next Big Thing?
Introduction: We are on the edge of two potent technological changes: Clouds and Memory Based Architectures. This evolution will rip open a chasm where new players can enter and prosper. Google is the master of disk. You can't beat them at a game they perfected. Disk based databases like SimpleDB and BigTable are complicated beasts, typical last gasp products of any aging technology before a change. The next era is the age of Memory and Cloud which will allow for new players to succeed. The tipping point will be soon. Let's take a short trip down web architecture lane: It's 1993: Yahoo runs on FreeBSD, Apache, Perl scripts and a SQL database It's 1995: Scale-up the database. It's 1998: LAMP It's 1999: Stateless + Load Balanced + Database + SAN It's 2001: In-memory data-grid. It's 2003: Add a caching layer. It's 2004: Add scale-out and partitioning. It's 2005: Add asynchronous job scheduling and maybe a distributed file system. It's 2007: Move it all into the cloud. It's 2008: C
5 0.073993765 182 high scalability-2007-12-12-Oracle Can Do Read-Write Splitting Too
Introduction: People sometimes wonder why Oracle isn't mentioned on this site more. Maybe it will now as Michael Nygard reports Oracle 11g now does read/write splitting with their Active Data Guard product. Average replication latency was 1 second and it's accomplished with standard Oracle JDBC drivers. They see a 250% increase in transactions per service for read-write service. And a 110% improvement in tps for read-only service was found. You see a change in hardware architecture with the new setup. They now recommend using a primary and multiple standby servers, a single controller per server, and a single set of disks in RAID1. Previously the recommendation was to have a primary and secondary server with two controllers per server and a set of mirrored disks per controller. The changes increase performance, availability, and hardware utilization. They also have a useful looking best practices document for High Availability called Maximum Availability Architecture (MAA) .
6 0.070482217 767 high scalability-2010-01-27-Hot Scalability Links for January 28 2010
7 0.070240192 925 high scalability-2010-10-22-Paper: Netflix’s Transition to High-Availability Storage Systems
9 0.064874418 1190 high scalability-2012-02-10-Stuff The Internet Says On Scalability For February 10, 2012
11 0.06218601 1134 high scalability-2011-10-28-Stuff The Internet Says On Scalability For October 28, 2011
12 0.06184867 392 high scalability-2008-09-24-Building a Scalable Architecture for Web Apps
14 0.060078494 1445 high scalability-2013-04-24-Strategy: Using Lots of RAM Often Cheaper than Using a Hadoop Cluster
17 0.058540605 1565 high scalability-2013-12-16-22 Recommendations for Building Effective High Traffic Web Software
19 0.05568463 1472 high scalability-2013-06-07-Stuff The Internet Says On Scalability For June 7, 2013
20 0.055237122 459 high scalability-2008-12-03-Java World Interview on Scalability and Other Java Scalability Secrets
topicId topicWeight
[(0, 0.09), (1, 0.018), (2, -0.0), (3, -0.002), (4, -0.02), (5, 0.063), (6, -0.009), (7, -0.025), (8, -0.007), (9, -0.041), (10, -0.019), (11, -0.019), (12, 0.015), (13, 0.028), (14, -0.005), (15, 0.016), (16, 0.01), (17, 0.009), (18, -0.01), (19, 0.012), (20, 0.046), (21, 0.022), (22, -0.051), (23, -0.013), (24, -0.03), (25, 0.036), (26, 0.004), (27, -0.017), (28, 0.042), (29, 0.034), (30, 0.011), (31, 0.032), (32, -0.01), (33, 0.0), (34, -0.014), (35, -0.023), (36, -0.016), (37, 0.008), (38, -0.006), (39, 0.008), (40, 0.02), (41, -0.021), (42, 0.012), (43, -0.004), (44, -0.02), (45, -0.005), (46, -0.011), (47, -0.019), (48, 0.022), (49, 0.009)]
simIndex simValue blogId blogTitle
same-blog 1 0.97790593 487 high scalability-2009-01-08-Paper: Sharding with Oracle Database
Introduction: The upshot of the paper is Oracle rules and MySQL sucks for sharding. Which is technically probable, if you don't throw in minor points like cost and ease of use. The points where they think Oracle wins: online schema changes, more robust replication, higher availability, better corruption handling, better use of large RAM and multiple cores, better and better tested partitioning features, better monitoring, and better gas mileage.
2 0.72666061 303 high scalability-2008-04-18-Scaling Mania at MySQL Conference 2008
Introduction: The 2008 MySQL Conference & Expo has now closed, but what is still open for viewing is all the MySQL scaling knowledge that was shared. Planet MySQL is a great source of the goings on: Scaling out MySQL: Hardware today and tomorrow by Jeremy Cole and Eric Bergen of Proven Scaling. In it are answered all the big questions of life: What about 64-bit? How many cores? How much memory? Shared storage? Finally we learn the secrets of true happiness. Panel Video: Scaling MySQL? Up or Out? . Don't have time? Take a look at the Diamond Note excellent game day summary. Companies like MySQL, Sun, Flickr, Fotolog, Wikipedia, Facebook and YouTube share intel on how many web servers they have, how they handle failure, and how they scale. Kevin Burton in Scaling MySQL and Java in High Write Throughput Environments - How we built Spinn3r shows how they crawl and index 500k posts per hour using MySQL and 40 servers. Venu Anuganti channels Dathan Pattishall's talk on scaling heavy con
3 0.70799303 16 high scalability-2007-07-16-Book: High Performance MySQL
Introduction: As users come to depend on MySQL, they find that they have to deal with issues of reliability, scalability, and performance--issues that are not well documented but are critical to a smoothly functioning site. This book is an insider's guide to these little understood topics. Author Jeremy Zawodny has managed large numbers of MySQL servers for mission-critical work at Yahoo!, maintained years of contacts with the MySQL AB team, and presents regularly at conferences. Jeremy and Derek have spent months experimenting, interviewing major users of MySQL, talking to MySQL AB, benchmarking, and writing some of their own tools in order to produce the information in this book. In High Performance MySQL you will learn about MySQL indexing and optimization in depth so you can make better use of these key features. You will learn practical replication, backup, and load-balancing strategies with information that goes beyond available tools to discuss their effects in real-life environments. And you
4 0.69134086 455 high scalability-2008-12-01-MySQL Database Scale-out and Replication for High Growth Businesses
Introduction: It is widely recognized that MySQL is the most popular database software in the world. Since its inception in 1995, there have been 11 million product installations around the world in a wide variety of markets. There are more installations of MySQL in use today than any other database architecture. From startup companies hoping to be the next Web2.0 poster child to large global enterprises, the MySQL database architecture has proven to be flexible, extendable, scalable, and more than capable of filling high-capacity database roles in very different venues.
5 0.68753386 1281 high scalability-2012-07-11-FictionPress: Publishing 6 Million Works of Fiction on the Web
Introduction: FictionPress operates both FictionPress.com and FanFiction.net and is home to over 6 million works of fiction, with millions of writers/readers participating from around the world in over 30 languages. Issues addressed : Support complex and efficient indexes at 100+ million rows. Predicable and consistent performance regardless of data size growth. Fast recovery. Ensuring Predictable Performance at Scale The Challenge : FictionPress offers a number of interactive features to its large user base. These include discussion forums, in-site messaging and user reviews. FictionPress made the decision to build its own discussion forums to meet its strict security and performance requirements. Xing Li, CTO of FictionPress, noted that the site “needs to host hundreds of thousands of forums. Existing forum software doesn’t do this while meeting our performance and security targets.” To ensure the real-time responsiveness of the forums, FictionPress needs the ability to creat
6 0.67371118 17 high scalability-2007-07-16-Paper: Guide to Cost-effective Database Scale-Out using MySQL
8 0.66886735 454 high scalability-2008-12-01-Deploying MySQL Database in Solaris Cluster Environments
9 0.63838911 4 high scalability-2007-07-10-Webcast: Advanced Database High Availability and Scalability Solutions
10 0.632155 1527 high scalability-2013-10-04-Stuff The Internet Says On Scalability For October 4th, 2013
11 0.62566042 847 high scalability-2010-06-23-Product: dbShards - Share Nothing. Shard Everything.
12 0.62448299 793 high scalability-2010-03-10-Saying Yes to NoSQL; Going Steady with Cassandra at Digg
13 0.62421155 465 high scalability-2008-12-14-Scaling MySQL on a 256-way T5440 server using Solaris ZFS and Java 1.7
14 0.62044013 1322 high scalability-2012-09-14-Stuff The Internet Says On Scalability For September 14, 2012
15 0.61909354 967 high scalability-2011-01-03-Stuff The Internet Says On Scalability For January 3, 2010
16 0.61722857 831 high scalability-2010-05-26-End-To-End Performance Study of Cloud Services
17 0.6167258 1231 high scalability-2012-04-20-Stuff The Internet Says On Scalability For April 20, 2012
18 0.61133927 1463 high scalability-2013-05-23-Paper: Calvin: Fast Distributed Transactions for Partitioned Database Systems
19 0.6071955 586 high scalability-2009-04-29-Presentations: MySQL Conference & Expo 2009
20 0.59780288 1227 high scalability-2012-04-13-Stuff The Internet Says On Scalability For April 13, 2012
topicId topicWeight
[(1, 0.079), (2, 0.235), (14, 0.376), (61, 0.158)]
simIndex simValue blogId blogTitle
1 0.82277125 441 high scalability-2008-11-13-CloudCamp London 2: private clouds and standardisation
Introduction: CloudCamp returned to London yesterday, organised with the help of Skills Matter at the Crypt on the Clarkenwell green. The main topics of this cloud/grid computing community meeting were service-level agreements, connecting private and public clouds and standardisation issues.
2 0.81828612 599 high scalability-2009-05-14-Who Has the Most Web Servers?
Introduction: An interesting post on DataCenterKnowledge! 1&1 Internet: 55,000 servers Rackspace: 50,038 servers The Planet: 48,500 servers Akamai Technologies: 48,000 servers OVH: 40,000 servers SBC Communications: 29,193 servers Verizon: 25,788 servers Time Warner Cable: 24,817 servers SoftLayer: 21,000 servers AT&T;: 20,268 servers iWeb: 10,000 servers How about Google , Microsoft, Amazon , eBay , Yahoo, GoDaddy, Facebook? Check out the post on DataCenterKnowledge and of course here on highscalability.com!
3 0.77391237 495 high scalability-2009-01-17-Intro to Caching,Caching algorithms and caching frameworks part 1
Introduction: Informative and well organized post on caching . Talks about: Why do we need cache?, What is Cache?, Cache Hit, Cache Miss, Storage Cost, Retrieval Cost, Invalidation, Replacement Policy, Optimal Replacement Policy, Caching Algorithms, Least Frequently Used (LFU), Least Recently Used (LRU), Least Recently Used 2(LRU2), Two Queues, Adaptive Replacement Cache (ACR), Most Recently Used (MRU), First in First out (FIFO), Distributed caching, Measuring Cache.
same-blog 4 0.75527292 487 high scalability-2009-01-08-Paper: Sharding with Oracle Database
Introduction: The upshot of the paper is Oracle rules and MySQL sucks for sharding. Which is technically probable, if you don't throw in minor points like cost and ease of use. The points where they think Oracle wins: online schema changes, more robust replication, higher availability, better corruption handling, better use of large RAM and multiple cores, better and better tested partitioning features, better monitoring, and better gas mileage.
5 0.75328505 405 high scalability-2008-10-07-Help a Scoble out. What should Robert ask in his scalability interview?
Introduction: One of the cool things about Mr. Scoble is he doesn't pretend to know everything, which can be an deadly boring affliction in this field. In this case Robert is asking for help in an upcoming interview. Maybe we can help? Here's Robert's plight: I’m really freaked out. I have one of the biggest interviews of my life coming up and I’m way under qualified to host it. It’s on Thursday and it’s about Scalability and Performance of Web Services. Look at who will be on. Matt Mullenweg, founder of Automattic, the company behind WordPress (and behind this blog). Paul Bucheit, one of the founders of FriendFeed and the creator of Gmail (he’s also the guy who gave Google the “don’t be evil” admonishion). Nat Brown, CTO of iLike, which got six million users on Facebook in about 10 days. What would you ask?
6 0.73395306 981 high scalability-2011-02-01-Google Strategy: Tree Distribution of Requests and Responses
7 0.72179401 537 high scalability-2009-03-12-QCon London 2009: Database projects to watch closely
8 0.71665609 1253 high scalability-2012-05-28-The Anatomy of Search Technology: Crawling using Combinators
9 0.68241471 725 high scalability-2009-10-21-Manage virtualized sprawl with VRMs
10 0.66239399 694 high scalability-2009-09-04-Hot Links for 2009-9-4
11 0.62850523 744 high scalability-2009-11-24-Hot Scalability Links for Nov 24 2009
12 0.62607116 1278 high scalability-2012-07-06-Stuff The Internet Says On Scalability For July 6, 2012
13 0.60572153 685 high scalability-2009-08-20-Dependency Injection and AOP frameworks for .NET
14 0.60123718 877 high scalability-2010-08-12-Designing Web Applications for Scalability
15 0.59303391 564 high scalability-2009-04-10-counting # of views, calculating most-least viewed
16 0.58695006 1135 high scalability-2011-10-31-15 Ways to Make Your Application Feel More Responsive under Google App Engine
17 0.58691841 637 high scalability-2009-06-24-Habits of Highly Scalable Web Applications
18 0.5854829 128 high scalability-2007-10-21-Paper: Standardizing Storage Clusters (with pNFS)
19 0.58512396 1538 high scalability-2013-10-28-Design Decisions for Scaling Your High Traffic Feeds
20 0.58355308 80 high scalability-2007-09-06-Product: Perdition Mail Retrieval Proxy