high_scalability high_scalability-2010 high_scalability-2010-902 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: This strategy is from the Large Hadron Collider project: Improvements in performance per Watt have caused CERN to no longer sign hardware support contracts longer than three years. Machines run until they die. They have a very high utilization of equipment (‘duty cycle’, 7 x 24 x 365). Replacing hardware makes more sense because of the lower cost and the power savings of new hardware.
sentIndex sentText sentNum sentScore
1 This strategy is from the Large Hadron Collider project: Improvements in performance per Watt have caused CERN to no longer sign hardware support contracts longer than three years. [sent-1, score-1.671]
2 They have a very high utilization of equipment (‘duty cycle’, 7 x 24 x 365). [sent-3, score-0.439]
3 Replacing hardware makes more sense because of the lower cost and the power savings of new hardware. [sent-4, score-0.976]
wordName wordTfidf (topN-words)
[('thelarge', 0.373), ('watt', 0.334), ('hadron', 0.322), ('cern', 0.312), ('hardware', 0.243), ('longer', 0.242), ('contracts', 0.237), ('replacing', 0.208), ('equipment', 0.204), ('savings', 0.2), ('cycle', 0.184), ('utilization', 0.183), ('improvements', 0.173), ('caused', 0.168), ('sign', 0.137), ('lower', 0.133), ('sense', 0.123), ('strategy', 0.114), ('three', 0.105), ('machines', 0.1), ('power', 0.082), ('cost', 0.08), ('makes', 0.078), ('support', 0.076), ('run', 0.064), ('per', 0.064), ('high', 0.052), ('performance', 0.043), ('new', 0.037)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999994 902 high scalability-2010-09-16-Strategy: Buy New, Don't Fix the Old
Introduction: This strategy is from the Large Hadron Collider project: Improvements in performance per Watt have caused CERN to no longer sign hardware support contracts longer than three years. Machines run until they die. They have a very high utilization of equipment (‘duty cycle’, 7 x 24 x 365). Replacing hardware makes more sense because of the lower cost and the power savings of new hardware.
2 0.29750666 901 high scalability-2010-09-16-How Can the Large Hadron Collider Withstand One Petabyte of Data a Second?
Introduction: Why is there something rather than nothing? That's the kind of question the Large Hadron Collider in CERN is hopefully poised to answer. And what is the output of this beautiful 17-mile long, 6 billion dollar wabi-sabish proton smashing machine? Data. Great heaping torrents of Grand Canyon sized data. 15 million gigabytes every year. That's 1000 times the information printed in books every year. It's so much data 10,000 scientists will use a grid of 80,000+ computers , in 300 computer centers , in 50 different countries just to help make sense of it all. How will all this data be collected, transported, stored, and analyzed? It turns out, using what amounts to sort of Internet of Particles instead of an Internet of Things. Two good articles have recently shed some electro-magnetic energy in the human visible spectrum on the IT aspects of the collider: LHC computing grid pushes petabytes of data, beats expectations by John Timmer on Ars Technica and an overview of the Br
3 0.094594672 1286 high scalability-2012-07-18-Strategy: Kill Off Multi-tenant Instances with High CPU Stolen Time
Introduction: Are all instances created equal? Perhaps because under multi-tenancy multiple virtual machines run on the same physical host, not all applications will run equally well on every instance. In that case it makes sense to measure and move to a better performing instance. That's the interesting idea from @botchagalupe : Imagine something like a "performance monkey" where an infrastructure is so bound that it can kill lower performing instances automatically. @adrianco says Netflix has throught of doing the same: We've looked at killing off multi-tenant instances that have high CPU stolen time... Related Articles Host server CPU utilization in Amazon EC2 cloud
4 0.088571191 865 high scalability-2010-07-27-A Metric A$$-Ton of Joe Stump: The Cloud is Cheaper than Bare Metal
Introduction: Should you pay more in the cloud or pay less for bare metal in the datacenter? This is a crucial decision point facing startups today. Which way should you go? In this Webpulp.tv interview , Joe Stump, always a go-to guy when you need a metric ass-ton (a favorite expression of Joe’s) of good advice on cutting edge practices for the modern startup, laughs at conventional wisdom by saying the cloud is really not more expensive than bare metal. The argument for a cheaper cloud has a three main points: Raw Hardware Costs are Close . Buying hardware is cheaper than renting hardware from Amazon, but it’s close when you consider reserved instances . A reserved instance costs about $250 a year and a new server costs about $1500. Let’s say Amazon is 30% slower. That means for a cluster of 20 machines you would need a another 7 servers. This takes 10 minutes and 4 button clicks to provision on the cloud (the advantage of a scale-out architecture + automation + cloud aware tools).
5 0.078611463 1419 high scalability-2013-03-07-It's a VM Wasteland - A Near Optimal Packing of VMs to Machines Reduces TCO by 22%
Introduction: In Algorithm Design for Performance Aware VM Consolidation we learn some shocking facts (gambling in Casablanca?): Average server utilization in many data centers is low, estimated between 5% and 15%. This is wasteful because an idle server often consumes more than 50% of peak power. Surely that's just for old style datacenters? Nope. In Google data centers, workloads that are consolidated use only 50% of the processor cores. Every other processor core is left unused simply to ensure that performance does not degrade. It's a VM wasteland. The goal is to reduce waste by packing VMs onto machines without hurting performance or wasting resources. The idea is to select VMs that interfere the least with each other and places them together on the same server. It's a NP-Complete problem, but this paper describes a practical method that performs provably close to the optimal. Interestingly they can optimize for performance or power efficiency, so you can use different algorithm
6 0.075453624 1485 high scalability-2013-07-01-PRISM: The Amazingly Low Cost of Using BigData to Know More About You in Under a Minute
7 0.07325314 1460 high scalability-2013-05-17-Stuff The Internet Says On Scalability For May 17, 2013
8 0.071628265 274 high scalability-2008-03-12-YouTube Architecture
9 0.068394326 1027 high scalability-2011-04-20-Packet Pushers: How to Build a Low Cost Data Center
10 0.066546597 1343 high scalability-2012-10-18-Save up to 30% by Selecting Better Performing Amazon Instances
11 0.064126812 1291 high scalability-2012-07-25-Vertical Scaling Ascendant - How are SSDs Changing Architectures?
12 0.059035301 430 high scalability-2008-10-26-Should you use a SAN to scale your architecture?
13 0.055600431 1600 high scalability-2014-02-21-Stuff The Internet Says On Scalability For February 21st, 2014
14 0.054699831 1075 high scalability-2011-07-07-Myth: Google Uses Server Farms So You Should Too - Resurrection of the Big-Ass Machines
15 0.054441787 1654 high scalability-2014-06-05-Cloud Architecture Revolution
16 0.054303933 750 high scalability-2009-12-16-Building Super Scalable Systems: Blade Runner Meets Autonomic Computing in the Ambient Cloud
18 0.053131111 1297 high scalability-2012-08-03-Stuff The Internet Says On Scalability For August 3, 2012
20 0.051483843 1137 high scalability-2011-11-04-Stuff The Internet Says On Scalability For November 4, 2011
topicId topicWeight
[(0, 0.071), (1, 0.025), (2, 0.017), (3, 0.0), (4, -0.063), (5, -0.019), (6, 0.026), (7, 0.031), (8, -0.008), (9, -0.008), (10, 0.016), (11, -0.048), (12, 0.022), (13, 0.016), (14, -0.002), (15, 0.023), (16, 0.007), (17, 0.007), (18, -0.021), (19, 0.047), (20, -0.004), (21, 0.026), (22, -0.019), (23, -0.045), (24, 0.01), (25, -0.011), (26, -0.01), (27, -0.029), (28, -0.046), (29, 0.046), (30, -0.002), (31, 0.018), (32, 0.032), (33, 0.003), (34, -0.031), (35, 0.012), (36, -0.029), (37, 0.034), (38, 0.003), (39, 0.013), (40, -0.005), (41, -0.014), (42, -0.011), (43, -0.007), (44, 0.037), (45, -0.04), (46, -0.018), (47, -0.02), (48, -0.03), (49, -0.021)]
simIndex simValue blogId blogTitle
same-blog 1 0.96637142 902 high scalability-2010-09-16-Strategy: Buy New, Don't Fix the Old
Introduction: This strategy is from the Large Hadron Collider project: Improvements in performance per Watt have caused CERN to no longer sign hardware support contracts longer than three years. Machines run until they die. They have a very high utilization of equipment (‘duty cycle’, 7 x 24 x 365). Replacing hardware makes more sense because of the lower cost and the power savings of new hardware.
Introduction: “ Data is everywhere, never be at a single location. Not scalable, not maintainable. ” –Alex Szalay While Galileo played life and death doctrinal games over the mysteries revealed by the telescope, another revolution went unnoticed, the microscope gave up mystery after mystery and nobody yet understood how subversive would be what it revealed. For the first time these new tools of perceptual augmentation allowed humans to peek behind the veil of appearance. A new new eye driving human invention and discovery for hundreds of years. Data is another material that hides, revealing itself only when we look at different scales and investigate its underlying patterns. If the universe is truly made of information , then we are looking into truly primal stuff. A new eye is needed for Data and an ambitious project called Data-scope aims to be the lens. A detailed paper on the Data-Scope tells more about what it is: The Data-Scope is a new scientific instrum
3 0.62639624 430 high scalability-2008-10-26-Should you use a SAN to scale your architecture?
Introduction: This is a question everyone must struggle with when building out their datacenter. Storage choices are always the ones I have the least confidence in. David Marks in his blog You Can Change It Later! asks the question Should I get a SAN to scale my site architecture? and answers no. A better solution is to use commodity hardware, directly attach storage on servers, and partition across servers to scale and for greater availability. David's reasoning is interesting: A SAN creates a SPOF (single point of failure) that is dependent on a vendor to fly and fix when there's a problem. This can lead to long down times during this outage you have no access to your data at all. Using easily available commodity hardware minimizes risks to your company, it's not just about saving money. Zooming over to Fry's to buy emergency equipment provides the kind of agility startups need in order to respond quickly to ever changing situations. It's hard to beat the power and flexibility (backup
4 0.61971301 1213 high scalability-2012-03-22-Paper: Revisiting Network I-O APIs: The netmap Framework
Introduction: Here's a really good article in the Communications of the ACM on reducing network packet processing overhead by redesigning the network stack: Revisiting Network I/O APIs: The Netmap Framework by Luigi Rizzo . As commodity networking performance increases operating systems need to keep up or all those CPUs will go to waste. How do they make this happen? Abstract: Today 10-gigabit interfaces are used more and more in datacenters and servers. On these links, packets flow as fast as one every 67.2 nanoseconds, yet modern operating systems can take 10-20 times longer just to move one packet between the wire and the application. We can do much better, not with more powerful hardware but by revising architectural decisions made long ago regarding the design of device drivers and network stacks. The netmap framework is a promising step in this direction. Thanks to a careful design and the engineering of a new packet I/O API, netmap eliminates much unnecessary overhead and moves
5 0.61411804 1511 high scalability-2013-09-04-Wide Fast SATA: the Recipe for Hot Performance
Introduction: This is a guest post by Brian Bulkowski , CTO and co-founder of Aerospike , a leading clustered NoSQL database, has worked in the area of high performance commodity systems since 1989. This blog post will tell you exactly how to build a multi-terabyte high throughput datacenter server. A fast, reliable multi-terrabyte data tier can be used for recent behavior (messages, tweets, plays, actions), or anywhere that today you use Redis or Memcache. You need to know: Which SSDs work Which chassis work How to configure your RAID cards Intel’s SATA solutions – combined with a high capacity storage server like the Dell R720xd and a host bus adapter based on the LSI 2208, and a Flash optimized database like Aerospike , enables high throughput and low latency. In a wide configuration, with 12 to 20 drives per 2U server, individual servers can cost-effectively serve at high throughput with 16T at $2.50 per GB with the s3700, or $1.25 with the s3500. Other SSD of
7 0.60311979 1369 high scalability-2012-12-10-Switch your databases to Flash storage. Now. Or you're doing it wrong.
8 0.60295045 142 high scalability-2007-11-05-Strategy: Diagonal Scaling - Don't Forget to Scale Out AND Up
9 0.6022507 526 high scalability-2009-03-05-Strategy: In Cloud Computing Systematically Drive Load to the CPU
10 0.59307295 1316 high scalability-2012-09-04-Changing Architectures: New Datacenter Networks Will Set Your Code and Data Free
11 0.59158194 1066 high scalability-2011-06-22-It's the Fraking IOPS - 1 SSD is 44,000 IOPS, Hard Drive is 180
12 0.59151268 1254 high scalability-2012-05-30-Strategy: Get Servers for Free and Make Users Happy by Turning on Compression
13 0.58778113 1291 high scalability-2012-07-25-Vertical Scaling Ascendant - How are SSDs Changing Architectures?
14 0.58419967 1177 high scalability-2012-01-19-Is it time to get rid of the Linux OS model in the cloud?
15 0.58182555 1372 high scalability-2012-12-14-Stuff The Internet Says On Scalability For December 14, 2012
16 0.57526839 49 high scalability-2007-07-30-allowed contributed
17 0.57410508 823 high scalability-2010-05-05-How will memristors change everything?
18 0.56830096 371 high scalability-2008-08-24-A Scalable, Commodity Data Center Network Architecture
19 0.56513619 403 high scalability-2008-10-06-Paper: Scaling Genome Sequencing - Complete Genomics Technology Overview
20 0.55156803 1572 high scalability-2014-01-03-Stuff The Internet Says On Scalability For January 3rd, 2014
topicId topicWeight
[(1, 0.099), (61, 0.128), (79, 0.233), (90, 0.371)]
simIndex simValue blogId blogTitle
same-blog 1 0.82362288 902 high scalability-2010-09-16-Strategy: Buy New, Don't Fix the Old
Introduction: This strategy is from the Large Hadron Collider project: Improvements in performance per Watt have caused CERN to no longer sign hardware support contracts longer than three years. Machines run until they die. They have a very high utilization of equipment (‘duty cycle’, 7 x 24 x 365). Replacing hardware makes more sense because of the lower cost and the power savings of new hardware.
2 0.71389246 713 high scalability-2009-10-01-Private Data Cloud: 'Do It Yourself' with Eucalyptus
Introduction: Private Clouds provide many of the benefits of the Public Cloud, namely elastic scalability, faster time-to-market and reduced OpEX, all within the Enterprises own perimeter that complies to its governance. Leading commercial Private Cloud products include VMware, Univa UD, Unisys. Open source solutions include pro ducts like Globus Nimbus, Enomaly Elastic Computing Platform, RESERVOIR and Eucalyptus. Read more at: http://bigdatamatters.com/bigdatamatters/2009/09/private-cloud-eucalyptus.html
3 0.63041252 1546 high scalability-2013-11-11-Ask HS: What is a good OLAP database choice with node.js?
Introduction: This question was asked over email and I thought a larger audience might want to take a whack at it. With a business associate, I am trying to develop a financial software that handles financial reports of listed companies. We managed to create this database with all the data necessary to do financial analysis. My associate is a Business Intelligence specialist so he is keen to use OLAPs databases like Microsoft Analysis Services or Jedox Palo, which enables in-memory calculations and very fast aggregation, slicing and dicing of data or write-backs. At the same time I did an online course (MOOC) from Stanford CS184 called Startup Engineering which promoted/talked a lot about javascript and especially node.js as the language of the future for servers. As I am keen to use open-source technologies (would be keen to avoid MS SSAS) for the development of a website to access this financial data , and there are so many choices for databases out there (Postgre, MongoDB, MySQL etc..but d
4 0.55028898 784 high scalability-2010-02-25-Paper: High Performance Scalable Data Stores
Introduction: The world of scalable databases is not a simple one. They come in every race, creed, and color. Rick Cattell has brought some harmony to that world by publishing High Performance Scalable Data Stores , a nicely detailed one stop shop paper comparing scalable databases soley on the content of their character. Ironically, the first step in that evaluation is dividing the world into four groups: Key-value stores: Redis, Scalaris, Voldmort, and Riak. Document stores: Couch DB, MongoDB, and SimpleDB. Record stores: BigTable, HBase, HyperTable, and Cassandra. Scalable RDBMSs: MySQL Cluster, ScaleDB, Drizzle, and VoltDB. The paper describes each system and then compares them on the dimensions of Concurrency Control, Data Storage Replication, Transaction Model, General Comments, Maturity, K-hits, License Language. And the winner is: there are no winners. Yet. Rick concludes by pointing to a great convergence: I believe that a few of these systems will gain critical mass an
5 0.54622614 1277 high scalability-2012-07-05-10 Golden Principles For Building Successful Mobile-Web Applications
Introduction: Wildly popular VC blogger Fred Wilson defines in an excellent 27 minute video the ten most important criteria he uses when deciding to give the gold, that is, fund a web application. Note, this video is from 2010 , so no doubt the ideas are still valid, but the importance of mobile vs web apps has probably shifted to mobile, as Mr. Wilson says in a recent post: mobile is growing like a weed . Speed - speed is more than a feature, it's a requirement. Mainstream users are unforgiving. If something is slow they won't use it. Pingdom is used to track speed across their portfolio. A trend they've noticed is that as an application slows down they don't grow as quickly. Instant Utility - a service must be instantly useful to users. Lengthy setup and configuration is a killer. Tricks like crawling the web to populate information you expect to get from your users later makes the service initially useful. YouTube won, for example, with instant availability of uploaded video.
6 0.54569483 1169 high scalability-2012-01-05-Shutterfly Saw a Speedup of 500% With Flashcache
7 0.54191881 75 high scalability-2007-08-28-Google Utilities : An online google guide,tools and Utilities.
8 0.53913885 1100 high scalability-2011-08-18-Paper: The Akamai Network - 61,000 servers, 1,000 networks, 70 countries
9 0.53470206 650 high scalability-2009-07-02-Product: Hbase
10 0.53234833 372 high scalability-2008-08-27-Updating distributed web applications
11 0.52674681 107 high scalability-2007-10-02-Some Real Financial Numbers for Your Startup
12 0.52375579 8 high scalability-2007-07-12-Should I use LAMP or Windows?
13 0.52368295 743 high scalability-2009-11-23-Big Data on Grids or on Clouds?
14 0.52244109 401 high scalability-2008-10-04-Is MapReduce going mainstream?
15 0.52237666 443 high scalability-2008-11-14-Paper: Pig Latin: A Not-So-Foreign Language for Data Processing
16 0.52098852 40 high scalability-2007-07-30-Product: Amazon Elastic Compute Cloud
17 0.51570553 871 high scalability-2010-08-04-Dremel: Interactive Analysis of Web-Scale Datasets - Data as a Programming Paradigm
18 0.506778 1181 high scalability-2012-01-25-Google Goes MoreSQL with Tenzing - SQL Over MapReduce
19 0.50417197 224 high scalability-2008-01-27-Scalability vs Performance vs Availability vs Reliability.. Also scale up vs scale out ???
20 0.50344741 782 high scalability-2010-02-23-When to migrate your database?