high_scalability high_scalability-2013 high_scalability-2013-1419 knowledge-graph by maker-knowledge-mining

1419 high scalability-2013-03-07-It's a VM Wasteland - A Near Optimal Packing of VMs to Machines Reduces TCO by 22%


meta infos for this blog

Source: html

Introduction: In  Algorithm Design for Performance Aware VM Consolidation  we learn some shocking facts (gambling in Casablanca?): Average server utilization in many data centers is low, estimated between 5% and 15%. This is wasteful because an idle server often consumes more than 50% of peak power. Surely that's just for old style datacenters? Nope. In Google data centers, workloads that are consolidated use only 50% of the processor cores. Every other processor core is left unused simply to ensure that performance does not degrade. It's a VM wasteland. The goal is to reduce waste by packing VMs onto machines without hurting performance or wasting resources. The idea is to select VMs that interfere the least with each other and places them together on the same server. It's a NP-Complete problem, but this paper describes a practical method that performs provably close to the optimal. Interestingly they can optimize for performance or power efficiency, so you can use different algorithm


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 In  Algorithm Design for Performance Aware VM Consolidation  we learn some shocking facts (gambling in Casablanca? [sent-1, score-0.247]

2 ): Average server utilization in many data centers is low, estimated between 5% and 15%. [sent-2, score-0.411]

3 This is wasteful because an idle server often consumes more than 50% of peak power. [sent-3, score-0.356]

4 In Google data centers, workloads that are consolidated use only 50% of the processor cores. [sent-6, score-0.373]

5 Every other processor core is left unused simply to ensure that performance does not degrade. [sent-7, score-0.46]

6 The goal is to reduce waste by packing VMs onto machines without hurting performance or wasting resources. [sent-9, score-0.553]

7 The idea is to select VMs that interfere the least with each other and places them together on the same server. [sent-10, score-0.225]

8 It's a NP-Complete problem, but this paper describes a practical method that performs provably close to the optimal. [sent-11, score-0.305]

9 Interestingly they can optimize for performance or power efficiency, so you can use different algorithms for different workloads. [sent-12, score-0.116]

10 The result when optimizing for performance are utilizations between 75% and 80% compared to 50% from the naıve method. [sent-13, score-0.196]

11 This gives a 22% reduction in TCO, a significant savings at scale: Experimental evaluations showed  that the proposed system performed well on realistic VM performance degradations, yielding over 30% savings in energy and up to 52% reduction in degradation. [sent-14, score-1.402]

12 It's hard to get greater utilization from VMs because systems are structured in such large chunks. [sent-15, score-0.165]

13 If we could decompose systems into smaller more modular pieces they could be much more efficiently scheduled over a server farm. [sent-16, score-0.404]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('vms', 0.284), ('vm', 0.255), ('reduction', 0.186), ('savings', 0.18), ('utilizations', 0.168), ('utilization', 0.165), ('evaluations', 0.157), ('gambling', 0.157), ('hurting', 0.15), ('processor', 0.149), ('provably', 0.145), ('packing', 0.145), ('shocking', 0.14), ('consolidated', 0.14), ('interfere', 0.14), ('yielding', 0.136), ('ve', 0.136), ('centers', 0.136), ('decompose', 0.13), ('tco', 0.13), ('wasteful', 0.13), ('na', 0.125), ('consumes', 0.125), ('consolidation', 0.119), ('wasting', 0.116), ('performance', 0.116), ('proposed', 0.114), ('unused', 0.113), ('interestingly', 0.111), ('realistic', 0.111), ('estimated', 0.11), ('facts', 0.107), ('experimental', 0.104), ('modular', 0.102), ('idle', 0.101), ('showed', 0.094), ('scheduled', 0.091), ('waste', 0.089), ('performs', 0.086), ('aware', 0.085), ('places', 0.085), ('workloads', 0.084), ('onto', 0.082), ('left', 0.082), ('pieces', 0.081), ('optimizing', 0.08), ('describes', 0.08), ('method', 0.08), ('efficiency', 0.078), ('performed', 0.078)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 1419 high scalability-2013-03-07-It's a VM Wasteland - A Near Optimal Packing of VMs to Machines Reduces TCO by 22%

Introduction: In  Algorithm Design for Performance Aware VM Consolidation  we learn some shocking facts (gambling in Casablanca?): Average server utilization in many data centers is low, estimated between 5% and 15%. This is wasteful because an idle server often consumes more than 50% of peak power. Surely that's just for old style datacenters? Nope. In Google data centers, workloads that are consolidated use only 50% of the processor cores. Every other processor core is left unused simply to ensure that performance does not degrade. It's a VM wasteland. The goal is to reduce waste by packing VMs onto machines without hurting performance or wasting resources. The idea is to select VMs that interfere the least with each other and places them together on the same server. It's a NP-Complete problem, but this paper describes a practical method that performs provably close to the optimal. Interestingly they can optimize for performance or power efficiency, so you can use different algorithm

2 0.19172223 1238 high scalability-2012-05-03-Snooze - Open-source, Scalable, Autonomic, and Energy-efficient VM Management for Private Clouds

Introduction: Snooze is an open-source, scalable, autonomic, and energy-efficient virtual machine (VM) management framework for private clouds. Similarly to other VM management frameworks such as Nimbus, OpenNebula, Eucalyptus, and OpenStack it allows to build compute infrastructures from virtualized resources. Particularly, once installed and configured users can submit and control the life-cycle of a large number of VMs. However, contrary to existing frameworks for scalability and fault tolerance, Snooze employs a self-organizing and healing (based on Apache ZooKeeper) hierarchical architecture. Moreover, it performs distributed VM management and is designed to be energy efficient. Therefore, it implements features to monitor and estimate VM resource (CPU, memory, network Rx, network Tx) demands, detect and resolve overload/underload situations, perform dynamic VM consolidation through live migration, and finally power management to save energy. Last but not least, it integrates a g

3 0.17763831 533 high scalability-2009-03-11-The Implications of Punctuated Scalabilium for Website Architecture

Introduction: Update: How do you design and handle peak load on the Cloud? by Cloudiquity. Gives a formula to try and predict and plan for peak load and talks about how GigaSpaces XAP, Scalr, RightScale and FreedomOSS can be used to handle peak load within EC2. Theo Schlossnagle, with his usual insight, talks about in Dissecting today's surges how the nature of internet traffic has evolved over time. Traffic now spikes like a heart attack, larger and more quickly than ever from traffic inflow sources like Digg and The New York Times. Theo relates how At least eight times in the past month, we've experienced from 100% to 1000% sudden increases in traffic across many of our clients and those spike can happen as quickly as 60 seconds. To me this sounds a lot like Punctuated equilibrium in evolution, a force that accounts for much creative growth in species... VMs don't spin up in less than 60 seconds so your ability to respond to such massive quick spikes is limited. This as

4 0.1313204 349 high scalability-2008-07-10-Can cloud computing smite down evil zombie botnet armies?

Introduction: In the more cool stuff I've never heard of before department is something called Self Cleansing Intrusion Tolerance (SCIT). Botnets are created when vulnerable computers live long enough to become infected with the will to do the evil bidding of their evil masters. Security is almost always about removing vulnerabilities (a process which to outside observers often looks like a dog chasing its tail ). SCIT takes a different approach, it works on the availability angle. Something I never thought of before, but which makes a great deal of sense once I thought about it. With SCIT you stop and restart VM instances every minute (or whatever depending in your desired window vulnerability).... This short exposure window means worms and viri do not have long enough to fully infect a machine and carry out a coordinated attack. A machine is up for a while. Does work. And then is torn down again only to be reborn as a clean VM with no possibility of infection (unless of course the VM

5 0.12619856 299 high scalability-2008-04-07-Rumors of Signs and Portents Concerning Freeish Google Cloud

Introduction: Update 2: Rumor no more. Google Jumps Head First Into Web Services With Google App Engine . The quick and dirty of it: developers simply upload their Python code to Google, launch the application, and can monitor usage and other metrics via a multi-platform desktop application. There were 10,000 developer slots open and of course I was too late. More as the cobra strikes. Update: TechCrunch reports Google To Launch BigTable As Web Service next week. It competes with Amazon's SimpleDB. Though it won't be truly comparable until they also release an EC2 and S3 equivalent. An internet hit for each data access is a little painful. As Jimmy says in Goodfellas, "That's the way. You don't take no sh*t from nobody. " First Dave Winer hallucinates a pig on the mean streets of Walnut Creek that told him Google's long foretold cloud offering will be free for bloggers of "modest needs." GigaOM then says a free cloud service is how Google could eat Amazon's bacon for lunch .

6 0.12399238 147 high scalability-2007-11-09-Paper: Container-based Operating System Virtualization: A Scalable, High-performance Alternative to Hypervisors

7 0.12319579 1275 high scalability-2012-07-02-C is for Compute - Google Compute Engine (GCE)

8 0.094577529 1537 high scalability-2013-10-25-Stuff The Internet Says On Scalability For October 25th, 2013

9 0.093814977 458 high scalability-2008-12-01-Web Consolidation on the Sun Fire T1000 using Solaris Containers

10 0.092936426 1209 high scalability-2012-03-14-The Azure Outage: Time Is a SPOF, Leap Day Doubly So

11 0.090968043 1006 high scalability-2011-03-17-Are long VM instance spin-up times in the cloud costing you money?

12 0.090432681 1023 high scalability-2011-04-14-Strategy: Cache Application Start State to Reduce Spin-up Times

13 0.082669497 1305 high scalability-2012-08-16-Paper: A Provably Correct Scalable Concurrent Skip List

14 0.080741629 1046 high scalability-2011-05-23-Evernote Architecture - 9 Million Users and 150 Million Requests a Day

15 0.080074891 923 high scalability-2010-10-21-Machine VM + Cloud API - Rewriting the Cloud from Scratch

16 0.078611463 902 high scalability-2010-09-16-Strategy: Buy New, Don't Fix the Old

17 0.077706486 1404 high scalability-2013-02-11-At Scale Even Little Wins Pay Off Big - Google and Facebook Examples

18 0.077092804 387 high scalability-2008-09-22-Paper: On Delivering Embarrassingly Distributed Cloud Services

19 0.077050999 825 high scalability-2010-05-10-Sify.com Architecture - A Portal at 3900 Requests Per Second

20 0.07557153 1038 high scalability-2011-05-11-Troubleshooting response time problems – why you cannot trust your system metrics


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.112), (1, 0.055), (2, 0.014), (3, 0.029), (4, -0.056), (5, 0.007), (6, 0.037), (7, 0.04), (8, -0.042), (9, 0.021), (10, -0.019), (11, -0.042), (12, 0.014), (13, 0.0), (14, 0.013), (15, -0.004), (16, 0.012), (17, -0.003), (18, 0.023), (19, 0.002), (20, 0.023), (21, -0.006), (22, -0.018), (23, -0.058), (24, -0.017), (25, 0.048), (26, -0.049), (27, -0.022), (28, -0.021), (29, 0.006), (30, -0.003), (31, -0.006), (32, -0.019), (33, 0.054), (34, -0.017), (35, 0.036), (36, 0.007), (37, 0.04), (38, -0.002), (39, 0.047), (40, -0.018), (41, 0.006), (42, 0.006), (43, -0.006), (44, -0.025), (45, -0.013), (46, -0.027), (47, 0.001), (48, 0.079), (49, 0.023)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.92576241 1419 high scalability-2013-03-07-It's a VM Wasteland - A Near Optimal Packing of VMs to Machines Reduces TCO by 22%

Introduction: In  Algorithm Design for Performance Aware VM Consolidation  we learn some shocking facts (gambling in Casablanca?): Average server utilization in many data centers is low, estimated between 5% and 15%. This is wasteful because an idle server often consumes more than 50% of peak power. Surely that's just for old style datacenters? Nope. In Google data centers, workloads that are consolidated use only 50% of the processor cores. Every other processor core is left unused simply to ensure that performance does not degrade. It's a VM wasteland. The goal is to reduce waste by packing VMs onto machines without hurting performance or wasting resources. The idea is to select VMs that interfere the least with each other and places them together on the same server. It's a NP-Complete problem, but this paper describes a practical method that performs provably close to the optimal. Interestingly they can optimize for performance or power efficiency, so you can use different algorithm

2 0.74140424 1039 high scalability-2011-05-12-Paper: Mind the Gap: Reconnecting Architecture and OS Research

Introduction: Mind the Gap: Reconnecting Architecture and OS Research  is a paper presented at HotOS XIII , the place where researchers talk about making potential futures happen. For a great overview of the conference take a look at this article by Matt Welsh:  Conference report: HotOS 2011 in Napa . In the VM/cloud age I question the need of having an OS at all, programs can compile directly against "raw" hardware, but the paper does a good job of trying to figure out the new roll operating systems can play in the future. We've been in a long OS holding pattern, so long that we've seen the rise of PaaS vendors skipping the OS level abstraction completely, but there's room for a middle ground between legacy time sharing systems of the past and service level APIs that are but one possible future. Introduction: For too long, operating systems researchers and developers have pretty much taken whatever computer architects have dished out. With occasional exceptions (e.g., virtualization support)

3 0.71214318 1238 high scalability-2012-05-03-Snooze - Open-source, Scalable, Autonomic, and Energy-efficient VM Management for Private Clouds

Introduction: Snooze is an open-source, scalable, autonomic, and energy-efficient virtual machine (VM) management framework for private clouds. Similarly to other VM management frameworks such as Nimbus, OpenNebula, Eucalyptus, and OpenStack it allows to build compute infrastructures from virtualized resources. Particularly, once installed and configured users can submit and control the life-cycle of a large number of VMs. However, contrary to existing frameworks for scalability and fault tolerance, Snooze employs a self-organizing and healing (based on Apache ZooKeeper) hierarchical architecture. Moreover, it performs distributed VM management and is designed to be energy efficient. Therefore, it implements features to monitor and estimate VM resource (CPU, memory, network Rx, network Tx) demands, detect and resolve overload/underload situations, perform dynamic VM consolidation through live migration, and finally power management to save energy. Last but not least, it integrates a g

4 0.69896442 1594 high scalability-2014-02-12-Paper: Network Stack Specialization for Performance

Introduction: In the scalability is specialization department here is an interesting paper presented at HotNets '13 on high performance networking:  Network Stack Specialization for Performance . The idea is generalizing a service so it fits in the kernel comes at a high performance cost. So move TCP into user space.  The result is a web server with ~3.5x the throughput of Nginx "while experiencing low CPU utilization, linear scaling on multicore systems, and saturating current NIC hardware." Here's a good description of the paper published on Layer 9 : Traditionally, servers and OSes have been built to be general purpose. However now we have a high degree of specialization. In fact, in a big web service, you might have thousands of machines dedicated to one function. Therefore, there's scope for specialization. This paper looks at a specific opportunity in that space. Network stacks today are good for high throughput with large transfers, but not small files (which are common in web browsi

5 0.6933614 1275 high scalability-2012-07-02-C is for Compute - Google Compute Engine (GCE)

Introduction: After poking around the Google Compute Engine (GCE) documentation I had some trouble creating a mental model of how GCE works. Is it like AWS, GAE, Rackspace, just what is it? After watching Google I/O 2012 - Introducing Google Compute Engine and Google Compute Engine -- Technical Details , it turns out my initial impression, that GCE is disarmingly straightforward, turns out to be the point. The focus of GCE is on the C, which stands for Compute, and that’s what GCE is all about: deploying lots of servers to solve computationally hard problems. What you get with GCE is a Super Datacenter on Google Steroids. If you are wondering how you will run the next Instagram on GCE then that would be missing the point. GAE is targeted at applications. GCE is targeted at: Delivering a proven, pure, high performance, high scale compute infrastructure using a utility pricing model, on top of an open, secure, extensible Infrastructure-as-a-Service. Delivering an expe

6 0.6913439 1213 high scalability-2012-03-22-Paper: Revisiting Network I-O APIs: The netmap Framework

7 0.68517685 1177 high scalability-2012-01-19-Is it time to get rid of the Linux OS model in the cloud?

8 0.67949533 147 high scalability-2007-11-09-Paper: Container-based Operating System Virtualization: A Scalable, High-performance Alternative to Hypervisors

9 0.66004336 1207 high scalability-2012-03-12-Google: Taming the Long Latency Tail - When More Machines Equals Worse Results

10 0.65826523 1234 high scalability-2012-04-26-Akaros - an open source operating system for manycore architectures

11 0.63762569 1116 high scalability-2011-09-15-Paper: It's Time for Low Latency - Inventing the 1 Microsecond Datacenter

12 0.6329447 1505 high scalability-2013-08-22-The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second edition

13 0.61824536 535 high scalability-2009-03-12-Paper: Understanding and Designing New Server Architectures for Emerging Warehouse-Computing Environments

14 0.61676347 1107 high scalability-2011-08-29-The Three Ages of Google - Batch, Warehouse, Instant

15 0.61669862 1540 high scalability-2013-10-30-Strategy: Use Your Quantum Computer Lab to Tell Intentional Blinks from Involuntary Blinks

16 0.61536837 1548 high scalability-2013-11-13-Google: Multiplex Multiple Works Loads on Computers to Increase Machine Utilization and Save Money

17 0.61323667 987 high scalability-2011-02-10-Dispelling the New SSL Myth

18 0.61048907 1537 high scalability-2013-10-25-Stuff The Internet Says On Scalability For October 25th, 2013

19 0.60795337 1437 high scalability-2013-04-08-NuoDB's First Experience: Google Compute Engine - 1.8 Million Transactions Per Second

20 0.60417539 1428 high scalability-2013-03-22-Stuff The Internet Says On Scalability For March 22, 2013


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(1, 0.065), (2, 0.191), (10, 0.035), (40, 0.428), (61, 0.056), (79, 0.101), (94, 0.023)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.92335439 402 high scalability-2008-10-05-Paper: Scalability Design Patterns

Introduction: I have introduced pattern languages in my earlier post on The Pattern Bible for Distributed Computing . Achieving highest possible scalability is a complex combination of many factors. This PLoP 2007 paper presents a pattern language that can be used to make a system highly scalable. The Scalability Pattern Language introduced by Kanwardeep Singh Ahluwalia includes patterns to: Introduce Scalability Optimize Algorithm Add Hardware Add Parallelism Add Intra-Process Parallelism Add Inter-Porcess Parallelism Add Hybrid Parallelism Optimize Decentralization Control Shared Resources Automate Scalability

2 0.90115285 338 high scalability-2008-06-02-Total Cost of Ownership for different web development frameworks

Introduction: I would like to compile a comparison matrix on the total cost of ownership for .Net, Java, Lamp & Rails. Where should I start? Has anyone seen or know of a recent study on this subject?

same-blog 3 0.89014494 1419 high scalability-2013-03-07-It's a VM Wasteland - A Near Optimal Packing of VMs to Machines Reduces TCO by 22%

Introduction: In  Algorithm Design for Performance Aware VM Consolidation  we learn some shocking facts (gambling in Casablanca?): Average server utilization in many data centers is low, estimated between 5% and 15%. This is wasteful because an idle server often consumes more than 50% of peak power. Surely that's just for old style datacenters? Nope. In Google data centers, workloads that are consolidated use only 50% of the processor cores. Every other processor core is left unused simply to ensure that performance does not degrade. It's a VM wasteland. The goal is to reduce waste by packing VMs onto machines without hurting performance or wasting resources. The idea is to select VMs that interfere the least with each other and places them together on the same server. It's a NP-Complete problem, but this paper describes a practical method that performs provably close to the optimal. Interestingly they can optimize for performance or power efficiency, so you can use different algorithm

4 0.8577897 860 high scalability-2010-07-17-Hot Scalability Links for July 17, 2010

Introduction: And by hot I also mean temperature. Summer has arrived. It's sizzling here in Silicon Valley. Thank you air conditioning! Scale the web by appointing a Crawler Czar? Tom Foremski has the idea that Google should open up their index so sites wouldn't have to endure the constant pounding by ravenous crawler bots. Don MacAskill of SmugMug estimates 50% of our web server CPU resources are spent serving crawlers. What a waste. How this would all work with real-time feeds, paid  feeds (Twitter, movies, ...), etc. is unknown, but does it make sense for all that money to be spent on extracting the same data over and over again? Tweets of Gold: jamesurquhart : Key to applications is architecture. Key for infrastructure supporting archs is configurability. Configurability==features . tjake :   People who choose their datastore based oh hearsay and not their own evaluation are doomed . b6n : No global lock ever goes unpunished. MichaelSurt

5 0.84149069 1466 high scalability-2013-05-29-Amazon: Creating a Customer Utopia One Culture Hack at a Time

Introduction: If you don’t cannibalize yourself, someone else will. -- Steve Jobs     America as the New World has a long history of inspiring  Utopian communities . Early experiments were famously religious. But there have been many others as new waves of thought have inspired people to organize and try something different. In the 1840s Transcendentalists, believing the true path lay in the perfection of the individual, created intentional communities like Brook Farm . We've also seen socialist ,  anarchist , hippy , and virtually every other kind of Utopia in-between. Psychologist B.F. Skinner wrote an infamous book, Walden Two , with a more "scientific" take on creating a Utopian community and Ayn Rand in Atlas Shrugged  had her free market version. I believe in startup organizations we see the modern version of Utopian energy in action. We now call it by names like "culture hacking", but the goal is the same: to create a new form of human community to achieve a profound go

6 0.82724196 330 high scalability-2008-05-27-Should Twitter be an All-You-Can-Eat Buffet or a Vending Machine?

7 0.82281637 27 high scalability-2007-07-25-Product: 3 PAR REMOTE COPY

8 0.81987697 1471 high scalability-2013-06-06-Paper: Memory Barriers: a Hardware View for Software Hackers

9 0.81877649 482 high scalability-2009-01-04-Alternative Memcache Usage: A Highly Scalable, Highly Available, In-Memory Shard Index

10 0.80483598 778 high scalability-2010-02-15-The Amazing Collective Compute Power of the Ambient Cloud

11 0.75185174 879 high scalability-2010-08-12-Think of Latency as a Pseudo-permanent Network Partition

12 0.74576825 1414 high scalability-2013-03-01-Stuff The Internet Says On Scalability For February 29, 2013

13 0.74491411 280 high scalability-2008-03-17-Paper: Consistent Hashing and Random Trees: Distributed Caching Protocols for Relieving Hot Spots on the World Wide Web

14 0.74444407 848 high scalability-2010-06-25-Hot Scalability Links for June 25, 2010

15 0.73064315 300 high scalability-2008-04-07-Scalr - Open Source Auto-scaling Hosting on Amazon EC2

16 0.71767163 768 high scalability-2010-02-01-What Will Kill the Cloud?

17 0.68706715 757 high scalability-2010-01-04-11 Strategies to Rock Your Startup’s Scalability in 2010

18 0.68110192 1492 high scalability-2013-07-17-How do you create a 100th Monkey software development culture?

19 0.64134479 97 high scalability-2007-09-18-Session management in highly scalable web sites

20 0.62308216 564 high scalability-2009-04-10-counting # of views, calculating most-least viewed