high_scalability high_scalability-2012 high_scalability-2012-1216 knowledge-graph by maker-knowledge-mining

1216 high scalability-2012-03-27-Big Data In the Cloud Using Cloudify


meta infos for this blog

Source: html

Introduction: Edd Dumbill  wrote an interesting article on O’Reilly Radar covering the current solutions for running  Big Data in the Cloud Big data and cloud technology go hand-in-hand. Big data needs clusters of servers for processing, which clouds can readily provide. Big PaaS Edd touched briefly on the role of PaaS for delivering Big Data applications in the cloud Beyond IaaS, several cloud services provide application layer support for big data work. Sometimes referred to as managed solutions, or  platform as a service  (PaaS), these services remove the need to ucale things such as databases or MapReduce, reducing your workload and maintenance burden. Additionally, PaaS providers can realize great efficiencies by hosting at the application level, and pass those savings on to the customer. To put it simply, managing data clusters is one thing. Being able to process the data is yet another challenge that we need to think about when we’re dealing with application platforms, as I no


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 Edd Dumbill  wrote an interesting article on O’Reilly Radar covering the current solutions for running  Big Data in the Cloud Big data and cloud technology go hand-in-hand. [sent-1, score-0.402]

2 Big data needs clusters of servers for processing, which clouds can readily provide. [sent-2, score-0.367]

3 Big PaaS Edd touched briefly on the role of PaaS for delivering Big Data applications in the cloud Beyond IaaS, several cloud services provide application layer support for big data work. [sent-3, score-0.97]

4 Sometimes referred to as managed solutions, or  platform as a service  (PaaS), these services remove the need to ucale things such as databases or MapReduce, reducing your workload and maintenance burden. [sent-4, score-0.283]

5 Additionally, PaaS providers can realize great efficiencies by hosting at the application level, and pass those savings on to the customer. [sent-5, score-0.571]

6 To put it simply, managing data clusters is one thing. [sent-6, score-0.379]

7 Being able to process the data is yet another challenge that we need to think about when we’re dealing with application platforms, as I noted in one of my  earlier posts  and this is where PaaS plays an important role. [sent-7, score-0.901]

8 The main challenge is that quite often the management of the data processing logic is built on completely different scaling, availability and monitoring tools than the one used for managing our Big Data deployment. [sent-8, score-0.713]

9 It turns out, that this silo thinking leads to a whole set of complexities starting from the inconsistency in having multiple managers, each determined in a different way when there is a failure or scaling event, and that quite often end up conflicting with one another. [sent-9, score-0.897]

10 Having lots of moving parts is yet another challenge that makes the entire deployment pretty much a complete mess. [sent-10, score-0.292]

11 In this post, I wanted to cover more specifically how I see the evolution of cloud application platforms (PaaS) to support Big Data. [sent-11, score-0.739]

12 I’ll refer specifically to  Cloudify  which was designed primarily to support Big Data applications. [sent-12, score-0.457]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('paas', 0.405), ('big', 0.208), ('challenge', 0.193), ('edd', 0.171), ('specifically', 0.158), ('silo', 0.153), ('touched', 0.143), ('storyhere', 0.139), ('platforms', 0.135), ('efficiencies', 0.13), ('clusters', 0.129), ('data', 0.128), ('radar', 0.128), ('conflicting', 0.125), ('briefly', 0.125), ('managing', 0.122), ('complexities', 0.122), ('inconsistency', 0.122), ('noted', 0.122), ('referred', 0.118), ('cloudify', 0.117), ('additionally', 0.115), ('refer', 0.115), ('determined', 0.111), ('readily', 0.11), ('support', 0.104), ('iaas', 0.103), ('solutions', 0.099), ('yet', 0.099), ('quite', 0.098), ('plays', 0.095), ('application', 0.094), ('pass', 0.093), ('savings', 0.092), ('earlier', 0.091), ('covering', 0.091), ('processing', 0.087), ('often', 0.085), ('managers', 0.084), ('cover', 0.084), ('cloud', 0.084), ('maintenance', 0.083), ('workload', 0.082), ('providers', 0.081), ('realize', 0.081), ('wrote', 0.081), ('leads', 0.081), ('primarily', 0.08), ('evolution', 0.08), ('dealing', 0.079)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.99999994 1216 high scalability-2012-03-27-Big Data In the Cloud Using Cloudify

Introduction: Edd Dumbill  wrote an interesting article on O’Reilly Radar covering the current solutions for running  Big Data in the Cloud Big data and cloud technology go hand-in-hand. Big data needs clusters of servers for processing, which clouds can readily provide. Big PaaS Edd touched briefly on the role of PaaS for delivering Big Data applications in the cloud Beyond IaaS, several cloud services provide application layer support for big data work. Sometimes referred to as managed solutions, or  platform as a service  (PaaS), these services remove the need to ucale things such as databases or MapReduce, reducing your workload and maintenance burden. Additionally, PaaS providers can realize great efficiencies by hosting at the application level, and pass those savings on to the customer. To put it simply, managing data clusters is one thing. Being able to process the data is yet another challenge that we need to think about when we’re dealing with application platforms, as I no

2 0.26114249 1002 high scalability-2011-03-09-Productivity vs. Control tradeoffs in PaaS

Introduction: Gartner published recently an interesting paper:  Productivity vs. Control: Cloud Application Platforms Must Split to Win . (The paper requires registration.) The paper does a pretty good job covering the evolution that is taking place in the PaaS market toward a more open platform and compares between the two main categories: aPaaS (essentially a PaaS running as a service) and CEAP (Cloud Enabled Application Platform) which is the  *P* out of PaaS that gives you the platform to build your own PaaS in private or public cloud. While I was reading through the paper I felt that something continued to bother me with this definition, even though I tend to agree with the overall observation. If I follow the logic of this paper than I have to give away productivity to gain control, hmm…  that’s a hard choice. The issue seem to be with the way we define productivity. Read the full detailes here

3 0.2318038 1031 high scalability-2011-04-28-PaaS on OpenStack - Run Applications on Any Cloud, Any Time Using Any Thing

Introduction: Yesterday, I had a session during the  OpenStack Summit  where I tried to present a more general view on how we should be thinking about PaaS in the context of OpenStack. The key takeaway : The main goal of PaaS is to drive productivity into the  process  by which we can deliver new applications. Most of the existing PaaS solutions take a fairly extreme approach with their abstraction of the underlying infrastructure and therefore fit a fairly small number of extremely simple applications and thus miss the real promise of PaaS. Amazon's Elastic Beanstalk took a more bottom up approach giving us better set of tradeoffs between the abstraction and control which makes it more broadly applicable to a larger set of applications. The fact that OpenStack is opensource allows us to think differently on the things we can do at the platform layer.  We can create a tighter integration between the PaaS and IaaS layers and thus come up with better set of tradeoffs into the way we drive

4 0.21751107 1110 high scalability-2011-09-06-Big Data Application Platform

Introduction: It's time to think of the architecture and application platforms surrounding "Big Data" databases. Big Data is often centered  around new database technologies mostly from the emerging NoSQL world.  The main challenge that these databases solve is how to handle massive amount of data at a reasonable cost and without poor performanc - distributed databases emerged to address this challenge and today we're seeing  high adoption rate  and quite impressive success stories such as the  Netflix use of Cassandra/DataStax solution .  All that indicate the speed in which this market evolves. The need for a Big Data Application Platform Application platforms provide a framework for making the development of applications simpler. They do this by carving out the generic parts of applications such as security, scalability, and reliability (which  are  attributes of a 'good' application) from the parts of the applications that are specific to our business domain. Most of the existing app

5 0.16098034 1240 high scalability-2012-05-07-Startups are Creating a New System of the World for IT

Introduction: It remains that, from the same principles, I now demonstrate the frame of the System of the World. -- Isaac Newton The practice of IT reminds me a lot of the practice of science before Isaac Newton. Aristotelianism was dead, but there was nothing to replace it. Then Newton came along, created a scientific revolution with his System of the World . And everything changed. That was New System of the World number one. New System of the World number two was written about by the incomparable Neal Stephenson in his incredible  Baroque Cycle  series. It explores the singular creation of a new way of organizing society grounded in new modes of thought in business, religion, politics, and science. Our modern world emerged Enlightened as it could from this roiling cauldron of forces. In IT we may have had a Leonardo da Vinci or even a Galileo, but we’ve never had our Newton. Maybe we don't need a towering genius to make everything clear? For years startups, like the frenetically inventive

6 0.15058003 891 high scalability-2010-09-01-Scale-out vs Scale-up

7 0.123803 292 high scalability-2008-03-30-Scaling Out MySQL

8 0.12334289 1313 high scalability-2012-08-28-Making Hadoop Run Faster

9 0.1183252 977 high scalability-2011-01-21-PaaS shouldn’t be built in Silos

10 0.11825781 450 high scalability-2008-11-24-Scalability Perspectives #3: Marc Andreessen – Internet Platforms

11 0.11570038 1176 high scalability-2012-01-17-Sponsored Post: Next Big Sound, ElasticHosts, 1&1, Red 5 Studios, SingleHop, Spokeo, Callfire, Attribution Modeling, Logic Monitor, New Relic, ScaleOut, AppDynamics, CloudSigma, ManageEngine, Site24x7

12 0.11319808 1354 high scalability-2012-11-05-Are we seeing the renaissance of enterprises in the cloud?

13 0.11277437 1355 high scalability-2012-11-05-Gone Fishin': Building Super Scalable Systems: Blade Runner Meets Autonomic Computing In The Ambient Cloud

14 0.11277021 750 high scalability-2009-12-16-Building Super Scalable Systems: Blade Runner Meets Autonomic Computing in the Ambient Cloud

15 0.11089686 906 high scalability-2010-09-22-Applying Scalability Patterns to Infrastructure Architecture

16 0.1085894 1383 high scalability-2013-01-08-Sponsored Post: Flurry, Rumble Games, Booking, aiCache, Teradata Aster, Aerospike, Percona, ScaleOut, New Relic, NetDNA, GigaSpaces, Logic Monitor, AppDynamics, ManageEngine, Site24x7

17 0.10785462 1370 high scalability-2012-12-11-Sponsored Post: Rumble Games, Duolingo, Booking, aiCache, Teradata Aster, Hadapt, Aerospike, Percona, ScaleOut, New Relic, NetDNA, GigaSpaces, Logic Monitor, AppDynamics, ManageEngine, Site24x7

18 0.10785462 1376 high scalability-2012-12-25-Sponsored Post: Flurry, Rumble Games, Duolingo, Booking, aiCache, Teradata Aster, Hadapt, Aerospike, Percona, ScaleOut, New Relic, NetDNA, GigaSpaces, Logic Monitor, AppDynamics, ManageEngine, Site24x7

19 0.10752441 538 high scalability-2009-03-16-Are Cloud Based Memory Architectures the Next Big Thing?

20 0.10699441 1514 high scalability-2013-09-09-Need Help with Database Scalability? Understand I-O


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.201), (1, 0.024), (2, 0.035), (3, 0.078), (4, -0.045), (5, 0.0), (6, 0.007), (7, -0.087), (8, 0.006), (9, 0.014), (10, 0.023), (11, 0.087), (12, -0.03), (13, 0.007), (14, 0.011), (15, -0.01), (16, 0.025), (17, -0.072), (18, 0.012), (19, 0.011), (20, -0.013), (21, -0.014), (22, 0.034), (23, 0.051), (24, 0.091), (25, 0.033), (26, -0.023), (27, -0.025), (28, -0.062), (29, 0.067), (30, 0.017), (31, 0.051), (32, -0.009), (33, 0.021), (34, -0.027), (35, 0.115), (36, 0.006), (37, -0.064), (38, 0.037), (39, 0.047), (40, -0.065), (41, 0.001), (42, -0.042), (43, 0.008), (44, 0.083), (45, 0.009), (46, 0.035), (47, -0.026), (48, -0.049), (49, -0.096)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.95531118 1216 high scalability-2012-03-27-Big Data In the Cloud Using Cloudify

Introduction: Edd Dumbill  wrote an interesting article on O’Reilly Radar covering the current solutions for running  Big Data in the Cloud Big data and cloud technology go hand-in-hand. Big data needs clusters of servers for processing, which clouds can readily provide. Big PaaS Edd touched briefly on the role of PaaS for delivering Big Data applications in the cloud Beyond IaaS, several cloud services provide application layer support for big data work. Sometimes referred to as managed solutions, or  platform as a service  (PaaS), these services remove the need to ucale things such as databases or MapReduce, reducing your workload and maintenance burden. Additionally, PaaS providers can realize great efficiencies by hosting at the application level, and pass those savings on to the customer. To put it simply, managing data clusters is one thing. Being able to process the data is yet another challenge that we need to think about when we’re dealing with application platforms, as I no

2 0.82793719 444 high scalability-2008-11-14-Private-Public Cloud

Introduction: Data centers are reshaping themselves by taking ideas from public cloud providers, such as Amazon and Google. The idea is to make the data center more cost-effective by enabling on-demand utility-based computing rather than dedicated machines. At the same time, it is clear that to make IT operations more effective, it doesn't make sense to run all the applications that are currently hosted in a company's data center in the private cloud. This calls for an integration between private and public cloud. In this post i discuss some of the challenges involved in making that happen: 1. How do we design applications to be cloud-agnostic? 2. How do we enable seamless fail-over to a public cloud? 3. Future-proofing: There are many cases in which we can't make a clear decision as to where our application should be running at the time of writing or developing the application. We would like to be in a position to change the decision as to where our application will be running even after o

3 0.79986703 1002 high scalability-2011-03-09-Productivity vs. Control tradeoffs in PaaS

Introduction: Gartner published recently an interesting paper:  Productivity vs. Control: Cloud Application Platforms Must Split to Win . (The paper requires registration.) The paper does a pretty good job covering the evolution that is taking place in the PaaS market toward a more open platform and compares between the two main categories: aPaaS (essentially a PaaS running as a service) and CEAP (Cloud Enabled Application Platform) which is the  *P* out of PaaS that gives you the platform to build your own PaaS in private or public cloud. While I was reading through the paper I felt that something continued to bother me with this definition, even though I tend to agree with the overall observation. If I follow the logic of this paper than I have to give away productivity to gain control, hmm…  that’s a hard choice. The issue seem to be with the way we define productivity. Read the full detailes here

4 0.77968603 250 high scalability-2008-02-17-Web Accelerators - snake oil or miracle remedy?

Introduction: Perhaps this question is borderline off-topic but since high scalability solutions often have a global aspect I will give it a try... Have anybody had any experience with different techniques for speeding up their application to places that have a problem with poor ping response time? Ideally I would love to be running only one data center world-wide but one day I know that our sales department will sign up a customer with an unacceptable response time... Could installing a web-accelerator in front of our application extend the reach of our current data center or will we just add complexity and another source of potential errors?

5 0.76677763 1031 high scalability-2011-04-28-PaaS on OpenStack - Run Applications on Any Cloud, Any Time Using Any Thing

Introduction: Yesterday, I had a session during the  OpenStack Summit  where I tried to present a more general view on how we should be thinking about PaaS in the context of OpenStack. The key takeaway : The main goal of PaaS is to drive productivity into the  process  by which we can deliver new applications. Most of the existing PaaS solutions take a fairly extreme approach with their abstraction of the underlying infrastructure and therefore fit a fairly small number of extremely simple applications and thus miss the real promise of PaaS. Amazon's Elastic Beanstalk took a more bottom up approach giving us better set of tradeoffs between the abstraction and control which makes it more broadly applicable to a larger set of applications. The fact that OpenStack is opensource allows us to think differently on the things we can do at the platform layer.  We can create a tighter integration between the PaaS and IaaS layers and thus come up with better set of tradeoffs into the way we drive

6 0.74217403 1110 high scalability-2011-09-06-Big Data Application Platform

7 0.74010402 702 high scalability-2009-09-11-The interactive cloud

8 0.72707933 1354 high scalability-2012-11-05-Are we seeing the renaissance of enterprises in the cloud?

9 0.71676999 727 high scalability-2009-10-25-Is Your Data Really Secured?

10 0.71386272 762 high scalability-2010-01-18-The Missing Piece in the Virtualization Stack (Part 1)

11 0.7087096 1056 high scalability-2011-06-09-Retrospect on recent AWS outage and Resilient Cloud-Based Architecture

12 0.70739037 1212 high scalability-2012-03-21-The Conspecific Hybrid Cloud

13 0.69614816 1161 high scalability-2011-12-22-Architecting Massively-Scalable Near-Real-Time Risk Analysis Solutions

14 0.68387938 1157 high scalability-2011-12-14-Virtualization and Cloud Computing is Changing the Network to East-West Routing

15 0.6775918 977 high scalability-2011-01-21-PaaS shouldn’t be built in Silos

16 0.67226404 567 high scalability-2009-04-14-Challanges for Developing Enterprise Application on the Cloud

17 0.66232699 266 high scalability-2008-03-04-Manage Downtime Risk by Connecting Multiple Data Centers into a Secure Virtual LAN

18 0.65758568 1184 high scalability-2012-01-31-Performance in the Cloud: Business Jitter is Bad

19 0.65148664 126 high scalability-2007-10-20-Should you build your next website using 3tera's grid OS?

20 0.6482113 924 high scalability-2010-10-21-What is Network-based Application Virtualization and Why Do You Need It?


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(1, 0.187), (2, 0.168), (10, 0.029), (30, 0.015), (36, 0.148), (61, 0.142), (79, 0.124), (94, 0.092)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.93604177 1540 high scalability-2013-10-30-Strategy: Use Your Quantum Computer Lab to Tell Intentional Blinks from Involuntary Blinks

Introduction: Oh, you don't have a Quantum Computer Lab staffed with researchers? Well, Google does. Here they are on G+ . To learn what they are up to the Verge has A first look inside Google's futuristic quantum lab . The lab is partnership between NASA, Google, and a 512-qubit D-Wave Two quantum computer.   One result from the lab is: The first practical application has been on Google Glass, as engineers put the quantum chips to work on Glass's blink detector, helping it to better distinguish between intentional winks and involuntary blinks. For engineering reasons, the quantum processor can never be installed in Glass, but together with Google's conventional server centers, it can point the way to a better blink-detecting algorithm. That would allow the Glass processor to detect blinks with better accuracy and using significantly less power. If successful, it could be an important breakthrough for wink-triggered apps, which have struggled with the task so far. Google thinks quantum

2 0.93380606 1310 high scalability-2012-08-23-Economies of Scale in the Datacenter: Gmail is 100x Cheaper to Run than Your Own Server

Introduction: Urs Hoelzle , infrastructure guru and SVP at Google, made a really interesting statement about the economics of scale in the datacenter: We’ve shown that when you run a large application in the datacenter, like Gmail, you can, compared to a small organization running their own email server, you can save nearly a factor of 100 in terms of compute and energy, when you run it at scale. My first thought was shock at the magnitude of the difference. 100x is a chasm crosser. Then I thought about Gmail, it's horizontally scalable using technologies that are following Moore's Law (storage and compute), latency requirements are lax, a commodity network is sufficient, and it can be highly automated so management costs scale slower than users. After that it's a simple matter of software :-) Oh, and developing a market where it's "cheaper to run a large thing than a small thing."

same-blog 3 0.92267931 1216 high scalability-2012-03-27-Big Data In the Cloud Using Cloudify

Introduction: Edd Dumbill  wrote an interesting article on O’Reilly Radar covering the current solutions for running  Big Data in the Cloud Big data and cloud technology go hand-in-hand. Big data needs clusters of servers for processing, which clouds can readily provide. Big PaaS Edd touched briefly on the role of PaaS for delivering Big Data applications in the cloud Beyond IaaS, several cloud services provide application layer support for big data work. Sometimes referred to as managed solutions, or  platform as a service  (PaaS), these services remove the need to ucale things such as databases or MapReduce, reducing your workload and maintenance burden. Additionally, PaaS providers can realize great efficiencies by hosting at the application level, and pass those savings on to the customer. To put it simply, managing data clusters is one thing. Being able to process the data is yet another challenge that we need to think about when we’re dealing with application platforms, as I no

4 0.92143363 984 high scalability-2011-02-04-Stuff The Internet Says On Scalability For February 4, 2011

Introduction: Submitted for your reading pleasure... Super Bowl Prediction : Pittsburgh 27, Green Bay 24. I'll be rooting for Green Bay, but the Pittsburgh defense will eventually win the day, beating back the fleet footed, quick tossing, and sharp shooting Aaron Rodgers. Roethlisberger will make exactly 3 plays that matter, but they'll be the right 3 plays. Reddit is now at 1 billion page views a month . Congratulations! Amazon S3 Cloud Stores 262 Billion Objects.   My god, it's full of stars... Quora’s Technology Examined  by Phil Whelan. Excellent detective work answering the question: How Does Quora Work? Quotable Quotes: @timoreilly : When hardware became commoditized, software was valuable. Now that software being commoditized, data is valuable. #strataconf @coldfusionPaul : "Write someone a query, they'll go away for a day. Teach someone to query, they'll just go away." so, I use #NoSQL 555 @squarecog:  To go *really* fast, you want to get rid of spokes in

5 0.9194954 608 high scalability-2009-05-27-The Future of the Parallelism and its Challenges

Introduction: The Future of the Parallelism and its Challenges Research and education in Parallel computing technologies is more important than ever. Here I present a perspective on the past contributions, current status, and future direction of the parallelism technologies. While machine power will grow impressively, increased parallelism, rather than clock rate, will be driving force in computing in the foreseeable future. This ongoing shift toward parallel architectural paradigms is one of the greatest challenges for the microprocessor and software industries. In 2005, Justin Ratter, chief technology officer of Intel Corporation, said ‘We are at the cusp of a transition to multicore, multithreaded architectures, and we still have not demonstrated the ease of programming the move will require…’ Key points: A Little history Parallelism Challenges Under the hood, Parallelism Challenges Synchronization problems CAS problems The future of the parallelism

6 0.91902566 1254 high scalability-2012-05-30-Strategy: Get Servers for Free and Make Users Happy by Turning on Compression

7 0.91050851 4 high scalability-2007-07-10-Webcast: Advanced Database High Availability and Scalability Solutions

8 0.90405416 264 high scalability-2008-03-03-Read This Site and Ace Your Next Interview!

9 0.90135145 451 high scalability-2008-11-30-Creating a high-performing online database

10 0.89899659 598 high scalability-2009-05-12-P2P server technology?

11 0.8895244 1180 high scalability-2012-01-24-The State of NoSQL in 2012

12 0.88769346 576 high scalability-2009-04-21-What CDN would you recommend?

13 0.88532007 1289 high scalability-2012-07-23-State of the CDN: More Traffic, Stable Prices, More Products, Profits - Not So Much

14 0.88468707 1002 high scalability-2011-03-09-Productivity vs. Control tradeoffs in PaaS

15 0.884426 1448 high scalability-2013-04-29-AWS v GCE Face-off and Why Innovation Needs Lower Cost Infrastructures

16 0.8824358 853 high scalability-2010-07-08-Cloud AWS Infrastructure vs. Physical Infrastructure

17 0.88066542 435 high scalability-2008-10-30-The case for functional decomposition

18 0.88060242 931 high scalability-2010-10-28-Notes from A NOSQL Evening in Palo Alto

19 0.87962669 649 high scalability-2009-07-02-Product: Facebook's Cassandra - A Massive Distributed Store

20 0.8792274 1040 high scalability-2011-05-13-Stuff The Internet Says On Scalability For May 13, 2011