high_scalability high_scalability-2009 high_scalability-2009-590 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Art of Distributed Part 1: Rethinking about distributed computing models I ‘m getting a lot of questions lately about the distributed computing, especially distributed computing model, and MapReduce, such as: What is MapReduce? Can MapReduce fit in all situations? How we can compares it with other technologies such as Grid Computing? And what is the best solution to our situation? So I decide to write about the distributed computing article in two parts. First one about the distributed computing model and what is the difference between them. In the second part I will discuss the reliability, and distributed storage systems. Download the article in PDF format. Download the article in MS Word format. I wait for your comments, and questions, and I will answer it in part two.
sentIndex sentText sentNum sentScore
1 Art of Distributed Part 1: Rethinking about distributed computing models I ‘m getting a lot of questions lately about the distributed computing, especially distributed computing model, and MapReduce, such as: What is MapReduce? [sent-1, score-2.302]
2 How we can compares it with other technologies such as Grid Computing? [sent-3, score-0.295]
3 So I decide to write about the distributed computing article in two parts. [sent-5, score-1.25]
4 First one about the distributed computing model and what is the difference between them. [sent-6, score-0.983]
5 In the second part I will discuss the reliability, and distributed storage systems. [sent-7, score-0.658]
6 I wait for your comments, and questions, and I will answer it in part two. [sent-10, score-0.345]
wordName wordTfidf (topN-words)
[('computing', 0.423), ('mapreduce', 0.332), ('distributed', 0.274), ('rethinking', 0.25), ('article', 0.248), ('pdf', 0.213), ('questions', 0.212), ('lately', 0.21), ('compares', 0.204), ('situations', 0.165), ('ms', 0.154), ('situation', 0.152), ('art', 0.151), ('word', 0.142), ('decide', 0.139), ('discuss', 0.139), ('model', 0.134), ('part', 0.126), ('difference', 0.125), ('comments', 0.122), ('wait', 0.116), ('reliability', 0.11), ('fit', 0.11), ('grid', 0.105), ('answer', 0.103), ('two', 0.1), ('especially', 0.093), ('technologies', 0.091), ('getting', 0.074), ('second', 0.07), ('write', 0.066), ('solution', 0.066), ('best', 0.061), ('first', 0.049), ('storage', 0.049), ('lot', 0.045), ('one', 0.027)]
simIndex simValue blogId blogTitle
same-blog 1 1.0000001 590 high scalability-2009-05-06-Art of Distributed
Introduction: Art of Distributed Part 1: Rethinking about distributed computing models I ‘m getting a lot of questions lately about the distributed computing, especially distributed computing model, and MapReduce, such as: What is MapReduce? Can MapReduce fit in all situations? How we can compares it with other technologies such as Grid Computing? And what is the best solution to our situation? So I decide to write about the distributed computing article in two parts. First one about the distributed computing model and what is the difference between them. In the second part I will discuss the reliability, and distributed storage systems. Download the article in PDF format. Download the article in MS Word format. I wait for your comments, and questions, and I will answer it in part two.
2 0.24384883 445 high scalability-2008-11-14-Useful Cloud Computing Blogs
Introduction: Update 2: Overcast: Conversations on Cloud Computing . Listened to the first two podcasts and they're doing a great job. Worth a look. The singing and dance routines are way over the top however :-) Update: 9 Sources of Cloud Computing News You May Not Know About by James Urquhart. I folded in these recommendations. Can't get enough cloud computing? Then you must really be a glutton for punishment! But just in case, here are some cloud computing resources, collected from various sources, that will help you transform into a Tesla silently flying solo down the diamond lane. Meta Sources Cloud Computing Email List : An often lively email list discussing cloud computing. Cloud Computing Blogs & Resources . An excellent and big list of cloud resources. Cloud Computing Portal : A community edited database for making the vendor selection process easier. List of Cloud Platforms, Providers, and Enablers . datacenterknowledge.com's Recap: More than 70 Industry Blogs : A
3 0.20182429 386 high scalability-2008-09-22-Cloud computing, grid computing, utility computing - list of top providers
Introduction: You want to have a scalable website. You want a website which can handle traffic spikes (think if you are getting on Digg, Slahsdot, Reddit, Techcrunch or other very popular websites frontpage). Regular hosting companies (especially shared hosting) can offer only so much. The servers usually get crushed under the load in short time. But there is hope. A new breed of hosting companies emerged recently. A new breed which can offer you the scalability you need at a fraction of the cost. Welcome to the world of “cloud computing!” (or “grid computing” or “utility computing”, which are terms for the same thing). Here's a website which compiled a list of cloud computing hosting companies (with short descriptions, prices and customer lists for each of them). Read the entire article about Cloud computing, grid computing, utility computing list at MyTestBox.com - web software reviews, news, tips & tricks .
4 0.18827817 325 high scalability-2008-05-25-How do you explain cloud computing to your grandma?
Introduction: Update 2: Nice introductory New York Time's article Cloud Computing: So You Don’t Have to Stand Still . Good example of how Animoto used RightScale and Amazon to meet a Facebook driven demand of 25,000 test drives an hour. Update: Peter Laird in Understanding the Cloud Computing/SaaS/PaaS markets: a Map of the Players in the Industry paints a very cool visual map of all the cloud service players. It's a larger industry than you might think. Once upon a time I worked at an Asynchronous Transfer Mode (ATM) switch startup. Over a delicious Christmas punch my grandma asked me what I did for a living that I could afford such extravagantly inexpensive gifts. Always so subtle. I explained I worked on an ATM switch. Mistake. She sniffed, said that's nice, and asked me why the Automated Teller Machine ate her bank card that morning. No matter how hard I tried I couldn't convince her I didn't work on bank ATMs. To all future job interrogations I waxed off, protesting I do boring soft
5 0.18602523 401 high scalability-2008-10-04-Is MapReduce going mainstream?
Introduction: Compares MapReduce to other parallel processing approaches and suggests new paradigm for clouds and grids
6 0.158796 362 high scalability-2008-08-11-Distributed Computing & Google Infrastructure
7 0.15266576 483 high scalability-2009-01-04-Paper: MapReduce: Simplified Data Processing on Large Clusters
8 0.13449576 223 high scalability-2008-01-25-Google: Introduction to Distributed System Design
9 0.12358823 22 high scalability-2007-07-23-Weblink Template
10 0.12317432 585 high scalability-2009-04-29-How to choice and build perfect server
11 0.11488102 891 high scalability-2010-09-01-Scale-out vs Scale-up
12 0.11336521 46 high scalability-2007-07-30-Product: Sun Utility Computing
13 0.10937231 309 high scalability-2008-04-23-Behind The Scenes of Google Scalability
14 0.10783605 414 high scalability-2008-10-15-Hadoop - A Primer
15 0.10489602 803 high scalability-2010-04-05-Intercloud: How Will We Scale Across Multiple Clouds?
16 0.10127774 448 high scalability-2008-11-22-Google Architecture
17 0.10015594 1049 high scalability-2011-05-31-Awesome List of Advanced Distributed Systems Papers
18 0.099911608 69 high scalability-2007-08-21-What does the next generation data center look like?
19 0.099758804 479 high scalability-2008-12-29-Platform virtualization - top 25 providers (software, hardware, combined)
20 0.099644981 355 high scalability-2008-07-21-Eucalyptus - Build Your Own Private EC2 Cloud
topicId topicWeight
[(0, 0.117), (1, 0.045), (2, 0.074), (3, 0.119), (4, -0.074), (5, 0.049), (6, -0.021), (7, -0.066), (8, 0.027), (9, 0.176), (10, -0.001), (11, -0.005), (12, 0.006), (13, 0.036), (14, 0.014), (15, -0.022), (16, -0.038), (17, -0.046), (18, 0.025), (19, -0.017), (20, -0.012), (21, 0.016), (22, -0.037), (23, 0.021), (24, 0.041), (25, -0.092), (26, 0.096), (27, 0.16), (28, -0.066), (29, 0.062), (30, 0.021), (31, 0.021), (32, -0.015), (33, 0.112), (34, 0.006), (35, -0.101), (36, 0.189), (37, 0.034), (38, 0.001), (39, 0.033), (40, -0.051), (41, 0.006), (42, 0.011), (43, 0.038), (44, -0.027), (45, -0.114), (46, 0.06), (47, 0.103), (48, 0.002), (49, -0.042)]
simIndex simValue blogId blogTitle
same-blog 1 0.99225235 590 high scalability-2009-05-06-Art of Distributed
Introduction: Art of Distributed Part 1: Rethinking about distributed computing models I ‘m getting a lot of questions lately about the distributed computing, especially distributed computing model, and MapReduce, such as: What is MapReduce? Can MapReduce fit in all situations? How we can compares it with other technologies such as Grid Computing? And what is the best solution to our situation? So I decide to write about the distributed computing article in two parts. First one about the distributed computing model and what is the difference between them. In the second part I will discuss the reliability, and distributed storage systems. Download the article in PDF format. Download the article in MS Word format. I wait for your comments, and questions, and I will answer it in part two.
2 0.7999537 891 high scalability-2010-09-01-Scale-out vs Scale-up
Introduction: In this post I'll cover the difference between multi-core concurrency that is often referred to as Scale-Up and distributed computing that is often referred to as Scale-Out mode. more ..
3 0.78864723 386 high scalability-2008-09-22-Cloud computing, grid computing, utility computing - list of top providers
Introduction: You want to have a scalable website. You want a website which can handle traffic spikes (think if you are getting on Digg, Slahsdot, Reddit, Techcrunch or other very popular websites frontpage). Regular hosting companies (especially shared hosting) can offer only so much. The servers usually get crushed under the load in short time. But there is hope. A new breed of hosting companies emerged recently. A new breed which can offer you the scalability you need at a fraction of the cost. Welcome to the world of “cloud computing!” (or “grid computing” or “utility computing”, which are terms for the same thing). Here's a website which compiled a list of cloud computing hosting companies (with short descriptions, prices and customer lists for each of them). Read the entire article about Cloud computing, grid computing, utility computing list at MyTestBox.com - web software reviews, news, tips & tricks .
4 0.74919343 1049 high scalability-2011-05-31-Awesome List of Advanced Distributed Systems Papers
Introduction: As part of Dr. Indranil Gupta 's CS 525 Spring 2011 Advanced Distributed Systems class, he has collected an incredible list of resources on distributed systems . His research group is also doing some interesting work. The various topics include: Before there Were Clouds, Cloud Computing, P2P Systems, Basic Distributed Computing Concepts, Sensor Networks, Overlays and DHTs, Cloud Programming, Cloud Scheduling, Key-Value Stores, Storage, Sensor Net Routing, Geo-Distribution, P2P Apps, In-network processing, Epidemics, Probabilistic Membership Protocols, Distributed Monitoring and Management, Publish-Subscribe/CDNs, Measurement Studies, Old Wine: Stale or Vintage?, In Byzantium, Cloud Pricing, Other Industrial Systems, Structure of Networks, Completing the Circle, Green Clouds, Distributed Debugging, Flash!, The Middle or the End?, Availability-Aware Systems, Design Methodologies, Handling Stress, Sources of unreliability in networks, Handling Stress, Selfish algorithms, Securi
5 0.74608231 362 high scalability-2008-08-11-Distributed Computing & Google Infrastructure
Introduction: A couple of videos about distributed computing with direct reference on Google infrastructure. You will get acquainted with: --MapReduce the software framework implemented by Google to support parallel computations over large (greater than 100 terabyte) data sets on commodity hardware --GFS and the way it stores it's data into 64mb chunks --Bigtable which is the simple implementation of a non-relational database at Google Cluster Computing and MapReduce Lectures 1-5 .
6 0.74531573 401 high scalability-2008-10-04-Is MapReduce going mainstream?
7 0.74452996 743 high scalability-2009-11-23-Big Data on Grids or on Clouds?
8 0.70390445 592 high scalability-2009-05-06-DyradLINQ
9 0.69566011 612 high scalability-2009-05-31-Parallel Programming for real-world
10 0.6863665 445 high scalability-2008-11-14-Useful Cloud Computing Blogs
11 0.67692137 325 high scalability-2008-05-25-How do you explain cloud computing to your grandma?
12 0.6580593 652 high scalability-2009-07-08-Art of Parallelism presentation
13 0.64755505 479 high scalability-2008-12-29-Platform virtualization - top 25 providers (software, hardware, combined)
14 0.64358181 69 high scalability-2007-08-21-What does the next generation data center look like?
15 0.63274664 470 high scalability-2008-12-18-Risk Analysis on the Cloud (Using Excel and GigaSpaces)
16 0.62849396 223 high scalability-2008-01-25-Google: Introduction to Distributed System Design
17 0.62264121 309 high scalability-2008-04-23-Behind The Scenes of Google Scalability
18 0.62233794 46 high scalability-2007-07-30-Product: Sun Utility Computing
19 0.61222684 608 high scalability-2009-05-27-The Future of the Parallelism and its Challenges
20 0.60646605 580 high scalability-2009-04-24-INFOSCALE 2009 in June in Hong Kong
topicId topicWeight
[(1, 0.142), (2, 0.189), (61, 0.125), (63, 0.21), (79, 0.182)]
simIndex simValue blogId blogTitle
1 0.95870781 193 high scalability-2007-12-26-Finding an excellent LAMP developer
Introduction: Hi... I have this idea to start a really great and scalable website, and I am building it! So far I'm doing everything myself - coding, networking, architecture planning, everything. I haven't even gotten into the legal aspects yet....... It would be MUCH easier if I had a technical person to handle that end of the operation. I'm a good coder, but like Bill Gates at Harvard for Math, I'm not the very best. I'd like to FIND that very best person available, to handle the technical aspects. For worse or better, I don't presently know somebody who fits this bill. I've posted a bazillion ads on Craig's List, with no really qualified responses. I've put out feelers among my own network, same result. Not sure what else I can do. Shoestring budget, so it's sweat equity in the beginning. That can actually be a plus, as it forces people to focus. Any ideas about what else I can do, to attract the right person? Thanks Jason
2 0.94849372 216 high scalability-2008-01-17-Database People Hating on MapReduce
Introduction: Update: Typical Programmer tackles the technical issues in Relational Database Experts Jump The MapReduce Shark . The culture clash is still what fascinates me. David DeWitt writes in the Database Column that MapReduce is a major step backwards: A giant step backward in the programming paradigm for large-scale data intensive applications A sub-optimal implementation, in that it uses brute force instead of indexing Not novel at all -- it represents a specific implementation of well known techniques developed nearly 25 years ago Missing most of the features that are routinely included in current DBMS Incompatible with all of the tools DBMS users have come to depend on Listening to databasers and map reducers talk is like eavesdropping on your average family holiday mashup. Every holiday people who have virtually nothing in common are thrown together because they incidentally share a little DNA or are married to the shared DNA. In desperation everyone gravitates to som
3 0.91598749 647 high scalability-2009-07-02-Hypertable is a New BigTable Clone that Runs on HDFS or KFS
Introduction: Update 3 : Presentation from the NoSQL conference : slides , video 1 , video 2 . Update 2 : The folks at Hypertable would like you to know that Hypertable is now officially sponsored by Baidu , China’s Leading Search Engine. As a sponsor of Hypertable, Baidu has committed an industrious team of engineers, numerous servers, and support resources to improve the quality and development of the open source technology. Update : InfoQ interview on Hypertable Lead Discusses Hadoop and Distributed Databases . Hypertable differs from HBase in that it is a higher performance implementation of Bigtable. Skrentablog gives the heads up on Hypertable , Zvents' open-source BigTable clone. It's written in C++ and can run on top of either HDFS or KFS. Performance looks encouraging at 28M rows of data inserted at a per-node write rate of 7mb/sec .
4 0.91067749 940 high scalability-2010-11-12-Stuff the Internet Says on Scalability For November 12th, 2010
Introduction: Google – A Study In Scalability And A Little Systems Horse Sense . A nice summary by Krishna Sankar of a version of Jeff Dean's classic talk on Google Scalability given to Stanford's EE380 class . Quotable Quotes: @jkalucki : Getting just 100 servers to work together for the first time is so ridiculously complicated. Horizontal scaling doesn't scale. @simeons : Yahoo's scalability is drivem by lots of asynchronous processing. "You learn to love it." -- @rstata Yahoo's CTO The Economics of the Cloud: Dissecting a Must-Read White Paper by Bernard Golden. I love the depiction of the unseen and unfelt forces that nevertheless organize everything around them: After a brief introduction, the authors lay out a central thesis: despite initial concerns about shortcomings in new technology offerings, "historically, underlying economics have a much stronger impact on the direction and speed of disruptions, as technological challenges are resolved or overcome thro
5 0.88289684 999 high scalability-2011-03-04-Stuff The Internet Says On Scalability For March 4, 2011
Introduction: Submitted for your reading pleasure on this beautifully blue and sunny Friday... @Werner : Each day #AWS adds enough computing muscle to power one whole Amazon.com circa 2000, when it was a $2.8 billion business http://wv.ly/gMr8LQ Building servers to rule in hell. Datacenters spend a lot of energy on cooling down processors. Why can't they operate at higher temperatures? This is the proposition addressed by James Hamilton in Exploring the Limits of Datacenter Temprature and Datacenter Knowledge in What’s Next? Hotter Servers with ‘Gas Pedals’ . Quotable Quotes for 200 Watson: @jreichhold : One thing working at Twitter teaches me daily is that all scale is relative. What seemed impossible last year is now the daily case. @dannycast0nguay : If you’re concerned about scalability, any algorithm that forces you to run agreement will eventually become your bottleneck.—Werner Vogels @rael : No shortcut ever goes undetected by scale. @sr
6 0.87736124 1613 high scalability-2014-03-17-Intuitively Showing How To Scale a Web Application Using a Coffee Shop as an Example
same-blog 8 0.83825779 590 high scalability-2009-05-06-Art of Distributed
9 0.82743847 711 high scalability-2009-09-22-How Ravelry Scales to 10 Million Requests Using Rails
10 0.81239629 687 high scalability-2009-08-24-How Google Serves Data from Multiple Datacenters
11 0.81235713 1242 high scalability-2012-05-09-Cell Architectures
12 0.81194574 702 high scalability-2009-09-11-The interactive cloud
13 0.81167853 129 high scalability-2007-10-23-Hire Facebook, Ning, and Salesforce to Scale for You
14 0.81151903 972 high scalability-2011-01-11-Google Megastore - 3 Billion Writes and 20 Billion Read Transactions Daily
15 0.81134474 1240 high scalability-2012-05-07-Startups are Creating a New System of the World for IT
16 0.81106085 733 high scalability-2009-10-29-Paper: No Relation: The Mixed Blessings of Non-Relational Databases
17 0.81078637 1535 high scalability-2013-10-21-Google's Sanjay Ghemawat on What Made Google Google and Great Big Data Career Advice
18 0.80952531 803 high scalability-2010-04-05-Intercloud: How Will We Scale Across Multiple Clouds?
19 0.80932778 1485 high scalability-2013-07-01-PRISM: The Amazingly Low Cost of Using BigData to Know More About You in Under a Minute