high_scalability high_scalability-2009 high_scalability-2009-591 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: The Dryad Project is investigating programming models for writing parallel and distributed programs to scale from a small cluster to a large data-center.
sentIndex sentText sentNum sentScore
1 The Dryad Project is investigating programming models for writing parallel and distributed programs to scale from a small cluster to a large data-center. [sent-1, score-2.278]
wordName wordTfidf (topN-words)
[('dryad', 0.582), ('investigating', 0.512), ('programs', 0.287), ('models', 0.266), ('writing', 0.236), ('project', 0.211), ('parallel', 0.19), ('programming', 0.178), ('cluster', 0.162), ('small', 0.145), ('large', 0.104), ('distributed', 0.103), ('scale', 0.095)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 591 high scalability-2009-05-06-Dyrad
Introduction: The Dryad Project is investigating programming models for writing parallel and distributed programs to scale from a small cluster to a large data-center.
2 0.33258381 155 high scalability-2007-11-15-Video: Dryad: A general-purpose distributed execution platform
Introduction: Dryad is Microsoft's answer to Google's map-reduce . What's the question: How do you process really large amounts of data? My initial impression of Dryad is it's like a giant Unix command line filter on steroids. There are lots of inputs, outputs, tees, queues, and merge sorts all connected together by a master exec program. What else does Dryad have to offer the scalable infrastructure wars? Dryad models programs as the execution of a directed acyclic graph. Each vertex is a program and edges are typed communication channels (files, TCP pipes, and shared memory channels within a process). Map-reduce uses a different model. It's more like a large distributed sort where the programmer defines functions for mapping, partitioning, and reducing. Each approach seems to borrow from the spirit of its creating organization. The graph approach seems a bit too complicated and map-reduce seems a bit too simple. How ironic, in the Alanis Morissette sense. Dryad is a middleware layer that ex
3 0.20311794 199 high scalability-2008-01-01-S3 for image storing
Introduction: Hi all, Has anyone got any experience with using Amazon S3 as an uploaded photo store? I'm writing a website that I need to keep as low budget as possible, and I'm investigating solutions for storing uploaded photos from users - not too many, probably in the low thousands. The site is commercial so I'm straying away from the Flickrs of the world. S3 seems to offer a solution but I'd like to hear from those who have used it before. Thanks Andy
4 0.19747131 592 high scalability-2009-05-06-DyradLINQ
Introduction: The goal of DryadLINQ is to make distributed computing on large compute cluster simple enough for ordinary programmers. DryadLINQ combines two important pieces of Microsoft technology: the Dryad distributed execution engine and the .NET Language Integrated Query (LINQ).
5 0.1135193 1273 high scalability-2012-06-27-Paper: Logic and Lattices for Distributed Programming
Introduction: Neil Conway from Berkeley CS is giving an advanced level talk at a meetup today in San Francisco on a new paper: Logic and Lattices for Distributed Programming - extending set logic to support CRDT-style lattices. The description of the meetup is probably the clearest introduction to the paper: Developers are increasingly choosing datastores that sacrifice strong consistency guarantees in exchange for improved performance and availability. Unfortunately, writing reliable distributed programs without the benefit of strong consistency can be very challenging. In this talk, I'll discuss work from our group at UC Berkeley that aims to make it easier to write distributed programs without relying on strong consistency. Bloom is a declarative programming language for distributed computing, while CALM is an analysis technique that identifies programs that are guaranteed to be eventually consistent. I'll then discuss our recent work on extending CALM to support a broader range of
6 0.095278174 595 high scalability-2009-05-08-Publish-subscribe model does not scale?
7 0.088400856 16 high scalability-2007-07-16-Book: High Performance MySQL
8 0.085781962 801 high scalability-2010-03-30-Running Large Graph Algorithms - Evaluation of Current State-of-the-Art and Lessons Learned
9 0.08559262 397 high scalability-2008-09-28-Product: Happy = Hadoop + Python
10 0.083613195 254 high scalability-2008-02-19-Hadoop Getting Closer to 1.0 Release
11 0.080337457 658 high scalability-2009-07-17-Against all the odds
12 0.079664961 612 high scalability-2009-05-31-Parallel Programming for real-world
13 0.078279801 1625 high scalability-2014-04-03-Leslie Lamport to Programmers: You're Doing it Wrong
14 0.074144699 218 high scalability-2008-01-17-Moving old to new. Do not be afraid of the re-write -- but take some help
15 0.070792899 1075 high scalability-2011-07-07-Myth: Google Uses Server Farms So You Should Too - Resurrection of the Big-Ass Machines
16 0.068321407 666 high scalability-2009-07-30-Learn How to Think at Scale
17 0.064256087 850 high scalability-2010-06-30-Paper: GraphLab: A New Framework For Parallel Machine Learning
18 0.063622378 1204 high scalability-2012-03-06-Ask For Forgiveness Programming - Or How We'll Program 1000 Cores
19 0.062027019 448 high scalability-2008-11-22-Google Architecture
20 0.059735108 1641 high scalability-2014-05-01-Paper: Can Programming Be Liberated From The Von Neumann Style?
topicId topicWeight
[(0, 0.055), (1, 0.036), (2, 0.019), (3, 0.046), (4, 0.005), (5, 0.041), (6, 0.007), (7, 0.011), (8, -0.012), (9, 0.053), (10, 0.015), (11, -0.009), (12, 0.012), (13, -0.045), (14, 0.035), (15, -0.041), (16, -0.039), (17, 0.006), (18, 0.003), (19, 0.045), (20, -0.004), (21, 0.001), (22, -0.065), (23, 0.006), (24, -0.019), (25, -0.007), (26, 0.068), (27, -0.013), (28, 0.029), (29, -0.001), (30, 0.008), (31, 0.053), (32, -0.033), (33, 0.024), (34, 0.003), (35, -0.041), (36, 0.057), (37, -0.069), (38, 0.001), (39, 0.014), (40, 0.008), (41, 0.042), (42, -0.024), (43, -0.024), (44, -0.005), (45, 0.004), (46, -0.014), (47, -0.036), (48, 0.067), (49, -0.031)]
simIndex simValue blogId blogTitle
same-blog 1 0.96160936 591 high scalability-2009-05-06-Dyrad
Introduction: The Dryad Project is investigating programming models for writing parallel and distributed programs to scale from a small cluster to a large data-center.
2 0.67937893 850 high scalability-2010-06-30-Paper: GraphLab: A New Framework For Parallel Machine Learning
Introduction: In the never ending quest to figure out how to do something useful with never ending streams of data, GraphLab: A New Framework For Parallel Machine Learning wants to go beyond low-level programming, MapReduce, and dataflow languages with a new parallel framework for ML (machine learning) which exploits the sparse structure and common computational patterns of ML algorithms. GraphLab enables ML experts to easily design and implement efficient scalable parallel algorithms by composing problem specific computation, data-dependencies, and scheduling . Our main contributions include: A graph-based data model which simultaneously represents data and computational dependencies. A set of concurrent access models which provide a range of sequential-consistency guarantees. A sophisticated modular scheduling mechanism. An aggregation framework to manage global state. From the abstract: Designing and implementing efficient, provably correct parallel machine lear
3 0.6749292 581 high scalability-2009-04-26-Map-Reduce for Machine Learning on Multicore
Introduction: We are at the beginning of the multicore era. Computers will have increasingly many cores (processors), but there is still no good programming framework for these architectures, and thus no simple and unified way for machine learning to take advantage of the potential speed up. In this paper, we develop a broadly applicable parallel programming method, one that is easily applied to many different learning algorithms. Our work is in distinct contrast to the tradition in machine learning of designing (often ingenious) ways to speed up a single algorithm at a time. Specifically, we show that algorithms that fit the Statistical Query model can be written in a certain “summation form,” which allows them to be easily parallelized on multicore computers. We adapt Google’s map-reduce paradigm to demonstrate this parallel speed up technique on a variety of learning algorithms including locally weighted linear regression (LWLR), k-means, logistic regression (LR), naive Bayes (NB), SVM
4 0.61684656 401 high scalability-2008-10-04-Is MapReduce going mainstream?
Introduction: Compares MapReduce to other parallel processing approaches and suggests new paradigm for clouds and grids
5 0.59218347 612 high scalability-2009-05-31-Parallel Programming for real-world
Introduction: Multicore computers shift the burden of software performance from chip designers and architects to software developers. What is the parallel Computing ? and what the different between Multi-Threading and Concurrency and Parallelism ? and what is differences between task and data parallel ? and how we can use it ? Fundamental article into Parallel Programming...
6 0.59032595 844 high scalability-2010-06-18-Paper: The Declarative Imperative: Experiences and Conjectures in Distributed Logic
7 0.58465767 592 high scalability-2009-05-06-DyradLINQ
8 0.57943255 608 high scalability-2009-05-27-The Future of the Parallelism and its Challenges
9 0.54789591 839 high scalability-2010-06-09-Paper: Propagation Networks: A Flexible and Expressive Substrate for Computation
10 0.5429979 155 high scalability-2007-11-15-Video: Dryad: A general-purpose distributed execution platform
11 0.53727418 1625 high scalability-2014-04-03-Leslie Lamport to Programmers: You're Doing it Wrong
12 0.53131354 810 high scalability-2010-04-14-Parallel Information Retrieval and Other Search Engine Goodness
13 0.53109407 1273 high scalability-2012-06-27-Paper: Logic and Lattices for Distributed Programming
14 0.53020108 983 high scalability-2011-02-02-Piccolo - Building Distributed Programs that are 11x Faster than Hadoop
15 0.5300402 1641 high scalability-2014-05-01-Paper: Can Programming Be Liberated From The Von Neumann Style?
16 0.52976924 534 high scalability-2009-03-12-Google TechTalk: Amdahl's Law in the Multicore Era
17 0.52732223 1127 high scalability-2011-09-28-Pursue robust indefinite scalability with the Movable Feast Machine
18 0.52134025 376 high scalability-2008-09-03-MapReduce framework Disco
19 0.51187229 1446 high scalability-2013-04-25-Paper: Making reliable distributed systems in the presence of software errors
20 0.50444251 660 high scalability-2009-07-21-Paper: Parallelizing the Web Browser
topicId topicWeight
[(1, 0.262), (2, 0.176), (42, 0.323)]
simIndex simValue blogId blogTitle
1 0.87050408 1608 high scalability-2014-03-10-Let's Play a Game of Take It or Leave It - Game 1
Introduction: The way this game is played is you read a few statements on some hot topics below. If you agree with a statement then you “take it”; if you disagree then you “leave it.” And if you are so moved please write a convincing comment as to why. Got it? Snowden vs. the State. Snowden represents true the spirit of freedom and is not a threat to all we hold dear. Walled Garden vs. Federated Freedom. The Walled Garden has won the last decade. The cycle of life will return the balance and federated services will once again win the day. Mobile + messaging vs. Le Web. Mobile + messaging is eating search and the web, changing the way things are found, discovered, and bought. Fiat vs. Cryptocurrency. BitCoin has had its 400 million dollars of fame, it’s on the way out, a tulip gone out of bloom. True Detective vs. The Field. True Detective is the best show on TV, ever. Wired and Breaking Bad need not apply.
2 0.85665977 814 high scalability-2010-04-20-Sponsored Post: Event - Social Developer Summit
Introduction: Social Developer Summit - June 29, 2010 - San Franciso, CA A meeting of the technically social - Building, scaling, and profiting in a social age Whether it's social games, social news, social discovery, social search, or other forms of social solutions , developers today are facing new hurdles in building instantly scalable products. As new technologies emerge to address the challenges faced by social application developers, it's increasingly important to come together for knowledge sharing purposes. The first Social Developer Summit will bring together social application developers to discuss the challenges, solutions, and best practices for building applications in the rapidly expanding social web economy . At the Social Developer Summit, industry experts will share tips and case studies for building high performance social web products. For more information please take a look at Social Developer Summit . If you are interested in a sponsored pos
3 0.85665715 804 high scalability-2010-04-06-Sponsored Post: Event - Social Developer Summit
Introduction: Social Developer Summit - June 29, 2010 - San Franciso, CA A meeting of the technically social - Building, scaling, and profiting in a social age Whether it's social games, social news, social discovery, social search, or other forms of social solutions , developers today are facing new hurdles in building instantly scalable products. As new technologies emerge to address the challenges faced by social application developers, it's increasingly important to come together for knowledge sharing purposes. The first Social Developer Summit will bring together social application developers to discuss the challenges, solutions, and best practices for building applications in the rapidly expanding social web economy . At the Social Developer Summit, industry experts will share tips and case studies for building high performance social web products. For more information please take a look at Social Developer Summit . If you are interested in a sponsored pos
4 0.85509634 524 high scalability-2009-03-04-Its time for auto scaling – avoid peak load provisioning for web applications
Introduction: Many web applications, including eBanking, Trading, eCommerce and Online Gaming, face large, fluctuating loads. In this post will describe how to achieve Right Sizing using virtualization and cloud computing. Will use a standard JEE web application to demonstrate how auto-scaling works on AWS Cloud without changing your application code.
same-blog 5 0.82072473 591 high scalability-2009-05-06-Dyrad
Introduction: The Dryad Project is investigating programming models for writing parallel and distributed programs to scale from a small cluster to a large data-center.
6 0.72529888 1227 high scalability-2012-04-13-Stuff The Internet Says On Scalability For April 13, 2012
7 0.70228827 1427 high scalability-2013-03-20-Dart - Is it the Future of the Web?
8 0.69516248 286 high scalability-2008-03-20-Paper: Asynchronous HTTP and Comet architectures
9 0.69172382 1641 high scalability-2014-05-01-Paper: Can Programming Be Liberated From The Von Neumann Style?
10 0.68961746 473 high scalability-2008-12-20-Second Life Architecture - The Grid
11 0.68897879 887 high scalability-2010-08-24-Sponsored Post: deviantART, Okta, EzRez, Cloud Sigma, ManageEngine, Site24x7
12 0.68874192 876 high scalability-2010-08-10-Sponsored Post: Okta, EzRez, VoltDB, Digg, Cloud Sigma, Applications Manager, Site24x7
13 0.68864465 866 high scalability-2010-07-27-Sponsored Post: Okta, EzRez, VoltDB, Digg, Cloud Sigma, Applications Manager, Site24x7
14 0.68785757 625 high scalability-2009-06-10-Managing cross partition transactions in a distributed KV system
15 0.68086302 424 high scalability-2008-10-22-EVE Online Architecture
16 0.67942673 762 high scalability-2010-01-18-The Missing Piece in the Virtualization Stack (Part 1)
17 0.67741966 1342 high scalability-2012-10-17-World of Warcraft's Lead designer Rob Pardo on the Role of the Cloud in Games
18 0.6762383 1205 high scalability-2012-03-07-Scale Indefinitely on S3 With These Secrets of the S3 Masters
19 0.67521572 799 high scalability-2010-03-23-Digg: 4000% Performance Increase by Sorting in PHP Rather than MySQL
20 0.67364514 69 high scalability-2007-08-21-What does the next generation data center look like?