high_scalability high_scalability-2009 high_scalability-2009-635 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Distributed systems are not typically a place domain driven design is applied. Distributed processing projects often start with an overall architecture vision and the idea about a processing model which basically drives the whole thing, including object design if it exists at all. Elaborate object designs are thought of as something that just gets in the way of distribution and performance, so the idea of spending time to apply DDD principles gets rejected in favour of raw throughput and processing power. However, from my experience, some more advanced DDD concepts can significantly improve the performance, scalability and throughput of distributed systems when applied correctly. This article a summary of the presentation titled "DDD in a distributed world" from the DDD Exchange 09 in London.
sentIndex sentText sentNum sentScore
1 Distributed systems are not typically a place domain driven design is applied. [sent-1, score-0.404]
2 Distributed processing projects often start with an overall architecture vision and the idea about a processing model which basically drives the whole thing, including object design if it exists at all. [sent-2, score-1.286]
3 Elaborate object designs are thought of as something that just gets in the way of distribution and performance, so the idea of spending time to apply DDD principles gets rejected in favour of raw throughput and processing power. [sent-3, score-1.6]
4 However, from my experience, some more advanced DDD concepts can significantly improve the performance, scalability and throughput of distributed systems when applied correctly. [sent-4, score-0.652]
5 This article a summary of the presentation titled "DDD in a distributed world" from the DDD Exchange 09 in London. [sent-5, score-0.399]
wordName wordTfidf (topN-words)
[('ddd', 0.795), ('favour', 0.18), ('titled', 0.143), ('rejected', 0.14), ('processing', 0.138), ('elaborate', 0.13), ('object', 0.13), ('gets', 0.115), ('throughput', 0.112), ('distributed', 0.11), ('exists', 0.101), ('concepts', 0.1), ('spending', 0.099), ('london', 0.095), ('designs', 0.093), ('exchange', 0.089), ('raw', 0.089), ('vision', 0.088), ('principles', 0.085), ('basically', 0.083), ('idea', 0.082), ('drives', 0.08), ('domain', 0.078), ('applied', 0.078), ('typically', 0.078), ('significantly', 0.074), ('presentation', 0.074), ('advanced', 0.073), ('summary', 0.072), ('design', 0.072), ('overall', 0.069), ('driven', 0.068), ('projects', 0.067), ('distribution', 0.066), ('apply', 0.065), ('however', 0.062), ('place', 0.057), ('thought', 0.056), ('improve', 0.054), ('whole', 0.054), ('including', 0.053), ('systems', 0.051), ('thing', 0.05), ('start', 0.046), ('often', 0.045), ('performance', 0.041), ('experience', 0.041), ('model', 0.04), ('world', 0.035), ('something', 0.035)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 635 high scalability-2009-06-22-Improving performance and scalability with DDD
Introduction: Distributed systems are not typically a place domain driven design is applied. Distributed processing projects often start with an overall architecture vision and the idea about a processing model which basically drives the whole thing, including object design if it exists at all. Elaborate object designs are thought of as something that just gets in the way of distribution and performance, so the idea of spending time to apply DDD principles gets rejected in favour of raw throughput and processing power. However, from my experience, some more advanced DDD concepts can significantly improve the performance, scalability and throughput of distributed systems when applied correctly. This article a summary of the presentation titled "DDD in a distributed world" from the DDD Exchange 09 in London.
2 0.09011218 1313 high scalability-2012-08-28-Making Hadoop Run Faster
Introduction: Making Hadoop Run Faster One of the challenges in processing data is that the speed at which we can input data is quite often much faster than the speed at which we can process it. This problem becomes even more pronounced in the context of Big Data, where the volume of data keeps on growing, along with a corresponding need for more insights, and thus the need for more complex processing also increases. Batch Processing to the Rescue Hadoop was designed to deal with this challenge in the following ways: 1. Use a distributed file system: This enables us to spread the load and grow our system as needed. 2. Optimize for write speed: To enable fast writes the Hadoop architecture was designed so that writes are first logged, and then processed. This enables fairly fast write speeds. 3. Use batch processing (Map/Reduce) to balance the speed for the data feeds with the processing speed. Batch Processing Challenges The challenge with batch-processing is that it assumes
3 0.061765231 544 high scalability-2009-03-18-QCon London 2009: Upgrading Twitter without service disruptions
Introduction: Evan Weaver from Twitter presented a talk on Twitter software upgrades, titled Improving running components as part of the Systems that never stop track at QCon London 2009 conference last Friday. The talk focused on several upgrades performed since last May, while Twitter was experiencing serious performance problems.
4 0.057490192 906 high scalability-2010-09-22-Applying Scalability Patterns to Infrastructure Architecture
Introduction: Too often software design patterns are overlooked by network and application delivery network architects but these patterns are often equally applicable to addressing a broad range of architectural challenges in the application delivery tier of the data center. By Lori Mac Vittie, F5 Networks The “ High Scalability ” blog is fast becoming one of my favorite reads. Last week did not disappoint with a post highlighting a set of scalability design patterns that was, apparently, inspired by yet another High Scalability post on “ 6 Ways to Kill Your Servers: Learning to Scale the Hard Way. ” Credit:Michael Chow/azcentral.com This particular post caught my attention primarily because although I’ve touched on many of these patterns in the past, I’ve never thought to call them what they are: scalability patterns. That’s probably a side-effect of forgetting that building an architecture of any kind is at its core computer science and thus
5 0.055824354 666 high scalability-2009-07-30-Learn How to Think at Scale
Introduction: Aaron Kimball of Cloudera gives a wonderful 23 minute presentation titled Cloudera Hadoop Training: Thinking at Scale Cloudera which talks about "common challenges and general best practices for scaling with your data." As a company Cloudera offers "enterprise-level support to users of Apache Hadoop." Part of that offering is a really useful series of tutorial videos on the Hadoop ecosystem . Like TV lawyer Perry Mason (or is it Harmon Rabb?), Aaron gradually builds his case. He opens with the problem of storing lots of data. Then a blistering cross examination of the problem of building distributed systems to analyze that data sets up a powerful closing argument. With so much testimony behind him, on closing Aaron really brings it home with why shared nothing systems like map-reduce are the right solution on how to query lots of data. They jury loved it. Here's the video Thinking at Scale . And here's a summary of some of the lessons learned from the talk: Lessons Learned
6 0.05561287 538 high scalability-2009-03-16-Are Cloud Based Memory Architectures the Next Big Thing?
7 0.055552389 537 high scalability-2009-03-12-QCon London 2009: Database projects to watch closely
8 0.055152472 373 high scalability-2008-08-29-Product: ScaleOut StateServer is Memcached on Steroids
9 0.050763115 1258 high scalability-2012-06-05-Thesis: Concurrent Programming for Scalable Web Architectures
10 0.049109858 649 high scalability-2009-07-02-Product: Facebook's Cassandra - A Massive Distributed Store
11 0.048061322 623 high scalability-2009-06-10-Dealing with multi-partition transactions in a distributed KV solution
12 0.047893636 1369 high scalability-2012-12-10-Switch your databases to Flash storage. Now. Or you're doing it wrong.
13 0.047421195 1068 high scalability-2011-06-27-TripAdvisor Architecture - 40M Visitors, 200M Dynamic Page Views, 30TB Data
14 0.047317158 1240 high scalability-2012-05-07-Startups are Creating a New System of the World for IT
15 0.044382077 1177 high scalability-2012-01-19-Is it time to get rid of the Linux OS model in the cloud?
16 0.043595977 1049 high scalability-2011-05-31-Awesome List of Advanced Distributed Systems Papers
17 0.043328486 674 high scalability-2009-08-07-The Canonical Cloud Architecture
18 0.043243572 1460 high scalability-2013-05-17-Stuff The Internet Says On Scalability For May 17, 2013
19 0.042950813 1567 high scalability-2013-12-20-Stuff The Internet Says On Scalability For December 20th, 2013
20 0.042854287 851 high scalability-2010-07-02-Hot Scalability Links for July 2, 2010
topicId topicWeight
[(0, 0.074), (1, 0.029), (2, 0.003), (3, 0.046), (4, 0.005), (5, 0.025), (6, 0.008), (7, 0.01), (8, -0.032), (9, 0.036), (10, -0.012), (11, -0.006), (12, -0.009), (13, -0.019), (14, 0.007), (15, -0.008), (16, 0.012), (17, -0.023), (18, 0.003), (19, 0.002), (20, 0.013), (21, 0.02), (22, -0.006), (23, 0.001), (24, -0.016), (25, -0.03), (26, 0.011), (27, 0.012), (28, -0.008), (29, 0.031), (30, 0.034), (31, 0.015), (32, -0.01), (33, 0.023), (34, -0.006), (35, -0.007), (36, 0.004), (37, -0.013), (38, 0.039), (39, -0.022), (40, -0.005), (41, -0.002), (42, 0.007), (43, 0.018), (44, 0.007), (45, -0.005), (46, -0.017), (47, -0.017), (48, 0.017), (49, 0.016)]
simIndex simValue blogId blogTitle
same-blog 1 0.96778703 635 high scalability-2009-06-22-Improving performance and scalability with DDD
Introduction: Distributed systems are not typically a place domain driven design is applied. Distributed processing projects often start with an overall architecture vision and the idea about a processing model which basically drives the whole thing, including object design if it exists at all. Elaborate object designs are thought of as something that just gets in the way of distribution and performance, so the idea of spending time to apply DDD principles gets rejected in favour of raw throughput and processing power. However, from my experience, some more advanced DDD concepts can significantly improve the performance, scalability and throughput of distributed systems when applied correctly. This article a summary of the presentation titled "DDD in a distributed world" from the DDD Exchange 09 in London.
2 0.75539333 400 high scalability-2008-10-01-The Pattern Bible for Distributed Computing
Introduction: Software design patterns are an emerging tool for guiding and documenting system design. Patterns usually describe software abstractions used by advanced designers and programmers in their software. Patterns can provide guidance for designing highly scalable distributed systems. Let's see how! Patterns are in essence solutions to problems. Most of them are expressed in a format called Alexandrian form which draws on constructs used by Christopher Alexander. There are variants but most look like this: The pattern name The problem the pattern is trying to solve Context Solution Examples Design rationale: This tells where the pattern came from, why it works, and why experts use it Patterns rarely stand alone. Each pattern works on a context, and transforms the system in that context to produce a new system in a new context. New problems arise in the new system and context, and the next ‘‘layer’’ of patterns can be applied. A pattern language is a structured col
3 0.72195458 1127 high scalability-2011-09-28-Pursue robust indefinite scalability with the Movable Feast Machine
Introduction: And now for something completely different, brought to you by David Ackley and Daniel Cannon in their playfully thought provoking paper: Pursue robust indefinite scalability , wherein they try to take a fresh look at neural networks, starting from scratch. What is this strange thing called indefinite scalability ? They sound like words that don't really go together: Indefinite scalability is the property that the design can support open-ended computational growth without substantial re-engineering, in as strict as sense as can be managed. By comparison, many computer, algorithm, and network designs -- even those that address scalability -- are only finitely scalable because their scalability occurs within some finite space. For example, an in-core sorting algorithm for a 32 bit machine can only scale to billions of numbers before address space is exhausted and then that algorithm must be re-engineered. Our idea is to expose indefinitely scalable computational power to program
4 0.71707171 604 high scalability-2009-05-20-Paper: Flux: An Adaptive Partitioning Operator for Continuous Query Systems
Introduction: At the core of the new real-time web, which is really really old, are continuous queries. I like how this paper proposed to handle dynamic demand and dynamic resource availability by making the underlying system adaptable, which seems like a very cloudy kind of thing to do. Abstract: The long-running nature of continuous queries poses new scalability challenges for dataflow processing. CQ systems execute pipelined dataflows that may be shared across multiple queries. The scalability of these dataflows is limited by their constituent, stateful operators – e.g. windowed joins or grouping operators. To scale such operators, a natural solution is to partition them across a shared-nothing platform. But in the CQ context, traditional, static techniques for partitioned parallelism can exhibit detrimental imbalances as workload and runtime conditions evolve. Longrunning CQ dataflows must continue to function robustly in the face of these imbalances. To address this challenge, we introduce
5 0.71482259 844 high scalability-2010-06-18-Paper: The Declarative Imperative: Experiences and Conjectures in Distributed Logic
Introduction: The Declarative Imperative: Experiences and Conjectures in Distributed Logic is written by UC Berkeley's Joseph Hellerstein for a keynote speech he gave at PODS . The video version of the talk is here . You may have heard about Mr. Hellerstein through the Berkeley Orders Of Magnitude project ( BOOM ), whose purpose is to help people build systems that are OOM (orders of magnitude) bigger than are building today, with OOM less effort than traditional programming methodologies . A noble goal which may be why BOOM was rated as a top 10 emerging technology for 2010 by MIT Technology Review . Quite an honor. The motivation for the talk is a familiar one: it's a dark period for computer programming and if we don't learn how to write parallel programs the children of Moore's law will destroy us all. We have more and more processors, yet we are stuck on figuring out how the average programmer can exploit them. The BOOM solution is the Bloom language which is based on Dedalus:
6 0.7023735 1611 high scalability-2014-03-12-Paper: Scalable Eventually Consistent Counters over Unreliable Networks
7 0.69504791 850 high scalability-2010-06-30-Paper: GraphLab: A New Framework For Parallel Machine Learning
8 0.69033808 1234 high scalability-2012-04-26-Akaros - an open source operating system for manycore architectures
9 0.68497169 1039 high scalability-2011-05-12-Paper: Mind the Gap: Reconnecting Architecture and OS Research
10 0.66470808 580 high scalability-2009-04-24-INFOSCALE 2009 in June in Hong Kong
11 0.65520167 1047 high scalability-2011-05-25-Stuff to Watch from Surge 2010
12 0.65443873 309 high scalability-2008-04-23-Behind The Scenes of Google Scalability
13 0.64938253 705 high scalability-2009-09-16-Paper: A practical scalable distributed B-tree
14 0.64890617 474 high scalability-2008-12-21-The I.H.S.D.F. Theorem: A Proposed Theorem for the Trade-offs in Horizontally Scalable Systems
15 0.64837599 1447 high scalability-2013-04-26-Stuff The Internet Says On Scalability For April 26, 2013
16 0.64626771 484 high scalability-2009-01-05-Lessons Learned at 208K: Towards Debugging Millions of Cores
17 0.64480531 1330 high scalability-2012-09-28-Stuff The Internet Says On Scalability For September 28, 2012
18 0.63884163 497 high scalability-2009-01-19-Papers: Readings in Distributed Systems
19 0.63851684 839 high scalability-2010-06-09-Paper: Propagation Networks: A Flexible and Expressive Substrate for Computation
20 0.63709915 983 high scalability-2011-02-02-Piccolo - Building Distributed Programs that are 11x Faster than Hadoop
topicId topicWeight
[(1, 0.093), (2, 0.185), (10, 0.05), (26, 0.307), (61, 0.091), (85, 0.113)]
simIndex simValue blogId blogTitle
Introduction: Why You Need To Attend THIS CONFERENCE • Understand the multi-dimensional view of business-technology alignment • A sense of urgency for aggressively pursuing Enterprise Architecture • A "language" (ie., a Framework) for improving enterprise communications about architecture issues • An understanding of the cultural changes implied by process evolution. How to effectively use the framework to anchor processes and procedures for delivering service and support for applications • An understanding of basic Enterprise physics • Recommendations for the Sr. Managers to understand the political realities and organizational resistance in realizing EA vision and some excellent advices for overcoming these barriers • Number of practical examples of how to work with people who affect decisions on EA implementation • How to create value for your organization by systematically recording assets, processes, connectivity, people, timing and motivation, through a simple framework
2 0.84812796 751 high scalability-2009-12-16-The most common flaw in software performance testing
Introduction: How many times have we all run across a situation where the performance tests on a piece of software pass with flying colors on the test systems only to see the software exhibit poor performance characteristics when the software is deployed in production? Read More Here...
3 0.83919603 697 high scalability-2009-09-09-GridwiseTech revolutionizes data management
Introduction: GridwiseTech has developed AdHoc , an advanced framework for sharing geographically distributed data and compute resources. It simplifies the resource management and makes cooperation secure and effective. The premise of AdHoc is to enable each member of the associated institution to control access to his or her resources without an IT administrator’s help, and with high security level of any exposed data or applications assured. It takes 3 easy steps to establish cooperation within AdHoc: create a virtual organization, add resources and share them. The application can be implemented within any organization to exchange data and resources or between institutions to join forces for more efficient results. AdHoc was initially created for a consortium of hospitals and institutions to share medical data sets. As a technical partner in that project, GridwiseTech implemented the Security Framework to provide access to that data and designed a graphical tool to facilitate the administration
4 0.81158811 1410 high scalability-2013-02-20-Smart Companies Fail Because they Do Everything Right - Staying Alive to Scale
Introduction: Wired has a wonderful interview with Clayton Christensen , author of the tech ninja's bible, Innovator's Dilemma . Innovation is the name of the game in Silicon Valley and if you want to understand the rules of the game this article is a quick and clear way of learning. Everything is simply explained with compelling examples by the man himself. Just as every empire has fallen, every organization is open to disruption. It's the human condition to become comfortable and discount potential dangers. It takes a great deal of mindfulness to outwit and outlast the human condition. If you want to be the disruptor and avoid being the disruptee, this is good stuff. He also talks about his new book, The Capitalist's Dilemma , which addresses this puzzle: if corporations are doing so well why are individuals doing so bad? If someone can help you see a deep meaningful pattern in life then they haven't brought you a fish, they've taught you how to fish. That's what Christensen does. Here'
5 0.8062892 73 high scalability-2007-08-23-Postgresql on high availability websites?
Introduction: I was looking at the pingdom infrastructure matrix (http://royal.pingdom.com/royalfiles/0702_infrastructure_matrix.pdf) and I saw that no sites are using Postgresql, and then I searched through highscalability.com and saw very few mentions of postgresql. Are there any examples of high-traffic sites that use postgresql? Does anyone have any experience with it? I'm having trouble finding good, recent studies of postgres (and postgres compared w/ mysql) online.
6 0.78497106 715 high scalability-2009-10-06-10 Ways to Take your Site from One to One Million Users by Kevin Rose
7 0.77772641 1570 high scalability-2014-01-01-Paper: Nanocubes: Nanocubes for Real-Time Exploration of Spatiotemporal Datasets
8 0.77410096 1356 high scalability-2012-11-07-Gone Fishin': 10 Ways to Take your Site from One to One Million Users by Kevin Rose
same-blog 9 0.77315921 635 high scalability-2009-06-22-Improving performance and scalability with DDD
10 0.75120306 148 high scalability-2007-11-11-Linkedin architecture
11 0.69519073 1650 high scalability-2014-05-19-A Short On How the Wayback Machine Stores More Pages than Stars in the Milky Way
12 0.68369341 381 high scalability-2008-09-08-Guerrilla Capacity Planning and the Law of Universal Scalability
13 0.66288084 92 high scalability-2007-09-15-The Role of Memory within Web 2.0 Architectures and Deployments
14 0.65082717 425 high scalability-2008-10-22-Scalability Best Practices: Lessons from eBay
15 0.64899939 137 high scalability-2007-10-30-Database parallelism choices greatly impact scalability
16 0.63612682 339 high scalability-2008-06-04-LinkedIn Architecture
17 0.63065243 1568 high scalability-2013-12-23-What Happens While Your Brain Sleeps is Surprisingly Like How Computers Stay Sane
18 0.61347669 423 high scalability-2008-10-19-Alternatives to Google App Engine
19 0.60720515 1239 high scalability-2012-05-04-Stuff The Internet Says On Scalability For May 4, 2012
20 0.60651976 943 high scalability-2010-11-16-Facebook's New Real-time Messaging System: HBase to Store 135+ Billion Messages a Month