high_scalability high_scalability-2009 high_scalability-2009-542 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: IBM WebSphere eXtreme Scale is IBMs in memory data grid product (IMDG). It can be used as a key-value store which partitions the keys (using a form of consistent hashing) over a set of servers such that each server is responsible for a subset of the keys. It automatically handles replication which can be either synchronous of asynchronous and handles advanced placement so that replicas can be placed in different physical zones when compared to the placement of the primary. Think buildings, racks, floor, data centers. It is fully elastic in that servers can be added and removed and it automatically redistributes the partition primaries and backups. It can be scaled from one server to hundreds if not thousands of JVMs in a single grid. Each additional server provides more CPU, memory capacity and network and it scales linearly with grid growth. It also has a key-graph mode where a graph of objects can be associated with a single key and it allows fine grained modification of that
sentIndex sentText sentNum sentScore
1 IBM WebSphere eXtreme Scale is IBMs in memory data grid product (IMDG). [sent-1, score-0.175]
2 It can be used as a key-value store which partitions the keys (using a form of consistent hashing) over a set of servers such that each server is responsible for a subset of the keys. [sent-2, score-0.372]
3 It automatically handles replication which can be either synchronous of asynchronous and handles advanced placement so that replicas can be placed in different physical zones when compared to the placement of the primary. [sent-3, score-1.189]
4 It is fully elastic in that servers can be added and removed and it automatically redistributes the partition primaries and backups. [sent-5, score-0.423]
5 It can be scaled from one server to hundreds if not thousands of JVMs in a single grid. [sent-6, score-0.08]
6 Each additional server provides more CPU, memory capacity and network and it scales linearly with grid growth. [sent-7, score-0.269]
7 It also has a key-graph mode where a graph of objects can be associated with a single key and it allows fine grained modification of that graph. [sent-8, score-0.679]
8 The object graph and key is stored in tuple form in this mode. [sent-9, score-0.585]
9 This allows clients using different object representations of some subset of the IMDG schema to share data stored in the IMDG. [sent-10, score-0.778]
10 It comes with automatic integration with databases so that values are automatically pulled from a database if not present and are written to the database when they change. [sent-11, score-0.623]
11 Write behind logic allows writes to the database to be much more efficient and allows the grid to run with the database down. [sent-12, score-0.677]
12 It comes with a HTTP Session filter to provide HTTP Session management for servlet containers. [sent-13, score-0.32]
13 It have a flexible deployment model allowing a lot of customization by customers. [sent-14, score-0.132]
14 We do a weekly video podcast on iTunes (search for extreme scale in iTunes) and make it available on YouTube also for customer education. [sent-15, score-0.285]
15 We answer customer questions and forum topics from the week in a casual two person chat forum. [sent-16, score-0.662]
wordName wordTfidf (topN-words)
[('imdg', 0.264), ('itunes', 0.23), ('placement', 0.202), ('forum', 0.188), ('automatically', 0.178), ('grid', 0.175), ('extreme', 0.174), ('subset', 0.171), ('allows', 0.17), ('redistributes', 0.156), ('representations', 0.144), ('jvms', 0.139), ('tuple', 0.135), ('websphere', 0.132), ('customization', 0.132), ('casual', 0.127), ('handles', 0.125), ('modification', 0.124), ('servlet', 0.122), ('session', 0.121), ('object', 0.12), ('graph', 0.117), ('buildings', 0.117), ('form', 0.115), ('customer', 0.111), ('floor', 0.109), ('racks', 0.107), ('grained', 0.107), ('comes', 0.1), ('filter', 0.098), ('pulled', 0.098), ('stored', 0.098), ('linearly', 0.094), ('hashing', 0.092), ('placed', 0.092), ('zones', 0.09), ('synchronous', 0.09), ('removed', 0.089), ('chat', 0.087), ('partitions', 0.086), ('replicas', 0.085), ('automatic', 0.085), ('mode', 0.082), ('database', 0.081), ('scaled', 0.08), ('associated', 0.079), ('schema', 0.075), ('http', 0.075), ('person', 0.075), ('topics', 0.074)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 542 high scalability-2009-03-17-IBM WebSphere eXtreme Scale (IMDG)
Introduction: IBM WebSphere eXtreme Scale is IBMs in memory data grid product (IMDG). It can be used as a key-value store which partitions the keys (using a form of consistent hashing) over a set of servers such that each server is responsible for a subset of the keys. It automatically handles replication which can be either synchronous of asynchronous and handles advanced placement so that replicas can be placed in different physical zones when compared to the placement of the primary. Think buildings, racks, floor, data centers. It is fully elastic in that servers can be added and removed and it automatically redistributes the partition primaries and backups. It can be scaled from one server to hundreds if not thousands of JVMs in a single grid. Each additional server provides more CPU, memory capacity and network and it scales linearly with grid growth. It also has a key-graph mode where a graph of objects can be associated with a single key and it allows fine grained modification of that
2 0.15947059 1160 high scalability-2011-12-21-In Memory Data Grid Technologies
Introduction: After winning a CSC Leading Edge Forum (LEF) research grant, I (Paul Colmer) wanted to publish some of the highlights of my research to share with the wider technology community. What is an In Memory Data Grid? It is not an in-memory relational database, a NOSQL database or a relational database. It is a different breed of software datastore. In summary an IMDG is an ‘off the shelf’ software product that exhibits the following characteristics: The data model is distributed across many servers in a single location or across multiple locations. This distribution is known as a data fabric. This distributed model is known as a ‘shared nothing’ architecture. All servers can be active in each site. All data is stored in the RAM of the servers. Servers can be added or removed non-disruptively, to increase the amount of RAM available. The data model is non-relational and is object-based. Distributed applications written on the .NET and Java application platforms are s
3 0.14895046 538 high scalability-2009-03-16-Are Cloud Based Memory Architectures the Next Big Thing?
Introduction: We are on the edge of two potent technological changes: Clouds and Memory Based Architectures. This evolution will rip open a chasm where new players can enter and prosper. Google is the master of disk. You can't beat them at a game they perfected. Disk based databases like SimpleDB and BigTable are complicated beasts, typical last gasp products of any aging technology before a change. The next era is the age of Memory and Cloud which will allow for new players to succeed. The tipping point will be soon. Let's take a short trip down web architecture lane: It's 1993: Yahoo runs on FreeBSD, Apache, Perl scripts and a SQL database It's 1995: Scale-up the database. It's 1998: LAMP It's 1999: Stateless + Load Balanced + Database + SAN It's 2001: In-memory data-grid. It's 2003: Add a caching layer. It's 2004: Add scale-out and partitioning. It's 2005: Add asynchronous job scheduling and maybe a distributed file system. It's 2007: Move it all into the cloud. It's 2008: C
4 0.14330058 1565 high scalability-2013-12-16-22 Recommendations for Building Effective High Traffic Web Software
Introduction: This is a guest post by Ashwanth Fernando, Software Engineer from the trenches at large scale internet companies. Inspired by the book "Effective Java" by Joshua Bloch, I wanted to share my holistic recommendations on building high traffic web software (i.e. web applications/services that serve high traffic loads). Some of these items may not be just about software design but also around surrounding areas such as the engineering organization, culture etc. Two disclaimers up front: 1) This is my opinion. 2) There will be real world situations where the below principles will be wrong as in all things "software". Please use common sense all the time. Consider using more than one datacenter There have been numerous horror stories about businesses, ahem going out of business because they just had a single datacenter. Its really important to have more than one data center if you want to protect yourself from natural disasters or electrical supply failures. Run all your datacen
5 0.14156158 623 high scalability-2009-06-10-Dealing with multi-partition transactions in a distributed KV solution
Introduction: I've been getting asked about this a lot lately so I figured I'd just blog about it. Products like WebSphere eXtreme Scale work by taking a dataset, partitioning it using a key and then assigning those partitions to a number of JVMs. Each partition usually has a primary and a replica. These 'shards' are assigned to JVMs. A transactional application typically interacts with the data on a single partition at a time. This means the transaction is executed in a single JVM. A server box will be able to do M of those transactions per second and it scales because N boxes does MN (M multiplied by N) transactions per second. Increase N, you get more transactions per second. Availability is very good because a transaction only depends on 1 of the N servers that are currently online. Any of the other (N-1) servers can go down or fail with no impact on the transaction. So, single partition transactions can scale indefinitely from a throughput point of view, offer very consistent response times and
6 0.12999696 691 high scalability-2009-08-31-Squarespace Architecture - A Grid Handles Hundreds of Millions of Requests a Month
7 0.12899077 373 high scalability-2008-08-29-Product: ScaleOut StateServer is Memcached on Steroids
10 0.12092508 1064 high scalability-2011-06-20-35+ Use Cases for Choosing Your Next NoSQL Database
11 0.11933441 240 high scalability-2008-02-05-Handling of Session for a site running from more than 1 data center
12 0.11798912 628 high scalability-2009-06-13-Neo4j - a Graph Database that Kicks Buttox
13 0.11779543 892 high scalability-2010-09-02-Distributed Hashing Algorithms by Example: Consistent Hashing
14 0.11757761 651 high scalability-2009-07-02-Product: Project Voldemort - A Distributed Database
15 0.11679672 97 high scalability-2007-09-18-Session management in highly scalable web sites
16 0.11510912 954 high scalability-2010-12-06-What the heck are you actually using NoSQL for?
17 0.11482569 19 high scalability-2007-07-16-Paper: Replication Under Scalable Hashing
18 0.11388058 589 high scalability-2009-05-05-Drop ACID and Think About Data
20 0.10334042 1205 high scalability-2012-03-07-Scale Indefinitely on S3 With These Secrets of the S3 Masters
topicId topicWeight
[(0, 0.165), (1, 0.059), (2, 0.009), (3, -0.054), (4, -0.034), (5, 0.112), (6, 0.023), (7, -0.083), (8, -0.044), (9, 0.037), (10, 0.055), (11, 0.048), (12, -0.057), (13, -0.009), (14, 0.014), (15, 0.032), (16, -0.014), (17, 0.089), (18, 0.036), (19, 0.013), (20, -0.068), (21, -0.041), (22, 0.031), (23, -0.028), (24, -0.05), (25, -0.057), (26, -0.008), (27, -0.006), (28, 0.02), (29, 0.031), (30, -0.005), (31, -0.05), (32, -0.051), (33, -0.035), (34, 0.006), (35, 0.004), (36, 0.023), (37, -0.038), (38, -0.007), (39, 0.034), (40, 0.04), (41, 0.006), (42, 0.143), (43, -0.023), (44, 0.032), (45, -0.012), (46, 0.079), (47, 0.063), (48, 0.023), (49, 0.022)]
simIndex simValue blogId blogTitle
same-blog 1 0.95877719 542 high scalability-2009-03-17-IBM WebSphere eXtreme Scale (IMDG)
Introduction: IBM WebSphere eXtreme Scale is IBMs in memory data grid product (IMDG). It can be used as a key-value store which partitions the keys (using a form of consistent hashing) over a set of servers such that each server is responsible for a subset of the keys. It automatically handles replication which can be either synchronous of asynchronous and handles advanced placement so that replicas can be placed in different physical zones when compared to the placement of the primary. Think buildings, racks, floor, data centers. It is fully elastic in that servers can be added and removed and it automatically redistributes the partition primaries and backups. It can be scaled from one server to hundreds if not thousands of JVMs in a single grid. Each additional server provides more CPU, memory capacity and network and it scales linearly with grid growth. It also has a key-graph mode where a graph of objects can be associated with a single key and it allows fine grained modification of that
2 0.72287643 1160 high scalability-2011-12-21-In Memory Data Grid Technologies
Introduction: After winning a CSC Leading Edge Forum (LEF) research grant, I (Paul Colmer) wanted to publish some of the highlights of my research to share with the wider technology community. What is an In Memory Data Grid? It is not an in-memory relational database, a NOSQL database or a relational database. It is a different breed of software datastore. In summary an IMDG is an ‘off the shelf’ software product that exhibits the following characteristics: The data model is distributed across many servers in a single location or across multiple locations. This distribution is known as a data fabric. This distributed model is known as a ‘shared nothing’ architecture. All servers can be active in each site. All data is stored in the RAM of the servers. Servers can be added or removed non-disruptively, to increase the amount of RAM available. The data model is non-relational and is object-based. Distributed applications written on the .NET and Java application platforms are s
3 0.67597693 623 high scalability-2009-06-10-Dealing with multi-partition transactions in a distributed KV solution
Introduction: I've been getting asked about this a lot lately so I figured I'd just blog about it. Products like WebSphere eXtreme Scale work by taking a dataset, partitioning it using a key and then assigning those partitions to a number of JVMs. Each partition usually has a primary and a replica. These 'shards' are assigned to JVMs. A transactional application typically interacts with the data on a single partition at a time. This means the transaction is executed in a single JVM. A server box will be able to do M of those transactions per second and it scales because N boxes does MN (M multiplied by N) transactions per second. Increase N, you get more transactions per second. Availability is very good because a transaction only depends on 1 of the N servers that are currently online. Any of the other (N-1) servers can go down or fail with no impact on the transaction. So, single partition transactions can scale indefinitely from a throughput point of view, offer very consistent response times and
4 0.65879703 19 high scalability-2007-07-16-Paper: Replication Under Scalable Hashing
Introduction: Replication Under Scalable Hashing: A Family of Algorithms for Scalable Decentralized Data Distribution From the abstract: Typical algorithms for decentralized data distribution work best in a system that is fully built before it first used; adding or removing components results in either extensive reorganization of data or load imbalance in the system. We have developed a family of decentralized algorithms, RUSH (Replication Under Scalable Hashing), that maps replicated objects to a scalable collection of storage servers or disks. RUSH algorithms distribute objects to servers according to user-specified server weighting. While all RUSH variants support addition of servers to the system, different variants have different characteristics with respect to lookup time in petabyte-scale systems, performance with mirroring (as opposed to redundancy codes), and storage server removal. All RUSH variants redistribute as few objects as possible when new servers are added or existing servers
5 0.63989019 393 high scalability-2008-09-25-GridGain: One Compute Grid, Many Data Grids
Introduction: GridGain was kind enough to present at the September 17th instance of the Silicon Valley Cloud Computing Group . I've been curious about GridGain so I was glad to see them there. In short GridGain is: an open source computational grid framework that enables Java developers to improve general performance of processing intensive applications by splitting and parallelizing the workload . GridGain can also be thought of as a set of middleware primitives for building applications. GridGain's peer group of competitors includes GigaSpaces, Terracotta, Coherence, and Hadoop. The speaker for GridGain was the President and Founder, Nikita Ivanov. He has a very pleasant down-to-earth way about him that contrasts nicely with a field given to religious discussions of complex taxomic definitions. Nikita first talked about cloud computing in general. He feels Java is the perfect gateway for cloud computing. Which is good because GridGain only works with Java . The Java centricity of G
6 0.63752562 696 high scalability-2009-09-07-Product: Infinispan - Open Source Data Grid
7 0.631782 628 high scalability-2009-06-13-Neo4j - a Graph Database that Kicks Buttox
8 0.63109195 766 high scalability-2010-01-26-Product: HyperGraphDB - A Graph Database
9 0.63105184 1205 high scalability-2012-03-07-Scale Indefinitely on S3 With These Secrets of the S3 Masters
10 0.62677675 268 high scalability-2008-03-06-Announce: First Meeting of Boston Scalability User Group
11 0.62649494 817 high scalability-2010-04-29-Product: SciDB - A Science-Oriented DBMS at 100 Petabytes
12 0.62634313 1565 high scalability-2013-12-16-22 Recommendations for Building Effective High Traffic Web Software
13 0.62559134 380 high scalability-2008-09-05-Product: Tungsten Replicator
14 0.62505352 538 high scalability-2009-03-16-Are Cloud Based Memory Architectures the Next Big Thing?
15 0.61129296 651 high scalability-2009-07-02-Product: Project Voldemort - A Distributed Database
16 0.60998487 373 high scalability-2008-08-29-Product: ScaleOut StateServer is Memcached on Steroids
17 0.60962796 633 high scalability-2009-06-19-GemFire 6.0: New innovations in data management
18 0.59990662 933 high scalability-2010-11-01-Hot Trend: Move Behavior to Data for a New Interactive Application Architecture
19 0.59668612 1064 high scalability-2011-06-20-35+ Use Cases for Choosing Your Next NoSQL Database
20 0.58919549 1092 high scalability-2011-08-04-Jim Starkey is Creating a Brave New World by Rethinking Databases for the Cloud
topicId topicWeight
[(1, 0.2), (2, 0.245), (10, 0.062), (61, 0.079), (79, 0.05), (80, 0.167), (85, 0.07), (94, 0.03)]
simIndex simValue blogId blogTitle
1 0.96719158 1012 high scalability-2011-03-28-Aztec Empire Strategy: Use Dual Pipes in Your Aqueduct for High Availability
Introduction: With the Chapultepec aqueduct , also named the great aqueduct , the Aztecs built a novel uninterruptible water supply for providing fresh water to Tenochtitlan , their fast growing jewel of a capital city. A section of the aqueduct is still around today: It's fun to think about how even 600 years ago how it was built with high availability in mind. We find engineers being engineers , no matter the age: It consisted of a twin pipe distribution system made in part of compacted soil and in part of wood for the crossings of the aqueduct over the bridges built to allow the passage of the canoes. It was finished around 1466 AD, and the main purpose was to supply fresh water to Mexico-Tenochtitlan, to mitigate its thirst. The main source for the aqueduct was the spring of Chapultepec and the purpose of the twin pipes was to ease the maintenance of the system, because the water was conveyed through one pipe, and when it got dirty, the water was diverted to the other pipe
2 0.94940656 206 high scalability-2008-01-10-MONO ASP.NET. Will it make the web???
Introduction: I was wondering if it is already possible to scale a MONO's .NET website. I cannot see any real websites (with the term real I mean "a highly visited website") running mono. What do you think? Will MONO ASP.NET scale??? Is it worth planning a site to run with Mono asp.net? Or should we leave it to the future? What do you think?
same-blog 3 0.9449259 542 high scalability-2009-03-17-IBM WebSphere eXtreme Scale (IMDG)
Introduction: IBM WebSphere eXtreme Scale is IBMs in memory data grid product (IMDG). It can be used as a key-value store which partitions the keys (using a form of consistent hashing) over a set of servers such that each server is responsible for a subset of the keys. It automatically handles replication which can be either synchronous of asynchronous and handles advanced placement so that replicas can be placed in different physical zones when compared to the placement of the primary. Think buildings, racks, floor, data centers. It is fully elastic in that servers can be added and removed and it automatically redistributes the partition primaries and backups. It can be scaled from one server to hundreds if not thousands of JVMs in a single grid. Each additional server provides more CPU, memory capacity and network and it scales linearly with grid growth. It also has a key-graph mode where a graph of objects can be associated with a single key and it allows fine grained modification of that
4 0.93631411 1274 high scalability-2012-06-29-Stuff The Internet Says On Scalability For June 29, 2012 - The Velocity Edition
Introduction: Judging from the tweet flow, Velocity looked like a riotous good time. In this video on the main themes at Velocity, after a little microphone enhanced violence, John Allspaw and Steve Souders identify resilience and automation as two of the big ideas behind building a faster and stronger web. John says resiliency is the idea that we we don't live in a perfect world so trying to build perfect systems is counter productive. We have to accept failure as a baseline and think in terms of degrees of availability. All abstraction layers leak so every part of a system must be monitorable and open to introspection. A focus on resilience means the web is growing up. Resilience has long been a requirement for "real" systems, it's great to see the web thinking in terms of the complex systems they've always been. For the Alpha and Omega on resilience you'll want to watch Dr. Richard Cook's inspiring talk on How Complex Systems Fail . Here are some of the most enjoyable Quotable Q
5 0.93218338 1220 high scalability-2012-04-02-YouPorn - Targeting 200 Million Views a Day and Beyond
Introduction: Update : Here's the video of the talk . Erick Pickup , lead developer at YouPorn.com, presented their architecture in a talk titled Building a Website To Scale given at the ConFoo conference. As you might expect, YouPorn is a beast, streaming three full DVDs of video every second, handing 300K queries every second, and generating up to 15GBs of log data per hour. Unfortunately, all we have are the slides of the talk, so this article isn’t as technical as I might like, there’s no visibility at all on the video handling for example, but we do get some interesting details. The most interesting takeway is that YouPorn is a pretty conventional LAMP stack, with a NoSQL twist as Redis now replaces MySQL in the live datapath. Reminds me a little of YouTube in its simplicity. The second most interesting takeaway was the great switchover . Common wisdom says never rewrite, but in 2011 YouPorn rewrote their entire site to use PHP + Redis instead of a complex Perl + MySQL b
6 0.90212244 1028 high scalability-2011-04-22-Stuff The Internet Says On Scalability For April 22, 2011
7 0.89790761 1106 high scalability-2011-08-26-Stuff The Internet Says On Scalability For August 26, 2011
8 0.88697225 313 high scalability-2008-05-02-Friends for Sale Architecture - A 300 Million Page View-Month Facebook RoR App
9 0.88430882 1087 high scalability-2011-07-26-Web 2.0 Killed the Middleware Star
10 0.88401735 1508 high scalability-2013-08-28-Sean Hull's 20 Biggest Bottlenecks that Reduce and Slow Down Scalability
11 0.88398325 558 high scalability-2009-04-06-How do you monitor the performance of your cluster?
12 0.88116497 881 high scalability-2010-08-16-Scaling an AWS infrastructure - Tools and Patterns
13 0.88068706 821 high scalability-2010-05-03-MocoSpace Architecture - 3 Billion Mobile Page Views a Month
14 0.88044214 1129 high scalability-2011-09-30-Stuff The Internet Says On Scalability For September 30, 2011
15 0.88042682 774 high scalability-2010-02-08-How FarmVille Scales to Harvest 75 Million Players a Month
16 0.87986469 811 high scalability-2010-04-16-Hot Scalability Links for April 16, 2010
17 0.87919104 1123 high scalability-2011-09-23-The Real News is Not that Facebook Serves Up 1 Trillion Pages a Month…
18 0.87866497 1170 high scalability-2012-01-06-Stuff The Internet Says On Scalability For January 6, 2012
19 0.87848753 904 high scalability-2010-09-21-Playfish's Social Gaming Architecture - 50 Million Monthly Users and Growing
20 0.8784681 511 high scalability-2009-02-12-MySpace Architecture