high_scalability high_scalability-2010 high_scalability-2010-894 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: The need for IT consolidation is most evident in two types of organizations. In the first group, IT grew organically with business over the decades, and survived changes of strategy, management, staff and vendor orientation. The second group of businesses capital groups are characterized by rapid growth through acquisitions (followed by attempts to integrate radically different IT environments). In both groups, their IT infrastructures have typically been pieced together over the past 20 (or more) years.ďťż Read more on BigDataMatters.com
sentIndex sentText sentNum sentScore
1 The need for IT consolidation is most evident in two types of organizations. [sent-1, score-0.611]
2 In the first group, IT grew organically with business over the decades, and survived changes of strategy, management, staff and vendor orientation. [sent-2, score-1.199]
3 The second group of businesses capital groups are characterized by rapid growth through acquisitions (followed by attempts to integrate radically different IT environments). [sent-3, score-2.184]
4 In both groups, their IT infrastructures have typically been pieced together over the past 20 (or more) years. [sent-4, score-0.756]
wordName wordTfidf (topN-words)
[('organically', 0.293), ('pieced', 0.275), ('acquisitions', 0.275), ('characterized', 0.263), ('groups', 0.26), ('survived', 0.223), ('evident', 0.219), ('consolidation', 0.208), ('decades', 0.2), ('radically', 0.192), ('attempts', 0.188), ('group', 0.188), ('capital', 0.185), ('infrastructures', 0.169), ('grew', 0.168), ('staff', 0.166), ('followed', 0.159), ('businesses', 0.157), ('vendor', 0.152), ('integrate', 0.148), ('environments', 0.135), ('typically', 0.126), ('rapid', 0.121), ('past', 0.108), ('types', 0.101), ('growth', 0.096), ('strategy', 0.09), ('together', 0.078), ('changes', 0.077), ('business', 0.072), ('second', 0.068), ('management', 0.067), ('read', 0.054), ('two', 0.049), ('first', 0.048), ('different', 0.043), ('need', 0.034)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999994 894 high scalability-2010-09-03-Six guiding principles to Consolidate your IT
Introduction: The need for IT consolidation is most evident in two types of organizations. In the first group, IT grew organically with business over the decades, and survived changes of strategy, management, staff and vendor orientation. The second group of businesses capital groups are characterized by rapid growth through acquisitions (followed by attempts to integrate radically different IT environments). In both groups, their IT infrastructures have typically been pieced together over the past 20 (or more) years.ďťż Read more on BigDataMatters.com
2 0.15250207 222 high scalability-2008-01-25-Application Database and DAL Architecture
Introduction: Hi gurus, I'm totally new to this high scalability thing. I'm trying to create a website with scalability in mind (personal project). In my application I'll have forums for different groups of people (each group will have their own forums, members of groups can still post in other groups' forums but each group will mainly be using their forums most of the time). Now, I'm going to start with about 2000 groups with the potential of reaching up to 10000 groups (this is the maximum due to the nature of my application). I was thinking that having all posts in one table will be way too much for one table (esp. that some groups are expected to post hundreds or even thousands times per day, let's say about 500 of the groups, the rest of the groups won't be that active though) as I'll have to index the PostID, ParentPostID, GroupID and PostDate which can produce large indexes (consequentially causing slow inserts) if having everything in one table. So, I'm thinking of a way to divide the posts
3 0.09378498 531 high scalability-2009-03-11-Classifying XTP systems and how cloud changes which type startups will use
Introduction: I try to group XTP in to two main groups, type 1 and 2 and then subdivide type 2 in to 2a and 2b. I describe how I do this grouping and then amplify it a little in the context of cloud services.
4 0.090432338 701 high scalability-2009-09-10-When optimizing - don't forget the Java Virtual Machine (JVM)
Introduction: Recently, I was working on a project that was coming to a close. It was related to optimizing a database using a Java based in-memory cache to reduce the load. The application had to process up to a million objects per day and was characterized by its heavy use of memory and the high number of read, write and update operations. These operations were found to be the most costly, which meant that optimization efforts were concentrated here. The project had already achieved impressive performance increases, but one question remained unanswered - would changing the JVM increase performance? Read more at: http://bigdatamatters.com/bigdatamatters/2009/08/jvm-performance.html
5 0.079424523 972 high scalability-2011-01-11-Google Megastore - 3 Billion Writes and 20 Billion Read Transactions Daily
Introduction: A giant step into the fully distributed future has been taken by the Google App Engine team with the release of their High Replication Datastore . The HRD is targeted at mission critical applications that require data replicated to at least three datacenters, full ACID semantics for entity groups , and lower consistency guarantees across entity groups. This is a major accomplishment. Few organizations can implement a true multi-datacenter datastore. Other than SimpleDB, how many other publicly accessible database services can operate out of multiple datacenters? Now that capability can be had by anyone. But there is a price, literally and otherwise. Because the HRD uses three times the resources as Google App Engine's Master/Slave datastatore, it will cost three times as much. And because it is a distributed database, with all that implies in the CAP sense, developers will have to be very careful in how they architect their applications because as costs increased, reliability incre
6 0.077979952 472 high scalability-2008-12-19-How to measure memory required for a user session
7 0.0761097 782 high scalability-2010-02-23-When to migrate your database?
8 0.068099901 383 high scalability-2008-09-10-Shard servers -- go big or small?
9 0.064936958 1346 high scalability-2012-10-24-Saving Cash Using Less Cache - 90% Savings in the Caching Tier
10 0.06492582 1033 high scalability-2011-05-02-The Updated Big List of Articles on the Amazon Outage
11 0.064385101 777 high scalability-2010-02-15-Scaling Ambition at StackOverflow
12 0.060658224 717 high scalability-2009-10-07-How to Avoid the Top 5 Scale-Out Pitfalls
13 0.059569165 918 high scalability-2010-10-12-The CIO’s Problem: Cloud “Mess” or Cloud “Mash”
14 0.054483257 203 high scalability-2008-01-07-How Ruby on Rails Survived a 550k Pageview Digging
15 0.054380529 208 high scalability-2008-01-11-FTP Sanity: Redundancy, archiving, consolidation.
16 0.051681578 576 high scalability-2009-04-21-What CDN would you recommend?
17 0.051484898 941 high scalability-2010-11-15-How Google's Instant Previews Reduces HTTP Requests
18 0.051211216 1548 high scalability-2013-11-13-Google: Multiplex Multiple Works Loads on Computers to Increase Machine Utilization and Save Money
19 0.050492879 757 high scalability-2010-01-04-11 Strategies to Rock Your Startup’s Scalability in 2010
20 0.049141809 457 high scalability-2008-12-01-Sun FireTM X4540 Server as Backup Server for Zmanda's Amanda Enterprise 2.6 Software
topicId topicWeight
[(0, 0.054), (1, 0.013), (2, 0.015), (3, 0.007), (4, -0.005), (5, -0.009), (6, -0.004), (7, -0.032), (8, -0.001), (9, -0.022), (10, -0.022), (11, 0.016), (12, 0.004), (13, 0.021), (14, 0.022), (15, 0.022), (16, 0.019), (17, -0.029), (18, 0.017), (19, 0.026), (20, -0.001), (21, -0.007), (22, 0.009), (23, -0.001), (24, -0.048), (25, 0.021), (26, -0.015), (27, -0.017), (28, -0.01), (29, 0.004), (30, 0.003), (31, 0.013), (32, 0.021), (33, -0.038), (34, -0.02), (35, 0.013), (36, 0.013), (37, 0.032), (38, 0.029), (39, 0.011), (40, -0.021), (41, 0.002), (42, 0.006), (43, -0.029), (44, 0.015), (45, -0.038), (46, -0.008), (47, -0.023), (48, 0.009), (49, 0.002)]
simIndex simValue blogId blogTitle
same-blog 1 0.96031499 894 high scalability-2010-09-03-Six guiding principles to Consolidate your IT
Introduction: The need for IT consolidation is most evident in two types of organizations. In the first group, IT grew organically with business over the decades, and survived changes of strategy, management, staff and vendor orientation. The second group of businesses capital groups are characterized by rapid growth through acquisitions (followed by attempts to integrate radically different IT environments). In both groups, their IT infrastructures have typically been pieced together over the past 20 (or more) years.ďťż Read more on BigDataMatters.com
2 0.61333245 813 high scalability-2010-04-19-The cost of High Availability (HA) with Oracle
Introduction: What's the cost of downtime to your business? $100,000 per hour, $1,000,000 or more? The recent Volcanic ash that has grounded European flights is estimated to be costing the airlines $200M a day. In the IT world, High Availability (HA) architectures allow for disaster recovery as well as uninterrupted business continuity during system failure. This post focuses on a customer’s backend, comprised of a business application stack supported by a dozen Oracle databases. They wish to equip this infrastructure with HA features and ensure that outages do not cost business. How do we address the challenge of pricing the complete solution, with hardware, software, services and annual support? Read more on BigDataMatters.com
Introduction: Many enterprises' high-availability architecture is based on the assumption that you can prevent failure from happening by putting all your critical data in a centralized database, back it up with expensive storage, and replicate it somehow between the sites. As I argued in one of my previous posts ( Why Existing Databases (RAC) are So Breakable! ) many of those assumptions are broken at their core, as storage is doomed to failure just like any other device, expensive hardware doesn’t make things any better and database replication is often not enough. One of the main lessons that we can take from the likes of Amazon and Google is that the right way to ensure continuous high availability is by designing our system to cope with failure. We need to assume that what we tend to think of as unthinkable will probably happen, as that’s the nature of failure. So rather than trying to prevent failures, we need to build a system that will tolerate them. As we can learn from a recent outage
4 0.59186292 822 high scalability-2010-05-04-Business continuity with real-time data integration
Introduction: Enterprises want to protect their data. As the appetite for data volumes grows, storage technology becomes a critical business asset on which business continuity relies. My recent survey in the medium-size enterprise segment shows the five dominant investment directions at the level of data management architecture: disaster recovery (DR), high availability (HA), backup, data processing performance and migration to more advanced databases. This suggests that corporations generally have sufficiently structured data collections but are concerned with business continuity and continuous availability of data. What infrastructures can provide these assurances? In this post I want to focus on yet another option, and that is the Real-Time Data Integration model. As an example I am going to discuss Oracle GoldenGate, which permits you to manage the data critical to your business in safety, ensuring business continuity without disruption even if the data is distributed among multiple, h
5 0.57694381 681 high scalability-2009-08-16-TechDev Stages
Introduction: Tech Dev Stages  explains the basic steps involved for the product development given business problems. A must read for newbie or starters for architecture development.
6 0.56767088 1589 high scalability-2014-02-03-How Google Backs Up the Internet Along With Exabytes of Other Data
7 0.56515723 500 high scalability-2009-01-22-Heterogeneous vs. Homogeneous System Architectures
8 0.55837971 731 high scalability-2009-10-28-Need for change in your IT infrastructure
9 0.54706997 1014 high scalability-2011-03-31-8 Lessons We Can Learn from the MySpace Incident - Balance, Vision, Fearlessness
10 0.54329741 96 high scalability-2007-09-18-Amazon Architecture
11 0.53842294 23 high scalability-2007-07-24-Major Websites Down: Or Why You Want to Run in Two or More Data Centers.
12 0.53299701 1628 high scalability-2014-04-08-Microservices - Not a free lunch!
13 0.52624661 559 high scalability-2009-04-07-Six Lessons Learned Deploying a Large-scale Infrastructure in Amazon EC2
14 0.52185786 1240 high scalability-2012-05-07-Startups are Creating a New System of the World for IT
15 0.52082634 28 high scalability-2007-07-25-Product: NetApp MetroCluster Software
16 0.51292223 139 high scalability-2007-10-30-Paper: Dynamo: Amazon’s Highly Available Key-value Store
17 0.50904894 757 high scalability-2010-01-04-11 Strategies to Rock Your Startup’s Scalability in 2010
18 0.5020591 1012 high scalability-2011-03-28-Aztec Empire Strategy: Use Dual Pipes in Your Aqueduct for High Availability
19 0.50072813 1098 high scalability-2011-08-15-Should any cloud be considered one availability zone? The Amazon experience says yes.
20 0.49742681 288 high scalability-2008-03-25-Paper: On Designing and Deploying Internet-Scale Services
topicId topicWeight
[(2, 0.015), (10, 0.142), (46, 0.442), (61, 0.252)]
simIndex simValue blogId blogTitle
same-blog 1 0.88593972 894 high scalability-2010-09-03-Six guiding principles to Consolidate your IT
Introduction: The need for IT consolidation is most evident in two types of organizations. In the first group, IT grew organically with business over the decades, and survived changes of strategy, management, staff and vendor orientation. The second group of businesses capital groups are characterized by rapid growth through acquisitions (followed by attempts to integrate radically different IT environments). In both groups, their IT infrastructures have typically been pieced together over the past 20 (or more) years.ďťż Read more on BigDataMatters.com
2 0.80274606 95 high scalability-2007-09-17-Scalable CMS?
Introduction: What do you guys think/know about the scalability of the popular CMSs (like Joomla, Drupal or Typo3)? Any experience/suggestions there? I'm not sure which to pick yet... Thanks, Stephan
3 0.53794551 145 high scalability-2007-11-08-ID generator
Introduction: Hi, I would like feed back on a ID generator I just made. What positive and negative effects do you see with this. It's programmed in Java, but could just as easily be programmed in any other typical language. It's thread safe and does not use any synchronization. When testing it on my laptop, I was able to generate 10 million IDs within about 15 seconds, so it should be more than fast enough. Take a look at the attachment.. (had to rename it from IdGen.java to IdGen.txt to attach it) IdGen.java
4 0.51578748 335 high scalability-2008-05-30-Is "Scaling Engineer" a new job title?
Introduction: Justin.tv is looking to hire a Scaling Engineer to help scale their video cluster, IRC server, web app, monitoring and search services. I've never seen this job title before. A quick search that showed only a few previous instances of it being used. Has anyone else seen Scaling Engineer as a job title before? It's a great idea. Scaling is certainly a worthy specialty of it's own. Why there's a difficult lingo, obscure tools, endlessly subtle concepts, a massive body of knowledge to master, and many competing religious factions. All a good start. Next I see a chain of Scalability Universities. Maybe use all those Starbucks that are closing down. Contact me for franchise opportunities :-)
5 0.51338947 634 high scalability-2009-06-20-Building a data cycle at LinkedIn with Hadoop and Project Voldemort
Introduction: Update : Building Voldemort read-only stores with Hadoop . A write up on what LinkedIn is doing to integrate large offline Hadoop data processing jobs with a fast, distributed online key-value storage system, Project Voldemort .
6 0.47658819 226 high scalability-2008-01-28-DR-BC for web-DB servers
7 0.47640842 268 high scalability-2008-03-06-Announce: First Meeting of Boston Scalability User Group
8 0.4707422 1201 high scalability-2012-02-29-Strategy: Put Mobile Video Into Cold Storage After 30 Days
9 0.46769658 324 high scalability-2008-05-19-UK Based CDN
10 0.46726069 549 high scalability-2009-03-26-Performance - When do I start worrying?
11 0.46711603 1303 high scalability-2012-08-13-Ask HighScalability: Facing scaling issues with news feeds on Redis. Any advice?
12 0.4644447 746 high scalability-2009-11-26-Kngine Snippet Search New Indexing Technology
13 0.45436251 580 high scalability-2009-04-24-INFOSCALE 2009 in June in Hong Kong
14 0.4507347 41 high scalability-2007-07-30-Product: Flickr
15 0.44569874 493 high scalability-2009-01-16-Just-In-Time Scalability: Agile Methods to Support Massive Growth (IMVU case study)
16 0.44523856 347 high scalability-2008-07-07-Five Ways to Stop Framework Fixation from Crashing Your Scaling Strategy
17 0.43739095 793 high scalability-2010-03-10-Saying Yes to NoSQL; Going Steady with Cassandra at Digg
18 0.43655464 930 high scalability-2010-10-28-NoSQL Took Away the Relational Model and Gave Nothing Back
19 0.43275046 132 high scalability-2007-10-25-Who can answer or analyze the image store and visit solution about alibaba.com?Thanks
20 0.43190968 208 high scalability-2008-01-11-FTP Sanity: Redundancy, archiving, consolidation.