high_scalability high_scalability-2009 high_scalability-2009-677 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: In this blog post BJ Clark overviews the available key-value stores that aren't SQL based. On some projects he yields interesting experiences. Make sure to check out the comments as well. The post has spread throughout social tiny text services so there is a chance you've already read it, but still worthwhile to put here.
sentIndex sentText sentNum sentScore
1 In this blog post BJ Clark overviews the available key-value stores that aren't SQL based. [sent-1, score-0.786]
2 The post has spread throughout social tiny text services so there is a chance you've already read it, but still worthwhile to put here. [sent-4, score-2.086]
wordName wordTfidf (topN-words)
[('overviews', 0.44), ('clark', 0.38), ('worthwhile', 0.38), ('yields', 0.28), ('tiny', 0.237), ('throughout', 0.221), ('text', 0.197), ('chance', 0.186), ('comments', 0.18), ('spread', 0.177), ('projects', 0.165), ('stores', 0.163), ('check', 0.153), ('social', 0.132), ('sure', 0.126), ('sql', 0.126), ('already', 0.111), ('put', 0.107), ('available', 0.094), ('post', 0.089), ('services', 0.087), ('interesting', 0.086), ('read', 0.081), ('still', 0.081), ('make', 0.054)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 677 high scalability-2009-08-09-NoSQL: If Only It Was That Easy
Introduction: In this blog post BJ Clark overviews the available key-value stores that aren't SQL based. On some projects he yields interesting experiences. Make sure to check out the comments as well. The post has spread throughout social tiny text services so there is a chance you've already read it, but still worthwhile to put here.
2 0.096941561 122 high scalability-2007-10-14-Product: The Spread Toolkit
Introduction: Complex applications coordinating work across a lot of machines often need a highly performing fault tolerant message layer. Though a blast to write, it's probably a better use of your time to use an off the shelf solution. And that's where Spread comes in. Flickr, for example, uses Spread to create real-time event feeds from their web server logs. What exactly is Spread? From the Spread website: Spread is an open source toolkit that provides a high performance messaging service that is resilient to faults across local and wide area networks. Spread functions as a unified message bus for distributed applications, and provides highly tuned application-level multicast, group communication, and point to point support. Spread services range from reliable messaging to fully ordered messages with delivery guarantees. Spread can be used in many distributed applications that require high reliability, high performance, and robust communication among various subsets of members. The
3 0.088754706 804 high scalability-2010-04-06-Sponsored Post: Event - Social Developer Summit
Introduction: Social Developer Summit - June 29, 2010 - San Franciso, CA A meeting of the technically social - Building, scaling, and profiting in a social age Whether it's social games, social news, social discovery, social search, or other forms of social solutions , developers today are facing new hurdles in building instantly scalable products. As new technologies emerge to address the challenges faced by social application developers, it's increasingly important to come together for knowledge sharing purposes. The first Social Developer Summit will bring together social application developers to discuss the challenges, solutions, and best practices for building applications in the rapidly expanding social web economy . At the Social Developer Summit, industry experts will share tips and case studies for building high performance social web products. For more information please take a look at Social Developer Summit . If you are interested in a sponsored pos
4 0.088754706 814 high scalability-2010-04-20-Sponsored Post: Event - Social Developer Summit
Introduction: Social Developer Summit - June 29, 2010 - San Franciso, CA A meeting of the technically social - Building, scaling, and profiting in a social age Whether it's social games, social news, social discovery, social search, or other forms of social solutions , developers today are facing new hurdles in building instantly scalable products. As new technologies emerge to address the challenges faced by social application developers, it's increasingly important to come together for knowledge sharing purposes. The first Social Developer Summit will bring together social application developers to discuss the challenges, solutions, and best practices for building applications in the rapidly expanding social web economy . At the Social Developer Summit, industry experts will share tips and case studies for building high performance social web products. For more information please take a look at Social Developer Summit . If you are interested in a sponsored pos
5 0.078476951 396 high scalability-2008-09-26-Lucasfilm: The Real Magic is in the Data Center
Introduction: Kevin Clark, director of IT operations for Lucasfilm, discusses how their data center works: * Linux-based platform, SUSE (looking to change), and a lot of proprietary open source applications for content creation. * 4,500-processor render farm in the datacenter. Workstations are used off hours. * Developed their own proprietary scheduler to schedule their 5,500 available processors. * Render nodes, the blade racks (from Verari), run dual-core dual Opteron chips with 32GB of memory on board, but are expanding those to quad-core. Are an AMD shop. * 400TB of storage online for production. * Every night they write out 10-20TB of new data on a render. A project will use up to a hundred-plus terabytes of storage. * Incremental backups are a challenge because the data changes up to 50 percent over a week. * NetApps used for storage. They like the global namespace in the virtual file system. * Foundry Networks architecture shop. One of the larger 10-GbE-backbone facilities
6 0.067400746 670 high scalability-2009-08-05-Anti-RDBMS: A list of distributed key-value stores
7 0.065811127 183 high scalability-2007-12-12-Report from OpenSocial Meetup at Google
9 0.05688189 72 high scalability-2007-08-22-Wikimedia architecture
10 0.056554694 384 high scalability-2008-09-16-EE-Appserver Clustering OR Terracota OR Coherence OR something else?
11 0.05528098 682 high scalability-2009-08-16-ThePort Network Architecture
12 0.055177152 467 high scalability-2008-12-16-[ANN] New Open Source Cache System
13 0.054533295 827 high scalability-2010-05-14-Hot Scalability Links for May 14, 2010
14 0.054512132 410 high scalability-2008-10-13-SQL Server 2008 Database Performance and Scalability
15 0.054304525 148 high scalability-2007-11-11-Linkedin architecture
16 0.053458244 285 high scalability-2008-03-19-Serving JavaScript Fast
17 0.053088192 784 high scalability-2010-02-25-Paper: High Performance Scalable Data Stores
18 0.052695364 961 high scalability-2010-12-21-SQL + NoSQL = Yes !
19 0.051534969 995 high scalability-2011-02-24-Strategy: Eliminate Unnecessary SQL
20 0.051045462 928 high scalability-2010-10-26-Scaling DISQUS to 75 Million Comments and 17,000 RPS
topicId topicWeight
[(0, 0.062), (1, 0.03), (2, 0.008), (3, -0.005), (4, 0.048), (5, 0.0), (6, -0.027), (7, -0.027), (8, 0.012), (9, -0.001), (10, 0.01), (11, 0.015), (12, -0.013), (13, 0.0), (14, -0.015), (15, 0.017), (16, -0.017), (17, -0.003), (18, 0.015), (19, 0.021), (20, -0.041), (21, -0.018), (22, 0.007), (23, 0.03), (24, 0.008), (25, 0.003), (26, -0.003), (27, -0.025), (28, -0.044), (29, -0.047), (30, 0.016), (31, -0.013), (32, 0.018), (33, 0.014), (34, 0.028), (35, -0.017), (36, 0.013), (37, 0.024), (38, 0.034), (39, 0.035), (40, 0.04), (41, -0.021), (42, 0.003), (43, -0.019), (44, 0.023), (45, 0.043), (46, 0.021), (47, 0.021), (48, -0.018), (49, -0.014)]
simIndex simValue blogId blogTitle
same-blog 1 0.97485274 677 high scalability-2009-08-09-NoSQL: If Only It Was That Easy
Introduction: In this blog post BJ Clark overviews the available key-value stores that aren't SQL based. On some projects he yields interesting experiences. Make sure to check out the comments as well. The post has spread throughout social tiny text services so there is a chance you've already read it, but still worthwhile to put here.
2 0.68385482 183 high scalability-2007-12-12-Report from OpenSocial Meetup at Google
Introduction: Update: Facebook pulls a Microsoft and embraces and extends by opening their platform to other social sites like Bebo. Very smart and unexpected. More info at Facebook to let other sites access platform code . This month's regular Facebook Meetup was held at Google and the topic of the day was OpenSocial . For those of you with real lives, OpenSocial "provides a common set of APIs for social applications across multiple websites." Over 200 excited people, hoping to do very exciting things, and dreaming of making an exciting pile of money, watched an OpenSocial presentation put on by a couple of appropriately knowledgeable evangelists. I could feel my social graph being more successfully monetized with each passing minute. Normally the meetings are much smaller, but Google puts on a very nice spread, so I think people may have showed up to dine :-) Or they could have showed up to learn why and how they should code to the new uber social API. By the looks of the full pl
3 0.6470589 804 high scalability-2010-04-06-Sponsored Post: Event - Social Developer Summit
Introduction: Social Developer Summit - June 29, 2010 - San Franciso, CA A meeting of the technically social - Building, scaling, and profiting in a social age Whether it's social games, social news, social discovery, social search, or other forms of social solutions , developers today are facing new hurdles in building instantly scalable products. As new technologies emerge to address the challenges faced by social application developers, it's increasingly important to come together for knowledge sharing purposes. The first Social Developer Summit will bring together social application developers to discuss the challenges, solutions, and best practices for building applications in the rapidly expanding social web economy . At the Social Developer Summit, industry experts will share tips and case studies for building high performance social web products. For more information please take a look at Social Developer Summit . If you are interested in a sponsored pos
4 0.6470589 814 high scalability-2010-04-20-Sponsored Post: Event - Social Developer Summit
Introduction: Social Developer Summit - June 29, 2010 - San Franciso, CA A meeting of the technically social - Building, scaling, and profiting in a social age Whether it's social games, social news, social discovery, social search, or other forms of social solutions , developers today are facing new hurdles in building instantly scalable products. As new technologies emerge to address the challenges faced by social application developers, it's increasingly important to come together for knowledge sharing purposes. The first Social Developer Summit will bring together social application developers to discuss the challenges, solutions, and best practices for building applications in the rapidly expanding social web economy . At the Social Developer Summit, industry experts will share tips and case studies for building high performance social web products. For more information please take a look at Social Developer Summit . If you are interested in a sponsored pos
5 0.59547061 811 high scalability-2010-04-16-Hot Scalability Links for April 16, 2010
Introduction: Twitter gets a total of 3 billion requests a day via its API ; 105,779,710 registered users; 300,000 new registered users a day; 180 million unique visitors a month; 55 million tweets a day. Who has the most servers? Google 1 million+; Intel 100K; 1&1 Internet 70K; Facebook 30K; Akamai 61K; Rackspace 56k+. Cloud Computing Economies of Scale . James Hamilton gives a fabulous talk breaking down where the costs are in the cloud. It's not where you may think. Higher utilization is the key. More here . Erlang Factory: Andy Gross: Distributed Erlang Systems In Operation: Patterns and Pitfalls by Martin J. Logan. Great overview of architecting distributed systems in Erlang. Covers what you want and don't want in a distributed system and how to compromise those elements, what's common, system design, cluster membership, load balancing, upgrades, debugging, and more. Extreme Scale Computing by Irving Wladawsky-Berger . “An exascale supercomputer capable of a million tr
6 0.57908791 802 high scalability-2010-04-01-Hot Scalability Links for April 1, 2010
7 0.5733543 723 high scalability-2009-10-16-Paper: Scaling Online Social Networks without Pains
9 0.53520435 153 high scalability-2007-11-13-Friendster Lost Lead Because of a Failure to Scale
10 0.52862298 1495 high scalability-2013-07-22-We're on a Break
11 0.52782816 1303 high scalability-2012-08-13-Ask HighScalability: Facing scaling issues with news feeds on Redis. Any advice?
12 0.5243293 1228 high scalability-2012-04-16-Instagram Architecture Update: What’s new with Instagram?
13 0.52051425 1158 high scalability-2011-12-16-Stuff The Internet Says On Scalability For December 16, 2011
14 0.51900631 1294 high scalability-2012-08-01-Prismatic Update: Machine Learning on Documents and Users
16 0.51737082 755 high scalability-2009-12-28-Zynga Needs a Server-side Systems Engineer
17 0.51141405 682 high scalability-2009-08-16-ThePort Network Architecture
18 0.5021168 361 high scalability-2008-08-08-Separation into read-write only databases
20 0.49044621 846 high scalability-2010-06-22-Sponsored Post: Jobs: Etsy, Digg, Huffington Post Event: Velocity Conference
topicId topicWeight
[(1, 0.141), (2, 0.138), (62, 0.452), (79, 0.089)]
simIndex simValue blogId blogTitle
1 0.76378918 1416 high scalability-2013-03-04-NoSQL Style - A Gangnam Style Parody
Introduction: Listen up all you IT people...NoSQL, it's the rage now, so turn the page now and boost your stack...Hey, mighty people...Go, go, go, hey, hey, hey, hey, hey, hey...Go NoSQL style... I for one feel both edified and entertained...can't wait for the Harlem Shake version.
same-blog 2 0.75612146 677 high scalability-2009-08-09-NoSQL: If Only It Was That Easy
Introduction: In this blog post BJ Clark overviews the available key-value stores that aren't SQL based. On some projects he yields interesting experiences. Make sure to check out the comments as well. The post has spread throughout social tiny text services so there is a chance you've already read it, but still worthwhile to put here.
3 0.5294109 127 high scalability-2007-10-20-Strategy: Send XHR Request on Lost Focus Instead of For Every Character
Introduction: Robert Stewart shared this useful Ajax related scalability strategy: We avoided XMLHttpRequests for individual keystrokes, choosing to go back to the server only when a field lost focus. Google can afford all the servers to handle the load for that, but we didn't want to. Do you have a scalability strategy to share? Then share it! .
4 0.47109306 862 high scalability-2010-07-20-Strategy: Consider When a Service Starts Billing in Your Algorithm Cost
Introduction: At Monday's Cloud Computing Meetup , Paco Nathan gave an excellent Getting Started on Hadoop talk ( slides ). I found one of Paco's strategies particularly interesting: consider when a service starts charging in cost calculations. Depending on your use case it may be cheaper to go with a more expensive service that charges only for work accomplished rather than charging for both work + startup time. The example is comparing the cost of running Hadoop on AWS yourself versus using Amazon's prepackaged Hadoop service, Elastic MapReduce (EMR). The thought may have gone through your mind as it did mine that it doesn't necessarily make sense to use Amazon's Hadoop service. Why pay a premium for EMR when Hadoop will run directly on AWS? One reason is that Amazon has made significant changes to Hadoop to make it run more efficiently and easily on AWS. The other more surprising reason is cost. When starting a 500 node Hadoop cluster, for example, you have to wait for all the node
5 0.46801418 509 high scalability-2009-02-05-Product: HAProxy - The Reliable, High Performance TCP-HTTP Load Balancer
Introduction: Update: Load Balancing in Amazon EC2 with HAProxy. Grig Gheorghiu writes a nice post on HAProxy functionality and configuration: Emulating virtual servers, Logging, SSL, Load balancing algorithms, Session persistence with cookies, Server health checks, etc. Adapted From the website: HAProxy is a free, very fast and reliable solution offering high availability, load balancing, and proxying for TCP and HTTP-based applications. It is particularly suited for web sites crawling under very high loads while needing persistence or Layer7 processing. Supporting tens of thousands of connections is clearly realistic with todays hardware. Its mode of operation makes its integration into existing architectures very easy and riskless, while still offering the possibility not to expose fragile web servers to the Net. Currently, two major versions are supported : * version 1.1 - maintains critical sites online since 200 The most stable and reliable, has reached years of uptime. Receive
6 0.43686306 70 high scalability-2007-08-22-How many machines do you need to run your site?
7 0.42555901 1172 high scalability-2012-01-10-A Perfect Fifth of Notes on Scalability
8 0.42351937 679 high scalability-2009-08-11-13 Scalability Best Practices
9 0.4227446 865 high scalability-2010-07-27-A Metric A$$-Ton of Joe Stump: The Cloud is Cheaper than Bare Metal
10 0.42269534 79 high scalability-2007-09-01-On-Demand Infinitely Scalable Database Seed the Amazon EC2 Cloud
11 0.42118543 15 high scalability-2007-07-16-Blog: MySQL Performance Blog - Everything about MySQL Performance.
12 0.42038092 1469 high scalability-2013-06-03-GOV.UK - Not Your Father's Stack
13 0.41962892 309 high scalability-2008-04-23-Behind The Scenes of Google Scalability
14 0.41851512 1472 high scalability-2013-06-07-Stuff The Internet Says On Scalability For June 7, 2013
15 0.41830063 219 high scalability-2008-01-21-Product: Hyperic
17 0.41716617 664 high scalability-2009-07-29-Strategy: Devirtualize for More Vroom
18 0.4166874 1053 high scalability-2011-06-06-Apple iCloud: Syncing and Distributed Storage Over Streaming and Centralized Storage
19 0.41636333 1110 high scalability-2011-09-06-Big Data Application Platform
20 0.41609269 126 high scalability-2007-10-20-Should you build your next website using 3tera's grid OS?