high_scalability high_scalability-2008 high_scalability-2008-469 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: This article is a primer, intended to shine some much needed light on the logical, process oriented implementations of database scalability strategies in the form of a broad introduction. More specifically, the intent is to elaborate on the majority of these implementations by example.
sentIndex sentText sentNum sentScore
1 This article is a primer, intended to shine some much needed light on the logical, process oriented implementations of database scalability strategies in the form of a broad introduction. [sent-1, score-2.354]
2 More specifically, the intent is to elaborate on the majority of these implementations by example. [sent-2, score-1.255]
wordName wordTfidf (topN-words)
[('implementations', 0.422), ('shine', 0.342), ('intent', 0.342), ('primer', 0.342), ('elaborate', 0.286), ('broad', 0.236), ('intended', 0.224), ('logical', 0.221), ('majority', 0.205), ('light', 0.187), ('oriented', 0.183), ('specifically', 0.183), ('strategies', 0.156), ('form', 0.136), ('article', 0.109), ('needed', 0.109), ('example', 0.083), ('process', 0.08), ('scalability', 0.062), ('much', 0.06), ('database', 0.048)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999994 469 high scalability-2008-12-17-Scalability Strategies Primer: Database Sharding
Introduction: This article is a primer, intended to shine some much needed light on the logical, process oriented implementations of database scalability strategies in the form of a broad introduction. More specifically, the intent is to elaborate on the majority of these implementations by example.
2 0.088631429 572 high scalability-2009-04-16-Paper: The End of an Architectural Era (It’s Time for a Complete Rewrite)
Introduction: Update 3 : A Comparison of Approaches to Large-Scale Data Analysis: MapReduce vs. DBMS Benchmarks . Although the process to load data into and tune the execution of parallel DBMSs took much longer than the MR system, the observed performance of these DBMSs was strikingly better. Update 2 : H-Store: A Next Generation OLTP DBMS is the project implementing the ideas in this paper: The goal of the H-Store project is to investigate how these architectural and application shifts affect the performance of OLTP databases, and to study what performance benefits would be possible with a complete redesign of OLTP systems in light of these trends. Our early results show that a simple prototype built from scratch using modern assumptions can outperform current commercial DBMS offerings by around a factor of 80 on OLTP workloads. Update : interesting related thread on Lamda the Ultimate . A really fascinating paper bolstering many of the anti-RDBMS threads the have popped up on the intert
3 0.084244587 234 high scalability-2008-01-30-The AOL XMPP scalability challenge
Introduction: Large scale distributed instant messaging, presence based protocol are a real challenge. With big players adopting the standard, the XMPP (eXtensible Messaging and Presence Protocol) community is facing the need to validate protocol and implementations to even larger scale.
4 0.072744012 1533 high scalability-2013-10-16-Interview With Google's Ilya Grigorik On His New Book: High Performance Browser Networking
Introduction: If you are Google you don't just complain about performance on the web, you do something about it. Doing something about web performance is the job of one Ilya Grigorik , Developer Advocate, Make the Web Fast at Google, and author of a great new book: High Performance Browser Networking: What every web developer should know about networking and web performance . That's a big topic you might be saying to yourself. And it is. The book is 400 plus information packed pages. But never fear. Ilya writes in a very straightforward style. It’s like a man page for the web. Which is a good thing. In case you are not familiar with Ilya, he's the perfect choice for writing such an ambitious book. For years Ilya has been producing excellent content on his blog and if you search YouTube you'll find presentation after presentation on the topics found in the book. Authority established. Reading the book I was struck by what a complicated beast or little World Wide Web has become. That's
5 0.072484344 885 high scalability-2010-08-23-Building a Scalable Key-Value Database: Project Hydracus
Introduction: The world of NoSQL and alternative database implementations (i.e. non-relational) is deeply fascinating to me. I can’t help but be swept up in the whirl of planet-scale web development scalability techniques and the evolution of how developers think about building their applications knowing that with success comes the inevitable need to scale to levels almost unimaginable just five or ten years ago. I’m going to make a prediction: Developers will be expected to understand the fundamentals of how different database systems can be applied within a singular application; their strengths and weaknesses, and when it is appropriate to leverage them. There’s a lot out there about scaling relational databases. Partitioning your data over application-managed shards is a topic that has seen its fair share of attention. But I think there’s a whole slew over NoSQL databases that have been built and designed by some very smart people that are new enough that their internals aren’t we
6 0.066548191 483 high scalability-2009-01-04-Paper: MapReduce: Simplified Data Processing on Large Clusters
7 0.063900664 440 high scalability-2008-11-11-Arhcitecture for content management
8 0.063754134 374 high scalability-2008-08-30-Paper: GargantuanComputing—GRIDs and P2P
9 0.063435249 92 high scalability-2007-09-15-The Role of Memory within Web 2.0 Architectures and Deployments
10 0.061003808 421 high scalability-2008-10-17-A High Performance Memory Database for Web Application Caches
12 0.056458846 434 high scalability-2008-10-30-Olio Web2.0 Toolkit - Evaluate Web Technologies and Tools
13 0.053184897 429 high scalability-2008-10-25-Product: Puppet the Automated Administration System
14 0.053126194 1228 high scalability-2012-04-16-Instagram Architecture Update: What’s new with Instagram?
15 0.051187642 1352 high scalability-2012-10-31-Gone Fishin': LiveJournal Architecture
16 0.048854057 854 high scalability-2010-07-09-Hot Scalability Links for July 9, 2010
17 0.048077948 494 high scalability-2009-01-16-Reducing Your Website's Bandwidth Usage - how to
18 0.047918227 52 high scalability-2007-08-01-Product: Memcached
19 0.047114953 658 high scalability-2009-07-17-Against all the odds
20 0.046108101 939 high scalability-2010-11-09-The Tera-Scale Effect
topicId topicWeight
[(0, 0.04), (1, 0.034), (2, -0.003), (3, 0.011), (4, 0.013), (5, 0.02), (6, -0.006), (7, -0.003), (8, -0.017), (9, 0.01), (10, -0.009), (11, 0.008), (12, -0.007), (13, 0.005), (14, -0.002), (15, -0.007), (16, 0.015), (17, -0.011), (18, 0.019), (19, -0.003), (20, -0.01), (21, -0.015), (22, 0.0), (23, 0.026), (24, 0.013), (25, 0.003), (26, -0.023), (27, -0.011), (28, -0.008), (29, 0.025), (30, -0.002), (31, 0.039), (32, 0.007), (33, 0.019), (34, -0.007), (35, -0.017), (36, 0.008), (37, 0.002), (38, 0.009), (39, 0.008), (40, 0.002), (41, 0.035), (42, 0.012), (43, 0.03), (44, 0.008), (45, -0.018), (46, 0.031), (47, 0.027), (48, -0.021), (49, -0.02)]
simIndex simValue blogId blogTitle
same-blog 1 0.89631385 469 high scalability-2008-12-17-Scalability Strategies Primer: Database Sharding
Introduction: This article is a primer, intended to shine some much needed light on the logical, process oriented implementations of database scalability strategies in the form of a broad introduction. More specifically, the intent is to elaborate on the majority of these implementations by example.
2 0.61268681 462 high scalability-2008-12-06-Paper: Real-world Concurrency
Introduction: An excellent article by Bryan Cantrill and Jeff Bonwick on how to write multi-threaded code. With more processors and no magic bullet solution for how to use them, knowing how to write multiprocessor code that doesn't screw up your system is still a valuable skill. Some topics: Know your cold paths from your hot paths. Intuition is frequently wrong—be data intensive. Know when—and when not—to break up a lock. Be wary of readers/writer locks. Consider per-CPU locking. Know when to broadcast—and when to signal. Learn to debug postmortem. Design your systems to be composable. Don't use a semaphore where a mutex would suffice. Consider memory retiring to implement per-chain hash-table locks. Be aware of false sharing. Consider using nonblocking synchronization routines to monitor contention. When reacquiring locks, consider using generation counts to detect state change. Use wait- and lock-free structures only if you absolutely must. Prepare for the th
3 0.60446656 186 high scalability-2007-12-13-un-article: the setup behind microsoft.com
Introduction: On the blogs.technet.com article on microsoft.com's infrastructure: The article reads like a blatant ad for it's own products, and is light on the technical side. The juicy bits are here, so you know what the fuss is about: Cytrix Netscaler (= loadbalancer with various optimizations) W2K8 + IIS7 and antivirus software on the webservers 650GB/day ISS log files 8-9GBit/s (unknown if CDN's are included) Simple network filtering: stateless access lists blocking unwanted ports on the routers/switches (hence the debated "no firewalls" claim). Note that this information may not reflect present reality very well; the spokesman appears to be reciting others words.
4 0.60053957 235 high scalability-2008-02-02-The case against ORM Frameworks in High Scalability Architectures
Introduction: Let me begin by saying that I have used and continue to use various ORM frameworks such as hibernate, ibatis, propel and activerecord in applications and websites that have a user base ranging from a couple hundred to 500k users. Especially for projects that have to be up and running in a short duration of time, ORM frameworks significantly reduce the effort required to manipulate and persist OOP objects by providing time saving facilities such as automatically generated model objects, integrated unit testing, secure variable substitution, etc. Hibernate even supports horizontal data partitioning via Hibernate Shards. However, the lay of the land is significantly different in the rarefied space occupied by applications needing to support millions of users. Profiling an application at this level and paying particular attention to the operations needed to move data to and from the database, it becomes evident that a significant portion of the operations are API related, whereby t
5 0.59089696 678 high scalability-2009-08-09-Writing about cisco loadbalancer?
Introduction: Guys, At one of my jobs I have to administer a CISCO ACE (application control engine) hardware load-balancer. I don't particularly love this beast, but it's very very powerful. There appears to be little real-world info out there, so it could be interesting writing an article on that. But I don't have other HW LB's to compare it to and I don't want to rehash the product page. What would interest you in a 'product review' of a loadbalancer? No replies means it's not an interesting topic, so no article then ;-)
6 0.5606789 656 high scalability-2009-07-16-Scalable Web Architectures and Application State
7 0.55622411 188 high scalability-2007-12-19-How can I learn to scale my project?
8 0.52662736 1560 high scalability-2013-12-09-In Memory: Grace Hopper to Programmers: Mind Your Nanoseconds!
9 0.52431482 1509 high scalability-2013-08-30-Stuff The Internet Says On Scalability For August 30, 2013
10 0.51924759 474 high scalability-2008-12-21-The I.H.S.D.F. Theorem: A Proposed Theorem for the Trade-offs in Horizontally Scalable Systems
11 0.50859833 1546 high scalability-2013-11-11-Ask HS: What is a good OLAP database choice with node.js?
12 0.50387424 97 high scalability-2007-09-18-Session management in highly scalable web sites
13 0.50166321 1436 high scalability-2013-04-05-Stuff The Internet Says On Scalability For April 5, 2013
15 0.50135267 250 high scalability-2008-02-17-Web Accelerators - snake oil or miracle remedy?
16 0.49801356 1231 high scalability-2012-04-20-Stuff The Internet Says On Scalability For April 20, 2012
17 0.49434936 1202 high scalability-2012-03-01-Grace Hopper to Programmers: Mind Your Nanoseconds!
18 0.49351567 1462 high scalability-2013-05-22-Strategy: Stop Using Linked-Lists
19 0.49289706 733 high scalability-2009-10-29-Paper: No Relation: The Mixed Blessings of Non-Relational Databases
20 0.49224678 1336 high scalability-2012-10-09-Batoo JPA - The new JPA Implementation that runs over 15 times faster...
topicId topicWeight
[(2, 0.148), (4, 0.553), (79, 0.106)]
simIndex simValue blogId blogTitle
same-blog 1 0.86185229 469 high scalability-2008-12-17-Scalability Strategies Primer: Database Sharding
Introduction: This article is a primer, intended to shine some much needed light on the logical, process oriented implementations of database scalability strategies in the form of a broad introduction. More specifically, the intent is to elaborate on the majority of these implementations by example.
2 0.67306542 282 high scalability-2008-03-18-Database War Stories #3: Flickr
Introduction: [Tim O'Reilly] Continuing my series of queries about how "Web 2.0" companies used databases, I asked Cal Henderson of Flickr to tell me "how the folksonomy model intersects with the traditional database. How do you manage a tag cloud?"
3 0.54850894 1164 high scalability-2011-12-27-PlentyOfFish Update - 6 Billion Pageviews and 32 Billion Images a Month
Introduction: Markus has a short update on their PlentyOfFish Architecture . Impressive November statistics: 6 billion pageviews served 32 billion images served 6 million logins i n one day IM servers handle about 30 billion pageviews 11 webservers (5 of which could be dropped) Hired first DBA in July . They currently have a handful of employees . All hosting/cdn costs combined are under $70k/month. Lesson : small organization, simple architecture, on raw hardware is still plenty profitable for PlentyOfFish. Related Articles On HackerNews 32 Billion images a month by Markus Frind.
4 0.52706617 12 high scalability-2007-07-15-Isilon Clustred Storage System
Introduction: The Isilon IQ family of clustered storage systems was designed from the ground up to meet the needs of data-intensive enterprises and high-performance computing environments. By combining Isilon's OneFS® operating system software with the latest advances in industry-standard hardware, Isilon delivers modular, pay-as-you-grow, enterprise-class clustered storage systems. OneFS, with TrueScale™ technology, powers the industry's first and only storage system that enables linear or independent scaling of performance and capacity. This new flexible and tunable system, featuring a robust suite of clustered storage software applications, provides customers with an "out of the box" solution that is fully optimized for the widest range of applications and workflow needs. * Scales from 4 TB ti 1 PB * Throughput of up to 10 GB per seond * Linear scaling * Easy to manage Related Articles Inside Skinny On Isilon by StorageMojo
5 0.50759125 1213 high scalability-2012-03-22-Paper: Revisiting Network I-O APIs: The netmap Framework
Introduction: Here's a really good article in the Communications of the ACM on reducing network packet processing overhead by redesigning the network stack: Revisiting Network I/O APIs: The Netmap Framework by Luigi Rizzo . As commodity networking performance increases operating systems need to keep up or all those CPUs will go to waste. How do they make this happen? Abstract: Today 10-gigabit interfaces are used more and more in datacenters and servers. On these links, packets flow as fast as one every 67.2 nanoseconds, yet modern operating systems can take 10-20 times longer just to move one packet between the wire and the application. We can do much better, not with more powerful hardware but by revising architectural decisions made long ago regarding the design of device drivers and network stacks. The netmap framework is a promising step in this direction. Thanks to a careful design and the engineering of a new packet I/O API, netmap eliminates much unnecessary overhead and moves
6 0.47341841 1343 high scalability-2012-10-18-Save up to 30% by Selecting Better Performing Amazon Instances
8 0.42409945 1157 high scalability-2011-12-14-Virtualization and Cloud Computing is Changing the Network to East-West Routing
9 0.42069596 919 high scalability-2010-10-14-I, Cloud
10 0.40973744 309 high scalability-2008-04-23-Behind The Scenes of Google Scalability
11 0.40783355 670 high scalability-2009-08-05-Anti-RDBMS: A list of distributed key-value stores
12 0.40646148 916 high scalability-2010-10-07-Hot Scalability Links For Oct 8, 2010
13 0.39205647 1620 high scalability-2014-03-27-Strategy: Cache Stored Procedure Results
14 0.38081464 79 high scalability-2007-09-01-On-Demand Infinitely Scalable Database Seed the Amazon EC2 Cloud
15 0.34652522 1436 high scalability-2013-04-05-Stuff The Internet Says On Scalability For April 5, 2013
16 0.33166879 1589 high scalability-2014-02-03-How Google Backs Up the Internet Along With Exabytes of Other Data
17 0.32973197 1094 high scalability-2011-08-08-Tagged Architecture - Scaling to 100 Million Users, 1000 Servers, and 5 Billion Page Views
18 0.3199439 188 high scalability-2007-12-19-How can I learn to scale my project?
19 0.30853415 202 high scalability-2008-01-06-Email Architecture
20 0.30073732 1385 high scalability-2013-01-11-Stuff The Internet Says On Scalability For January 11, 2013