high_scalability high_scalability-2007 high_scalability-2007-53 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: MogileFS is an open source distributed filesystem. Its properties and features include: Application level, No single point of failure, Automatic file replication, Better than RAID, Flat Namespace, Shared-Nothing, No RAID required, Local filesystem agnostic.
sentIndex sentText sentNum sentScore
1 Its properties and features include: Application level, No single point of failure, Automatic file replication, Better than RAID, Flat Namespace, Shared-Nothing, No RAID required, Local filesystem agnostic. [sent-2, score-0.935]
wordName wordTfidf (topN-words)
[('raid', 0.464), ('namespace', 0.384), ('agnostic', 0.376), ('flat', 0.293), ('filesystem', 0.284), ('automatic', 0.252), ('properties', 0.238), ('include', 0.16), ('local', 0.158), ('failure', 0.154), ('replication', 0.144), ('required', 0.144), ('file', 0.122), ('level', 0.11), ('features', 0.109), ('source', 0.102), ('point', 0.1), ('open', 0.097), ('better', 0.087), ('single', 0.082), ('distributed', 0.075), ('application', 0.068)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 53 high scalability-2007-08-01-Product: MogileFS
Introduction: MogileFS is an open source distributed filesystem. Its properties and features include: Application level, No single point of failure, Automatic file replication, Better than RAID, Flat Namespace, Shared-Nothing, No RAID required, Local filesystem agnostic.
2 0.24054761 143 high scalability-2007-11-06-Product: ChironFS
Introduction: If you are trying to create highly available file systems, especially across data centers, then ChironFS is one potential solution. It's relatively new, so there aren't lots of experience reports, but it looks worth considering. What is ChironFS and how does it work? Adapted from the ChironFS website: The Chiron Filesystem is a Fuse based filesystem that frees you from single points of failure. It's main purpose is to guarantee filesystem availability using replication. But it isn't a RAID implementation. RAID replicates DEVICES not FILESYSTEMS. Why not just use RAID over some network block device? Because it is a block device and if one server mounts that device in RW mode, no other server will be able to mount it in RW mode. Any real network may have many servers and offer a variety of services. Keeping everything running can become a real nightmare!
3 0.14208977 1279 high scalability-2012-07-09-Data Replication in NoSQL Databases
Introduction: This is the third guest post ( part 1 , part 2 ) of a series by Greg Lindahl, CTO of blekko, the spam free search engine. Previously, Greg was Founder and Distinguished Engineer at PathScale, at which he was the architect of the InfiniPath low-latency InfiniBand HCA, used to build tightly-coupled supercomputing clusters. blekko's home-grown NoSQL database was designed from the start to support a web-scale search engine, with 1,000s of servers and petabytes of disk. Data replication is a very important part of keeping the database up and serving queries. Like many NoSQL database authors, we decided to keep R=3 copies of each piece of data in the database, and not use RAID to improve reliability. The key goal we were shooting for was a database which degrades gracefully when there are many small failures over time, without needing human intervention. Why don't we like RAID for big NoSQL databases? Most big storage systems use RAID levels like 3, 4, 5, or 10 to improve relia
4 0.13020517 1473 high scalability-2013-06-10-The 10 Deadly Sins Against Scalability
Introduction: In the moral realm there may be 7 deadly sins, but scalability maven Sean Hull has come up Five More Things Deadly to Scalability that when added to his earlier 5 Things That are Toxic to Scalability , make for a numerologically satisfying 10 sins again scalability: Slow Disk I/O – RAID 5 – Multi-tenant EBS . Use RAID 10, it provides good protection along with good read and write performance. The design of RAID 5 means poor performance and long repair times on failure. On AWS consider Provisioned IOPS as a way around IO bottlenecks. Using the database for Queuing. The database may seem like the perfect place to keep work queues, but under load locking and scanning overhead kills performance. Use specialized products like RabbitMQ and SQS to remove this bottleneck. Using Database for full-text searching. Search seems like another perfect database feature. At scale search doesn't perform well. Use specialized technologies like Solr or Sphinx. Insufficient Caching
5 0.1089603 585 high scalability-2009-04-29-How to choice and build perfect server
Introduction: There are a lot of questions about the server components, and how to choice and/or build perfect server with consider the power consumption. So I decide to write about this topic . Key Points: What kind of components the servers needs The Green Computing and the Servers components. How much power the server consume. Choice the right components: Processors, HDD, RAID, Memory Build Server, or buy?
6 0.10617195 1114 high scalability-2011-09-13-Must see: 5 Steps to Scaling MongoDB (Or Any DB) in 8 Minutes
7 0.095486812 283 high scalability-2008-03-18-Shared filesystem on EC2
8 0.094244406 1511 high scalability-2013-09-04-Wide Fast SATA: the Recipe for Hot Performance
9 0.08525829 98 high scalability-2007-09-18-Sync data on all servers
10 0.084762283 1508 high scalability-2013-08-28-Sean Hull's 20 Biggest Bottlenecks that Reduce and Slow Down Scalability
11 0.083337627 889 high scalability-2010-08-30-Pomegranate - Storing Billions and Billions of Tiny Little Files
12 0.079447366 1121 high scalability-2011-09-21-5 Scalability Poisons and 3 Cloud Scalability Antidotes
13 0.078821197 68 high scalability-2007-08-20-TypePad Architecture
14 0.077586874 229 high scalability-2008-01-29-Building scalable storage into application - Instead of MogileFS OpenAFS etc.
15 0.075997636 1162 high scalability-2011-12-23-Funny: A Cautionary Tale About Storage and Backup
16 0.073353723 1163 high scalability-2011-12-23-Stuff The Internet Says On Scalability For December 23, 2011
17 0.07066378 490 high scalability-2009-01-12-Getting ready for the cloud
18 0.069851518 1316 high scalability-2012-09-04-Changing Architectures: New Datacenter Networks Will Set Your Code and Data Free
19 0.069754243 1099 high scalability-2011-08-16-The 5 Biggest Ways to Boost MySQL Scalability
20 0.065637223 396 high scalability-2008-09-26-Lucasfilm: The Real Magic is in the Data Center
topicId topicWeight
[(0, 0.072), (1, 0.032), (2, -0.009), (3, -0.01), (4, -0.012), (5, 0.047), (6, 0.037), (7, -0.061), (8, -0.002), (9, 0.004), (10, 0.001), (11, -0.029), (12, -0.016), (13, -0.04), (14, -0.003), (15, 0.048), (16, -0.012), (17, 0.031), (18, -0.027), (19, 0.012), (20, 0.035), (21, 0.04), (22, -0.037), (23, 0.054), (24, -0.052), (25, -0.003), (26, 0.069), (27, -0.0), (28, -0.064), (29, 0.003), (30, -0.051), (31, -0.028), (32, 0.048), (33, -0.084), (34, -0.022), (35, -0.012), (36, 0.002), (37, -0.021), (38, -0.038), (39, -0.049), (40, -0.049), (41, -0.07), (42, -0.061), (43, -0.005), (44, -0.035), (45, -0.005), (46, 0.01), (47, 0.06), (48, -0.038), (49, 0.035)]
simIndex simValue blogId blogTitle
same-blog 1 0.97097766 53 high scalability-2007-08-01-Product: MogileFS
Introduction: MogileFS is an open source distributed filesystem. Its properties and features include: Application level, No single point of failure, Automatic file replication, Better than RAID, Flat Namespace, Shared-Nothing, No RAID required, Local filesystem agnostic.
2 0.72325307 143 high scalability-2007-11-06-Product: ChironFS
Introduction: If you are trying to create highly available file systems, especially across data centers, then ChironFS is one potential solution. It's relatively new, so there aren't lots of experience reports, but it looks worth considering. What is ChironFS and how does it work? Adapted from the ChironFS website: The Chiron Filesystem is a Fuse based filesystem that frees you from single points of failure. It's main purpose is to guarantee filesystem availability using replication. But it isn't a RAID implementation. RAID replicates DEVICES not FILESYSTEMS. Why not just use RAID over some network block device? Because it is a block device and if one server mounts that device in RW mode, no other server will be able to mount it in RW mode. Any real network may have many servers and offer a variety of services. Keeping everything running can become a real nightmare!
3 0.6725139 125 high scalability-2007-10-18-another approach to replication
Introduction: File replication based on erasure codes can reduce total replicas size 2 times and more.
4 0.66146022 889 high scalability-2010-08-30-Pomegranate - Storing Billions and Billions of Tiny Little Files
Introduction: Pomegranate is a novel distributed file system built over distributed tabular storage that acts an awful lot like a NoSQL system. It's targeted at increasing the performance of tiny object access in order to support applications like online photo and micro-blog services, which require high concurrency, high throughput, and low latency. Their tests seem to indicate it works: We have demonstrate that file system over tabular storage performs well for highly concurrent access. In our test cluster, we observed linearly increased more than 100,000 aggregate read and write requests served per second ( RPS ). Rather than sitting atop the file system like almost every other K-V store, Pomegranate is baked into file system. The idea is that the file system API is common to every platform so it wouldn't require a separate API to use. Every application could use it out of the box. The features of Pomegranate are: It handles billions of small files efficiently, even in on
5 0.65183103 103 high scalability-2007-09-28-Kosmos File System (KFS) is a New High End Google File System Option
Introduction: There's a new clustered file system on the spindle: Kosmos File System (KFS) . Thanks to Rich Skrenta for turning me on to KFS and I think his blog post says it all. KFS is an open source project written in C++ by search startup Kosmix . The team members have a good pedigree so there's a better than average chance this software will be worth considering. After you stop trying to turn KFS into "Kentucky Fried File System" in your mind, take a look at KFS' intriguing feature set: Incremental scalability: New chunkserver nodes can be added as storage needs increase; the system automatically adapts to the new nodes. Availability: Replication is used to provide availability due to chunk server failures. Typically, files are replicated 3-way. Per file degree of replication: The degree of replication is configurable on a per file basis, with a max. limit of 64. Re-replication: Whenever the degree of replication for a file drops below the configured amount (
6 0.64537436 278 high scalability-2008-03-16-Product: GlusterFS
7 0.6168378 271 high scalability-2008-03-08-Product: DRBD - Distributed Replicated Block Device
9 0.59781897 283 high scalability-2008-03-18-Shared filesystem on EC2
10 0.59752429 112 high scalability-2007-10-04-You Can Now Store All Your Stuff on Your Own Google Like File System
11 0.57226348 98 high scalability-2007-09-18-Sync data on all servers
12 0.57156307 20 high scalability-2007-07-16-Paper: The Clustered Storage Revolution
13 0.56699342 1279 high scalability-2012-07-09-Data Replication in NoSQL Databases
14 0.55239248 229 high scalability-2008-01-29-Building scalable storage into application - Instead of MogileFS OpenAFS etc.
15 0.54483902 50 high scalability-2007-07-31-BerkeleyDB & other distributed high performance key-value databases
16 0.53183496 368 high scalability-2008-08-17-Wuala - P2P Online Storage Cloud
17 0.52624619 971 high scalability-2011-01-10-Riak's Bitcask - A Log-Structured Hash Table for Fast Key-Value Data
18 0.5136916 13 high scalability-2007-07-15-Lustre cluster file system
20 0.50432563 529 high scalability-2009-03-10-Paper: Consensus Protocols: Paxos
topicId topicWeight
[(1, 0.168), (2, 0.137), (46, 0.232), (85, 0.27)]
simIndex simValue blogId blogTitle
same-blog 1 0.85495788 53 high scalability-2007-08-01-Product: MogileFS
Introduction: MogileFS is an open source distributed filesystem. Its properties and features include: Application level, No single point of failure, Automatic file replication, Better than RAID, Flat Namespace, Shared-Nothing, No RAID required, Local filesystem agnostic.
2 0.81781268 191 high scalability-2007-12-23-Synchronizing Memcached application
Introduction: I have an application with couple of web servers that uses MemcacheD. How can i synchronize concurrent put to the cache? The value of the entry is list. Atomic append operation could have been helpful, but unfortunately memcahe doesn't support atomic append.
3 0.80372441 1049 high scalability-2011-05-31-Awesome List of Advanced Distributed Systems Papers
Introduction: As part of Dr. Indranil Gupta 's CS 525 Spring 2011 Advanced Distributed Systems class, he has collected an incredible list of resources on distributed systems . His research group is also doing some interesting work. The various topics include: Before there Were Clouds, Cloud Computing, P2P Systems, Basic Distributed Computing Concepts, Sensor Networks, Overlays and DHTs, Cloud Programming, Cloud Scheduling, Key-Value Stores, Storage, Sensor Net Routing, Geo-Distribution, P2P Apps, In-network processing, Epidemics, Probabilistic Membership Protocols, Distributed Monitoring and Management, Publish-Subscribe/CDNs, Measurement Studies, Old Wine: Stale or Vintage?, In Byzantium, Cloud Pricing, Other Industrial Systems, Structure of Networks, Completing the Circle, Green Clouds, Distributed Debugging, Flash!, The Middle or the End?, Availability-Aware Systems, Design Methodologies, Handling Stress, Sources of unreliability in networks, Handling Stress, Selfish algorithms, Securi
4 0.79317498 102 high scalability-2007-09-27-Product: Sequoia Database Clustering Technology
Introduction: Sequoia is a transparent middleware solution offering clustering, load balancing and failover services for any database. Sequoia is the continuation of the C-JDBC project. The database is distributed and replicated among several nodes and Sequoia balances the queries among these nodes. Sequoia handles node and network failures with transparent failover. It also provides support for hot recovery, online maintenance operations and online upgrades. Features in a nutshell No modification of existing applications or databases. Operational with any database providing a JDBC driver. High availability provided by advanced RAIDb technology. Transparent failover and recovery capabilities. Performance scalability with unique load balancing and query result caching features. Integrated JMX-based administration and monitoring. 100% Java implementation allowing portability across platforms with a JRE 1.4 or greater. Open source licensed under Apache v2 license. Professi
5 0.78043544 646 high scalability-2009-07-01-Podcast about Facebook's Cassandra Project and the New Wave of Distributed Databases
Introduction: In this podcast , we interview Jonathan Ellis about how Facebook's open sourced Cassandra Project took lessons learned from Amazon's Dynamo and Google's BigTable to tackle the difficult problem of building a highly scalable, always available, distributed data store.
6 0.77462548 1039 high scalability-2011-05-12-Paper: Mind the Gap: Reconnecting Architecture and OS Research
7 0.76622701 820 high scalability-2010-05-03-100 Node Hazelcast cluster on Amazon EC2
8 0.75758135 492 high scalability-2009-01-16-Database Sharding for startups
9 0.75661182 59 high scalability-2007-08-04-Try Squid as a Reverse Proxy
10 0.75295734 143 high scalability-2007-11-06-Product: ChironFS
11 0.72222447 1500 high scalability-2013-08-12-100 Curse Free Lessons from Gordon Ramsay on Building Great Software
12 0.71703559 1032 high scalability-2011-05-02-Stack Overflow Makes Slow Pages 100x Faster by Simple SQL Tuning
13 0.71445549 118 high scalability-2007-10-09-High Load on production Webservers after Sourcecode sync
14 0.71048063 597 high scalability-2009-05-12-GemStone Unveils GemFire Enterprise 6.0
15 0.70644861 1239 high scalability-2012-05-04-Stuff The Internet Says On Scalability For May 4, 2012
16 0.70539278 1577 high scalability-2014-01-13-NYTimes Architecture: No Head, No Master, No Single Point of Failure
17 0.69954181 447 high scalability-2008-11-19-High Definition Video Delivery on the Web?
18 0.69598281 1024 high scalability-2011-04-15-Stuff The Internet Says On Scalability For April 15, 2011
19 0.68565488 1592 high scalability-2014-02-07-Stuff The Internet Says On Scalability For February 7th, 2014
20 0.6825726 638 high scalability-2009-06-26-PlentyOfFish Architecture