high_scalability high_scalability-2010 high_scalability-2010-848 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Royans Tharakan is blogging like a mad man at the Velocity Conference . Read a summary of many of the presentations on his blog . Zuckerberg almost guarantees 1 billion Facebook users . And I almost believe him. Northscale introduces Membase , a new distributed key-value NoSQL competitor featuring a memcache compatible interface, yet is persistent like a database. Hopefully we'll have more on their internals later. Notable Tweets: Aaron Cordova - scalability means "can change size" and also "works at large sizes" - this conflates two orthogonal features of cloud computing. Jaime Garcia Reinoso - I t's the scalability, stupid! Alex Averbuch - when I read/hear "unlimited/inifinite scalability" I stop reading/listening and start thinking about cake. Dennis Clark - I used to smirk at developers whose main DB experience was in MUMPS or Pick, until I realized those are old-school #NoSQL engines. Hypertable vs. HBase Performance Evaluation .
sentIndex sentText sentNum sentScore
1 Royans Tharakan is blogging like a mad man at the Velocity Conference . [sent-1, score-0.175]
2 Northscale introduces Membase , a new distributed key-value NoSQL competitor featuring a memcache compatible interface, yet is persistent like a database. [sent-5, score-0.186]
3 Notable Tweets: Aaron Cordova - scalability means "can change size" and also "works at large sizes" - this conflates two orthogonal features of cloud computing. [sent-7, score-0.094]
4 Dennis Clark - I used to smirk at developers whose main DB experience was in MUMPS or Pick, until I realized those are old-school #NoSQL engines. [sent-10, score-0.118]
5 Hypertable's benchmarking shows Hypertable has up to 10 times better performance than HBase on some tests and they feel they are fine tuned for cloud hosted web analytics workloads. [sent-13, score-0.386]
6 GigaOM is putting up articles and video from their cloud focussed Structure 2010 Conference . [sent-14, score-0.099]
7 You may find the Structure 2010: The Quest for Exascale Computing Power panel talk hosted by Joyent's Jason Hoffman especially interesting. [sent-15, score-0.091]
8 To build an exascale computer build today would you would use 4 gigawatts of power and 125 million cores! [sent-16, score-0.187]
9 To go larger you'll have to horizontally partition across multiple databases and Wayne shows you how. [sent-19, score-0.124]
10 This is a subject he's knowledgeable and passionate about, so it's worth a look. [sent-21, score-0.099]
11 Theo is also putting on the Surge Conference , which is laser focussed on designing large scale systems. [sent-22, score-0.354]
12 A Chef recipe from ClusterChef that will help you create a scalable, efficient compute cluster in the cloud. [sent-23, score-0.081]
13 Eric Schadt has an informative video in Nature, Computational solutions to large-scale data management , on how to tame the avalanche of scientific data using High Performance Computing in the cloud. [sent-26, score-0.213]
14 A blast from the past with an overview of eToys. [sent-27, score-0.09]
15 Replace Perl with your favorite dynamic language of choice and this approach would still feel very comfortable today. [sent-30, score-0.09]
16 Rule one about any sort of benchmark on the Internet is it will be destroyed from every angle. [sent-33, score-0.193]
17 A useful benchmark has to include complex workloads so issues likes locks, memory management, latency, paging, fairness, drops, etc can be revealed, that's when a more complex infrastructure really shows its worth. [sent-35, score-0.232]
18 Abhishek Tiwari writes on how algorithms first popularized by Google in BigTable are now being used in large scale gene sequence analysis. [sent-37, score-0.196]
19 While a reverse migration back to MySQL from NoSQL doesn't seem to be imminent, we do see some early settlers going back to a more familiar home. [sent-38, score-0.269]
20 In this case Blue74 is going from MongoDB back to MySQL because of: Lack of Transactions, Missing Records, No Joins, Schemalessness, and Unstability. [sent-39, score-0.079]
wordName wordTfidf (topN-words)
[('wayne', 0.236), ('exascale', 0.187), ('haskell', 0.187), ('hypertable', 0.17), ('focussed', 0.159), ('azure', 0.125), ('shows', 0.124), ('erlang', 0.122), ('mumps', 0.118), ('smirk', 0.118), ('thatwill', 0.118), ('huffington', 0.118), ('avalanche', 0.111), ('settlers', 0.111), ('hbase', 0.111), ('benchmark', 0.108), ('popularized', 0.106), ('schemalessness', 0.106), ('administratorget', 0.106), ('gb', 0.103), ('tame', 0.102), ('tharakan', 0.099), ('knowledgeable', 0.099), ('putting', 0.099), ('nosql', 0.096), ('featuring', 0.096), ('hoffman', 0.096), ('laser', 0.096), ('fairness', 0.096), ('orthogonal', 0.094), ('aaron', 0.092), ('hosted', 0.091), ('blast', 0.09), ('mad', 0.09), ('competitor', 0.09), ('gene', 0.09), ('toadvertisea', 0.09), ('usfor', 0.09), ('feel', 0.09), ('royans', 0.088), ('blogging', 0.085), ('destroyed', 0.085), ('jason', 0.084), ('paging', 0.084), ('benchmarking', 0.081), ('recipe', 0.081), ('internals', 0.08), ('back', 0.079), ('quest', 0.079), ('pleasecontact', 0.079)]
simIndex simValue blogId blogTitle
same-blog 1 1.0000002 848 high scalability-2010-06-25-Hot Scalability Links for June 25, 2010
Introduction: Royans Tharakan is blogging like a mad man at the Velocity Conference . Read a summary of many of the presentations on his blog . Zuckerberg almost guarantees 1 billion Facebook users . And I almost believe him. Northscale introduces Membase , a new distributed key-value NoSQL competitor featuring a memcache compatible interface, yet is persistent like a database. Hopefully we'll have more on their internals later. Notable Tweets: Aaron Cordova - scalability means "can change size" and also "works at large sizes" - this conflates two orthogonal features of cloud computing. Jaime Garcia Reinoso - I t's the scalability, stupid! Alex Averbuch - when I read/hear "unlimited/inifinite scalability" I stop reading/listening and start thinking about cake. Dennis Clark - I used to smirk at developers whose main DB experience was in MUMPS or Pick, until I realized those are old-school #NoSQL engines. Hypertable vs. HBase Performance Evaluation .
Introduction: This is a guest post by Doug Judd , original creator of Hypertable and the CEO of Hypertable, Inc. Hypertable delivers 2X better throughput in most tests -- HBase fails 41 and 167 billion record insert tests, overwhelmed by garbage collection -- Both systems deliver similar results for random read uniform test We recently conducted a test comparing the performance of Hypertable ( @hypertable ) version 0.9.5.5 to that of HBase ( @HBase ) version 0.90.4 (CDH3u2) running Zookeeper 3.3.4. In this post, we summarize the results and offer explanations for the discrepancies. For the full test report, see Hypertable vs. HBase II . Introduction Hypertable and HBase are both open source, scalable databases modeled after Google's proprietary Bigtable database. The primary difference between the two systems is that Hypertable is written in C++, while HBase is written in Java. We modeled this test after the one described in section 7 of the Bigtable paper and tuned both systems fo
3 0.15946272 935 high scalability-2010-11-05-Hot Scalability Links For November 5th, 2010
Introduction: So much good stuff this week... Adrian Cockcroft Compares NoSQL Availability Models . Let's risk feeding the CAP trolls, and try to get some insight into the differences between the many NoSQL contenders . Adrian asks how each NoSQL product will add a movie to its favorites list, read it back, and how this works across availability zones. Much trickier than it sounds with multiple writers. Cassandra and MongoDB answer back. Stuff the Internet Says: @jerng : Reading up on scalability. WHY THE HELL FOR? Because I want to know the future. @freerangedata : The #nosql options are the micro brews/craft beers of data stores. So many good ones, so little time to try them all. @edward_ribeiro : Soon, Darwinism will start to play its role on #NoSQL systems. You know, only the fittest will survive. @connectionreq : I'm always wowed when I hear how Facebook abuses their MySQL databases in crazy ways @louismrose : This is the kind of scalability we should be wo
4 0.13732743 647 high scalability-2009-07-02-Hypertable is a New BigTable Clone that Runs on HDFS or KFS
Introduction: Update 3 : Presentation from the NoSQL conference : slides , video 1 , video 2 . Update 2 : The folks at Hypertable would like you to know that Hypertable is now officially sponsored by Baidu , China’s Leading Search Engine. As a sponsor of Hypertable, Baidu has committed an industrious team of engineers, numerous servers, and support resources to improve the quality and development of the open source technology. Update : InfoQ interview on Hypertable Lead Discusses Hadoop and Distributed Databases . Hypertable differs from HBase in that it is a higher performance implementation of Bigtable. Skrentablog gives the heads up on Hypertable , Zvents' open-source BigTable clone. It's written in C++ and can run on top of either HDFS or KFS. Performance looks encouraging at 28M rows of data inserted at a per-node write rate of 7mb/sec .
5 0.135216 1115 high scalability-2011-09-14-Big List of Scalabilty Conferences
Introduction: Which of the many conferences should you attend? I get this question a lot, so I compiled a list. The list isn't life, it's not a top 10, and it won't say if a conference is naughty or nice, but they are conferences I know about, have attended, or referenced in an article. By no means is this list exhaustive. If you know of a conference people should consider attending, please add them in the comments. If you have an opinion about a particular conference, please comment on that too. Some have a low opinion of conferences. I'm not one of them. Sure, some conferences can be a waste of time, and those should fade away. And sure, we see a lot more conference-as-monetization strategy these days. But it's conferences that help motivate people to do their best work. Would we see even half the slide decks, papers, or talks describing how people do what they do without conferences as motivation? No we would not. Taking all that wisdom and stuffing it into a presentations is surprisingly hard
6 0.13410082 954 high scalability-2010-12-06-What the heck are you actually using NoSQL for?
7 0.1284214 854 high scalability-2010-07-09-Hot Scalability Links for July 9, 2010
8 0.11776263 811 high scalability-2010-04-16-Hot Scalability Links for April 16, 2010
9 0.11393942 195 high scalability-2007-12-28-Amazon's EC2: Pay as You Grow Could Cut Your Costs in Half
10 0.11108536 538 high scalability-2009-03-16-Are Cloud Based Memory Architectures the Next Big Thing?
11 0.1109881 517 high scalability-2009-02-21-Google AppEngine - A Second Look
12 0.11056703 851 high scalability-2010-07-02-Hot Scalability Links for July 2, 2010
13 0.1103373 1089 high scalability-2011-07-29-Stuff The Internet Says On Scalability For July 29, 2011
14 0.11022426 802 high scalability-2010-04-01-Hot Scalability Links for April 1, 2010
15 0.10928318 943 high scalability-2010-11-16-Facebook's New Real-time Messaging System: HBase to Store 135+ Billion Messages a Month
16 0.10771617 931 high scalability-2010-10-28-Notes from A NOSQL Evening in Palo Alto
17 0.10733907 399 high scalability-2008-10-01-Joyent - Cloud Computing Built on Accelerators
18 0.10692946 770 high scalability-2010-02-03-NoSQL Means Never Having to Store Blobs Again
20 0.10628431 750 high scalability-2009-12-16-Building Super Scalable Systems: Blade Runner Meets Autonomic Computing in the Ambient Cloud
topicId topicWeight
[(0, 0.214), (1, 0.067), (2, 0.023), (3, 0.062), (4, 0.033), (5, 0.057), (6, -0.095), (7, -0.028), (8, 0.035), (9, 0.015), (10, 0.0), (11, -0.059), (12, 0.039), (13, 0.025), (14, -0.009), (15, -0.029), (16, 0.016), (17, -0.005), (18, -0.052), (19, -0.026), (20, 0.014), (21, 0.028), (22, -0.02), (23, -0.019), (24, 0.086), (25, -0.035), (26, 0.056), (27, 0.033), (28, 0.004), (29, 0.029), (30, 0.079), (31, -0.005), (32, 0.03), (33, -0.003), (34, 0.036), (35, 0.036), (36, -0.035), (37, 0.008), (38, -0.003), (39, 0.032), (40, -0.005), (41, -0.01), (42, 0.053), (43, -0.025), (44, -0.052), (45, 0.016), (46, -0.011), (47, 0.055), (48, 0.057), (49, 0.043)]
simIndex simValue blogId blogTitle
same-blog 1 0.95897919 848 high scalability-2010-06-25-Hot Scalability Links for June 25, 2010
Introduction: Royans Tharakan is blogging like a mad man at the Velocity Conference . Read a summary of many of the presentations on his blog . Zuckerberg almost guarantees 1 billion Facebook users . And I almost believe him. Northscale introduces Membase , a new distributed key-value NoSQL competitor featuring a memcache compatible interface, yet is persistent like a database. Hopefully we'll have more on their internals later. Notable Tweets: Aaron Cordova - scalability means "can change size" and also "works at large sizes" - this conflates two orthogonal features of cloud computing. Jaime Garcia Reinoso - I t's the scalability, stupid! Alex Averbuch - when I read/hear "unlimited/inifinite scalability" I stop reading/listening and start thinking about cake. Dennis Clark - I used to smirk at developers whose main DB experience was in MUMPS or Pick, until I realized those are old-school #NoSQL engines. Hypertable vs. HBase Performance Evaluation .
2 0.80255693 935 high scalability-2010-11-05-Hot Scalability Links For November 5th, 2010
Introduction: So much good stuff this week... Adrian Cockcroft Compares NoSQL Availability Models . Let's risk feeding the CAP trolls, and try to get some insight into the differences between the many NoSQL contenders . Adrian asks how each NoSQL product will add a movie to its favorites list, read it back, and how this works across availability zones. Much trickier than it sounds with multiple writers. Cassandra and MongoDB answer back. Stuff the Internet Says: @jerng : Reading up on scalability. WHY THE HELL FOR? Because I want to know the future. @freerangedata : The #nosql options are the micro brews/craft beers of data stores. So many good ones, so little time to try them all. @edward_ribeiro : Soon, Darwinism will start to play its role on #NoSQL systems. You know, only the fittest will survive. @connectionreq : I'm always wowed when I hear how Facebook abuses their MySQL databases in crazy ways @louismrose : This is the kind of scalability we should be wo
3 0.78223246 940 high scalability-2010-11-12-Stuff the Internet Says on Scalability For November 12th, 2010
Introduction: Google – A Study In Scalability And A Little Systems Horse Sense . A nice summary by Krishna Sankar of a version of Jeff Dean's classic talk on Google Scalability given to Stanford's EE380 class . Quotable Quotes: @jkalucki : Getting just 100 servers to work together for the first time is so ridiculously complicated. Horizontal scaling doesn't scale. @simeons : Yahoo's scalability is drivem by lots of asynchronous processing. "You learn to love it." -- @rstata Yahoo's CTO The Economics of the Cloud: Dissecting a Must-Read White Paper by Bernard Golden. I love the depiction of the unseen and unfelt forces that nevertheless organize everything around them: After a brief introduction, the authors lay out a central thesis: despite initial concerns about shortcomings in new technology offerings, "historically, underlying economics have a much stronger impact on the direction and speed of disruptions, as technological challenges are resolved or overcome thro
4 0.78047055 1024 high scalability-2011-04-15-Stuff The Internet Says On Scalability For April 15, 2011
Introduction: Submitted for your reading pleasure... Luxury is an ancient notion. There was once a Chinese mandarin who had himself wakened three times every morning simply for the pleasure of being told it was not yet time to get up . ~Argosy We have a Qutoable Quote machine for you today: @kevinweil : Twitter monthly signups have increased more than 50% since December, and we're now doing well over 150 million Tweets per day. @ChrisShain : Prediction: Black art of query optimization will become black art of #nosql data modeling, for same reasons. Minimize IOs, query time. @ui_matters : Infrastructure as a Service = no hardware headaches. Platform as a Svc = no scalability headaches. SaaS = common dev platform #amchamtech @plcstpierre : Thinking about high scalability stuff... I never thought database stuff can be interesting... @webdz9r : mass scalability for dynamic web content. What took us 8 machines, now take us 1 web and 1 app. @joelvarty : CDN is always an aft
5 0.77772886 1019 high scalability-2011-04-08-Stuff The Internet Says On Scalability For April 8, 2011
Introduction: Submitted for your reading pleasure on this tomato killing frosty morn... Now we really know why vampires feed on blood...they are elastically acquiring more compute power. Your Next Computer May Be Made of...Blood! It's those memristors again. Ancient vamps are really just giant super computers. Twitter now at 155 million tweets a day , up from 55 million a year ago. 10,000-core Linux supercomputer built in Amazon cloud By Jon Brodkin. T he 10,000 cores were composed of 1,250 instances with eight cores each, as well as 8.75TB of RAM and 2PB disk space. The cluster ran for eight hours at a cost of $8,500 . Quotable Quotes for $273 Alex: @davidklemke : Holy balls Windows Azure Tables is awesome. Man am I regretting not getting into this cloud stuff sooner, it's scalability heaven. @nik : The volume of tweets we are flowing into HBase is truly staggering #bigdata #datasift @wattersjames : One of the key points I mentioned before: Scalability is being abl
6 0.76169622 854 high scalability-2010-07-09-Hot Scalability Links for July 9, 2010
7 0.75093353 1089 high scalability-2011-07-29-Stuff The Internet Says On Scalability For July 29, 2011
8 0.74941283 893 high scalability-2010-09-03-Hot Scalability Links For Sep 3, 2010
9 0.74785233 1007 high scalability-2011-03-18-Stuff The Internet Says On Scalability For March 18, 2011
10 0.74172133 916 high scalability-2010-10-07-Hot Scalability Links For Oct 8, 2010
11 0.73812276 1045 high scalability-2011-05-20-Stuff The Internet Says On Scalability For May 20, 2011
12 0.73498797 851 high scalability-2010-07-02-Hot Scalability Links for July 2, 2010
13 0.73229009 1015 high scalability-2011-04-01-Stuff The Internet Says On Scalability For April 1, 2011
14 0.73217702 1129 high scalability-2011-09-30-Stuff The Internet Says On Scalability For September 30, 2011
15 0.72890288 1147 high scalability-2011-11-25-Stuff The Internet Says On Scalability For November 25, 2011
16 0.72589028 1411 high scalability-2013-02-22-Stuff The Internet Says On Scalability For February 22, 2013
17 0.72568268 949 high scalability-2010-11-29-Stuff the Internet Says on Scalability For November 29th, 2010
18 0.72524953 1203 high scalability-2012-03-02-Stuff The Internet Says On Scalability For March 2, 2012
19 0.72505701 1067 high scalability-2011-06-24-Stuff The Internet Says On Scalability For June 24, 2011
20 0.72330576 1036 high scalability-2011-05-06-Stuff The Internet Says On Scalability For May 6th, 2011
topicId topicWeight
[(1, 0.159), (2, 0.15), (10, 0.05), (30, 0.032), (40, 0.285), (56, 0.05), (61, 0.093), (79, 0.09), (85, 0.016)]
simIndex simValue blogId blogTitle
1 0.94605881 860 high scalability-2010-07-17-Hot Scalability Links for July 17, 2010
Introduction: And by hot I also mean temperature. Summer has arrived. It's sizzling here in Silicon Valley. Thank you air conditioning! Scale the web by appointing a Crawler Czar? Tom Foremski has the idea that Google should open up their index so sites wouldn't have to endure the constant pounding by ravenous crawler bots. Don MacAskill of SmugMug estimates 50% of our web server CPU resources are spent serving crawlers. What a waste. How this would all work with real-time feeds, paid feeds (Twitter, movies, ...), etc. is unknown, but does it make sense for all that money to be spent on extracting the same data over and over again? Tweets of Gold: jamesurquhart : Key to applications is architecture. Key for infrastructure supporting archs is configurability. Configurability==features . tjake : People who choose their datastore based oh hearsay and not their own evaluation are doomed . b6n : No global lock ever goes unpunished. MichaelSurt
2 0.9344998 1466 high scalability-2013-05-29-Amazon: Creating a Customer Utopia One Culture Hack at a Time
Introduction: If you don’t cannibalize yourself, someone else will. -- Steve Jobs America as the New World has a long history of inspiring Utopian communities . Early experiments were famously religious. But there have been many others as new waves of thought have inspired people to organize and try something different. In the 1840s Transcendentalists, believing the true path lay in the perfection of the individual, created intentional communities like Brook Farm . We've also seen socialist , anarchist , hippy , and virtually every other kind of Utopia in-between. Psychologist B.F. Skinner wrote an infamous book, Walden Two , with a more "scientific" take on creating a Utopian community and Ayn Rand in Atlas Shrugged had her free market version. I believe in startup organizations we see the modern version of Utopian energy in action. We now call it by names like "culture hacking", but the goal is the same: to create a new form of human community to achieve a profound go
3 0.93076485 27 high scalability-2007-07-25-Product: 3 PAR REMOTE COPY
Introduction: 3PAR Remote Copy is a uniquely simple and efficient replication technology that allows customers to protect and share any application data affordably. Built upon 3PAR Thin Copy technology, Remote Copy lowers the total cost of storage by addressing the cost and complexity of remote replication. Common Uses of 3PAR Remote Copy: Affordable Disaster Recovery: Mirror data cost-effectively across town or across the world. Centralized Archive: Replicate data from multiple 3PAR InServs located in multiple data centers to a centralized data archive location. Resilient Pod Architecture: Mutually replicate tier 1 or 2 data to tier 3 capacity between two InServs (application pods). Remote Data Access: Replicate data to a remote location for sharing of data with remote users.
4 0.92266566 1419 high scalability-2013-03-07-It's a VM Wasteland - A Near Optimal Packing of VMs to Machines Reduces TCO by 22%
Introduction: In Algorithm Design for Performance Aware VM Consolidation we learn some shocking facts (gambling in Casablanca?): Average server utilization in many data centers is low, estimated between 5% and 15%. This is wasteful because an idle server often consumes more than 50% of peak power. Surely that's just for old style datacenters? Nope. In Google data centers, workloads that are consolidated use only 50% of the processor cores. Every other processor core is left unused simply to ensure that performance does not degrade. It's a VM wasteland. The goal is to reduce waste by packing VMs onto machines without hurting performance or wasting resources. The idea is to select VMs that interfere the least with each other and places them together on the same server. It's a NP-Complete problem, but this paper describes a practical method that performs provably close to the optimal. Interestingly they can optimize for performance or power efficiency, so you can use different algorithm
5 0.9066959 338 high scalability-2008-06-02-Total Cost of Ownership for different web development frameworks
Introduction: I would like to compile a comparison matrix on the total cost of ownership for .Net, Java, Lamp & Rails. Where should I start? Has anyone seen or know of a recent study on this subject?
same-blog 6 0.904037 848 high scalability-2010-06-25-Hot Scalability Links for June 25, 2010
8 0.89716256 778 high scalability-2010-02-15-The Amazing Collective Compute Power of the Ambient Cloud
9 0.88776588 1414 high scalability-2013-03-01-Stuff The Internet Says On Scalability For February 29, 2013
10 0.88109112 402 high scalability-2008-10-05-Paper: Scalability Design Patterns
11 0.87737161 330 high scalability-2008-05-27-Should Twitter be an All-You-Can-Eat Buffet or a Vending Machine?
12 0.86764258 300 high scalability-2008-04-07-Scalr - Open Source Auto-scaling Hosting on Amazon EC2
14 0.84989381 1471 high scalability-2013-06-06-Paper: Memory Barriers: a Hardware View for Software Hackers
15 0.84190553 757 high scalability-2010-01-04-11 Strategies to Rock Your Startup’s Scalability in 2010
16 0.83700985 768 high scalability-2010-02-01-What Will Kill the Cloud?
17 0.82483017 879 high scalability-2010-08-12-Think of Latency as a Pseudo-permanent Network Partition
18 0.79593509 1492 high scalability-2013-07-17-How do you create a 100th Monkey software development culture?
19 0.75550222 97 high scalability-2007-09-18-Session management in highly scalable web sites
20 0.75211138 1515 high scalability-2013-09-11-Ten Lessons from GitHub’s First Year in 2008