high_scalability high_scalability-2008 high_scalability-2008-323 knowledge-graph by maker-knowledge-mining

323 high scalability-2008-05-19-Twitter as a scalability case study


meta infos for this blog

Source: html

Introduction: A lot has been said already about Twitter's scalability issues. Many have given Twitter as an anti-pattern of how not to deal with scalability and have suggested different solutions for scaling it. As Twitter is famously a Ruby-on-Rails deployment, this case has also been used as a weapon in the language/platform wars between the RoR and Java camps, and to a lesser degree, also with the LAMP (PHP) camp


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 A lot has been said already about Twitter's scalability issues. [sent-1, score-0.416]

2 Many have given Twitter as an anti-pattern of how not to deal with scalability and have suggested different solutions for scaling it. [sent-2, score-0.831]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('twitter', 0.35), ('weapon', 0.326), ('camps', 0.316), ('camp', 0.307), ('famously', 0.307), ('lesser', 0.287), ('ror', 0.257), ('suggested', 0.245), ('wars', 0.243), ('degree', 0.193), ('said', 0.145), ('lamp', 0.14), ('deployment', 0.129), ('php', 0.127), ('deal', 0.126), ('scalability', 0.118), ('solutions', 0.109), ('given', 0.109), ('already', 0.096), ('java', 0.095), ('case', 0.094), ('also', 0.089), ('scaling', 0.069), ('lot', 0.057), ('different', 0.055), ('used', 0.053), ('many', 0.048)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 323 high scalability-2008-05-19-Twitter as a scalability case study

Introduction: A lot has been said already about Twitter's scalability issues. Many have given Twitter as an anti-pattern of how not to deal with scalability and have suggested different solutions for scaling it. As Twitter is famously a Ruby-on-Rails deployment, this case has also been used as a weapon in the language/platform wars between the RoR and Java camps, and to a lesser degree, also with the LAMP (PHP) camp

2 0.23684791 837 high scalability-2010-06-07-Six Ways Twitter May Reach its Big Hairy Audacious Goal of One Billion Users

Introduction: Twitter has a big hairy audacious goal of reaching one billion users  by 2013. Three forces stand against Twitter. The world will end in 2012 . But let's be optimistic and assume we'll make it. Next is Facebook. Currently Facebook is the user leader with over  400 million users . Will Facebook stumble or will they rocket to one billion users before Twitter? And lastly, there's Twitter's "low" starting point and "slow" growth rate. Twitter currently has 106 million registered users and adds about 300,000 new users a day. That doesn't add up to a billion in three years. Twitter needs to triple the number of registered users they add per day. How will Twitter reach its goal of over one billion users served? From recent infrastructure announcements and information gleaned at Chirp  ( videos ) and other talks, it has become a little clearer how they hope to reach their billion user goal: 1) Make a Big Hairy Audacious Goal 2) Hire Lots of Quality People 3) Hug Developers and Users 4) D

3 0.1552261 639 high scalability-2009-06-27-Scaling Twitter: Making Twitter 10000 Percent Faster

Introduction: Update 6: Some interesting changes from Twitter's Evan Weaver : everything in RAM now, database is a backup; peaks at 300 tweets/second; every tweet followed by average 126 people; vector cache of tweet IDs; row cache; fragment cache; page cache; keep separate caches; GC makes Ruby optimization resistant so went with Scala; Thrift and HTTP are used internally; 100s internal requests for every external request; rewrote MQ but kept interface the same; 3 queues are used to load balance requests; extensive A/B testing for backwards capability; switched to C memcached client for speed; optimize critical path; faster to get the cached results from the network memory than recompute them locally. Update 5: Twitter on Scala . A Conversation with Steve Jenson, Alex Payne, and Robey Pointer by Bill Venners. A fascinating discussion of why Twitter moved to the Java JVM for their server infrastructure (long lived processes) and why they moved to Scala to program against it (high level langu

4 0.14690745 568 high scalability-2009-04-14-Designing a Scalable Twitter

Introduction: There were many talks recently about twitter scalability and their specific choice of language such as Scala to address their existing Ruby based scalability. In this post i tried to provide a more methodical approach for handling twitter scalability challenges that is centered around the right choice of architecture patterns rather then the language itself. The architecture pattern are given in a generic fashion that is not specific to twitter itself and can serve anyone who is looking to build a scalable real time web application in the near future.

5 0.13081683 855 high scalability-2010-07-11-So, Why is Twitter Really Not Using Cassandra to Store Tweets?

Introduction: A firestorm of accusations circled around recently saying that Cassandra, the elected-by-major-adopters emperor of the NoSQL movement, has no clothes. It was said Twitter was dumping Cassandra;  Reddit outages  were linked to Cassandra; and even Facebook, Cassandra's cradle of birth, was said to have abandoned Cassandra. Shouts of NoSQL Fail! were heard in the streets. Much gloating followed. Is the emperor really naked? Casually dressed maybe, but not naked. (Note: after this point the article contains a flow chart that is NSFW. Some people are very sensitive about cussing, so if that's you, please go back, don't read on. Danger! There are no nude pictures or anything, just some strong language. But this is my most favorite flow chart of all time, so it's worth it :-) Is Twitter really abandoning Cassandra? Not according to Twitter, which came out with a post, Cassandra at Twitter Today , explaining that they are using Cassandra in production for geolocation and analytics. T

6 0.12037378 553 high scalability-2009-04-03-Collectl interface to Ganglia - any interest?

7 0.11430851 544 high scalability-2009-03-18-QCon London 2009: Upgrading Twitter without service disruptions

8 0.093912952 780 high scalability-2010-02-19-Twitter’s Plan to Analyze 100 Billion Tweets

9 0.089396708 1491 high scalability-2013-07-15-Ask HS: What's Wrong with Twitter, Why Isn't One Machine Enough?

10 0.08833269 207 high scalability-2008-01-10-Sharding with Cookie-Based Session Storage

11 0.086357266 872 high scalability-2010-08-05-Pairing NoSQL and Relational Data Storage: MySQL with MongoDB

12 0.084341824 1159 high scalability-2011-12-19-How Twitter Stores 250 Million Tweets a Day Using MySQL

13 0.082003079 1251 high scalability-2012-05-24-Build your own twitter like real time analytics - a step by step guide

14 0.078389429 1488 high scalability-2013-07-08-The Architecture Twitter Uses to Deal with 150M Active Users, 300K QPS, a 22 MB-S Firehose, and Send Tweets in Under 5 Seconds

15 0.076410308 783 high scalability-2010-02-24-Hot Scalability Links for February 24, 2010

16 0.070707783 1626 high scalability-2014-04-04-Stuff The Internet Says On Scalability For April 4th, 2014

17 0.070514657 1375 high scalability-2012-12-21-Stuff The Internet Says On Scalability For December 21, 2012

18 0.066527136 166 high scalability-2007-11-27-Solving the Client Side API Scalability Problem with a Little Game Theory

19 0.066255957 459 high scalability-2008-12-03-Java World Interview on Scalability and Other Java Scalability Secrets

20 0.065016583 554 high scalability-2009-04-04-Digg Architecture


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.074), (1, 0.02), (2, -0.008), (3, 0.005), (4, 0.062), (5, -0.019), (6, -0.072), (7, 0.022), (8, 0.012), (9, 0.023), (10, -0.006), (11, 0.064), (12, 0.041), (13, 0.013), (14, -0.038), (15, -0.039), (16, 0.054), (17, -0.049), (18, -0.03), (19, -0.001), (20, -0.035), (21, -0.033), (22, -0.003), (23, 0.007), (24, -0.013), (25, 0.002), (26, 0.037), (27, -0.094), (28, -0.003), (29, 0.034), (30, 0.055), (31, -0.069), (32, -0.084), (33, 0.043), (34, -0.114), (35, -0.011), (36, -0.037), (37, -0.005), (38, -0.142), (39, 0.009), (40, 0.053), (41, -0.03), (42, -0.034), (43, -0.004), (44, -0.006), (45, -0.038), (46, 0.028), (47, 0.011), (48, 0.06), (49, 0.008)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.98218608 323 high scalability-2008-05-19-Twitter as a scalability case study

Introduction: A lot has been said already about Twitter's scalability issues. Many have given Twitter as an anti-pattern of how not to deal with scalability and have suggested different solutions for scaling it. As Twitter is famously a Ruby-on-Rails deployment, this case has also been used as a weapon in the language/platform wars between the RoR and Java camps, and to a lesser degree, also with the LAMP (PHP) camp

2 0.85761636 568 high scalability-2009-04-14-Designing a Scalable Twitter

Introduction: There were many talks recently about twitter scalability and their specific choice of language such as Scala to address their existing Ruby based scalability. In this post i tried to provide a more methodical approach for handling twitter scalability challenges that is centered around the right choice of architecture patterns rather then the language itself. The architecture pattern are given in a generic fashion that is not specific to twitter itself and can serve anyone who is looking to build a scalable real time web application in the near future.

3 0.7809844 544 high scalability-2009-03-18-QCon London 2009: Upgrading Twitter without service disruptions

Introduction: Evan Weaver from Twitter presented a talk on Twitter software upgrades, titled Improving running components as part of the Systems that never stop track at QCon London 2009 conference last Friday. The talk focused on several upgrades performed since last May, while Twitter was experiencing serious performance problems.

4 0.76686013 556 high scalability-2009-04-05-At Some Point the Cost of Servers Outweighs the Cost of Programmers

Introduction: This is the intriguing quote by Bill Venners in an interview with Twitter's Alex Payne on Twitter's heretical switch from a pure Ruby stack to a Ruby on Rails stack on the front-end and JVM/Scala on the back-end: So performance was also one of the problems with JRuby, which I [Bill Venners] think helps explain better why they'd [Twitter] prefer Scala over Ruby or JRuby for some things. I have often heard Rubyists say that although Ruby is slower than Java, for many things it is plenty fast enough, and they are right. The logic goes further, saying that servers are cheap, and programmers expensive, so it makes sense to tradeoff some runtime performance for programmer productivity. And I think that's very often true too, but not always. If you have enough traffic, at some point the cost of servers outweighs the cost of programmers . I'm not sure whether Twitter is past that point, but they get a lot of traffic. And frankly this isn't an intrinsic tradeoff. Other dynamic languages

5 0.76129919 837 high scalability-2010-06-07-Six Ways Twitter May Reach its Big Hairy Audacious Goal of One Billion Users

Introduction: Twitter has a big hairy audacious goal of reaching one billion users  by 2013. Three forces stand against Twitter. The world will end in 2012 . But let's be optimistic and assume we'll make it. Next is Facebook. Currently Facebook is the user leader with over  400 million users . Will Facebook stumble or will they rocket to one billion users before Twitter? And lastly, there's Twitter's "low" starting point and "slow" growth rate. Twitter currently has 106 million registered users and adds about 300,000 new users a day. That doesn't add up to a billion in three years. Twitter needs to triple the number of registered users they add per day. How will Twitter reach its goal of over one billion users served? From recent infrastructure announcements and information gleaned at Chirp  ( videos ) and other talks, it has become a little clearer how they hope to reach their billion user goal: 1) Make a Big Hairy Audacious Goal 2) Hire Lots of Quality People 3) Hug Developers and Users 4) D

6 0.74895841 1491 high scalability-2013-07-15-Ask HS: What's Wrong with Twitter, Why Isn't One Machine Enough?

7 0.72570908 574 high scalability-2009-04-20-Some things about Memcached from a Twitter software developer

8 0.68478507 639 high scalability-2009-06-27-Scaling Twitter: Making Twitter 10000 Percent Faster

9 0.68164951 855 high scalability-2010-07-11-So, Why is Twitter Really Not Using Cassandra to Store Tweets?

10 0.66224861 1488 high scalability-2013-07-08-The Architecture Twitter Uses to Deal with 150M Active Users, 300K QPS, a 22 MB-S Firehose, and Send Tweets in Under 5 Seconds

11 0.63041186 1251 high scalability-2012-05-24-Build your own twitter like real time analytics - a step by step guide

12 0.60941595 970 high scalability-2011-01-06-BankSimple Mini-Architecture - Using a Next Generation Toolchain

13 0.58901191 1159 high scalability-2011-12-19-How Twitter Stores 250 Million Tweets a Day Using MySQL

14 0.52941835 402 high scalability-2008-10-05-Paper: Scalability Design Patterns

15 0.52513647 780 high scalability-2010-02-19-Twitter’s Plan to Analyze 100 Billion Tweets

16 0.50719267 166 high scalability-2007-11-27-Solving the Client Side API Scalability Problem with a Little Game Theory

17 0.50102907 1004 high scalability-2011-03-14-Twitter by the Numbers - 460,000 New Accounts and 140 Million Tweets Per Day

18 0.49843219 363 high scalability-2008-08-12-Strategy: Limit The New, Not The Old

19 0.4983843 1551 high scalability-2013-11-20-How Twitter Improved JVM Performance by Reducing GC and Faster Memory Allocation

20 0.49618313 783 high scalability-2010-02-24-Hot Scalability Links for February 24, 2010


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(79, 0.82)]

similar blogs list:

simIndex simValue blogId blogTitle

1 1.0 692 high scalability-2009-09-01-Cheap storage: how backblaze takes matters in hand

Introduction: Blackblaze blogs about how they built their own storage infrastructure on the cheap to run their cloud backup service. This episode: the hardware. Sorry, just a link this time.

2 1.0 1119 high scalability-2011-09-20-HighScalability is old news. Step your scaling game way up... (NSFW cartoon)

Introduction: Jeremy Raines tweeted a link to this cartoon  my new filing technique is unstoppable , showing how scotch tape can be used to create a new super-database. Very funny in a Dilbert sort of way, but definitely not NSFW...   For more on Twisted Tuesday, you may enjoy: Hilarious Video: Relational Database Vs NoSQL Fanbois NSFW: Hilarious Fault-Tolerance Cartoon

3 0.99006563 8 high scalability-2007-07-12-Should I use LAMP or Windows?

Introduction: Hi, I stumb l ed on your s i te and I am th i nking about start i ng a website. I haven't rece i ved a good answer about what I shou l d use to bui l d i t, so I thought I wou l d give it a shot. I am a w i ndows guy. I know .Net and ASP and how to bu i ld web s i tes using that stack. But I not i ce most sites use LAMP and that's what most people ta l k about using. What's wrong w i th using Windows? .Net Programmer

4 0.98740095 743 high scalability-2009-11-23-Big Data on Grids or on Clouds?

Introduction: Contributed by Wolfgang Gentzsch: Now that we have a new computing paradigm, Cloud Computing, how can Clouds help our data? Replace our internal data vaults as we hoped Grids would? Are Grids dead now that we have Clouds? Despite all the promising developments in the Grid and Cloud computing space, and the avalanche of publications and talks on this subject, many people still seem to be confused about internal data and compute resources, versus Grids versus Clouds, and they are hesitant to take the next step. I think there are a number of issues driving this uncertainty. read more at: BigDataMatters.com

5 0.97452021 782 high scalability-2010-02-23-When to migrate your database?

Introduction: Why migrate your database? Efficiency and availability problems are harming your business – reports are out of date, your batch processing window is nearing its limits, outages (unplanned/planned) frequently halt work. Database consolidation – remove the costs that result from a heterogeneous database environment (DBAs time, database vendor pricing, database versions, hardware, OSs, patches, upgrades etc.). OK, so the driving forces for migration are clear,  what now? Read more on BigDataMatters.com

6 0.97386891 443 high scalability-2008-11-14-Paper: Pig Latin: A Not-So-Foreign Language for Data Processing

7 0.96811318 401 high scalability-2008-10-04-Is MapReduce going mainstream?

8 0.95433074 372 high scalability-2008-08-27-Updating distributed web applications

9 0.94508535 107 high scalability-2007-10-02-Some Real Financial Numbers for Your Startup

10 0.93161786 1100 high scalability-2011-08-18-Paper: The Akamai Network - 61,000 servers, 1,000 networks, 70 countries

11 0.90565997 1169 high scalability-2012-01-05-Shutterfly Saw a Speedup of 500% With Flashcache

12 0.90407425 1277 high scalability-2012-07-05-10 Golden Principles For Building Successful Mobile-Web Applications

same-blog 13 0.87977022 323 high scalability-2008-05-19-Twitter as a scalability case study

14 0.87364477 784 high scalability-2010-02-25-Paper: High Performance Scalable Data Stores

15 0.84553868 75 high scalability-2007-08-28-Google Utilities : An online google guide,tools and Utilities.

16 0.81657487 871 high scalability-2010-08-04-Dremel: Interactive Analysis of Web-Scale Datasets - Data as a Programming Paradigm

17 0.8137182 1162 high scalability-2011-12-23-Funny: A Cautionary Tale About Storage and Backup

18 0.78057694 1403 high scalability-2013-02-08-Stuff The Internet Says On Scalability For February 8, 2013

19 0.77858764 1181 high scalability-2012-01-25-Google Goes MoreSQL with Tenzing - SQL Over MapReduce

20 0.77651042 680 high scalability-2009-08-13-Reconnoiter - Large-Scale Trending and Fault-Detection