high_scalability high_scalability-2007 high_scalability-2007-67 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: The questions was extracted from: http://highscalability.com/plentyoffish-architecture#comment-126 For startup like Markus, what is the best hosting option (and grow more later)? host your own server or use ISP co-location option? He still has to pay huge money on the bandwidth with that payload, right?
sentIndex sentText sentNum sentScore
1 com/plentyoffish-architecture#comment-126 For startup like Markus, what is the best hosting option (and grow more later)? [sent-2, score-1.061]
2 host your own server or use ISP co-location option? [sent-3, score-0.283]
3 He still has to pay huge money on the bandwidth with that payload, right? [sent-4, score-0.682]
wordName wordTfidf (topN-words)
[('option', 0.392), ('payload', 0.387), ('markus', 0.378), ('extracted', 0.357), ('isp', 0.324), ('hosting', 0.184), ('startup', 0.183), ('host', 0.175), ('questions', 0.172), ('grow', 0.162), ('money', 0.158), ('pay', 0.157), ('later', 0.152), ('huge', 0.143), ('bandwidth', 0.134), ('http', 0.11), ('right', 0.108), ('best', 0.099), ('still', 0.09), ('architecture', 0.083), ('server', 0.065), ('use', 0.043), ('like', 0.041)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 67 high scalability-2007-08-17-What is the best hosting option?
Introduction: The questions was extracted from: http://highscalability.com/plentyoffish-architecture#comment-126 For startup like Markus, what is the best hosting option (and grow more later)? host your own server or use ISP co-location option? He still has to pay huge money on the bandwidth with that payload, right?
2 0.18149479 1164 high scalability-2011-12-27-PlentyOfFish Update - 6 Billion Pageviews and 32 Billion Images a Month
Introduction: Markus has a short update on their PlentyOfFish Architecture . Impressive November statistics: 6 billion pageviews served 32 billion images served 6 million logins i n one day IM servers handle about 30 billion pageviews 11 webservers (5 of which could be dropped) Hired first DBA in July . They currently have a handful of employees . All hosting/cdn costs combined are under $70k/month. Lesson : small organization, simple architecture, on raw hardware is still plenty profitable for PlentyOfFish. Related Articles On HackerNews 32 Billion images a month by Markus Frind.
3 0.16354755 638 high scalability-2009-06-26-PlentyOfFish Architecture
Introduction: Update 5 : PlentyOfFish Update - 6 Billion Pageviews And 32 Billion Images A Month Update 4 : Jeff Atwood costs out Markus' scale up approach against a scale out approach and finds scale up wanting. The discussion in the comments is as interesting as the article. My guess is Markus doesn't want to rewrite his software to work across a scale out cluster so even if it's more expensive scale up works better for his needs. Update 3 : POF now has 200 million images and serves 10,000 images served per second. They'll be moving to a 250,000 IOPS RamSan to handle the load. Also upgraded to a core database machine with 512 GB of RAM, 32 CPU’s, SQLServer 2008 and Windows 2008. Update 2 : This seems to be a POF Peer1 love fest infomercial . It's pretty content free, but the production values are high. Lots of quirky sounds and fish swimming on the screen. Update : by Facebook standards Read/WriteWeb says POF is worth a cool one billion dollars . It helps to talk like Dr. Evil whe
4 0.16174394 1361 high scalability-2012-11-22-Gone Fishin': PlentyOfFish Architecture
Introduction: Other than StackOverflow , PlentyOfFish is perhaps the most spectacular example of scale-up architectures working for what your average sane person would consider a large system. It doesn't hurt that it's also a sexy story. Update 5 : PlentyOfFish Update - 6 Billion Pageviews And 32 Billion Images A Month Update 4 : Jeff Atwood costs out Markus' scale up approach against a scale out approach and finds scale up wanting. The discussion in the comments is as interesting as the article. My guess is Markus doesn't want to rewrite his software to work across a scale out cluster so even if it's more expensive scale up works better for his needs. Update 3 : POF now has 200 million images and serves 10,000 images served per second. They'll be moving to a 250,000 IOPS RamSan to handle the load. Also upgraded to a core database machine with 512 GB of RAM, 32 CPU’s, SQLServer 2008 and Windows 2008. Update 2 : This seems to be a POF Peer1 love fest infomercial . It's pretty cont
5 0.10365692 442 high scalability-2008-11-13-Plenty of Fish Says Scaling for Free Doesn't Pay
Introduction: Plenty of Fish CEO Markus Frind, famous nerd hero for making over $10 million a year from Google ads on a free dating site he made and ran all by himself, now sees a problem with the free model : The problem with free is that every time you double the size of your database the cost of maintaining the site grows 6 fold. I really underestimated how much resources it would take, I have one database table now that exceeds 3 billion records. The bigger you get as a free site the less money you make per visit and the more it costs to service a visit...There is really no money in being free and we have to start experimenting with other models now or we won’t be able to compete in 3 or 4 years. As one commenter succinctly put it: the “golden time” of AdSense is over . Time to look at costs. The POF architecture is to run scarily huge tables on single machines. They also buy and maintain their own SAN. So it seems scaling up is what is increasing costs and decreasing profits. I wo
6 0.099648654 307 high scalability-2008-04-21-Using Google AppEngine for a Little Micro-Scalability
7 0.097379662 181 high scalability-2007-12-11-Hosting and CDN for startup video sharing site
8 0.094426185 841 high scalability-2010-06-14-How scalable could be a cPanel Hosting service?
9 0.083232202 200 high scalability-2008-01-02-WEB hosting Select
10 0.081430107 98 high scalability-2007-09-18-Sync data on all servers
11 0.074782535 228 high scalability-2008-01-28-Product: ISPMan Centralized ISP Management System
12 0.071034148 794 high scalability-2010-03-11-What would you like to ask Justin.tv?
13 0.067621544 632 high scalability-2009-06-15-starting small with growth in mind
14 0.063622713 386 high scalability-2008-09-22-Cloud computing, grid computing, utility computing - list of top providers
15 0.063583709 313 high scalability-2008-05-02-Friends for Sale Architecture - A 300 Million Page View-Month Facebook RoR App
16 0.063449204 853 high scalability-2010-07-08-Cloud AWS Infrastructure vs. Physical Infrastructure
17 0.063363232 1423 high scalability-2013-03-13-Iron.io Moved From Ruby to Go: 28 Servers Cut and Colossal Clusterf**ks Prevented
18 0.06157396 148 high scalability-2007-11-11-Linkedin architecture
19 0.060975321 38 high scalability-2007-07-30-Build an Infinitely Scalable Infrastructure for $100 Using Amazon Services
20 0.059885148 585 high scalability-2009-04-29-How to choice and build perfect server
topicId topicWeight
[(0, 0.073), (1, 0.021), (2, 0.005), (3, -0.039), (4, -0.034), (5, -0.066), (6, -0.018), (7, -0.014), (8, 0.037), (9, 0.013), (10, -0.003), (11, -0.038), (12, -0.023), (13, -0.004), (14, 0.039), (15, 0.018), (16, 0.001), (17, 0.039), (18, -0.005), (19, -0.001), (20, -0.028), (21, -0.008), (22, -0.034), (23, -0.065), (24, 0.033), (25, -0.055), (26, 0.032), (27, -0.013), (28, 0.011), (29, 0.004), (30, -0.022), (31, 0.028), (32, -0.005), (33, 0.003), (34, -0.004), (35, -0.02), (36, 0.022), (37, 0.076), (38, 0.028), (39, -0.009), (40, 0.012), (41, 0.039), (42, 0.014), (43, 0.02), (44, -0.012), (45, 0.009), (46, 0.008), (47, 0.012), (48, 0.019), (49, 0.025)]
simIndex simValue blogId blogTitle
same-blog 1 0.93907398 67 high scalability-2007-08-17-What is the best hosting option?
Introduction: The questions was extracted from: http://highscalability.com/plentyoffish-architecture#comment-126 For startup like Markus, what is the best hosting option (and grow more later)? host your own server or use ISP co-location option? He still has to pay huge money on the bandwidth with that payload, right?
2 0.64204603 1164 high scalability-2011-12-27-PlentyOfFish Update - 6 Billion Pageviews and 32 Billion Images a Month
Introduction: Markus has a short update on their PlentyOfFish Architecture . Impressive November statistics: 6 billion pageviews served 32 billion images served 6 million logins i n one day IM servers handle about 30 billion pageviews 11 webservers (5 of which could be dropped) Hired first DBA in July . They currently have a handful of employees . All hosting/cdn costs combined are under $70k/month. Lesson : small organization, simple architecture, on raw hardware is still plenty profitable for PlentyOfFish. Related Articles On HackerNews 32 Billion images a month by Markus Frind.
3 0.62387687 794 high scalability-2010-03-11-What would you like to ask Justin.tv?
Introduction: It looks like I'll have the chance to interview someone tomorrow from Justin.tv about their architecture, which is pretty exciting given their leadership role in live broadcasting. They get 30 million uniques a month, can handle 1 million simultaneous broadcasts and hope to grow another magnitude in the near future. That must take some doing. Here's your opportunity, especially if you think my questions suck, to ask your own sucky questions :-) What would you like to know about Justin.tv?
4 0.606915 290 high scalability-2008-03-28-How to Get DNS Names of a Web Server
Introduction: For some special reason, I'm trying to make a web server able to get all the DNS names mapped to its IP. Let me explain more, I'm creating a website that will run in a web farm, every web server in the farm will have some subdomains mapped to its ip, what I want is that whenever my application starts on a web server is to be able to get all the subdomains mapped/assigned to that server, e.g. sub1.mydomain.com, sub2.mydomain.com. I understand that I have to use reverse dns lookup (i.e. give the IP get the domain name), but I also want to get all the subdomains not just the first one that maps to that IP. I've been reading about DNS on the internet but I don't seem to find any information on how to achieve what I want, normally you use dns to get the ip of a domain but I'm not sure that all servers enable reverse lookup. The problem is that I'm still not sure whether I'll host my own DNS server or use the services of some company (many companies offer DNS hosting services), so, my qu
5 0.60133237 1361 high scalability-2012-11-22-Gone Fishin': PlentyOfFish Architecture
Introduction: Other than StackOverflow , PlentyOfFish is perhaps the most spectacular example of scale-up architectures working for what your average sane person would consider a large system. It doesn't hurt that it's also a sexy story. Update 5 : PlentyOfFish Update - 6 Billion Pageviews And 32 Billion Images A Month Update 4 : Jeff Atwood costs out Markus' scale up approach against a scale out approach and finds scale up wanting. The discussion in the comments is as interesting as the article. My guess is Markus doesn't want to rewrite his software to work across a scale out cluster so even if it's more expensive scale up works better for his needs. Update 3 : POF now has 200 million images and serves 10,000 images served per second. They'll be moving to a 250,000 IOPS RamSan to handle the load. Also upgraded to a core database machine with 512 GB of RAM, 32 CPU’s, SQLServer 2008 and Windows 2008. Update 2 : This seems to be a POF Peer1 love fest infomercial . It's pretty cont
6 0.59996468 638 high scalability-2009-06-26-PlentyOfFish Architecture
7 0.59606719 551 high scalability-2009-03-30-Lavabit Architecture - Creating a Scalable Email Service
8 0.59582978 598 high scalability-2009-05-12-P2P server technology?
9 0.5951162 181 high scalability-2007-12-11-Hosting and CDN for startup video sharing site
10 0.59255475 619 high scalability-2009-06-05-HotPads Shows the True Cost of Hosting on Amazon
11 0.5842216 253 high scalability-2008-02-19-Building a email communication system
12 0.57873553 176 high scalability-2007-12-07-Synchronizing databases in different geographic locations
13 0.56649858 200 high scalability-2008-01-02-WEB hosting Select
14 0.56641465 277 high scalability-2008-03-16-Do you have any questions for the Elastra CEO?
15 0.56376672 59 high scalability-2007-08-04-Try Squid as a Reverse Proxy
16 0.56253093 158 high scalability-2007-11-17-Can How Bees Solve their Load Balancing Problems Help Build More Scalable Websites?
17 0.56225109 1268 high scalability-2012-06-20-Ask HighScalability: How do I organize millions of images?
18 0.55733377 259 high scalability-2008-02-25-Any Suggestions for the Architecture Template?
19 0.55733377 260 high scalability-2008-02-25-Architecture Template Advice Needed
20 0.55413008 70 high scalability-2007-08-22-How many machines do you need to run your site?
topicId topicWeight
[(1, 0.139), (2, 0.196), (56, 0.377), (61, 0.1)]
simIndex simValue blogId blogTitle
1 0.96842694 1394 high scalability-2013-01-25-Stuff The Internet Says On Scalability For January 25, 2013
Introduction: Sorry, Stuff the Internet Says has been called on the account of a power outage. Gods of rain and tree have interfered with thee. Instead, how about watching a little Python? (that's Monty, not the language)
2 0.866436 732 high scalability-2009-10-29-Digg - Looking to the Future with Cassandra
Introduction: Digg has been researching ways to scale our database infrastructure for some time now. We’ve adopted a traditional vertically partitioned master-slave configuration with MySQL, and also investigated sharding MySQL with IDDB . Ultimately, these solutions left us wanting. In the case of the traditional architecture, the lack of redundancy on the write masters is painful, and both approaches have significant management overhead to keep running. Since it was already necessary to abandon data normalization and consistency to make these approaches work, we felt comfortable looking at more exotic, non-relational data stores. After considering HBase, Hypertable, Cassandra, Tokyo Cabinet/Tyrant, Voldemort, and Dynomite, we settled on Cassandra . Each system has its own strengths and weaknesses, but Cassandra has a good blend of everything. It offers column-oriented data storage, so you have a bit more structure than plain key/value stores. It operates in a distributed, highly available,
3 0.86445284 779 high scalability-2010-02-16-Seven Signs You May Need a NoSQL Database
Introduction: While exploring deep into some dusty old library stacks, I dug up Nostradamus' long lost NoSQL codex. What are the chances? Strangely, it also gave the plot to the next Dan Brown novel, but I left that out for reasons of sanity. About NoSQL, here is what Nosty (his friends call him Nosty) predicted are the signs you may need a NoSQL database... You noticed a lot of your database fields are really serialized complex objects in disguise . Why bother with a RDBMS at all then? Storing serialized objects in a relational database is like being on the pill while trying to get pregnant, a bit counter productive. Just use a schemaless database from the start. Using a standard query language has become too confining . You just want to be free. SQL is so easy, so convenient, and so standard, it's really not a challenge anymore. You need to be different. Then NoSQL is for you. Each has their own completely different query mechanism . Your toolbox only contains a hammer . Hammers wh
4 0.86393201 45 high scalability-2007-07-30-Product: SmarterStats
Introduction: SmarterStats provides a solid architecture businesses and individual end users can use to track growth and forecast internet trends. * Track your website's growth and forecast internet trends * Features over 130 report items, plus Geographic Reporting * Log comparison saving 90% of your disk space * Email Reports available in Enterprise Edition * Enhanced data mining available in both editions
same-blog 5 0.82998943 67 high scalability-2007-08-17-What is the best hosting option?
Introduction: The questions was extracted from: http://highscalability.com/plentyoffish-architecture#comment-126 For startup like Markus, what is the best hosting option (and grow more later)? host your own server or use ISP co-location option? He still has to pay huge money on the bandwidth with that payload, right?
6 0.78993368 941 high scalability-2010-11-15-How Google's Instant Previews Reduces HTTP Requests
7 0.77146596 854 high scalability-2010-07-09-Hot Scalability Links for July 9, 2010
8 0.75743747 1022 high scalability-2011-04-13-Paper: NoSQL Databases - NoSQL Introduction and Overview
9 0.74417108 759 high scalability-2010-01-11-Strategy: Don't Use Polling for Real-time Feeds
10 0.74294662 446 high scalability-2008-11-18-Scalability Perspectives #2: Van Jacobson – Content-Centric Networking
11 0.73101521 479 high scalability-2008-12-29-Platform virtualization - top 25 providers (software, hardware, combined)
12 0.72393614 659 high scalability-2009-07-20-A Scalability Lament
13 0.71487588 1322 high scalability-2012-09-14-Stuff The Internet Says On Scalability For September 14, 2012
14 0.68501228 815 high scalability-2010-04-27-Paper: Dapper, Google's Large-Scale Distributed Systems Tracing Infrastructure
15 0.64082962 1565 high scalability-2013-12-16-22 Recommendations for Building Effective High Traffic Web Software
16 0.60549128 1408 high scalability-2013-02-19-Puppet monitoring: how to monitor the success or failure of Puppet runs
17 0.59892732 245 high scalability-2008-02-12-Product: rPath - Creating and Managing Virtual Appliances
18 0.58623773 811 high scalability-2010-04-16-Hot Scalability Links for April 16, 2010
19 0.58555263 1236 high scalability-2012-04-30-Masstree - Much Faster than MongoDB, VoltDB, Redis, and Competitive with Memcached
20 0.58180541 461 high scalability-2008-12-05-Sprinkle - Provisioning Tool to Build Remote Servers