high_scalability high_scalability-2008 high_scalability-2008-264 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Paul Tyma published a massive and massively good 96 page insider's manual on How to Pass a Silicon Valley Software Engineering Interview . My eyes immediately latched on to one of his key example scenarios, which involves scaling Facebook: Facebook ● What was Facebook day 1? – A database with a PHP front-end ● In PHP, Java, C#, whatever – How long would it take you to reproduce Facebook's first incarnation? ● A single MySQL instance with some simple queries probably used to happily query the whole userbase. Facebook ● What is it today? ● Its not about “that stuff you learned in school” – Its about what a company with thousands of (possibly conflicting) queries per second operating on a directed-graph with 50 million nodes ● And of course a few Petabytes of data ● And 99.99% uptime ● Design decision? A Facebook user is (or recently was) currently limited to 5000 friends. If you've been reading all the wisdom contributed to and reference
sentIndex sentText sentNum sentScore
1 Paul Tyma published a massive and massively good 96 page insider's manual on How to Pass a Silicon Valley Software Engineering Interview . [sent-1, score-0.421]
2 My eyes immediately latched on to one of his key example scenarios, which involves scaling Facebook: Facebook ● What was Facebook day 1? [sent-2, score-0.389]
3 – A database with a PHP front-end ● In PHP, Java, C#, whatever – How long would it take you to reproduce Facebook's first incarnation? [sent-3, score-0.293]
4 ● A single MySQL instance with some simple queries probably used to happily query the whole userbase. [sent-4, score-0.385]
5 ● Its not about “that stuff you learned in school” – Its about what a company with thousands of (possibly conflicting) queries per second operating on a directed-graph with 50 million nodes ● And of course a few Petabytes of data ● And 99. [sent-6, score-0.506]
6 A Facebook user is (or recently was) currently limited to 5000 friends. [sent-8, score-0.248]
7 If you've been reading all the wisdom contributed to and referenced by this website you might just rock this interview and put a little more money in your pocket. [sent-9, score-0.892]
8 So this site isn't a total waste of time :-) Yet I wonder how we can have 96 pages on interviewing and still not talk about software development at all? [sent-10, score-0.692]
wordName wordTfidf (topN-words)
[('facebook', 0.252), ('tyma', 0.219), ('incarnation', 0.219), ('interviewing', 0.209), ('referenced', 0.19), ('happily', 0.185), ('reproduce', 0.185), ('insider', 0.181), ('conflicting', 0.171), ('contributed', 0.166), ('onhow', 0.166), ('eyes', 0.161), ('php', 0.158), ('school', 0.146), ('rock', 0.14), ('wisdom', 0.138), ('uptime', 0.137), ('paul', 0.136), ('valley', 0.134), ('silicon', 0.133), ('queries', 0.129), ('involves', 0.128), ('pass', 0.127), ('waste', 0.124), ('published', 0.124), ('petabytes', 0.119), ('wonder', 0.117), ('scenarios', 0.116), ('massively', 0.113), ('possibly', 0.112), ('whatever', 0.108), ('manual', 0.104), ('decision', 0.102), ('interview', 0.1), ('immediately', 0.1), ('recently', 0.089), ('limited', 0.085), ('total', 0.083), ('pages', 0.083), ('reading', 0.082), ('massive', 0.08), ('learned', 0.08), ('course', 0.078), ('money', 0.076), ('software', 0.076), ('stuff', 0.075), ('currently', 0.074), ('company', 0.072), ('thousands', 0.072), ('probably', 0.071)]
simIndex simValue blogId blogTitle
same-blog 1 1.0000001 264 high scalability-2008-03-03-Read This Site and Ace Your Next Interview!
Introduction: Paul Tyma published a massive and massively good 96 page insider's manual on How to Pass a Silicon Valley Software Engineering Interview . My eyes immediately latched on to one of his key example scenarios, which involves scaling Facebook: Facebook ● What was Facebook day 1? – A database with a PHP front-end ● In PHP, Java, C#, whatever – How long would it take you to reproduce Facebook's first incarnation? ● A single MySQL instance with some simple queries probably used to happily query the whole userbase. Facebook ● What is it today? ● Its not about “that stuff you learned in school” – Its about what a company with thousands of (possibly conflicting) queries per second operating on a directed-graph with 50 million nodes ● And of course a few Petabytes of data ● And 99.99% uptime ● Design decision? A Facebook user is (or recently was) currently limited to 5000 friends. If you've been reading all the wisdom contributed to and reference
2 0.17018542 720 high scalability-2009-10-12-High Performance at Massive Scale – Lessons learned at Facebook
Introduction: Jeff Rothschild, Vice President of Technology at Facebook gave a great presentation at UC San Diego on our favorite subject: " High Performance at Massive Scale – Lessons learned at Facebook ". The abstract for the talk is: Facebook has grown into one of the largest sites on the Internet today serving over 200 billion pages per month. The nature of social data makes engineering a site for this level of scale a particularly challenging proposition. In this presentation, I will discuss the aspects of social data that present challenges for scalability and will describe the the core architectural components and design principles that Facebook has used to address these challenges. In addition, I will discuss emerging technologies that offer new opportunities for building cost-effective high performance web architectures. There's a lot of interesting about this talk that we'll get into later, but I thought you might want a head start on learning how Facebook handles 30K+ machines,
3 0.14979306 1123 high scalability-2011-09-23-The Real News is Not that Facebook Serves Up 1 Trillion Pages a Month…
Introduction: There’s some amount of debate whether Facebook really crossed over the one trillion page view per month threshold. While one report says it did, another respected firm says it did not ; that its monthly page views are a mere 467 billion per month. In the big scheme of things, the discrepancy is somewhat irrelevant, as neither show the true load on Facebook’s infrastructure – which is far more impressive a set of numbers than its externally measured “page view” metric. Mashable reported in “ Facebook Surpasses 1 Trillion Pageviews per Month ” that the social networking giant saw “approximately 870 million unique visitors in June and 860 million in July” and followed up with some per visitor statistics, indicating “each visitor averaged approximately 1,160 page views in July and 40 per visit — enormous by any standard. Time spent on the site was around 25 minutes per user.” From an architectural standpoint it’s not just about the page views. It’s about requests and responses,
4 0.14722317 845 high scalability-2010-06-22-Exploring the software behind Facebook, the world’s largest site
Introduction: Peter Alguacil at Pingdom wrote a HighScalability worthy article on Facebook's architecture: Exploring the software behind Facebook, the world’s largest site . It covers the challenges Facebook faces, the software Facebook uses, and the techniques Facebook uses to keep on scaling. Definitely worth a look.
5 0.14066894 624 high scalability-2009-06-10-Hive - A Petabyte Scale Data Warehouse using Hadoop
Introduction: This post about using Hive and Hadoop for analytics comes straight from Facebook engineers. Scalable analysis on large data sets has been core to the functions of a number of teams at Facebook - both engineering and non-engineering. Apart from ad hoc analysis and business intelligence applications used by analysts across the company, a number of Facebook products are also based on analytics. These products range from simple reporting applications like Insights for the Facebook Ad Network, to more advanced kind such as Facebook's Lexicon product. As a result a flexible infrastructure that caters to the needs of these diverse applications and users and that also scales up in a cost effective manner with the ever increasing amounts of data being generated on Facebook, is critical. Hive and Hadoop are the technologies that we have used to address these requirements at Facebook. Read the rest of the article on Engineering @ Facebook's Notes page
6 0.12711143 721 high scalability-2009-10-13-Why are Facebook, Digg, and Twitter so hard to scale?
7 0.1148037 464 high scalability-2008-12-13-Strategy: Facebook Tweaks to Handle 6 Time as Many Memcached Requests
8 0.11095978 840 high scalability-2010-06-10-The Four Meta Secrets of Scaling at Facebook
9 0.11041636 562 high scalability-2009-04-10-Facebook's Aditya giving presentation on Facebook Architecture
11 0.09608233 1008 high scalability-2011-03-22-Facebook's New Realtime Analytics System: HBase to Process 20 Billion Events Per Day
12 0.095343351 1011 high scalability-2011-03-25-Did the Microsoft Stack Kill MySpace?
13 0.094931528 1081 high scalability-2011-07-18-Building your own Facebook Realtime Analytics System
14 0.094803177 1561 high scalability-2013-12-09-Site Moves from PHP to Facebook's HipHop, Now Pages Load in .6 Seconds Instead of Five
15 0.092154987 943 high scalability-2010-11-16-Facebook's New Real-time Messaging System: HBase to Store 135+ Billion Messages a Month
16 0.091320306 415 high scalability-2008-10-15-Need help with your Hadoop deployment? This company may help!
17 0.090797931 934 high scalability-2010-11-04-Facebook at 13 Million Queries Per Second Recommends: Minimize Request Variance
18 0.090040207 1444 high scalability-2013-04-23-Facebook Secrets of Web Performance
19 0.086309806 870 high scalability-2010-08-02-7 Scaling Strategies Facebook Used to Grow to 500 Million Users
20 0.08543843 1220 high scalability-2012-04-02-YouPorn - Targeting 200 Million Views a Day and Beyond
topicId topicWeight
[(0, 0.144), (1, 0.068), (2, -0.004), (3, -0.04), (4, 0.08), (5, -0.031), (6, -0.079), (7, 0.033), (8, 0.076), (9, 0.042), (10, 0.013), (11, 0.018), (12, 0.051), (13, 0.048), (14, -0.037), (15, 0.007), (16, -0.006), (17, -0.029), (18, 0.018), (19, 0.052), (20, 0.1), (21, 0.079), (22, 0.042), (23, 0.008), (24, 0.031), (25, -0.032), (26, 0.027), (27, -0.055), (28, 0.081), (29, -0.018), (30, -0.133), (31, 0.049), (32, 0.003), (33, 0.047), (34, -0.002), (35, 0.046), (36, 0.025), (37, -0.056), (38, -0.048), (39, 0.028), (40, -0.083), (41, -0.023), (42, 0.004), (43, 0.031), (44, 0.025), (45, -0.005), (46, 0.035), (47, 0.028), (48, 0.016), (49, -0.053)]
simIndex simValue blogId blogTitle
same-blog 1 0.98689616 264 high scalability-2008-03-03-Read This Site and Ace Your Next Interview!
Introduction: Paul Tyma published a massive and massively good 96 page insider's manual on How to Pass a Silicon Valley Software Engineering Interview . My eyes immediately latched on to one of his key example scenarios, which involves scaling Facebook: Facebook ● What was Facebook day 1? – A database with a PHP front-end ● In PHP, Java, C#, whatever – How long would it take you to reproduce Facebook's first incarnation? ● A single MySQL instance with some simple queries probably used to happily query the whole userbase. Facebook ● What is it today? ● Its not about “that stuff you learned in school” – Its about what a company with thousands of (possibly conflicting) queries per second operating on a directed-graph with 50 million nodes ● And of course a few Petabytes of data ● And 99.99% uptime ● Design decision? A Facebook user is (or recently was) currently limited to 5000 friends. If you've been reading all the wisdom contributed to and reference
2 0.84176928 1323 high scalability-2012-09-15-4 Reasons Facebook Dumped HTML5 and Went Native
Introduction: Facebook made quite a splash when they released their native iOS app , not because of their app per se, but because of their conclusion that their biggest mistake was betting on HTML5 , so they had to go native. As you might imagine this was a bit like telling a Great White Shark that its bark is worse than its bite. A common refrain was Facebook simply had made a bad HTML5 site, not that HTML5 itself is bad, as plenty of other vendors have made slick well performing mobile sites. An interesting and relevant conversation given the rising butt kickery of mobile. But we were lacking details. Now we aren't. If you were wondering just why Facebook ditched HTML5, Tobie Langel in Perf Feedback - What's slowing down Mobile Facebook , lists out the reasons: Tooling / Developer APIs . Most importantly, the lack of tooling to track down memory problems. Scrolling performance. Scrolling must be fast and smooth and full featured. It's not. GPU. A clunky API and black box ap
3 0.83771807 562 high scalability-2009-04-10-Facebook's Aditya giving presentation on Facebook Architecture
Introduction: Facebook's engg. director aditya talks about facebook architecture. How they use mysql, php and memcache. How they have modified the above to suit their requirements.
4 0.78855425 845 high scalability-2010-06-22-Exploring the software behind Facebook, the world’s largest site
Introduction: Peter Alguacil at Pingdom wrote a HighScalability worthy article on Facebook's architecture: Exploring the software behind Facebook, the world’s largest site . It covers the challenges Facebook faces, the software Facebook uses, and the techniques Facebook uses to keep on scaling. Definitely worth a look.
5 0.78490156 624 high scalability-2009-06-10-Hive - A Petabyte Scale Data Warehouse using Hadoop
Introduction: This post about using Hive and Hadoop for analytics comes straight from Facebook engineers. Scalable analysis on large data sets has been core to the functions of a number of teams at Facebook - both engineering and non-engineering. Apart from ad hoc analysis and business intelligence applications used by analysts across the company, a number of Facebook products are also based on analytics. These products range from simple reporting applications like Insights for the Facebook Ad Network, to more advanced kind such as Facebook's Lexicon product. As a result a flexible infrastructure that caters to the needs of these diverse applications and users and that also scales up in a cost effective manner with the ever increasing amounts of data being generated on Facebook, is critical. Hive and Hadoop are the technologies that we have used to address these requirements at Facebook. Read the rest of the article on Engineering @ Facebook's Notes page
6 0.78471118 720 high scalability-2009-10-12-High Performance at Massive Scale – Lessons learned at Facebook
7 0.78227937 966 high scalability-2010-12-31-Facebook in 20 Minutes: 2.7M Photos, 10.2M Comments, 4.6M Messages
10 0.73949176 1081 high scalability-2011-07-18-Building your own Facebook Realtime Analytics System
11 0.73430163 870 high scalability-2010-08-02-7 Scaling Strategies Facebook Used to Grow to 500 Million Users
12 0.70698977 840 high scalability-2010-06-10-The Four Meta Secrets of Scaling at Facebook
13 0.70098633 1123 high scalability-2011-09-23-The Real News is Not that Facebook Serves Up 1 Trillion Pages a Month…
14 0.69023389 378 high scalability-2008-09-03-Some Facebook Secrets to Better Operations
15 0.68066686 599 high scalability-2009-05-14-Who Has the Most Web Servers?
16 0.65580523 943 high scalability-2010-11-16-Facebook's New Real-time Messaging System: HBase to Store 135+ Billion Messages a Month
17 0.65570843 1444 high scalability-2013-04-23-Facebook Secrets of Web Performance
18 0.63499755 1595 high scalability-2014-02-13-Snabb Switch - Skip the OS and Get 40 million Requests Per Second in Lua
19 0.63308161 464 high scalability-2008-12-13-Strategy: Facebook Tweaks to Handle 6 Time as Many Memcached Requests
20 0.63245457 563 high scalability-2009-04-10-Facebook Chat Architecture
topicId topicWeight
[(1, 0.199), (2, 0.205), (30, 0.059), (36, 0.102), (61, 0.076), (79, 0.197), (94, 0.061)]
simIndex simValue blogId blogTitle
same-blog 1 0.96998173 264 high scalability-2008-03-03-Read This Site and Ace Your Next Interview!
Introduction: Paul Tyma published a massive and massively good 96 page insider's manual on How to Pass a Silicon Valley Software Engineering Interview . My eyes immediately latched on to one of his key example scenarios, which involves scaling Facebook: Facebook ● What was Facebook day 1? – A database with a PHP front-end ● In PHP, Java, C#, whatever – How long would it take you to reproduce Facebook's first incarnation? ● A single MySQL instance with some simple queries probably used to happily query the whole userbase. Facebook ● What is it today? ● Its not about “that stuff you learned in school” – Its about what a company with thousands of (possibly conflicting) queries per second operating on a directed-graph with 50 million nodes ● And of course a few Petabytes of data ● And 99.99% uptime ● Design decision? A Facebook user is (or recently was) currently limited to 5000 friends. If you've been reading all the wisdom contributed to and reference
Introduction: Oh, you don't have a Quantum Computer Lab staffed with researchers? Well, Google does. Here they are on G+ . To learn what they are up to the Verge has A first look inside Google's futuristic quantum lab . The lab is partnership between NASA, Google, and a 512-qubit D-Wave Two quantum computer. One result from the lab is: The first practical application has been on Google Glass, as engineers put the quantum chips to work on Glass's blink detector, helping it to better distinguish between intentional winks and involuntary blinks. For engineering reasons, the quantum processor can never be installed in Glass, but together with Google's conventional server centers, it can point the way to a better blink-detecting algorithm. That would allow the Glass processor to detect blinks with better accuracy and using significantly less power. If successful, it could be an important breakthrough for wink-triggered apps, which have struggled with the task so far. Google thinks quantum
Introduction: Urs Hoelzle , infrastructure guru and SVP at Google, made a really interesting statement about the economics of scale in the datacenter: We’ve shown that when you run a large application in the datacenter, like Gmail, you can, compared to a small organization running their own email server, you can save nearly a factor of 100 in terms of compute and energy, when you run it at scale. My first thought was shock at the magnitude of the difference. 100x is a chasm crosser. Then I thought about Gmail, it's horizontally scalable using technologies that are following Moore's Law (storage and compute), latency requirements are lax, a commodity network is sufficient, and it can be highly automated so management costs scale slower than users. After that it's a simple matter of software :-) Oh, and developing a market where it's "cheaper to run a large thing than a small thing."
4 0.94385856 1216 high scalability-2012-03-27-Big Data In the Cloud Using Cloudify
Introduction: Edd Dumbill wrote an interesting article on O’Reilly Radar covering the current solutions for running Big Data in the Cloud Big data and cloud technology go hand-in-hand. Big data needs clusters of servers for processing, which clouds can readily provide. Big PaaS Edd touched briefly on the role of PaaS for delivering Big Data applications in the cloud Beyond IaaS, several cloud services provide application layer support for big data work. Sometimes referred to as managed solutions, or platform as a service (PaaS), these services remove the need to ucale things such as databases or MapReduce, reducing your workload and maintenance burden. Additionally, PaaS providers can realize great efficiencies by hosting at the application level, and pass those savings on to the customer. To put it simply, managing data clusters is one thing. Being able to process the data is yet another challenge that we need to think about when we’re dealing with application platforms, as I no
5 0.94170135 1240 high scalability-2012-05-07-Startups are Creating a New System of the World for IT
Introduction: It remains that, from the same principles, I now demonstrate the frame of the System of the World. -- Isaac Newton The practice of IT reminds me a lot of the practice of science before Isaac Newton. Aristotelianism was dead, but there was nothing to replace it. Then Newton came along, created a scientific revolution with his System of the World . And everything changed. That was New System of the World number one. New System of the World number two was written about by the incomparable Neal Stephenson in his incredible Baroque Cycle series. It explores the singular creation of a new way of organizing society grounded in new modes of thought in business, religion, politics, and science. Our modern world emerged Enlightened as it could from this roiling cauldron of forces. In IT we may have had a Leonardo da Vinci or even a Galileo, but we’ve never had our Newton. Maybe we don't need a towering genius to make everything clear? For years startups, like the frenetically inventive
6 0.94120538 1275 high scalability-2012-07-02-C is for Compute - Google Compute Engine (GCE)
7 0.94068074 576 high scalability-2009-04-21-What CDN would you recommend?
8 0.93910545 608 high scalability-2009-05-27-The Future of the Parallelism and its Challenges
9 0.93781102 451 high scalability-2008-11-30-Creating a high-performing online database
10 0.93754888 1575 high scalability-2014-01-08-Under Snowden's Light Software Architecture Choices Become Murky
11 0.93744171 229 high scalability-2008-01-29-Building scalable storage into application - Instead of MogileFS OpenAFS etc.
12 0.93588853 803 high scalability-2010-04-05-Intercloud: How Will We Scale Across Multiple Clouds?
15 0.93554133 195 high scalability-2007-12-28-Amazon's EC2: Pay as You Grow Could Cut Your Costs in Half
16 0.93471366 972 high scalability-2011-01-11-Google Megastore - 3 Billion Writes and 20 Billion Read Transactions Daily
17 0.93308532 1654 high scalability-2014-06-05-Cloud Architecture Revolution
18 0.93283373 38 high scalability-2007-07-30-Build an Infinitely Scalable Infrastructure for $100 Using Amazon Services
19 0.93264991 984 high scalability-2011-02-04-Stuff The Internet Says On Scalability For February 4, 2011
20 0.93186587 129 high scalability-2007-10-23-Hire Facebook, Ning, and Salesforce to Scale for You