high_scalability high_scalability-2007 high_scalability-2007-82 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: From a reader: > Was reading through your very interesting/useful site. >Most of the architectures are non j2ee-Does that mean that >there aren't enough websites that are scalable(with youtube > like userbase) built with j2ee tech-would like to know if there > are any and their architecture as >well. eBay uses Java, but in a very pragmatic way. They use servlets, an application server, the JDK, and they do the rest themselves. They skip JSP, entity beans, and JMS. When you need to scale putting all your eggs in one basket is a risky strategy. Why use JSP when you can do better? When use entity beans when you can do better? Use servlets because they are a very effective way of handling http requests. Use Java because it is fast, runs everywhere, and has a boat load of libraries you can use to build your build your custom system. Probably the major reason J2EE is absentee is simply LAMP. LAMP is just so incredibly functional for most 2-tier shared nothing site
sentIndex sentText sentNum sentScore
1 From a reader: > Was reading through your very interesting/useful site. [sent-1, score-0.075]
2 >Most of the architectures are non j2ee-Does that mean that >there aren't enough websites that are scalable(with youtube > like userbase) built with j2ee tech-would like to know if there > are any and their architecture as >well. [sent-2, score-0.481]
3 eBay uses Java, but in a very pragmatic way. [sent-3, score-0.113]
4 They use servlets, an application server, the JDK, and they do the rest themselves. [sent-4, score-0.244]
5 When you need to scale putting all your eggs in one basket is a risky strategy. [sent-6, score-0.657]
6 Use servlets because they are a very effective way of handling http requests. [sent-9, score-0.472]
7 Use Java because it is fast, runs everywhere, and has a boat load of libraries you can use to build your build your custom system. [sent-10, score-0.709]
8 Probably the major reason J2EE is absentee is simply LAMP. [sent-11, score-0.202]
9 LAMP is just so incredibly functional for most 2-tier shared nothing sites they don't need a better infrastructure for writing an application tier. [sent-12, score-0.715]
10 Personally, I pretty excited about GWT which uses Java and servlets. [sent-13, score-0.175]
11 We'll see if that starts to take off a little bit more. [sent-14, score-0.194]
wordName wordTfidf (topN-words)
[('beans', 0.382), ('jsp', 0.382), ('servlets', 0.312), ('entity', 0.234), ('boat', 0.2), ('eggs', 0.191), ('userbase', 0.191), ('basket', 0.184), ('jdk', 0.184), ('gwt', 0.169), ('java', 0.161), ('risky', 0.143), ('skip', 0.13), ('incredibly', 0.126), ('non', 0.116), ('reader', 0.114), ('use', 0.113), ('pragmatic', 0.113), ('better', 0.113), ('everywhere', 0.109), ('excited', 0.108), ('youtube', 0.105), ('libraries', 0.101), ('functional', 0.093), ('effective', 0.089), ('putting', 0.089), ('lamp', 0.079), ('custom', 0.078), ('reading', 0.075), ('writing', 0.075), ('reason', 0.074), ('build', 0.074), ('architectures', 0.072), ('starts', 0.072), ('rest', 0.072), ('mean', 0.071), ('handling', 0.071), ('bit', 0.071), ('nothing', 0.069), ('runs', 0.069), ('pretty', 0.067), ('websites', 0.067), ('shared', 0.065), ('sites', 0.065), ('simply', 0.064), ('major', 0.064), ('application', 0.059), ('little', 0.051), ('enough', 0.05), ('need', 0.05)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 82 high scalability-2007-09-06-Why doesn't anyone use j2ee?
Introduction: From a reader: > Was reading through your very interesting/useful site. >Most of the architectures are non j2ee-Does that mean that >there aren't enough websites that are scalable(with youtube > like userbase) built with j2ee tech-would like to know if there > are any and their architecture as >well. eBay uses Java, but in a very pragmatic way. They use servlets, an application server, the JDK, and they do the rest themselves. They skip JSP, entity beans, and JMS. When you need to scale putting all your eggs in one basket is a risky strategy. Why use JSP when you can do better? When use entity beans when you can do better? Use servlets because they are a very effective way of handling http requests. Use Java because it is fast, runs everywhere, and has a boat load of libraries you can use to build your build your custom system. Probably the major reason J2EE is absentee is simply LAMP. LAMP is just so incredibly functional for most 2-tier shared nothing site
2 0.13918437 459 high scalability-2008-12-03-Java World Interview on Scalability and Other Java Scalability Secrets
Introduction: OK, this interview is with me on Java scalability issues. I sound like a bigger idiot than I would like, but I suppose it could have been worse. The Java World folks were very nice and did a good job, so there’s no blame on them :-) The interview went an interesting direction, but there’s more I’d like add and I will do so here. Two major rules regarding Java and scalability have popped out at me: Java – It’s the platform stupid . Java the language isn’t the big win. What is the big win is the ecosystem building up around the JVM, libraries, and toolsets. Java – It’s the community stupid . A lot of creativity is being expended on leveraging the Java platform to meet scalability challenges. The amazing community that has built up around Java is pushing Java to the next level in almost every direction imaginable. The fecundity of the Java ecosystem can most readily be seen with the efforts to tame our multi-core future. There’s a multi-core crisis going in case you haven’t
3 0.11610661 327 high scalability-2008-05-27-How I Learned to Stop Worrying and Love Using a Lot of Disk Space to Scale
Introduction: Update 3 : ReadWriteWeb says Google App Engine Announces New Pricing Plans, APIs, Open Access . Pricing is specified but I'm not sure what to make of it yet. An image manipulation library is added (thus the need to pay for more CPU :-) and memcached support has been added. Memcached will help resolve the can't write for every read problem that pops up when keeping counters. Update 2 : onGWT.com threw a GAE load party and a lot of people came. The results at Load test : Google App Engine = 1, Community = 0 . GAE handled a peak of 35 requests/second and a sustained 10 requests/second. Some think performance was good, others not so good. My GMT watch broke and I was late to arrive. Maybe next time. Also added a few new design rules from the post. Update : Added a few new rules gleaned from the GAE Meetup : Design By Explicit Cost Model and Puts are Precious. How do you structure your database using a distributed hash table like BigTable ? The answer isn't what you might expect. If
4 0.093492456 1143 high scalability-2011-11-16-Google+ Infrastructure Update - the JavaScript Story
Introduction: In Google+ Is Built Using Tools You Can Use Too: Closure, Java Servlets, JavaScript, BigTable, Colossus, Quick Turnaround we glimpsed inside Google's technology stack for building Google+. Mark Knichel, an engineer on the Google+ infrastructure team, has helped us look a little deeper on how Javascript is handled in Google+. Here's a quick look: They love Closure for its library, templates, compiler, and strict type checking. Compilation is now required for good performance. I've wondered if GWT will be killed off as have other Google properties, but I've been told GWT is being used heavily inside Google, so thankfully that probably won't happen. Closure templates are used both Java and JavaScript to render pages server-side and in the browser. Just-in-time JavaScript. Code is split into modules so the minimum amount of JavaScript is loaded asynchronously in the background as necessary. Navigation happens without loading the page. Page navigation happens without pag
5 0.090973772 972 high scalability-2011-01-11-Google Megastore - 3 Billion Writes and 20 Billion Read Transactions Daily
Introduction: A giant step into the fully distributed future has been taken by the Google App Engine team with the release of their High Replication Datastore . The HRD is targeted at mission critical applications that require data replicated to at least three datacenters, full ACID semantics for entity groups , and lower consistency guarantees across entity groups. This is a major accomplishment. Few organizations can implement a true multi-datacenter datastore. Other than SimpleDB, how many other publicly accessible database services can operate out of multiple datacenters? Now that capability can be had by anyone. But there is a price, literally and otherwise. Because the HRD uses three times the resources as Google App Engine's Master/Slave datastatore, it will cost three times as much. And because it is a distributed database, with all that implies in the CAP sense, developers will have to be very careful in how they architect their applications because as costs increased, reliability incre
6 0.087849617 331 high scalability-2008-05-27-eBay Architecture
7 0.085380904 1068 high scalability-2011-06-27-TripAdvisor Architecture - 40M Visitors, 200M Dynamic Page Views, 30TB Data
8 0.081902966 1215 high scalability-2012-03-26-7 Years of YouTube Scalability Lessons in 30 Minutes
9 0.079980716 1305 high scalability-2012-08-16-Paper: A Provably Correct Scalable Concurrent Skip List
11 0.077162057 563 high scalability-2009-04-10-Facebook Chat Architecture
12 0.071147218 96 high scalability-2007-09-18-Amazon Architecture
14 0.068416819 301 high scalability-2008-04-08-Google AppEngine - A First Look
16 0.067501687 1118 high scalability-2011-09-19-Big Iron Returns with BigMemory
17 0.066667281 121 high scalability-2007-10-14-Newbie in scalability design issues
19 0.065603413 985 high scalability-2011-02-08-Mollom Architecture - Killing Over 373 Million Spams at 100 Requests Per Second
20 0.065235361 194 high scalability-2007-12-26-Golden rule of web caching
topicId topicWeight
[(0, 0.111), (1, 0.013), (2, -0.015), (3, -0.021), (4, 0.017), (5, -0.023), (6, -0.008), (7, 0.003), (8, -0.028), (9, 0.023), (10, -0.032), (11, 0.01), (12, -0.001), (13, -0.016), (14, 0.021), (15, -0.063), (16, 0.009), (17, -0.03), (18, 0.019), (19, -0.028), (20, -0.012), (21, -0.017), (22, 0.01), (23, -0.055), (24, 0.007), (25, -0.019), (26, 0.044), (27, -0.064), (28, 0.037), (29, 0.04), (30, -0.027), (31, -0.019), (32, 0.021), (33, -0.062), (34, 0.003), (35, -0.038), (36, -0.022), (37, -0.022), (38, 0.003), (39, -0.011), (40, -0.005), (41, 0.023), (42, -0.03), (43, -0.012), (44, -0.004), (45, 0.009), (46, 0.059), (47, -0.047), (48, -0.025), (49, 0.026)]
simIndex simValue blogId blogTitle
same-blog 1 0.93693781 82 high scalability-2007-09-06-Why doesn't anyone use j2ee?
Introduction: From a reader: > Was reading through your very interesting/useful site. >Most of the architectures are non j2ee-Does that mean that >there aren't enough websites that are scalable(with youtube > like userbase) built with j2ee tech-would like to know if there > are any and their architecture as >well. eBay uses Java, but in a very pragmatic way. They use servlets, an application server, the JDK, and they do the rest themselves. They skip JSP, entity beans, and JMS. When you need to scale putting all your eggs in one basket is a risky strategy. Why use JSP when you can do better? When use entity beans when you can do better? Use servlets because they are a very effective way of handling http requests. Use Java because it is fast, runs everywhere, and has a boat load of libraries you can use to build your build your custom system. Probably the major reason J2EE is absentee is simply LAMP. LAMP is just so incredibly functional for most 2-tier shared nothing site
2 0.77469361 459 high scalability-2008-12-03-Java World Interview on Scalability and Other Java Scalability Secrets
Introduction: OK, this interview is with me on Java scalability issues. I sound like a bigger idiot than I would like, but I suppose it could have been worse. The Java World folks were very nice and did a good job, so there’s no blame on them :-) The interview went an interesting direction, but there’s more I’d like add and I will do so here. Two major rules regarding Java and scalability have popped out at me: Java – It’s the platform stupid . Java the language isn’t the big win. What is the big win is the ecosystem building up around the JVM, libraries, and toolsets. Java – It’s the community stupid . A lot of creativity is being expended on leveraging the Java platform to meet scalability challenges. The amazing community that has built up around Java is pushing Java to the next level in almost every direction imaginable. The fecundity of the Java ecosystem can most readily be seen with the efforts to tame our multi-core future. There’s a multi-core crisis going in case you haven’t
Introduction: PayPal gives yet another glowing report of an app rewritten in node.js experiencing substantial performance improvements. PayPal rewrote their account overview page, one of the most trafficked apps on the website, which was previously written in King Java. The benefits: Full-stack engineers. Using JavaScript on both the front-end and the back-end removed an artificial boundary between the browser and server, allowing engineers to code both. Built almost twice as fast with fewer people Written in 33% fewer lines of code Constructed with 40% fewer files Double the requests per second vs. the Java application. 35% decrease in the average response time for the same page. A common pro Java response is an argument like clearly these people don't know how to program Java. Or rewriting an application usually makes it faster. Or the benchmark is faulty. And so on. Consider it noted. These are all potential factors. Baron Schwartz from VividCortex has a different tak
4 0.70502371 550 high scalability-2009-03-30-Ebay history and architecture
Introduction: Ebay [1] Starts in 1995, initial name AuctionWeb (V1) : - very simple architecture - based on perl - no database, for data persistence they used plain files Because of rapid growth they needed to improve their architecture and so V2 (clever name) was born: - replaced perl with C/C++ - started using a database in a master-slave configuration - C++ back-end - XSLT front-end Any request will lead to an XML file being created in C++ and the XLST processor will transform that into html. *pretty sophisticated architecture for the 90s, XLST was cutting-edge back then* That hold out pretty well for a while but in the late 90s ebay experienced an exponential growth. They started having some trouble with outages and needed improvements, so V3 was developed: - based on java - search engine still used C++ - proof that relational databases can scale (aggressive caching) - developed a messaging layer for making a lot of asyncronious calls, they a
5 0.68532801 346 high scalability-2008-06-28-ID generation schemes
Introduction: Hi, Generating unique ids is a common requirements in many projects. Generally, this responsibility is given to Database layer. By using sequences or some other technique. This is a problem for horizontal scalability. What are the Guid generation schemes used in high scalable web sites generally? I have seen use java's SecureRandom class to generate Guid. What are the other methods generally used? Thanks Unmesh
6 0.68527758 993 high scalability-2011-02-22-Is Node.js Becoming a Part of the Stack? SimpleGeo Says Yes.
7 0.6429643 434 high scalability-2008-10-30-Olio Web2.0 Toolkit - Evaluate Web Technologies and Tools
8 0.64187956 113 high scalability-2007-10-07-Paper: Architecture of a Highly Scalable NIO-Based Server
9 0.63845026 1277 high scalability-2012-07-05-10 Golden Principles For Building Successful Mobile-Web Applications
10 0.61772251 384 high scalability-2008-09-16-EE-Appserver Clustering OR Terracota OR Coherence OR something else?
11 0.61433327 1427 high scalability-2013-03-20-Dart - Is it the Future of the Web?
12 0.60835642 471 high scalability-2008-12-19-Gigaspaces curbs latency outliers with Java Real Time
13 0.59765226 1591 high scalability-2014-02-05-Little’s Law, Scalability and Fault Tolerance: The OS is your bottleneck. What you can do?
14 0.59141803 328 high scalability-2008-05-27-Scalable virus scanning for web-applications
15 0.58815354 124 high scalability-2007-10-16-How Scalable are Single Page Ajax Apps?
16 0.5863958 129 high scalability-2007-10-23-Hire Facebook, Ning, and Salesforce to Scale for You
17 0.5856083 301 high scalability-2008-04-08-Google AppEngine - A First Look
18 0.58541197 145 high scalability-2007-11-08-ID generator
19 0.58438402 338 high scalability-2008-06-02-Total Cost of Ownership for different web development frameworks
20 0.5829438 307 high scalability-2008-04-21-Using Google AppEngine for a Little Micro-Scalability
topicId topicWeight
[(1, 0.2), (2, 0.182), (12, 0.355), (40, 0.064), (79, 0.072)]
simIndex simValue blogId blogTitle
1 0.8646363 209 high scalability-2008-01-12-Gandi.net, french registrar launches in granular server resources.
Introduction: Gandi.net, a French domain registrar has launched a very flexible dynamic resource allocated VPS service.
same-blog 2 0.82873034 82 high scalability-2007-09-06-Why doesn't anyone use j2ee?
Introduction: From a reader: > Was reading through your very interesting/useful site. >Most of the architectures are non j2ee-Does that mean that >there aren't enough websites that are scalable(with youtube > like userbase) built with j2ee tech-would like to know if there > are any and their architecture as >well. eBay uses Java, but in a very pragmatic way. They use servlets, an application server, the JDK, and they do the rest themselves. They skip JSP, entity beans, and JMS. When you need to scale putting all your eggs in one basket is a risky strategy. Why use JSP when you can do better? When use entity beans when you can do better? Use servlets because they are a very effective way of handling http requests. Use Java because it is fast, runs everywhere, and has a boat load of libraries you can use to build your build your custom system. Probably the major reason J2EE is absentee is simply LAMP. LAMP is just so incredibly functional for most 2-tier shared nothing site
3 0.81699008 278 high scalability-2008-03-16-Product: GlusterFS
Introduction: Adapted from their website: GlusterFS is a clustered file-system capable of scaling to several peta-bytes. It aggregates various storage bricks over Infiniband RDMA or TCP/IP interconnect into one large parallel network file system. Storage bricks can be made of any commodity hardware such as x86-64 server with SATA-II RAID and Infiniband HBA). Cluster file systems are still not mature for enterprise market. They are too complex to deploy and maintain though they are extremely scalable and cheap. Can be entirely built out of commodity OS and hardware. GlusterFS hopes to solves this problem. GlusterFS achieved 35 GBps read throughput . The GlusterFS Aggregated I/O Benchmark was performed on 64 bricks clustered storage system over 10 Gbps Infiniband interconnect. A cluster of 220 clients pounded the storage system with multiple dd (disk-dump) instances, each reading / writing a 1 GB file with 1MB block size. GlusterFS was configured with unify translator and round-robin scheduler
4 0.7491799 285 high scalability-2008-03-19-Serving JavaScript Fast
Introduction: Cal Henderson writes at thinkvitamin.com : "With our so-called "Web 2.0' applications and their rich content and interaction, we expect our applications to increasingly make use of CSS and JavaScript. To make sure these applications are nice and snappy to use, we need to optimize the size and nature of content required to render the page, making sure we’re delivering the optimum experience. In practice, this means a combination of making our content as small and fast to download as possible, while avoiding unnecessarily refetching unmodified resources." A lot of good comments too.
5 0.74219429 3 high scalability-2007-07-09-LiveJournal Architecture
Introduction: A fascinating and detailed story of how LiveJournal evolved their system to scale. LiveJournal was an early player in the free blog service race and faced issues from quickly adding a large number of users. Blog posts come fast and furious which causes a lot of writes and writes are particularly hard to scale. Understanding how LiveJournal faced their scaling problems will help any aspiring website builder. Site: http://www.livejournal.com/ Information Sources LiveJournal - Behind The Scenes Scaling Storytime Google Video Tokyo Video 2005 version Platform Linux MySql Perl Memcached MogileFS Apache What's Inside? Scaling from 1, 2, and 4 hosts to cluster of servers. Avoid single points of failure. Using MySQL replication only takes you so far. Becoming IO bound kills scaling. Spread out writes and reads for more parallelism. You can't keep adding read slaves and scale. Shard storage approach, using DRBD, for maxim
6 0.71938097 490 high scalability-2009-01-12-Getting ready for the cloud
7 0.71318334 1352 high scalability-2012-10-31-Gone Fishin': LiveJournal Architecture
8 0.70596677 161 high scalability-2007-11-20-Product: SmartFrog a Distributed Configuration and Deployment Framework
9 0.65445787 886 high scalability-2010-08-24-21 Quality Screencasts on Scaling Rails
10 0.61688673 415 high scalability-2008-10-15-Need help with your Hadoop deployment? This company may help!
11 0.60277963 983 high scalability-2011-02-02-Piccolo - Building Distributed Programs that are 11x Faster than Hadoop
12 0.60090327 1633 high scalability-2014-04-16-Six Lessons Learned the Hard Way About Scaling a Million User System
13 0.60077572 679 high scalability-2009-08-11-13 Scalability Best Practices
14 0.59988695 1405 high scalability-2013-02-13-7 Sensible and 1 Really Surprising Way EVE Online Scales to Play Huge Games
15 0.59984368 1517 high scalability-2013-09-16-The Hidden DNS Tax - Cascading Timeouts and Errors
16 0.5984118 1448 high scalability-2013-04-29-AWS v GCE Face-off and Why Innovation Needs Lower Cost Infrastructures
17 0.59765941 1472 high scalability-2013-06-07-Stuff The Internet Says On Scalability For June 7, 2013
18 0.59760088 373 high scalability-2008-08-29-Product: ScaleOut StateServer is Memcached on Steroids
19 0.59674567 1641 high scalability-2014-05-01-Paper: Can Programming Be Liberated From The Von Neumann Style?
20 0.59671545 219 high scalability-2008-01-21-Product: Hyperic