high_scalability high_scalability-2008 high_scalability-2008-204 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: All, What is the best way to scan the content being uploaded by the users? Is there any open source solution available to do that? How does YouTube, flickr and other user uploadable content sites handle this? Any insight would be greatly appreciated! Regards, Janakan Rajendran.
sentIndex sentText sentNum sentScore
1 All, What is the best way to scan the content being uploaded by the users? [sent-1, score-1.061]
2 Is there any open source solution available to do that? [sent-2, score-0.43]
3 How does YouTube, flickr and other user uploadable content sites handle this? [sent-3, score-0.904]
wordName wordTfidf (topN-words)
[('rajendran', 0.461), ('appreciated', 0.377), ('scan', 0.321), ('uploaded', 0.287), ('greatly', 0.285), ('content', 0.272), ('flickr', 0.269), ('youtube', 0.253), ('insight', 0.218), ('sites', 0.157), ('solution', 0.113), ('handle', 0.111), ('available', 0.11), ('source', 0.106), ('best', 0.105), ('open', 0.101), ('user', 0.095), ('users', 0.087), ('way', 0.076), ('would', 0.066)]
simIndex simValue blogId blogTitle
same-blog 1 0.99999994 204 high scalability-2008-01-08-Virus Scanning for Uploaded content
Introduction: All, What is the best way to scan the content being uploaded by the users? Is there any open source solution available to do that? How does YouTube, flickr and other user uploadable content sites handle this? Any insight would be greatly appreciated! Regards, Janakan Rajendran.
2 0.20923597 226 high scalability-2008-01-28-DR-BC for web-DB servers
Introduction: All, I'm looking for a faster/reliable solution for DR/BC as well as for sclability for my web/db servers. I came across VMWare Infrastructure and other products. The I/O performance concerns me to go with virtual servers. I'm also looking into imaging software such as Acrnois. Could anyone share their thoughts on how it's being done with bigger names such as google/youtube etc..? Thank you, Regards, Janakan Rajendran.
3 0.19300659 238 high scalability-2008-02-04-IPS-IDS for heavy content site
Introduction: All, My site would have heavy content (video/pictures). I'm looking for an efficient IPS/IDS solution which would not introduce much of latency. I'm more familiar with Cisco ASA and also familiar with Juniper, Foundry and others. I also came across snort but haven't used it before. I'm more of looking for an appliance (for the ease of configuration,support etc...) Could any one share their thoughts on performane of IPS/IDS from this vendors? Thanks! Janakan Rajendran
4 0.15464132 199 high scalability-2008-01-01-S3 for image storing
Introduction: Hi all, Has anyone got any experience with using Amazon S3 as an uploaded photo store? I'm writing a website that I need to keep as low budget as possible, and I'm investigating solutions for storing uploaded photos from users - not too many, probably in the low thousands. The site is commercial so I'm straying away from the Flickrs of the world. S3 seems to offer a solution but I'd like to hear from those who have used it before. Thanks Andy
5 0.14176612 41 high scalability-2007-07-30-Product: Flickr
Introduction: Flickr offers a free basic account with limited upload bandwidth and limited storage. Download bandwidth is unlimited. Upgrading to a paid Pro account for $25/year removes all upload and storage restrictions. Flickr's terms of use warn that "professional or corporate uses of Flickr are prohibited", and all external images require a link back to Flickr.
6 0.14130394 348 high scalability-2008-07-09-Federation at Flickr: Doing Billions of Queries Per Day
7 0.1332242 1215 high scalability-2012-03-26-7 Years of YouTube Scalability Lessons in 30 Minutes
8 0.12861875 198 high scalability-2008-01-01-HOW CDN works
9 0.12513138 241 high scalability-2008-02-05-SLA monitoring
10 0.11669297 1288 high scalability-2012-07-23-Ask HighScalability: How Do I Build My MegaUpload + Itunes + YouTube Startup?
11 0.10902143 274 high scalability-2008-03-12-YouTube Architecture
12 0.10768887 560 high scalability-2009-04-08-Learned lessons from the largest player (Flickr, YouTube, Google, etc)
13 0.10503747 856 high scalability-2010-07-12-Creating Scalable Digital Libraries
14 0.099110663 176 high scalability-2007-12-07-Synchronizing databases in different geographic locations
15 0.094869547 377 high scalability-2008-09-03-SMACKDOWN :: Who are the Open Source Content Management System (CMS) market leaders in 2008?
16 0.091040723 382 high scalability-2008-09-09-Content Delivery Networks (CDN) – a comprehensive list of providers
17 0.088205889 70 high scalability-2007-08-22-How many machines do you need to run your site?
18 0.083939776 576 high scalability-2009-04-21-What CDN would you recommend?
19 0.080411226 1399 high scalability-2013-02-05-Ask HighScalability: Memcached and Relations
20 0.077694274 1 high scalability-2007-07-06-Start Here
topicId topicWeight
[(0, 0.071), (1, 0.027), (2, -0.013), (3, -0.1), (4, 0.001), (5, -0.052), (6, -0.063), (7, -0.012), (8, 0.013), (9, 0.08), (10, -0.031), (11, -0.004), (12, -0.049), (13, -0.018), (14, 0.051), (15, 0.038), (16, 0.013), (17, 0.039), (18, 0.01), (19, -0.094), (20, -0.066), (21, 0.014), (22, 0.01), (23, 0.039), (24, -0.031), (25, -0.003), (26, 0.036), (27, -0.022), (28, 0.073), (29, 0.015), (30, 0.011), (31, 0.053), (32, 0.061), (33, -0.058), (34, -0.009), (35, -0.034), (36, 0.062), (37, -0.057), (38, -0.01), (39, 0.002), (40, 0.004), (41, 0.041), (42, -0.005), (43, 0.007), (44, -0.058), (45, 0.014), (46, -0.038), (47, -0.029), (48, -0.017), (49, 0.025)]
simIndex simValue blogId blogTitle
same-blog 1 0.9751519 204 high scalability-2008-01-08-Virus Scanning for Uploaded content
Introduction: All, What is the best way to scan the content being uploaded by the users? Is there any open source solution available to do that? How does YouTube, flickr and other user uploadable content sites handle this? Any insight would be greatly appreciated! Regards, Janakan Rajendran.
2 0.72901589 198 high scalability-2008-01-01-HOW CDN works
Introduction: All, I'm just new to this and have a basic understanding how CDN works? My questions are: 1. How does CDN sync data with web servers for video/images? If I have a user to upload a video to my site, will it get stored directly in CDN or it comes to my webserver first and then sync-ed with cache server? 2. How to have only the dynamic video/image delivered through CDN while the rest is served by a webserver? 3. How sync happens and who pays for the bandwidth for sync? I'd appreciate if someone could explain this. Regards, Janakan Rajendran
3 0.6796 135 high scalability-2007-10-27-.Net2 and AJAX scalability?
Introduction: Am I mad to cons i der using .Net2 and AJAX for a high-scalabi l ity app l ication? In case you wonder why, it's the legacy of a webs i te bui l t on IIS and .Net 1.1, and we're look i ng for ways to make the content more attractive and interact i ve. In this case, it's a medical image l i brary being shared by a few Wikis and on l ine coursework for medica l students ( < 15K users) and doctors ( < 150K users) But I'm worr i ed about the performance overhead. We a l ready have a performance prob l em because of personal i sing the content for users according to their type (student or doctor), and for doctors, their grade and special i ty.
4 0.66962624 39 high scalability-2007-07-30-Product: Akamai
Introduction: Akamai transparently mirrors content (usually media objects such as audio, graphics, animation, video) stored on customer servers. Though the domain name is the same, the IP address points to an Akamai server rather than the customer's server. In addition to image caching, Akamai provides services which accelerate dynamic and personalized content, J2EE-compliant applications, and streaming media.
5 0.64563084 856 high scalability-2010-07-12-Creating Scalable Digital Libraries
Introduction: Like many other media content providers, libraries and museums are increasingly moving their content onto the Web. While the move itself is no easy process (with digitization, web development, and training costs), being able to successfully deliver content to a wide audience is an ongoing concern, particularly for large libraries. Much of the concern is financial, as most libraries do not have the internal budget or outside investors that for-profit businesses enjoy. Even large university libraries will face serious budget constraints that even other university departments, such as science and technology would not face. Creating a scalable infrastructure and also distributing a large digital collection that can handle multiple requests, requires planning that many librarians have not even imagined. They must stop thinking in terms of "one-item-per-customer" and start thinking in terms of numerous users accessing the same information simultaneously. Content Delivery Network
6 0.63900864 285 high scalability-2008-03-19-Serving JavaScript Fast
7 0.63589019 100 high scalability-2007-09-26-Use a CDN to Instantly Improve Your Website's Performance by 20% or More
8 0.62308234 576 high scalability-2009-04-21-What CDN would you recommend?
9 0.61393058 382 high scalability-2008-09-09-Content Delivery Networks (CDN) – a comprehensive list of providers
10 0.59916806 1402 high scalability-2013-02-07-Ask HighScalability: Web asset server concept - 3rd party software available?
11 0.58212173 181 high scalability-2007-12-11-Hosting and CDN for startup video sharing site
12 0.57794219 199 high scalability-2008-01-01-S3 for image storing
13 0.56742901 60 high scalability-2007-08-07-Can you profit from the coming Content Delivery Network wars?
14 0.56489241 1102 high scalability-2011-08-22-Strategy: Run a Scalable, Available, and Cheap Static Site on S3 or GitHub
15 0.5495581 144 high scalability-2007-11-07-What CDN would you recommend?
16 0.54002053 59 high scalability-2007-08-04-Try Squid as a Reverse Proxy
17 0.53674507 41 high scalability-2007-07-30-Product: Flickr
18 0.53129143 238 high scalability-2008-02-04-IPS-IDS for heavy content site
19 0.52723372 1289 high scalability-2012-07-23-State of the CDN: More Traffic, Stable Prices, More Products, Profits - Not So Much
20 0.52597523 291 high scalability-2008-03-29-20 New Rules for Faster Web Pages
topicId topicWeight
[(1, 0.759), (2, 0.04)]
simIndex simValue blogId blogTitle
1 0.99437582 574 high scalability-2009-04-20-Some things about Memcached from a Twitter software developer
Introduction: Memcached is generally treated as a black box. But what if you really need to know what's in there? Not for runtime purposes, but for optimization and capacity planning? Read more on Evan Weaver, a software developer working for Twitter (a contributor for Rails core and Mongrel).
2 0.99384409 438 high scalability-2008-11-05-Managing application on the cloud using a JMX Fabric
Introduction: This post describes how you can create a federated management model using JMX standard API. Applications that are already using a standard JMX interface can plug-in the new federated implementation without changing the application code and without introducing additional performance overhead.
Introduction: Who's Hiring? Flurry has built large-scale app measurement and advertising services that are used by more than 80,000 media companies and independent developers to monetize mobile and related platforms. If you're interested in joining a thriving, growing team, please check us out . Rumble Games is looking for a Senior Platform Engineer to build massively scalable and shared services for the next generation of online games. We have the best team this industry has seen, and we will transform the way people play together. Join us . We need awesome people @ Booking.com - We want YOU! Come design next generation interfaces, solve critical scalability problems, and hack on one of the largest Perl codebases. Apply: http://www.booking.com/jobs.en-us.html Teradata Aster is looking for Distributed Systems , Analytic Applications , and Performance Architects . As a member of the Architecture Group you will help define the technical roadmap for the product. The
Introduction: Who's Hiring? Apple is hiring for multiple positions. Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Senior Server Side Engineer . The Emerging Technology team is looking for a highly motivated, detail-oriented, energetic individual with experience in a variety of big data technologies. You will be part of a fast growing, cohesive team with many exciting responsibilities related to Big Data, including: Develop scalable, robust systems that will gather, process, store large amount of data Define/develop Big Data technologies for Apple internal and customer facing applications. Please apply here . Senior Server Side Engineer . The Emerging Technology team is looking for a highly motivated, detail-oriented, energetic individual with experience in a variety of big data technologies. You will be part of a fast growing, cohesive team with many exciting responsibilities related
Introduction: Who's Hiring? Flurry has built large-scale app measurement and advertising services that are used by more than 80,000 media companies and independent developers to monetize mobile and related platforms. If you're interested in joining a thriving, growing team, please check us out . Rumble Games is looking for a Senior Platform Engineer to build massively scalable and shared services for the next generation of online games. We have the best team this industry has seen, and we will transform the way people play together. Join us . Duolingo , a fast-growing (>11% per week), free (no ads, no fees, no subscriptions) language learning site is looking for an infrastructure engineer to scale Duolingo to millions of users, please apply here . We need awesome people @ Booking.com - We want YOU! Come design next generation interfaces, solve critical scalability problems, and hack on one of the largest Perl codebases. Apply: http://www.booking.com/jobs.en-us.html
same-blog 11 0.98974854 204 high scalability-2008-01-08-Virus Scanning for Uploaded content