high_scalability high_scalability-2008 high_scalability-2008-204 knowledge-graph by maker-knowledge-mining

204 high scalability-2008-01-08-Virus Scanning for Uploaded content


meta infos for this blog

Source: html

Introduction: All, What is the best way to scan the content being uploaded by the users? Is there any open source solution available to do that? How does YouTube, flickr and other user uploadable content sites handle this? Any insight would be greatly appreciated! Regards, Janakan Rajendran.


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 All, What is the best way to scan the content being uploaded by the users? [sent-1, score-1.061]

2 Is there any open source solution available to do that? [sent-2, score-0.43]

3 How does YouTube, flickr and other user uploadable content sites handle this? [sent-3, score-0.904]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('rajendran', 0.461), ('appreciated', 0.377), ('scan', 0.321), ('uploaded', 0.287), ('greatly', 0.285), ('content', 0.272), ('flickr', 0.269), ('youtube', 0.253), ('insight', 0.218), ('sites', 0.157), ('solution', 0.113), ('handle', 0.111), ('available', 0.11), ('source', 0.106), ('best', 0.105), ('open', 0.101), ('user', 0.095), ('users', 0.087), ('way', 0.076), ('would', 0.066)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.99999994 204 high scalability-2008-01-08-Virus Scanning for Uploaded content

Introduction: All, What is the best way to scan the content being uploaded by the users? Is there any open source solution available to do that? How does YouTube, flickr and other user uploadable content sites handle this? Any insight would be greatly appreciated! Regards, Janakan Rajendran.

2 0.20923597 226 high scalability-2008-01-28-DR-BC for web-DB servers

Introduction: All, I'm looking for a faster/reliable solution for DR/BC as well as for sclability for my web/db servers. I came across VMWare Infrastructure and other products. The I/O performance concerns me to go with virtual servers. I'm also looking into imaging software such as Acrnois. Could anyone share their thoughts on how it's being done with bigger names such as google/youtube etc..? Thank you, Regards, Janakan Rajendran.

3 0.19300659 238 high scalability-2008-02-04-IPS-IDS for heavy content site

Introduction: All, My site would have heavy content (video/pictures). I'm looking for an efficient IPS/IDS solution which would not introduce much of latency. I'm more familiar with Cisco ASA and also familiar with Juniper, Foundry and others. I also came across snort but haven't used it before. I'm more of looking for an appliance (for the ease of configuration,support etc...) Could any one share their thoughts on performane of IPS/IDS from this vendors? Thanks! Janakan Rajendran

4 0.15464132 199 high scalability-2008-01-01-S3 for image storing

Introduction: Hi all, Has anyone got any experience with using Amazon S3 as an uploaded photo store? I'm writing a website that I need to keep as low budget as possible, and I'm investigating solutions for storing uploaded photos from users - not too many, probably in the low thousands. The site is commercial so I'm straying away from the Flickrs of the world. S3 seems to offer a solution but I'd like to hear from those who have used it before. Thanks Andy

5 0.14176612 41 high scalability-2007-07-30-Product: Flickr

Introduction: Flickr offers a free basic account with limited upload bandwidth and limited storage. Download bandwidth is unlimited. Upgrading to a paid Pro account for $25/year removes all upload and storage restrictions. Flickr's terms of use warn that "professional or corporate uses of Flickr are prohibited", and all external images require a link back to Flickr.

6 0.14130394 348 high scalability-2008-07-09-Federation at Flickr: Doing Billions of Queries Per Day

7 0.1332242 1215 high scalability-2012-03-26-7 Years of YouTube Scalability Lessons in 30 Minutes

8 0.12861875 198 high scalability-2008-01-01-HOW CDN works

9 0.12513138 241 high scalability-2008-02-05-SLA monitoring

10 0.11669297 1288 high scalability-2012-07-23-Ask HighScalability: How Do I Build My MegaUpload + Itunes + YouTube Startup?

11 0.10902143 274 high scalability-2008-03-12-YouTube Architecture

12 0.10768887 560 high scalability-2009-04-08-Learned lessons from the largest player (Flickr, YouTube, Google, etc)

13 0.10503747 856 high scalability-2010-07-12-Creating Scalable Digital Libraries

14 0.099110663 176 high scalability-2007-12-07-Synchronizing databases in different geographic locations

15 0.094869547 377 high scalability-2008-09-03-SMACKDOWN :: Who are the Open Source Content Management System (CMS) market leaders in 2008?

16 0.091040723 382 high scalability-2008-09-09-Content Delivery Networks (CDN) – a comprehensive list of providers

17 0.088205889 70 high scalability-2007-08-22-How many machines do you need to run your site?

18 0.083939776 576 high scalability-2009-04-21-What CDN would you recommend?

19 0.080411226 1399 high scalability-2013-02-05-Ask HighScalability: Memcached and Relations

20 0.077694274 1 high scalability-2007-07-06-Start Here


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.071), (1, 0.027), (2, -0.013), (3, -0.1), (4, 0.001), (5, -0.052), (6, -0.063), (7, -0.012), (8, 0.013), (9, 0.08), (10, -0.031), (11, -0.004), (12, -0.049), (13, -0.018), (14, 0.051), (15, 0.038), (16, 0.013), (17, 0.039), (18, 0.01), (19, -0.094), (20, -0.066), (21, 0.014), (22, 0.01), (23, 0.039), (24, -0.031), (25, -0.003), (26, 0.036), (27, -0.022), (28, 0.073), (29, 0.015), (30, 0.011), (31, 0.053), (32, 0.061), (33, -0.058), (34, -0.009), (35, -0.034), (36, 0.062), (37, -0.057), (38, -0.01), (39, 0.002), (40, 0.004), (41, 0.041), (42, -0.005), (43, 0.007), (44, -0.058), (45, 0.014), (46, -0.038), (47, -0.029), (48, -0.017), (49, 0.025)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.9751519 204 high scalability-2008-01-08-Virus Scanning for Uploaded content

Introduction: All, What is the best way to scan the content being uploaded by the users? Is there any open source solution available to do that? How does YouTube, flickr and other user uploadable content sites handle this? Any insight would be greatly appreciated! Regards, Janakan Rajendran.

2 0.72901589 198 high scalability-2008-01-01-HOW CDN works

Introduction: All, I'm just new to this and have a basic understanding how CDN works? My questions are: 1. How does CDN sync data with web servers for video/images? If I have a user to upload a video to my site, will it get stored directly in CDN or it comes to my webserver first and then sync-ed with cache server? 2. How to have only the dynamic video/image delivered through CDN while the rest is served by a webserver? 3. How sync happens and who pays for the bandwidth for sync? I'd appreciate if someone could explain this. Regards, Janakan Rajendran

3 0.6796 135 high scalability-2007-10-27-.Net2 and AJAX scalability?

Introduction: Am I mad to cons i der using .Net2 and AJAX for a high-scalabi l ity app l ication? In case you wonder why, it's the legacy of a webs i te bui l t on IIS and .Net 1.1, and we're look i ng for ways to make the content more attractive and interact i ve. In this case, it's a medical image l i brary being shared by a few Wikis and on l ine coursework for medica l students ( < 15K users) and doctors ( < 150K users) But I'm worr i ed about the performance overhead. We a l ready have a performance prob l em because of personal i sing the content for users according to their type (student or doctor), and for doctors, their grade and special i ty.

4 0.66962624 39 high scalability-2007-07-30-Product: Akamai

Introduction: Akamai transparently mirrors content (usually media objects such as audio, graphics, animation, video) stored on customer servers. Though the domain name is the same, the IP address points to an Akamai server rather than the customer's server. In addition to image caching, Akamai provides services which accelerate dynamic and personalized content, J2EE-compliant applications, and streaming media.

5 0.64563084 856 high scalability-2010-07-12-Creating Scalable Digital Libraries

Introduction: Like many other media content providers, libraries and museums are increasingly moving their content onto the Web.  While the move itself is no easy process (with digitization, web development, and training costs), being able to successfully deliver content to a wide audience is an ongoing concern, particularly for large libraries. Much of the concern is financial, as most libraries do not have the internal budget or outside investors that for-profit businesses enjoy.  Even large university libraries will face serious budget constraints that even other university departments, such as science and technology would not face. Creating a scalable infrastructure and also distributing a large digital collection that can handle multiple requests, requires planning that many librarians have not even imagined.  They must stop thinking in terms of "one-item-per-customer" and start thinking in terms of numerous users accessing the same information simultaneously. Content Delivery Network

6 0.63900864 285 high scalability-2008-03-19-Serving JavaScript Fast

7 0.63589019 100 high scalability-2007-09-26-Use a CDN to Instantly Improve Your Website's Performance by 20% or More

8 0.62308234 576 high scalability-2009-04-21-What CDN would you recommend?

9 0.61393058 382 high scalability-2008-09-09-Content Delivery Networks (CDN) – a comprehensive list of providers

10 0.59916806 1402 high scalability-2013-02-07-Ask HighScalability: Web asset server concept - 3rd party software available?

11 0.58212173 181 high scalability-2007-12-11-Hosting and CDN for startup video sharing site

12 0.57794219 199 high scalability-2008-01-01-S3 for image storing

13 0.56742901 60 high scalability-2007-08-07-Can you profit from the coming Content Delivery Network wars?

14 0.56489241 1102 high scalability-2011-08-22-Strategy: Run a Scalable, Available, and Cheap Static Site on S3 or GitHub

15 0.5495581 144 high scalability-2007-11-07-What CDN would you recommend?

16 0.54002053 59 high scalability-2007-08-04-Try Squid as a Reverse Proxy

17 0.53674507 41 high scalability-2007-07-30-Product: Flickr

18 0.53129143 238 high scalability-2008-02-04-IPS-IDS for heavy content site

19 0.52723372 1289 high scalability-2012-07-23-State of the CDN: More Traffic, Stable Prices, More Products, Profits - Not So Much

20 0.52597523 291 high scalability-2008-03-29-20 New Rules for Faster Web Pages


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(1, 0.759), (2, 0.04)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.99437582 574 high scalability-2009-04-20-Some things about Memcached from a Twitter software developer

Introduction: Memcached is generally treated as a black box. But what if you really need to know what's in there? Not for runtime purposes, but for optimization and capacity planning? Read more on Evan Weaver, a software developer working for Twitter (a contributor for Rails core and Mongrel).

2 0.99384409 438 high scalability-2008-11-05-Managing application on the cloud using a JMX Fabric

Introduction: This post describes how you can create a federated management model using JMX standard API. Applications that are already using a standard JMX interface can plug-in the new federated implementation without changing the application code and without introducing additional performance overhead.

3 0.99361986 1383 high scalability-2013-01-08-Sponsored Post: Flurry, Rumble Games, Booking, aiCache, Teradata Aster, Aerospike, Percona, ScaleOut, New Relic, NetDNA, GigaSpaces, Logic Monitor, AppDynamics, ManageEngine, Site24x7

Introduction: Who's Hiring? Flurry  has   built large-scale app measurement and advertising services that are used by more than 80,000 media companies and independent developers to monetize mobile and related platforms. If you're interested in joining a thriving, growing team, please check us out . Rumble Games is looking for a Senior Platform Engineer to build massively scalable and shared services for the next generation of online games. We have the best team this industry has seen, and we will transform the way people play together. Join us . We need awesome people @ Booking.com - We want YOU! Come design next generation interfaces, solve critical scalability problems, and hack on one of the largest Perl codebases. Apply: http://www.booking.com/jobs.en-us.html Teradata Aster is looking for Distributed Systems , Analytic Applications ,  and Performance Architects . As a member of the Architecture Group you will help define the technical roadmap for the product. The

4 0.9935106 1590 high scalability-2014-02-04-Sponsored Post: Logentries, Booking, Apple, MongoDB, BlueStripe, AiScaler, Aerospike, LogicMonitor, AppDynamics, ManageEngine, Site24x7

Introduction: Who's Hiring? Apple is hiring for multiple positions. Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Senior Server Side Engineer . The Emerging Technology team is looking for a highly motivated, detail-oriented, energetic individual with experience in a variety of big data technologies.  You will be part of a fast growing, cohesive team with many exciting responsibilities related to Big Data, including: Develop scalable, robust systems that will gather, process, store large amount of data Define/develop Big Data technologies for Apple internal and customer facing applications. Please apply here . Senior Server Side Engineer . The Emerging Technology team is looking for a highly motivated, detail-oriented, energetic individual with experience in a variety of big data technologies.  You will be part of a fast growing, cohesive team with many exciting responsibilities related

5 0.99303132 1376 high scalability-2012-12-25-Sponsored Post: Flurry, Rumble Games, Duolingo, Booking, aiCache, Teradata Aster, Hadapt, Aerospike, Percona, ScaleOut, New Relic, NetDNA, GigaSpaces, Logic Monitor, AppDynamics, ManageEngine, Site24x7

Introduction: Who's Hiring? Flurry  has   built large-scale app measurement and advertising services that are used by more than 80,000 media companies and independent developers to monetize mobile and related platforms. If you're interested in joining a thriving, growing team, please check us out . Rumble Games is looking for a Senior Platform Engineer to build massively scalable and shared services for the next generation of online games. We have the best team this industry has seen, and we will transform the way people play together. Join us . Duolingo , a fast-growing (>11% per week), free (no ads, no fees, no subscriptions) language learning site is looking for an infrastructure engineer to scale Duolingo to millions of users, please apply here . We need awesome people @ Booking.com - We want YOU! Come design next generation interfaces, solve critical scalability problems, and hack on one of the largest Perl codebases. Apply: http://www.booking.com/jobs.en-us.html

6 0.9930312 1370 high scalability-2012-12-11-Sponsored Post: Rumble Games, Duolingo, Booking, aiCache, Teradata Aster, Hadapt, Aerospike, Percona, ScaleOut, New Relic, NetDNA, GigaSpaces, Logic Monitor, AppDynamics, ManageEngine, Site24x7

7 0.99206448 1598 high scalability-2014-02-18-Sponsored Post: Couchbase, Tokutek, Logentries, Booking, Apple, MongoDB, BlueStripe, AiScaler, Aerospike, LogicMonitor, AppDynamics, ManageEngine, Site24x7

8 0.99081236 1391 high scalability-2013-01-22-Sponsored Post: Amazon, Zoosk, Booking, aiCache, Teradata Aster, Aerospike, Percona, ScaleOut, New Relic, NetDNA, Logic Monitor, AppDynamics, ManageEngine, Site24x7

9 0.99042749 1400 high scalability-2013-02-05-Sponsored Post: Amazon, Zoosk, aiCache, Teradata Aster, Aerospike, Percona, ScaleOut, New Relic, NetDNA, Logic Monitor, AppDynamics, ManageEngine, Site24x7

10 0.99002868 1409 high scalability-2013-02-19-Sponsored Post: OLO, Amazon, Zoosk, aiCache, Teradata Aster, Aerospike, Percona, ScaleOut, New Relic, Logic Monitor, AppDynamics, ManageEngine, Site24x7

same-blog 11 0.98974854 204 high scalability-2008-01-08-Virus Scanning for Uploaded content

12 0.98938596 1525 high scalability-2013-10-01-Sponsored Post: Apple, Intechnica, Couchbase, MongoDB, Stackdriver, BlueStripe, Surge, Booking, Rackspace, AiCache, Aerospike, New Relic, LogicMonitor, AppDynamics, ManageEngine, Site24x7

13 0.98589331 1569 high scalability-2013-12-24-Sponsored Post: Netflix, Logentries, Host Color, Booking, Spokeo, Apple, ScaleOut, MongoDB, BlueStripe, AiScaler, Aerospike, New Relic, LogicMonitor, AppDynamics, ManageEngine, Site24x7

14 0.98563039 1532 high scalability-2013-10-15-Sponsored Post: Apple, ScaleOut, FreeAgent, CloudStats.me, Intechnica, Couchbase, MongoDB, Stackdriver, BlueStripe, Booking, Rackspace, AiCache, Aerospike, New Relic, LogicMonitor, AppDynamics, ManageEngine, Site24x7

15 0.9851101 1518 high scalability-2013-09-17-Sponsored Post: Apple, Couchbase, Evernote, MongoDB, Stackdriver, BlueStripe, Surge, Booking, Rackspace, AiCache, Aerospike, New Relic, LogicMonitor, AppDynamics, ManageEngine, Site24x7

16 0.98501337 1510 high scalability-2013-09-03-Sponsored Post: Apple, Couchbase, Evernote, 10gen, Stackdriver, BlueStripe, Surge, Booking, Rackspace, AiCache, Aerospike, New Relic, LogicMonitor, AppDynamics, ManageEngine, Site24x7

17 0.98399007 1426 high scalability-2013-03-19-Sponsored Post: Fitbit, OLO, Amazon, aiCache, Aerospike, Percona, ScaleOut, New Relic, Logic Monitor, AppDynamics, ManageEngine, Site24x7

18 0.983989 1417 high scalability-2013-03-05-Sponsored Post: Fitbit, OLO, Amazon, aiCache, Aerospike, Percona, ScaleOut, New Relic, Logic Monitor, AppDynamics, ManageEngine, Site24x7

19 0.98392624 1583 high scalability-2014-01-21-Sponsored Post: Netflix, Logentries, Host Color, Booking, Apple, MongoDB, BlueStripe, AiScaler, Aerospike, LogicMonitor, AppDynamics, ManageEngine, Site24x7

20 0.98243189 1457 high scalability-2013-05-14-Sponsored Post: Dow Jones, Spotify, Evernote, Surge, Rackspace, Amazon, Booking, aiCache, Aerospike, Percona, ScaleOut, New Relic, LogicMonitor, AppDynamics, ManageEngine, Site24x7