high_scalability high_scalability-2008 high_scalability-2008-451 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Hi there, I have an idea for an online database that services a large number of people. I've been studying it for a while and it seems feasible to me to create it and get people to populate it. It will need time to grow but eventually it will get there. The model I'm looking at is IMDB, the depth of information is fascinating, yet it's fast, not so easy to use though, but it's pretty usable! What do you think I need to create a database an online database like IMDB. I know that IMDB power comes from it's information, not the design of the site. This is something I kind of figured out. But what I need to know is the best tools to publish database contents on the web, retrieve it in that fast way like IMDB. I'm sure that I will need to create data entry logs for my users to populate the database. What programming languages you suggest? development environment? approaches? your contribution is highly appreciated. Regards, Jalil
sentIndex sentText sentNum sentScore
1 Hi there, I have an idea for an online database that services a large number of people. [sent-1, score-0.427]
2 I've been studying it for a while and it seems feasible to me to create it and get people to populate it. [sent-2, score-1.067]
3 It will need time to grow but eventually it will get there. [sent-3, score-0.355]
4 The model I'm looking at is IMDB, the depth of information is fascinating, yet it's fast, not so easy to use though, but it's pretty usable! [sent-4, score-0.591]
5 What do you think I need to create a database an online database like IMDB. [sent-5, score-0.669]
6 I know that IMDB power comes from it's information, not the design of the site. [sent-6, score-0.263]
7 But what I need to know is the best tools to publish database contents on the web, retrieve it in that fast way like IMDB. [sent-8, score-1.001]
8 I'm sure that I will need to create data entry logs for my users to populate the database. [sent-9, score-0.955]
wordName wordTfidf (topN-words)
[('imdb', 0.501), ('populate', 0.367), ('studying', 0.209), ('contribution', 0.204), ('feasible', 0.199), ('hi', 0.171), ('depth', 0.166), ('contents', 0.165), ('figured', 0.163), ('publish', 0.156), ('create', 0.155), ('online', 0.153), ('retrieve', 0.152), ('usable', 0.151), ('suggest', 0.148), ('entry', 0.135), ('database', 0.122), ('need', 0.117), ('fascinating', 0.11), ('logs', 0.109), ('approaches', 0.108), ('information', 0.102), ('languages', 0.101), ('eventually', 0.095), ('fast', 0.09), ('know', 0.083), ('grow', 0.083), ('pretty', 0.079), ('environment', 0.077), ('seems', 0.077), ('comes', 0.075), ('kind', 0.074), ('though', 0.074), ('yet', 0.072), ('sure', 0.072), ('programming', 0.066), ('tools', 0.065), ('highly', 0.06), ('get', 0.06), ('looking', 0.059), ('development', 0.058), ('easy', 0.057), ('idea', 0.057), ('model', 0.056), ('power', 0.055), ('best', 0.051), ('design', 0.05), ('services', 0.049), ('something', 0.049), ('number', 0.046)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 451 high scalability-2008-11-30-Creating a high-performing online database
Introduction: Hi there, I have an idea for an online database that services a large number of people. I've been studying it for a while and it seems feasible to me to create it and get people to populate it. It will need time to grow but eventually it will get there. The model I'm looking at is IMDB, the depth of information is fascinating, yet it's fast, not so easy to use though, but it's pretty usable! What do you think I need to create a database an online database like IMDB. I know that IMDB power comes from it's information, not the design of the site. This is something I kind of figured out. But what I need to know is the best tools to publish database contents on the web, retrieve it in that fast way like IMDB. I'm sure that I will need to create data entry logs for my users to populate the database. What programming languages you suggest? development environment? approaches? your contribution is highly appreciated. Regards, Jalil
2 0.21652175 752 high scalability-2009-12-17-Oracle and IBM databases: Disk-based vs In-memory databases
Introduction: Current disk based RDBMS can run out of steam when processing large data. Can these problems be solved by migrating from a disk based RDBMS to an IMDB? Any limitations? To find out, I tested one of each from the two leading vendors who together hold 70% of the market share - Oracle's 11g and TimesTen 11g , and IBM's DB2 v9.5 and solidDB 6.3 . read more at BigDataMatters.com
3 0.12817262 54 high scalability-2007-08-02-Multilanguage Website
Introduction: Hi , someone can point me to some good resurce about how to bulid a multilanguage website ? the only resource i have found is this http://www.indiawebdevelopers.com/technology/multilanguage_support.asp thanks! p.s. great site ;)
4 0.094293177 64 high scalability-2007-08-10-How do we make a large real-time search engine?
Introduction: We're implementing a website which should be oriented to content and with massive access by public and we would need a search engine to index and execute queries on the indexes of contents (stored in a database, most likely MySQL InnoDB or Oracle). The solution we found is to implement a separate service to make index constantly the contents of the database at regular intervals. Anyway, this is a complex and not optimal solution, since we would like it to index in real time and make it searchable. Could you point me to some examples or articles I could review to design a solution for such this context?
5 0.092370093 91 high scalability-2007-09-13-Design Preparations for Scaling
Introduction: Hi there, what do you think is crucial in the code designing of a scalable site? How does one prepare for webfarms and clusters (e.g. in PHP)? Thanks, Stephan
6 0.090314984 123 high scalability-2007-10-15-Olympic Site Architecture
7 0.083293587 1288 high scalability-2012-07-23-Ask HighScalability: How Do I Build My MegaUpload + Itunes + YouTube Startup?
8 0.075468808 1277 high scalability-2012-07-05-10 Golden Principles For Building Successful Mobile-Web Applications
9 0.07500077 611 high scalability-2009-05-31-Need help on Site loading & database optimization - URGENT
10 0.072314486 640 high scalability-2009-06-28-Google Voice Architecture
11 0.072046809 1240 high scalability-2012-05-07-Startups are Creating a New System of the World for IT
12 0.071921974 167 high scalability-2007-11-27-Starting a website from scratch - what technologies should I use?
13 0.070490941 199 high scalability-2008-01-01-S3 for image storing
14 0.068749331 1559 high scalability-2013-12-06-Stuff The Internet Says On Scalability For December 6th, 2013
15 0.065544046 187 high scalability-2007-12-14-The Current Pros and Cons List for SimpleDB
16 0.064892806 8 high scalability-2007-07-12-Should I use LAMP or Windows?
17 0.064796843 1258 high scalability-2012-06-05-Thesis: Concurrent Programming for Scalable Web Architectures
19 0.063093372 750 high scalability-2009-12-16-Building Super Scalable Systems: Blade Runner Meets Autonomic Computing in the Ambient Cloud
topicId topicWeight
[(0, 0.121), (1, 0.032), (2, 0.017), (3, -0.015), (4, 0.046), (5, -0.007), (6, -0.017), (7, -0.018), (8, 0.012), (9, -0.006), (10, -0.016), (11, -0.001), (12, -0.029), (13, 0.004), (14, 0.068), (15, -0.06), (16, -0.004), (17, -0.014), (18, 0.001), (19, -0.014), (20, -0.008), (21, -0.035), (22, -0.031), (23, 0.024), (24, -0.01), (25, -0.031), (26, 0.007), (27, -0.041), (28, 0.021), (29, 0.006), (30, -0.011), (31, 0.019), (32, 0.012), (33, -0.019), (34, -0.029), (35, 0.027), (36, 0.057), (37, -0.032), (38, 0.007), (39, 0.007), (40, 0.03), (41, 0.014), (42, -0.013), (43, 0.013), (44, 0.028), (45, 0.016), (46, -0.042), (47, 0.026), (48, 0.006), (49, 0.001)]
simIndex simValue blogId blogTitle
same-blog 1 0.89632523 451 high scalability-2008-11-30-Creating a high-performing online database
Introduction: Hi there, I have an idea for an online database that services a large number of people. I've been studying it for a while and it seems feasible to me to create it and get people to populate it. It will need time to grow but eventually it will get there. The model I'm looking at is IMDB, the depth of information is fascinating, yet it's fast, not so easy to use though, but it's pretty usable! What do you think I need to create a database an online database like IMDB. I know that IMDB power comes from it's information, not the design of the site. This is something I kind of figured out. But what I need to know is the best tools to publish database contents on the web, retrieve it in that fast way like IMDB. I'm sure that I will need to create data entry logs for my users to populate the database. What programming languages you suggest? development environment? approaches? your contribution is highly appreciated. Regards, Jalil
2 0.76059335 167 high scalability-2007-11-27-Starting a website from scratch - what technologies should I use?
Introduction: Hi, if you were to design your own highly scalable website from scratch, what technologies would you use? Based on Web 2.0 popularity, LAMP seems to be high in the running. But would you tack on CakePHP? Drupal? or build your framework/CMS from scratch? What version of Linux runs best for a scalable website? Would you consider Windows and .NET? Java? Or do you want to throw a brick at me for even suggesting such heresies? Would you prefer Postgres, Tomcat, Perl, Python, or any of that other *NIX fancy stuff...why or why not? Please forget for the moment, "use what you know" argument. I am pretty versatile, and can look for an expert in whatever platform I choose. So all skills being equal, I'm looking for the best community support, the fastest development time and most importantly, the best scaling approach. Let's say, for fun, that I'm planning for the website to have as many messages going back & forth as an eBay. Definitely building this on a
3 0.75433517 1288 high scalability-2012-07-23-Ask HighScalability: How Do I Build My MegaUpload + Itunes + YouTube Startup?
Introduction: This question was sent in by Val, who asking for a little help in creating the next big thing. Any ideas? I'm planning to run my own, first startup website and have been surfing the webs for relevant info to plan the technology I will use for it (the frontend and the backend, including the software and the hardware). The website will be something like a combination of: MegaUpload (users will upload their files) iTunes (users will be paid for their uploads) and YouTube (in the future I'm planning to let users watch/listen to the content online, without downloading). I don't have any investors yet, nor the budget - I'm still preparing the idea and I'm going to create first implementation (an "alpha version") before I show it to potential investors. Hence the initial technologies have to be extremely cheap *but* also highly scalable in the future so that I don't have to redo anything when the website grows. Unfortunately I don't have much experience in running
4 0.7298708 1399 high scalability-2013-02-05-Ask HighScalability: Memcached and Relations
Introduction: Hi everybody I'm wondering what you would do: I develop a webapp using Grails, Memcached and Mysql as persistence. Now, I have following domain classes (simplified): Product : Can be in one category Category : Can have nested children, and have multiple products. I need to access all product objects by id which led me to the idea to store all products in one big Memcached-entry with a key: PRODUCTMAP and as value, all product attributes as array, like: [productId1: [title: 'title'], productId2: [title: 'title']] If I browse to category 4, I simply get my map categoryMap with value [cateoryId: [productId1, productId2]] I also can list all products of a certain category by providing that id. The bad thing about this is that I always have to put back everything if I modify a single product. Who can give advice how to realize that? Any help will be appreciated! Thanks, Best Sullivan
5 0.72195077 277 high scalability-2008-03-16-Do you have any questions for the Elastra CEO?
Introduction: It looks like in the near future I'll have a chance to interview the Elastra CEO. Elastra provides standard databases--MySQL, EnterpriseDB and PostgreSQL-- on top of EC2 and S3. They are selling aggressive pricing, expandable and contactable database resource usage in response to demand, and a simple management and operations interface to well known databases deployed in a cloud. Elastra could be an important option for developers looking for a more traditional cloudy database. I was wondering if you guys had any suggestions for questions you would like answered? What would you like to know about their service? What are you looking for in a cloudy database? What would stop you from adopting it or what would make you decide to adopt it? Any ideas you have would help a lot and will probably be better than anything I have.
6 0.71835941 375 high scalability-2008-09-01-A Scalability checklist?
7 0.71516907 222 high scalability-2008-01-25-Application Database and DAL Architecture
8 0.71064371 675 high scalability-2009-08-08-1dbase vs. many and cloud hosting vs. dedicated server(s)?
9 0.70910728 238 high scalability-2008-02-04-IPS-IDS for heavy content site
10 0.70420986 435 high scalability-2008-10-30-The case for functional decomposition
12 0.68248993 1388 high scalability-2013-01-16-What if Cars Were Rented Like We Hire Programmers?
13 0.6792497 235 high scalability-2008-02-02-The case against ORM Frameworks in High Scalability Architectures
14 0.67798847 1 high scalability-2007-07-06-Start Here
15 0.67430729 685 high scalability-2009-08-20-Dependency Injection and AOP frameworks for .NET
16 0.67308646 338 high scalability-2008-06-02-Total Cost of Ownership for different web development frameworks
17 0.67266977 199 high scalability-2008-01-01-S3 for image storing
18 0.65891051 384 high scalability-2008-09-16-EE-Appserver Clustering OR Terracota OR Coherence OR something else?
19 0.65862721 311 high scalability-2008-04-29-Strategy: Sample to Reduce Data Set
20 0.64841807 206 high scalability-2008-01-10-MONO ASP.NET. Will it make the web???
topicId topicWeight
[(1, 0.147), (2, 0.195), (36, 0.245), (61, 0.103), (79, 0.123), (85, 0.031), (94, 0.025)]
simIndex simValue blogId blogTitle
1 0.90007412 608 high scalability-2009-05-27-The Future of the Parallelism and its Challenges
Introduction: The Future of the Parallelism and its Challenges Research and education in Parallel computing technologies is more important than ever. Here I present a perspective on the past contributions, current status, and future direction of the parallelism technologies. While machine power will grow impressively, increased parallelism, rather than clock rate, will be driving force in computing in the foreseeable future. This ongoing shift toward parallel architectural paradigms is one of the greatest challenges for the microprocessor and software industries. In 2005, Justin Ratter, chief technology officer of Intel Corporation, said ‘We are at the cusp of a transition to multicore, multithreaded architectures, and we still have not demonstrated the ease of programming the move will require…’ Key points: A Little history Parallelism Challenges Under the hood, Parallelism Challenges Synchronization problems CAS problems The future of the parallelism
2 0.89953279 1254 high scalability-2012-05-30-Strategy: Get Servers for Free and Make Users Happy by Turning on Compression
Introduction: Edward Capriolo has a really interesting article on his dramatic performance expanding experience of turning on compression for Cassandra . The idea: Enabling compression shrunk 71GB of data down to 31GB, which caused more data to fit in RAM, which reduced disk IO to nearly nothing. Compression means more data can be stored, which is like buying more machines without having to spend more money. Compression means serving more data out of RAM, which means clients are happier because of the performance improvements. The cost is higher CPU usage to perform the encrypt/decrypt. But disk IO is orders of magnitude slower than decompression and most servers have CPU to burn. Edward's article is well written, has the specifics on how to turn on compression for Cassandra, pretty graphs, and lots more details.
Introduction: Urs Hoelzle , infrastructure guru and SVP at Google, made a really interesting statement about the economics of scale in the datacenter: We’ve shown that when you run a large application in the datacenter, like Gmail, you can, compared to a small organization running their own email server, you can save nearly a factor of 100 in terms of compute and energy, when you run it at scale. My first thought was shock at the magnitude of the difference. 100x is a chasm crosser. Then I thought about Gmail, it's horizontally scalable using technologies that are following Moore's Law (storage and compute), latency requirements are lax, a commodity network is sufficient, and it can be highly automated so management costs scale slower than users. After that it's a simple matter of software :-) Oh, and developing a market where it's "cheaper to run a large thing than a small thing."
Introduction: Oh, you don't have a Quantum Computer Lab staffed with researchers? Well, Google does. Here they are on G+ . To learn what they are up to the Verge has A first look inside Google's futuristic quantum lab . The lab is partnership between NASA, Google, and a 512-qubit D-Wave Two quantum computer. One result from the lab is: The first practical application has been on Google Glass, as engineers put the quantum chips to work on Glass's blink detector, helping it to better distinguish between intentional winks and involuntary blinks. For engineering reasons, the quantum processor can never be installed in Glass, but together with Google's conventional server centers, it can point the way to a better blink-detecting algorithm. That would allow the Glass processor to detect blinks with better accuracy and using significantly less power. If successful, it could be an important breakthrough for wink-triggered apps, which have struggled with the task so far. Google thinks quantum
5 0.87156379 4 high scalability-2007-07-10-Webcast: Advanced Database High Availability and Scalability Solutions
Introduction: If MySQL, PostgreSQL or EnterpriseDB High-Availability and Scalability issues are on your plate, you'll find this webcast very informative. Highly recommended! Webcast starts on Thursday, July 12, 2007 at 10:00AM PDT (1:00PM EDT, 18:00GMT). Duration: 50 minutes, plus Q&A; Advanced Database High-Availability and Scalability Solutions ImageProgram Agenda Disk Based Replication • Overview, major features • Benefits, use cases • Limitations and challenges Master/Slave Asynchronous Replication • Overview, major features • Benefits, use cases • Limitations and challenges Synchronous Multi-Master Cluster: Continuent uni/cluster • Cluster overview, major features • Cluster benefits, use cases • Limitations and challenges Product Positioning: HA Continuum • Comparisons • Key differentiators • How to pick the right solution Continuent Professional Services • HA Quick Assessment Service • HA JumpStart Implementation Services Q&A;
same-blog 6 0.86605752 451 high scalability-2008-11-30-Creating a high-performing online database
7 0.86437899 984 high scalability-2011-02-04-Stuff The Internet Says On Scalability For February 4, 2011
8 0.80465478 264 high scalability-2008-03-03-Read This Site and Ace Your Next Interview!
9 0.78249592 1216 high scalability-2012-03-27-Big Data In the Cloud Using Cloudify
10 0.77386743 644 high scalability-2009-06-29-eHarmony.com describes how they use Amazon EC2 and MapReduce
11 0.76101041 1037 high scalability-2011-05-10-Viddler Architecture - 7 Million Embeds a Day and 1500 Req-Sec Peak
12 0.75682181 1559 high scalability-2013-12-06-Stuff The Internet Says On Scalability For December 6th, 2013
13 0.75660402 935 high scalability-2010-11-05-Hot Scalability Links For November 5th, 2010
14 0.75622576 1064 high scalability-2011-06-20-35+ Use Cases for Choosing Your Next NoSQL Database
15 0.75606531 152 high scalability-2007-11-13-Flickr Architecture
17 0.75496393 392 high scalability-2008-09-24-Building a Scalable Architecture for Web Apps
18 0.75485659 954 high scalability-2010-12-06-What the heck are you actually using NoSQL for?
19 0.7548089 509 high scalability-2009-02-05-Product: HAProxy - The Reliable, High Performance TCP-HTTP Load Balancer
20 0.75462741 1106 high scalability-2011-08-26-Stuff The Internet Says On Scalability For August 26, 2011