high_scalability high_scalability-2007 high_scalability-2007-62 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Is there any alternative to LIKE '%...%' OR LIKE '%...%' in MySQL if you have to offer partial string matching on a large dataset?
sentIndex sentText sentNum sentScore
1 %' in MySQL if you have to offer partial string matching on a large dataset? [sent-7, score-1.685]
wordName wordTfidf (topN-words)
[('string', 0.446), ('dataset', 0.443), ('matching', 0.443), ('partial', 0.424), ('alternative', 0.326), ('offer', 0.255), ('mysql', 0.166), ('like', 0.128), ('large', 0.117)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 62 high scalability-2007-08-08-Partial String Matching
Introduction: Is there any alternative to LIKE '%...%' OR LIKE '%...%' in MySQL if you have to offer partial string matching on a large dataset?
2 0.14448838 476 high scalability-2008-12-28-How to Organize a Database Table’s Keys for Scalability
Introduction: The key (no pun intended) to understanding how to organize your dataset’s data is to think of each shard not as an individual database, but as one large singular database. Just as in a normal single server database setup where you have a unique key for each row within a table, each row key within each individual shard must be unique to the whole dataset partitioned across all shards. There are a few different ways we can accomplish uniqueness of row keys across a shard cluster. Each has its pro’s and con’s and the one chosen should be specific to the problems you’re trying to solve.
3 0.13947389 1404 high scalability-2013-02-11-At Scale Even Little Wins Pay Off Big - Google and Facebook Examples
Introduction: There's a popular line of thought that says don't waste time on optimization because developing features is more important than saving money. True, you can always add resources, but at some point, especially in a more mature part of a product lifecycle: performance equals $$$. Two great examples of this evolution come from Facebook and Google. The upshot is that when you spend time and money on optimizing your tool chain you can get huge wins in performance, control, and costs. Certainly, don’t bother if you are just starting, but at some point you may want to switch to big development efforts in improving efficiency. Facebook and HipHop The Facebook example is quite well known: HipHop , a static PHP compiler released in 2010, after two years of development. PHP because Facebook implements their web tier in PHP . They've now developed a dynamic compiler, HipHop VM , using techniques like JIT, side exits, HipHop bytecode, type prediction, and parallel tracelet l
4 0.11232122 307 high scalability-2008-04-21-Using Google AppEngine for a Little Micro-Scalability
Introduction: Over the years I've accumulated quite a rag tag collection of personal systems scattered wide across a galaxy of different servers. For the past month I've been on a quest to rationalize this conglomeration by moving everything to a managed service of one kind or another. The goal: lift a load of worry from my mind. I like to do my own stuff my self so I learn something and have control. Control always comes with headaches and it was time for a little aspirin. As part of the process GAE came in handy as a host for a few Twitter related scripts I couldn't manage to run anywhere else. I recoded my simple little scripts into Python/GAE and learned a lot in the process. In the move I exported HighScalability from a VPS and imported it into a shared hosting service. I could never quite configure Apache and MySQL well enough that they wouldn't spike memory periodically and crash the VPS. And since a memory crash did not automatically restarted it was unacceptable. I also wrote a scrip
5 0.11165792 115 high scalability-2007-10-07-Using ThreadLocal to pass context information around in web applications
Introduction: Hi, In java web servers, each http request is handled by a thread in thread pool. So for a Servlet handling the request, a thread is assigned. It is tempting (and very convinient) to keep context information in the threadlocal variable. I recently had a requirement where we need to assign logged in user id and timestamp to request sent to web services. Because we already had the code in place, it was extremely difficult to change the method signatures to pass user id everywhere. The solution I thought is class ReferenceIdGenerator { public static setReferenceId(String login) { threadLocal.set(login + System.currentMillis()); } public static String getReferenceId() { return threadLocal.get(); } private static ThreadLocal threadLocal = new ThreadLocal(); } class MySevlet { void service(.....) { HttpSession session = request.getSession(false); String userId = session.get("userId"); ReferenceIdGenerator.setRefernceId(userId
6 0.10485667 196 high scalability-2007-12-30-MySQL clustering strategies and comparisions
7 0.10261094 1001 high scalability-2011-03-09-Google and Netflix Strategy: Use Partial Responses to Reduce Request Sizes
8 0.093187451 561 high scalability-2009-04-08-N+1+caching is ok?
9 0.091704898 956 high scalability-2010-12-08-How To Get Experience Working With Large Datasets
10 0.07899642 654 high scalability-2009-07-09-No to SQL? Anti-database movement gains steam – My Take
11 0.076582968 1048 high scalability-2011-05-27-Stuff The Internet Says On Scalability For May 27, 2011
12 0.076268107 279 high scalability-2008-03-17-Microsoft's New Database Cloud Ready to Rumble with Amazon
13 0.075667143 482 high scalability-2009-01-04-Alternative Memcache Usage: A Highly Scalable, Highly Available, In-Memory Shard Index
14 0.072728321 664 high scalability-2009-07-29-Strategy: Devirtualize for More Vroom
15 0.071271896 1407 high scalability-2013-02-15-Stuff The Internet Says On Scalability For February 15, 2013
16 0.069419175 17 high scalability-2007-07-16-Paper: Guide to Cost-effective Database Scale-Out using MySQL
17 0.068749465 1452 high scalability-2013-05-06-7 Not So Sexy Tips for Saving Money On Amazon
18 0.067223057 15 high scalability-2007-07-16-Blog: MySQL Performance Blog - Everything about MySQL Performance.
19 0.066818684 504 high scalability-2009-01-29-Event: MySQL Conference & Expo 2009
20 0.064827628 1633 high scalability-2014-04-16-Six Lessons Learned the Hard Way About Scaling a Million User System
topicId topicWeight
[(0, 0.043), (1, 0.013), (2, -0.018), (3, -0.015), (4, 0.011), (5, 0.046), (6, -0.035), (7, -0.041), (8, 0.01), (9, -0.025), (10, -0.002), (11, -0.009), (12, 0.046), (13, 0.026), (14, 0.013), (15, -0.005), (16, -0.019), (17, -0.002), (18, 0.002), (19, -0.026), (20, 0.021), (21, -0.012), (22, -0.057), (23, 0.033), (24, 0.021), (25, 0.065), (26, 0.026), (27, -0.014), (28, 0.048), (29, -0.02), (30, -0.037), (31, -0.017), (32, 0.007), (33, 0.041), (34, 0.017), (35, 0.032), (36, 0.011), (37, 0.008), (38, -0.022), (39, -0.032), (40, 0.068), (41, -0.016), (42, 0.028), (43, -0.01), (44, -0.015), (45, -0.041), (46, -0.053), (47, -0.047), (48, 0.034), (49, -0.04)]
simIndex simValue blogId blogTitle
same-blog 1 0.95359302 62 high scalability-2007-08-08-Partial String Matching
Introduction: Is there any alternative to LIKE '%...%' OR LIKE '%...%' in MySQL if you have to offer partial string matching on a large dataset?
2 0.7962786 16 high scalability-2007-07-16-Book: High Performance MySQL
Introduction: As users come to depend on MySQL, they find that they have to deal with issues of reliability, scalability, and performance--issues that are not well documented but are critical to a smoothly functioning site. This book is an insider's guide to these little understood topics. Author Jeremy Zawodny has managed large numbers of MySQL servers for mission-critical work at Yahoo!, maintained years of contacts with the MySQL AB team, and presents regularly at conferences. Jeremy and Derek have spent months experimenting, interviewing major users of MySQL, talking to MySQL AB, benchmarking, and writing some of their own tools in order to produce the information in this book. In High Performance MySQL you will learn about MySQL indexing and optimization in depth so you can make better use of these key features. You will learn practical replication, backup, and load-balancing strategies with information that goes beyond available tools to discuss their effects in real-life environments. And you
3 0.78605187 196 high scalability-2007-12-30-MySQL clustering strategies and comparisions
Introduction: Compare: 1. MySQL Clustering(ndb-cluster stogare) 2. MySQL / GFS-GNBD/ HA 3. MySQL / DRBD /HA 4. MySQL Write Master / Multiple MySQL Read Slaves 5. Standalone MySQL Servers(Functionally seperated)
4 0.74975425 586 high scalability-2009-04-29-Presentations: MySQL Conference & Expo 2009
Introduction: The Presentations of the MySQL Conference & Expo 2009 held April 20-23 in Santa Clara is available on the above link. They include: Beginner's Guide to Website Performance with MySQL and memcached by Adam Donnison Calpont: Open Source Columnar Storage Engine for Scalable MySQL DW by Jim Tommaney Creating Quick and Powerful Web Applications with MySQL, GlassFish, and NetBeans by Arun Gupta Deep-inspecting MySQL with DTrace by Domas Mituzas Distributed Innodb Caching with memcached by Matthew Yonkovit and Yves Trudeau Improving Performance by Running MySQL Multiple Times by MC Brown Introduction to Using DTrace with MySQL by Vince Carbone MySQL Cluster 7.0 - New Features by Johan Andersson Optimizing MySQL Performance with ZFS by Allan Packer SAN Performance on a Internal Disk Budget: The Coming Solid State Disk Revolution by Matthew Yonkovit This is Not a Web App: The Evolution of a MySQL Deployment at Google by Mark Callaghan
5 0.73206758 15 high scalability-2007-07-16-Blog: MySQL Performance Blog - Everything about MySQL Performance.
Introduction: Follow this blog and you'll learn a lot about MySQL and how to make it sing. A Quick Hit of What's Inside Working with large data sets in MySQL, PHP Large result sets and summary tables, Implementing efficient counters with MySQL. Site: http://www.mysqlperformanceblog.com/
6 0.71906692 303 high scalability-2008-04-18-Scaling Mania at MySQL Conference 2008
7 0.71551412 17 high scalability-2007-07-16-Paper: Guide to Cost-effective Database Scale-Out using MySQL
8 0.6815418 504 high scalability-2009-01-29-Event: MySQL Conference & Expo 2009
9 0.62025559 454 high scalability-2008-12-01-Deploying MySQL Database in Solaris Cluster Environments
10 0.60889888 252 high scalability-2008-02-18-limit on the number of databases open
11 0.60815287 455 high scalability-2008-12-01-MySQL Database Scale-out and Replication for High Growth Businesses
12 0.60734105 729 high scalability-2009-10-28-And the winner is: MySQL or Memcached or Tokyo Tyrant?
13 0.60528487 214 high scalability-2008-01-15-Sun to Acquire MySQL
14 0.59449315 213 high scalability-2008-01-15-Does Sun Buying MySQL Change Your Scaling Strategy?
15 0.59017771 770 high scalability-2010-02-03-NoSQL Means Never Having to Store Blobs Again
17 0.56761926 995 high scalability-2011-02-24-Strategy: Eliminate Unnecessary SQL
18 0.54315329 465 high scalability-2008-12-14-Scaling MySQL on a 256-way T5440 server using Solaris ZFS and Java 1.7
19 0.53218985 73 high scalability-2007-08-23-Postgresql on high availability websites?
20 0.52751237 487 high scalability-2009-01-08-Paper: Sharding with Oracle Database
topicId topicWeight
[(2, 0.112), (31, 0.626)]
simIndex simValue blogId blogTitle
same-blog 1 0.91682899 62 high scalability-2007-08-08-Partial String Matching
Introduction: Is there any alternative to LIKE '%...%' OR LIKE '%...%' in MySQL if you have to offer partial string matching on a large dataset?
2 0.68412662 207 high scalability-2008-01-10-Sharding with Cookie-Based Session Storage
Introduction: In a recent project, I utilized RoR's cookie-based session storage to shard geographically distinct user groups. My technique for doing so was unique and, although it was a premature optimization, it is none-the-less an idea worth exploring.
3 0.57591665 615 high scalability-2009-06-01-HotPads on AWS
Introduction: HotPads abandoned our managed hosting in December and took the leap over to EC2 and its siblings. The presentation has a lot of detail on costs and other things to watch out for, so if you're currently planning your "cloud" architecture, you'll find some of this really helpful.
4 0.41101182 1651 high scalability-2014-05-20-It's Networking. In Space! Or How E.T. Will Phone Home.
Introduction: What will the version of the Internet that follows us to the stars look like? Yes, people are really thinking seriously about this sort of thing. Specifically the InterPlanetary Networking Special Interest Group (IPNSIG). Ansible-like faster-than-light communication it isn't. There's no magical warp drive. Nor is a network of telepaths acting as a 'verse spanning telegraph system. It's more mundane than that. And in many ways more interesting as it's sort of like the old Internet on steroids, the one that was based on on UUCP and dial-up connections, but over vastly longer distances and with much longer delays : The Interplanetary Internet (based on IPN, also called InterPlaNet) is a conceived computer network in space, consisting of a set of network nodes which can communicate with each other.[1][2] Communication would be greatly delayed by the great interplanetary distances, so the IPN needs a new set of protocols and technology that are tolerant to large delays and
5 0.40289727 368 high scalability-2008-08-17-Wuala - P2P Online Storage Cloud
Introduction: How do you design a reliable distributed file system when the expected availability of the individual nodes are only ~1/5? That is the case for P2P systems. Dominik Grolimund, the founder of a Swiss startup Caleido will show you how! They have launched Wuala , the social online storage service which scales as new nodes join the P2P network. The goal of Wua.la is to provide distributed online storage that is: large scalable reliable secure by harnessing the idle resources of participating computers. This challenge is an old dream of computer science. In fact as Andrew Tanenbaum wrote in 1995: "The design of a world-wide, fully transparent distributed filesystem fot simultaneous use by millions of mobile and frequently disconnected users is left as an exercise for the reader" After three years of research and development at at ETH Zurich, the Swiss Federal Institute of Technology on a distributed storage system, Caleido is ready to unveil the resu
6 0.32612997 892 high scalability-2010-09-02-Distributed Hashing Algorithms by Example: Consistent Hashing
7 0.2873522 785 high scalability-2010-02-26-MySQL and Memcached: End of an Era?
8 0.28136063 1255 high scalability-2012-06-01-Stuff The Internet Says On Scalability For June 1, 2012
9 0.27340746 702 high scalability-2009-09-11-The interactive cloud
10 0.27161732 294 high scalability-2008-04-01-How to update video views count effectively?
11 0.20480308 909 high scalability-2010-09-28-Sponsored Post: Wiredrive, Joyent, DeviantART, CloudSigma, ManageEngine, Site24x7
12 0.20056298 719 high scalability-2009-10-09-Have you collectl'd yet? If not, maybe collectl-utils will make it easier to do so
13 0.1767163 56 high scalability-2007-08-03-Running Hadoop MapReduce on Amazon EC2 and Amazon S3
14 0.1767163 565 high scalability-2009-04-13-Benchmark for keeping data in browser in AJAX projects
15 0.17662832 436 high scalability-2008-11-02-Strategy: How to Manage Sessions Using Memcached
16 0.17651585 223 high scalability-2008-01-25-Google: Introduction to Distributed System Design
17 0.17592393 836 high scalability-2010-06-04-Strategy: Cache Larger Chunks - Cache Hit Rate is a Bad Indicator
18 0.17588454 878 high scalability-2010-08-12-Strategy: Terminate SSL Connections in Hardware and Reduce Server Count by 40%
19 0.17571527 911 high scalability-2010-09-30-More Troubles with Caching
20 0.17451946 594 high scalability-2009-05-08-Eight Best Practices for Building Scalable Systems