high_scalability high_scalability-2007 high_scalability-2007-45 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: SmarterStats provides a solid architecture businesses and individual end users can use to track growth and forecast internet trends. * Track your website's growth and forecast internet trends * Features over 130 report items, plus Geographic Reporting * Log comparison saving 90% of your disk space * Email Reports available in Enterprise Edition * Enhanced data mining available in both editions
sentIndex sentText sentNum sentScore
1 SmarterStats provides a solid architecture businesses and individual end users can use to track growth and forecast internet trends. [sent-1, score-1.786]
wordName wordTfidf (topN-words)
[('forecast', 0.509), ('editions', 0.321), ('enhanced', 0.254), ('track', 0.254), ('growth', 0.211), ('mining', 0.198), ('geographic', 0.191), ('internet', 0.177), ('businesses', 0.172), ('reporting', 0.171), ('saving', 0.171), ('comparison', 0.154), ('reports', 0.152), ('report', 0.149), ('solid', 0.142), ('trends', 0.141), ('plus', 0.139), ('available', 0.137), ('items', 0.131), ('edition', 0.128), ('email', 0.12), ('individual', 0.113), ('log', 0.106), ('enterprise', 0.1), ('space', 0.097), ('website', 0.083), ('disk', 0.082), ('end', 0.071), ('features', 0.07), ('architecture', 0.055), ('users', 0.054), ('use', 0.028), ('data', 0.024)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 45 high scalability-2007-07-30-Product: SmarterStats
Introduction: SmarterStats provides a solid architecture businesses and individual end users can use to track growth and forecast internet trends. * Track your website's growth and forecast internet trends * Features over 130 report items, plus Geographic Reporting * Log comparison saving 90% of your disk space * Email Reports available in Enterprise Edition * Enhanced data mining available in both editions
2 0.0907313 304 high scalability-2008-04-19-How to build a real-time analytics system?
Introduction: Hello everybody! I am a developer of a website with a lot of traffic. Right now we are managing the whole website using perl + postgresql + fastcgi + memcached + mogileFS + lighttpd + roundrobin DNS distributed over 5 servers and I must say it works like a charm, load is stable and everything works very fast and we are recording about 8 million pageviews per day. The only problem is with postgres database since we have it installed only on one server and if this server goes down, the whole "cluster" goes down. That's why we have a master2slave replication so we still have a backup database except that when the master goes down, all inserts/updates are disabled so the whole website is just read only. But this is not a problem since this configuration is working for us and we don't have any problems with it. Right now we are planning to build our own analytics service that would be customized for our needs. We tried various different software packages but were not satisfi
3 0.089204833 37 high scalability-2007-07-28-Product: Web Log Storming
Introduction: Web Log Storming is an interactive, desktop-based Web Log Analyzer for Windows. The whole new concept of log analysis makes it clearly different from any other web log analyzer. Browse through statistics to get into details - down to individual visitor's session. Check individual visitor behavior pattern and how it fits into your desired scenario. Web Log Storming does far more than just generate common reports - it displays detailed web site statistics with interactive graphs and reports. Very complete detailed log analysis of activity from every visitor to your web site is only a mouse-click away. In other words, analyze your web logs like never before! It's easy to track sessions, hits, page views, downloads, or whatever metric is most important to each user. You can look at referring pages and see which search engines and keywords were used to bring visitors to the site. Web site behavior, from the top entry and exit pages, to the paths that users follow, can be analyzed. You
4 0.087712713 105 high scalability-2007-10-01-Statistics Logging Scalability
Introduction: My company is developing a centralized web platform to service our clients. We currently use about 3Mb/s on our uplink at our ISP serving web pages for about 100 clients. We'd like to offer them statistics that mean something to their businesses and have been contemplating writing our own statistics code to handle the task. All statistics would be gathered at the page view level and we're implementing a HttpModule in ASP.Net 2.0 to handle the gather of the data. That said, I'm curious to hear comments on writing this data (~500 bytes of log data/page request). We need to write this data somewhere and then build a process to aggregate the data into a warehouse application used in our reporting system. Google Analytics is out of the question because we do not want our hosting infrastructure dependant upon a remote server. Web Trends et al. are too expensive for our clients. I'm thinking of a couple of options. 1) Writing log data directly to a SQL Server 2000 db and havin
5 0.085481711 46 high scalability-2007-07-30-Product: Sun Utility Computing
Introduction: The Sun Grid Compute Utility is a simple to use, simple to access data center-on-demand. Sun Grid delivers enterprise computing power and resources over the Internet, enabling developers, researchers, scientists and businesses to optimize performance, speed time to results, and accelerate innovation without investment in IT infrastructure. No matter the size of your business or the size of your job -- there is no barrier to entry and exit. This is the future of computing available today: IT as a service.
6 0.073283099 854 high scalability-2010-07-09-Hot Scalability Links for July 9, 2010
7 0.072838724 410 high scalability-2008-10-13-SQL Server 2008 Database Performance and Scalability
8 0.072119899 1390 high scalability-2013-01-21-Processing 100 Million Pixels a Day - Small Amounts of Contention Cause Big Problems at Scale
9 0.070585318 202 high scalability-2008-01-06-Email Architecture
10 0.069776133 1020 high scalability-2011-04-12-Caching and Processing 2TB Mozilla Crash Reports in memory with Hazelcast
11 0.069596462 407 high scalability-2008-10-10-The Art of Capacity Planning: Scaling Web Resources
16 0.060710497 221 high scalability-2008-01-24-Mailinator Architecture
18 0.059987564 453 high scalability-2008-12-01-Breakthrough Web-Tier Solutions with Record-Breaking Performance
19 0.059826173 731 high scalability-2009-10-28-Need for change in your IT infrastructure
20 0.059799377 30 high scalability-2007-07-26-Product: AWStats a Log Analyzer
topicId topicWeight
[(0, 0.075), (1, 0.004), (2, 0.015), (3, -0.014), (4, 0.014), (5, -0.008), (6, -0.0), (7, -0.023), (8, 0.008), (9, 0.033), (10, -0.031), (11, -0.015), (12, 0.024), (13, 0.006), (14, 0.058), (15, 0.042), (16, 0.015), (17, 0.003), (18, -0.005), (19, 0.019), (20, -0.027), (21, -0.027), (22, -0.024), (23, 0.065), (24, 0.031), (25, -0.004), (26, -0.049), (27, 0.005), (28, -0.03), (29, -0.018), (30, -0.022), (31, -0.037), (32, 0.013), (33, -0.045), (34, -0.036), (35, 0.035), (36, -0.027), (37, 0.006), (38, 0.028), (39, -0.01), (40, 0.007), (41, 0.031), (42, 0.021), (43, 0.029), (44, -0.026), (45, 0.001), (46, -0.016), (47, 0.011), (48, -0.085), (49, -0.016)]
simIndex simValue blogId blogTitle
same-blog 1 0.9696089 45 high scalability-2007-07-30-Product: SmarterStats
Introduction: SmarterStats provides a solid architecture businesses and individual end users can use to track growth and forecast internet trends. * Track your website's growth and forecast internet trends * Features over 130 report items, plus Geographic Reporting * Log comparison saving 90% of your disk space * Email Reports available in Enterprise Edition * Enhanced data mining available in both editions
2 0.67604989 541 high scalability-2009-03-16-Product: Smart Inspect
Introduction: Smart Inspect has added quite a few features specifically tailored to high scalability and high performance environments to our tool over the years. This includes the ability to log to memory and dump log files on demand (when a crash occurs for example), special backlog queue features, a log service application for central log storage and a lot more. Additionally, our SmartInspect Console (the viewer application) makes viewing, filtering and inspecting large amounts of logging data a lot easier/practical.
Introduction: This is a guest post by Gordon Worley , a Software Engineer at Korrelate , where they correlate (see what they did there) online purchases to offline purchases. Several weeks ago, we came into the office one morning to find every server alarm going off. Pixel log processing was behind by 8 hours and not making headway. Checking the logs, we discovered that a big client had come online during the night and was giving us 10 times more traffic than we were originally told to expect. I wouldn’t say we panicked, but the office was certainly more jittery than usual. Over the next several hours, though, thanks both to foresight and quick thinking, we were able to scale up to handle the added load and clear the backlog to return log processing to a steady state. At Korrelate, we deploy tracking pixels , also known beacons or web bugs, that our partners use to send us information about their users. These tiny web objects contain no visible content, but may include transparent 1 by 1 gif
4 0.64099211 35 high scalability-2007-07-28-Product: FastStats Log Analyzer
Introduction: FastStats Log Analyzer enables you to: Determine whether your CPC advertising is profitable: Are you spending $0.75 per click on Google or Overture, but only receiving $0.56 per click in revenue? Tune site traffic patterns: FastStats's Hyperlink Tree View feature lets you visually see how traffic flows through your web site. High-performance solution for even the busiest web sites: Our software has been clocked at over 1000 MB/min. Other popular log file analysis tools (we won't name names), run at 1/40th the speed. We've been in the business for over 6 years, delivering value, quality, and good customer service to our clients. Our products are used for data mining at some of the world's busiest web sites -- why not give FastStats a try at your web site? FastStats log file analysis supports a wide variety of web server log files, including Apache logs and Microsoft IIS logs.
5 0.63627982 30 high scalability-2007-07-26-Product: AWStats a Log Analyzer
Introduction: AWStats is a free powerful and featureful tool that generates advanced web, streaming, ftp or mail server statistics, graphically. This log analyzer works as a CGI or from command line and shows you all possible information your log contains, in few graphical web pages. It uses a partial information file to be able to process large log files, often and quickly. It can analyze log files from all major server tools like Apache log files (NCSA combined/XLF/ELF log format or common/CLF log format), WebStar, IIS (W3C log format) and a lot of other web, proxy, wap, streaming servers, mail servers and some ftp servers.
6 0.62483436 553 high scalability-2009-04-03-Collectl interface to Ganglia - any interest?
7 0.61380255 36 high scalability-2007-07-28-Product: Web Log Expert
8 0.61370713 37 high scalability-2007-07-28-Product: Web Log Storming
9 0.59688634 1196 high scalability-2012-02-20-Berkeley DB Architecture - NoSQL Before NoSQL was Cool
10 0.59178561 233 high scalability-2008-01-30-How Rackspace Now Uses MapReduce and Hadoop to Query Terabytes of Data
11 0.5898869 77 high scalability-2007-08-30-Log Everything All the Time
12 0.58984172 449 high scalability-2008-11-24-Product: Scribe - Facebook's Scalable Logging System
13 0.57549006 105 high scalability-2007-10-01-Statistics Logging Scalability
14 0.57059449 570 high scalability-2009-04-15-Implementing large scale web analytics
15 0.56844622 937 high scalability-2010-11-09-Paper: Hyder - Scaling Out without Partitioning
16 0.50956303 304 high scalability-2008-04-19-How to build a real-time analytics system?
17 0.50148869 488 high scalability-2009-01-08-file synchronization solutions
18 0.46726385 403 high scalability-2008-10-06-Paper: Scaling Genome Sequencing - Complete Genomics Technology Overview
19 0.46177518 28 high scalability-2007-07-25-Product: NetApp MetroCluster Software
20 0.45510948 1256 high scalability-2012-06-04-OpenFlow-SDN is Not a Silver Bullet for Network Scalability
topicId topicWeight
[(1, 0.221), (2, 0.099), (56, 0.516)]
simIndex simValue blogId blogTitle
1 0.93542945 1394 high scalability-2013-01-25-Stuff The Internet Says On Scalability For January 25, 2013
Introduction: Sorry, Stuff the Internet Says has been called on the account of a power outage. Gods of rain and tree have interfered with thee. Instead, how about watching a little Python? (that's Monty, not the language)
same-blog 2 0.8725425 45 high scalability-2007-07-30-Product: SmarterStats
Introduction: SmarterStats provides a solid architecture businesses and individual end users can use to track growth and forecast internet trends. * Track your website's growth and forecast internet trends * Features over 130 report items, plus Geographic Reporting * Log comparison saving 90% of your disk space * Email Reports available in Enterprise Edition * Enhanced data mining available in both editions
3 0.75933796 732 high scalability-2009-10-29-Digg - Looking to the Future with Cassandra
Introduction: Digg has been researching ways to scale our database infrastructure for some time now. We’ve adopted a traditional vertically partitioned master-slave configuration with MySQL, and also investigated sharding MySQL with IDDB . Ultimately, these solutions left us wanting. In the case of the traditional architecture, the lack of redundancy on the write masters is painful, and both approaches have significant management overhead to keep running. Since it was already necessary to abandon data normalization and consistency to make these approaches work, we felt comfortable looking at more exotic, non-relational data stores. After considering HBase, Hypertable, Cassandra, Tokyo Cabinet/Tyrant, Voldemort, and Dynomite, we settled on Cassandra . Each system has its own strengths and weaknesses, but Cassandra has a good blend of everything. It offers column-oriented data storage, so you have a bit more structure than plain key/value stores. It operates in a distributed, highly available,
4 0.6938976 779 high scalability-2010-02-16-Seven Signs You May Need a NoSQL Database
Introduction: While exploring deep into some dusty old library stacks, I dug up Nostradamus' long lost NoSQL codex. What are the chances? Strangely, it also gave the plot to the next Dan Brown novel, but I left that out for reasons of sanity. About NoSQL, here is what Nosty (his friends call him Nosty) predicted are the signs you may need a NoSQL database... You noticed a lot of your database fields are really serialized complex objects in disguise . Why bother with a RDBMS at all then? Storing serialized objects in a relational database is like being on the pill while trying to get pregnant, a bit counter productive. Just use a schemaless database from the start. Using a standard query language has become too confining . You just want to be free. SQL is so easy, so convenient, and so standard, it's really not a challenge anymore. You need to be different. Then NoSQL is for you. Each has their own completely different query mechanism . Your toolbox only contains a hammer . Hammers wh
5 0.67143166 479 high scalability-2008-12-29-Platform virtualization - top 25 providers (software, hardware, combined)
Introduction: In this article they present the companies which offers means (mainly, the software and hardware) which powers most of the cloud computing hosting providers, namely virtualization solutions. Read the entire article about Platform virtualization - top 25 providers (software, hardware, combined) at MyTestBox.com - web software reviews, news, tips & tricks .
6 0.64121807 67 high scalability-2007-08-17-What is the best hosting option?
7 0.62845886 941 high scalability-2010-11-15-How Google's Instant Previews Reduces HTTP Requests
8 0.61243516 446 high scalability-2008-11-18-Scalability Perspectives #2: Van Jacobson – Content-Centric Networking
9 0.58904207 1022 high scalability-2011-04-13-Paper: NoSQL Databases - NoSQL Introduction and Overview
10 0.57643032 854 high scalability-2010-07-09-Hot Scalability Links for July 9, 2010
11 0.55531788 1322 high scalability-2012-09-14-Stuff The Internet Says On Scalability For September 14, 2012
12 0.53666043 947 high scalability-2010-11-23-Sponsored Post: Imo, Undertone, Joyent, Appirio, Tuenti, CloudSigma, ManageEngine, Site24x7
13 0.52869779 659 high scalability-2009-07-20-A Scalability Lament
15 0.5171327 759 high scalability-2010-01-11-Strategy: Don't Use Polling for Real-time Feeds
16 0.5141269 245 high scalability-2008-02-12-Product: rPath - Creating and Managing Virtual Appliances
17 0.50253183 815 high scalability-2010-04-27-Paper: Dapper, Google's Large-Scale Distributed Systems Tracing Infrastructure
19 0.4712351 46 high scalability-2007-07-30-Product: Sun Utility Computing