high_scalability high_scalability-2009 high_scalability-2009-746 knowledge-graph by maker-knowledge-mining

746 high scalability-2009-11-26-Kngine Snippet Search New Indexing Technology


meta infos for this blog

Source: html

Introduction: While Kngine just announce some improvement and new features , I would like you take you in small trip in Snippet Search research project at Kngine.   What is Kngine? Kngine is startup company working in Searching technologies, We in Kngine aims to organize the human beings Systematic Knowledge and Experiences and make it accessible to everyone. We aim to collect and organize all objective data, and make it possible and easy to access. Our goal is to build Web 3.0 Web Search Engine on the advances of Web Search Engine, Semantic Web, Data Representation technologies a new form of Web Search Engine that will unleash a revolution of new possibilities.   Introduction to Snippet Search Today, The Web Search Engine’s is the Web getaway, especially to get specific information. But unfortunately the search engines didn’t changed mush as the Web changed from 90’s. Since the 90’s the Web search engine still provide the same kind of results: Links to documents. We i


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 While Kngine just announce some improvement and new features , I would like you take you in small trip in Snippet Search research project at Kngine. [sent-1, score-0.189]

2 Kngine is startup company working in Searching technologies, We in Kngine aims to organize the human beings Systematic Knowledge and Experiences and make it accessible to everyone. [sent-3, score-0.365]

3 We aim to collect and organize all objective data, and make it possible and easy to access. [sent-4, score-0.295]

4 0 Web Search Engine on the advances of Web Search Engine, Semantic Web, Data Representation technologies a new form of Web Search Engine that will unleash a revolution of new possibilities. [sent-6, score-0.265]

5 But unfortunately the search engines didn’t changed mush as the Web changed from 90’s. [sent-8, score-0.714]

6 Since the 90’s the Web search engine still provide the same kind of results: Links to documents. [sent-9, score-0.56]

7 We in Kngine think that it’s the time to rethink about the results contents; because a lot of times we need summary, structured information, list of things, or direct answer. [sent-10, score-0.426]

8 Snippet Search Snippet Search research started in September 2009. [sent-11, score-0.079]

9 Snippet Search is new indexing technology aims to provides more relevant search results (short-term) and answer the abstract questions (long term) that can't handled by the Question Answering Engines. [sent-12, score-0.943]

10 Snippet Search aims to change the Web results that we are familiar with. [sent-13, score-0.454]

11 Today when you search about something, you will get list of documents links with small description. [sent-15, score-0.676]

12 But if you focus on the description that appear bellow the document title, you will found that it not related to what you looking for, and even if it’s related it’s not completed, which means that you will still need to open more pages and search about what you looking for (i. [sent-16, score-1.005]

13 Snippet Search results will consist of collection of rich ranked paragraphs rather than collection of documents links. [sent-19, score-0.813]

14 Snippet Search paragraphs is semantically related to what you looking for (i. [sent-20, score-0.494]

15 content what you looking) so we will be able to get what he looking for directly without open other pages. [sent-22, score-0.121]


similar blogs computed by tfidf model

tfidf for this blog:

wordName wordTfidf (topN-words)

[('snippet', 0.499), ('search', 0.435), ('kngine', 0.384), ('results', 0.205), ('aims', 0.202), ('paragraphs', 0.192), ('web', 0.125), ('engine', 0.125), ('looking', 0.121), ('organize', 0.114), ('bellow', 0.102), ('semantically', 0.096), ('documents', 0.096), ('unleash', 0.088), ('consist', 0.085), ('related', 0.085), ('links', 0.085), ('changed', 0.084), ('collection', 0.081), ('research', 0.079), ('answering', 0.073), ('ranked', 0.073), ('semantic', 0.073), ('rethink', 0.068), ('contents', 0.067), ('objective', 0.064), ('representation', 0.064), ('september', 0.063), ('technologies', 0.062), ('title', 0.062), ('aim', 0.061), ('list', 0.06), ('completed', 0.059), ('advances', 0.058), ('trip', 0.058), ('engines', 0.057), ('revolution', 0.057), ('collect', 0.056), ('appear', 0.056), ('unfortunately', 0.054), ('improvement', 0.052), ('relevant', 0.051), ('searching', 0.051), ('abstract', 0.05), ('accessible', 0.049), ('introduction', 0.048), ('direct', 0.047), ('action', 0.047), ('familiar', 0.047), ('structured', 0.046)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 1.0 746 high scalability-2009-11-26-Kngine Snippet Search New Indexing Technology

Introduction: While Kngine just announce some improvement and new features , I would like you take you in small trip in Snippet Search research project at Kngine.   What is Kngine? Kngine is startup company working in Searching technologies, We in Kngine aims to organize the human beings Systematic Knowledge and Experiences and make it accessible to everyone. We aim to collect and organize all objective data, and make it possible and easy to access. Our goal is to build Web 3.0 Web Search Engine on the advances of Web Search Engine, Semantic Web, Data Representation technologies a new form of Web Search Engine that will unleash a revolution of new possibilities.   Introduction to Snippet Search Today, The Web Search Engine’s is the Web getaway, especially to get specific information. But unfortunately the search engines didn’t changed mush as the Web changed from 90’s. Since the 90’s the Web search engine still provide the same kind of results: Links to documents. We i

2 0.45525426 630 high scalability-2009-06-14-kngine 'Knowledge Engine' milestone 2

Introduction: Kngine is Knowledge Web search engine designed to provide meaningful search results, such as: semantic information about the keywords/concepts, answer the user’s questions, discover the relations between the keywords/concepts, and link the different kind of data together, such as: Movies, Subtitles, Photos, Price at sale store, User reviews, and Influenced story Goals Kngine long-term goal is to make all human beings systematic knowledge and experience accessible to everyone. I aim to collect and organize all objective data, and make it possible and easy to access. Our goal is to build on the advances of Web search engine, semantic web, data representation technologies a new form of Web search engine that will unleash a revolution of new possibilities. Kngine tries to combine the power of Web search engines with the power of Semantic search and the data representation to provide meaningful search results compromising user needs. Status Kngine starts as a research project in O

3 0.32043365 332 high scalability-2008-05-28-Job queue and search engine

Introduction: Hi, I want to implement a search engine with lucene. To be scalable, I would like to execute search jobs asynchronously (with a job queuing system). But i don't know if it is a good design... Why ? Search results can be large ! (eg: 100+ pages with 25 documents per page) With asynchronous sytem, I need to store results for each search job. I can set a short expiration time (~5 min) for each search result, but it's still large. What do you think about it ? Which design would you use for that ? Thanks Mat

4 0.16554686 269 high scalability-2008-03-08-Audiogalaxy.com Architecture

Introduction: Update 3: Always Refer to Your V1 As a Prototype . You really do have to plan to throw one away. Update 2: Lessons Learned Scaling the Audiogalaxy Search Engine . Things he should have done and fun things he couldn’t justify doing. Update: Design details of Audiogalaxy.com’s high performance MySQL search engine . At peak times, the search engine needed to handle 1500-2000 searches every second against a MySQL database with about 200 million rows. Search was one of most interesting problems at Audiogalaxy. It was one of the core functions of the site, and somewhere between 50 to 70 million searches were performed every day. At peak times, the search engine needed to handle 1500-2000 searches every second against a MySQL database with about 200 million rows.

5 0.15768191 1395 high scalability-2013-01-28-DuckDuckGo Architecture - 1 Million Deep Searches a Day and Growing

Introduction: This is an interview with  Gabriel Weinberg , founder of  Duck Duck Go  and general  all around startup guru , on what DDG’s architecture looks like in 2012. Innovative search engine upstart DuckDuckGo had 30 million searches in February 2012 and averages over 1 million searches a day. It’s being positioned by super investor Fred Wilson as a clean, private, impartial and fast search engine. After talking with Gabriel I like what Fred Wilson said earlier, it seems closer to the heart of the matter: We invested in DuckDuckGo for the Reddit, Hacker News anarchists .                    Choosing DuckDuckGo can be thought of as not just a technical choice, but a vote for revolution. In an age when knowing your essence is not about about love or friendship, but about more effectively selling you to advertisers, DDG is positioning themselves as the do not track alternative , keepers of the privacy flame . You will still be monetized of course, but in a more civilized and an

6 0.13950872 342 high scalability-2008-06-08-Search fast in million rows

7 0.13256411 658 high scalability-2009-07-17-Against all the odds

8 0.12779757 258 high scalability-2008-02-24-Yandex Architecture

9 0.12484357 899 high scalability-2010-09-09-How did Google Instant become Faster with 5-7X More Results Pages?

10 0.12321484 856 high scalability-2010-07-12-Creating Scalable Digital Libraries

11 0.11486196 912 high scalability-2010-10-01-Google Paper: Large-scale Incremental Processing Using Distributed Transactions and Notifications

12 0.11289527 775 high scalability-2010-02-10-ElasticSearch - Open Source, Distributed, RESTful Search Engine

13 0.10569023 702 high scalability-2009-09-11-The interactive cloud

14 0.10234004 810 high scalability-2010-04-14-Parallel Information Retrieval and Other Search Engine Goodness

15 0.098990202 834 high scalability-2010-06-01-Web Speed Can Push You Off of Google Search Rankings! What Can You Do?

16 0.093664363 335 high scalability-2008-05-30-Is "Scaling Engineer" a new job title?

17 0.092828102 1052 high scalability-2011-06-03-Stuff The Internet Says On Scalability For June 3, 2011

18 0.092549123 64 high scalability-2007-08-10-How do we make a large real-time search engine?

19 0.087144025 915 high scalability-2010-10-05-Sponsored Post: Box.net, Wiredrive, Joyent, DeviantART, CloudSigma, ManageEngine, Site24x7

20 0.0868086 689 high scalability-2009-08-28-Strategy: Solve Only 80 Percent of the Problem


similar blogs computed by lsi model

lsi for this blog:

topicId topicWeight

[(0, 0.118), (1, 0.036), (2, 0.014), (3, -0.009), (4, 0.035), (5, -0.001), (6, -0.019), (7, 0.024), (8, 0.029), (9, 0.063), (10, 0.012), (11, -0.058), (12, -0.039), (13, -0.037), (14, 0.046), (15, -0.004), (16, -0.079), (17, -0.009), (18, 0.115), (19, -0.045), (20, 0.073), (21, -0.094), (22, 0.016), (23, 0.044), (24, -0.069), (25, -0.06), (26, -0.128), (27, 0.009), (28, 0.012), (29, 0.133), (30, -0.098), (31, 0.046), (32, -0.068), (33, 0.045), (34, 0.168), (35, -0.011), (36, -0.025), (37, -0.009), (38, -0.079), (39, -0.084), (40, 0.183), (41, 0.014), (42, -0.01), (43, 0.058), (44, -0.066), (45, 0.088), (46, -0.038), (47, 0.063), (48, 0.103), (49, -0.095)]

similar blogs list:

simIndex simValue blogId blogTitle

same-blog 1 0.98288608 746 high scalability-2009-11-26-Kngine Snippet Search New Indexing Technology

Introduction: While Kngine just announce some improvement and new features , I would like you take you in small trip in Snippet Search research project at Kngine.   What is Kngine? Kngine is startup company working in Searching technologies, We in Kngine aims to organize the human beings Systematic Knowledge and Experiences and make it accessible to everyone. We aim to collect and organize all objective data, and make it possible and easy to access. Our goal is to build Web 3.0 Web Search Engine on the advances of Web Search Engine, Semantic Web, Data Representation technologies a new form of Web Search Engine that will unleash a revolution of new possibilities.   Introduction to Snippet Search Today, The Web Search Engine’s is the Web getaway, especially to get specific information. But unfortunately the search engines didn’t changed mush as the Web changed from 90’s. Since the 90’s the Web search engine still provide the same kind of results: Links to documents. We i

2 0.94153887 332 high scalability-2008-05-28-Job queue and search engine

Introduction: Hi, I want to implement a search engine with lucene. To be scalable, I would like to execute search jobs asynchronously (with a job queuing system). But i don't know if it is a good design... Why ? Search results can be large ! (eg: 100+ pages with 25 documents per page) With asynchronous sytem, I need to store results for each search job. I can set a short expiration time (~5 min) for each search result, but it's still large. What do you think about it ? Which design would you use for that ? Thanks Mat

3 0.88334543 630 high scalability-2009-06-14-kngine 'Knowledge Engine' milestone 2

Introduction: Kngine is Knowledge Web search engine designed to provide meaningful search results, such as: semantic information about the keywords/concepts, answer the user’s questions, discover the relations between the keywords/concepts, and link the different kind of data together, such as: Movies, Subtitles, Photos, Price at sale store, User reviews, and Influenced story Goals Kngine long-term goal is to make all human beings systematic knowledge and experience accessible to everyone. I aim to collect and organize all objective data, and make it possible and easy to access. Our goal is to build on the advances of Web search engine, semantic web, data representation technologies a new form of Web search engine that will unleash a revolution of new possibilities. Kngine tries to combine the power of Web search engines with the power of Semantic search and the data representation to provide meaningful search results compromising user needs. Status Kngine starts as a research project in O

4 0.86359286 342 high scalability-2008-06-08-Search fast in million rows

Introduction: I have a table .This table has many columns but search performed based on 1 columns ,this table can have more than million rows. The data in these columns is something like funny,new york,hollywood User can search with parameters as funny hollywood .I need to take this 2 words and then search on column whether that column contain this words and how many times .It is not possible to index here .If the results return say 1200 results then without comparing each and every column i can't determine no of results.I need to compare for each and every column.This query is very frequent .How can i approach for this problem.What type of architecture,tools is helpful. I just know that this can be accomplished with distributed system but how can i make this system. I also see in this website that LinkedIn uses Lucene for search .Is Lucene is helpful in my case.My table has also lots of insertion ,however updation in not very frequent.

5 0.76920187 246 high scalability-2008-02-12-Search the tags across all post

Introduction: Let suppose i have table which stored tags .Now user can enter keywords and i have to search through all the records in table and find post which contain tags entered by user .user can enter more than 1 keywords. What strategy ,technique i use to search fast .There maybe more than millions records and many users are firing same query. Thanks

6 0.74718249 1601 high scalability-2014-02-25-Peter Norvig's 9 Master Steps to Improving a Program

7 0.7444368 810 high scalability-2010-04-14-Parallel Information Retrieval and Other Search Engine Goodness

8 0.74425793 258 high scalability-2008-02-24-Yandex Architecture

9 0.71816814 899 high scalability-2010-09-09-How did Google Instant become Faster with 5-7X More Results Pages?

10 0.67687517 64 high scalability-2007-08-10-How do we make a large real-time search engine?

11 0.67335069 775 high scalability-2010-02-10-ElasticSearch - Open Source, Distributed, RESTful Search Engine

12 0.66459256 1395 high scalability-2013-01-28-DuckDuckGo Architecture - 1 Million Deep Searches a Day and Growing

13 0.65324026 269 high scalability-2008-03-08-Audiogalaxy.com Architecture

14 0.65213501 1295 high scalability-2012-08-02-Ask DuckDuckGo: Is there Anything you Want to Know About DDG?

15 0.62232339 1253 high scalability-2012-05-28-The Anatomy of Search Technology: Crawling using Combinators

16 0.59540582 335 high scalability-2008-05-30-Is "Scaling Engineer" a new job title?

17 0.54057437 1610 high scalability-2014-03-11-Douglas Adams - 3 Rules that Describe Our Reactions to Technologies

18 0.52316737 689 high scalability-2009-08-28-Strategy: Solve Only 80 Percent of the Problem

19 0.50591916 1233 high scalability-2012-04-25-The Anatomy of Search Technology: blekko’s NoSQL database

20 0.50411612 912 high scalability-2010-10-01-Google Paper: Large-scale Incremental Processing Using Distributed Transactions and Notifications


similar blogs computed by lda model

lda for this blog:

topicId topicWeight

[(2, 0.102), (10, 0.016), (28, 0.065), (61, 0.603), (79, 0.071), (94, 0.023)]

similar blogs list:

simIndex simValue blogId blogTitle

1 0.99207252 324 high scalability-2008-05-19-UK Based CDN

Introduction: Hi, I was wondering if I could borrow the collective minds of you all to draw up a list to the CDN's that you'd use/do use in the UK. If they're outside the UK but have decent support then also include. The service must be cheap and not require a huge setup fee, it's really only for a small time business; it shares video & high-res pics so mass cheap storage is a must and wondered whether you guys had any ideas, also costs? Mass storage isn't cheap in the UK compared to the states, for example, unless I go colo but as I say, it's a small setup but happens to require a fair bit of space. Would S3 be a good starting point? What is the service like? I hear mixed reviews about it. Many thanks, Jim

same-blog 2 0.99191093 746 high scalability-2009-11-26-Kngine Snippet Search New Indexing Technology

Introduction: While Kngine just announce some improvement and new features , I would like you take you in small trip in Snippet Search research project at Kngine.   What is Kngine? Kngine is startup company working in Searching technologies, We in Kngine aims to organize the human beings Systematic Knowledge and Experiences and make it accessible to everyone. We aim to collect and organize all objective data, and make it possible and easy to access. Our goal is to build Web 3.0 Web Search Engine on the advances of Web Search Engine, Semantic Web, Data Representation technologies a new form of Web Search Engine that will unleash a revolution of new possibilities.   Introduction to Snippet Search Today, The Web Search Engine’s is the Web getaway, especially to get specific information. But unfortunately the search engines didn’t changed mush as the Web changed from 90’s. Since the 90’s the Web search engine still provide the same kind of results: Links to documents. We i

3 0.98561174 226 high scalability-2008-01-28-DR-BC for web-DB servers

Introduction: All, I'm looking for a faster/reliable solution for DR/BC as well as for sclability for my web/db servers. I came across VMWare Infrastructure and other products. The I/O performance concerns me to go with virtual servers. I'm also looking into imaging software such as Acrnois. Could anyone share their thoughts on how it's being done with bigger names such as google/youtube etc..? Thank you, Regards, Janakan Rajendran.

4 0.98195863 549 high scalability-2009-03-26-Performance - When do I start worrying?

Introduction: A common problem of the application designers is to predict when they need to start worrying about the Architectural/System improvements on their application. Do I need to add more resources? If yes, then how long before I am compelled to do so? The question is not only when but also what. Should I plan to implement a true caching layer on top of my application or do I need to shard my database. Do I need to move to a distributed search infrastructure and if yes when ! Essentially we try to find out the functionalities of the application that will become critical over time.

5 0.97809052 1303 high scalability-2012-08-13-Ask HighScalability: Facing scaling issues with news feeds on Redis. Any advice?

Introduction: We just released a social section to our iOS app several days ago and we are already facing scaling issues with the users' news feeds. We're basically using a Fan-out-on-write (push) model for the users' news feeds (posts of people and topics they follow) and we're using Redis for this (backend is Rails on Heroku).  However, our current 60,000 news feeds is ballooning our Redis store to almost 1GB in a just a few days (it's growing way too fast for our budget). Currently we're storing the entire news feed for the user (post id, post text, author, icon url, etc) and we cap the entries to 300 per feed. I'm wondering if we need to just store the post IDs of each user feed in Redis and then store the rest of the post information somewhere else?  Would love some feedback here.  In this case, our iOS app would make an api call to our Rails app to retrieve a user's news feed.  Rails app would retrieve news feed list (just post IDs) from Redis, and then Rails app would need to query to g

6 0.97628403 1201 high scalability-2012-02-29-Strategy: Put Mobile Video Into Cold Storage After 30 Days

7 0.97295952 268 high scalability-2008-03-06-Announce: First Meeting of Boston Scalability User Group

8 0.95009315 580 high scalability-2009-04-24-INFOSCALE 2009 in June in Hong Kong

9 0.94392353 493 high scalability-2009-01-16-Just-In-Time Scalability: Agile Methods to Support Massive Growth (IMVU case study)

10 0.9399268 793 high scalability-2010-03-10-Saying Yes to NoSQL; Going Steady with Cassandra at Digg

11 0.93012255 208 high scalability-2008-01-11-FTP Sanity: Redundancy, archiving, consolidation.

12 0.92731786 930 high scalability-2010-10-28-NoSQL Took Away the Relational Model and Gave Nothing Back

13 0.92229503 347 high scalability-2008-07-07-Five Ways to Stop Framework Fixation from Crashing Your Scaling Strategy

14 0.91641885 173 high scalability-2007-12-05-Easier Production Releases

15 0.91160244 132 high scalability-2007-10-25-Who can answer or analyze the image store and visit solution about alibaba.com?Thanks

16 0.90472037 675 high scalability-2009-08-08-1dbase vs. many and cloud hosting vs. dedicated server(s)?

17 0.86993277 749 high scalability-2009-12-15-The Common Principles Behind the NOSQL Alternatives

18 0.86853814 238 high scalability-2008-02-04-IPS-IDS for heavy content site

19 0.85148042 1287 high scalability-2012-07-20-Stuff The Internet Says On Scalability For July 20, 2012

20 0.84866476 198 high scalability-2008-01-01-HOW CDN works