Research And Application Of Web Search Results Clustering Based On The Search Term

Posted on:2011-03-05

Degree:Master

Type:Thesis

Country:China

Candidate:Q C Ma

Full Text:PDF

GTID:2178360308964798

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

In recent years, with the rapid development of Internet, various information on the Internet expanded rapidly, how quickly and accurately find the information users need to become exceptionally important. With this demand, search engine technology has made great progress, and there were a number of very good search engines, but there have been a number of cluster-based search engines. With the traditional linear form of a list of search results returned for the user than the search engines, search engine based on clustering of the biggest advantages is that the user's search results are returned in the form of clustering, which further facilitates the user in the mountains of information quickly and accurately find the information they need.However, these existing clustering-based search engines are only based on the basic simple clustering of the Web content at the expense of the user's search terms and related information between pages. Our thesis is based on users search for words in Web page clustering algorithm, synonyms clustering CBC (Clustering By Committee) algorithm is applied to the web page clustering ideas. Vector space model, the weights were calculated from the characteristic value, the text vector to determine similarity computation and clustering center of the aspects of CBC clustering algorithm was improved. In particular, we have increased eigenvalue value in the search word in the text of the weight vector, by this way to reflect the user's search term on the web clustering results. Experiments show that the improved algorithm is feasible and effective. Finally, in the proposed clustering algorithm based on the design and implementation of a Chinese Web page clustering system. The system is modular in design, implementation of the cluster from a web page to process the entire web page clustering.

Keywords/Search Tags:

Web clustering, vector space model, the search term, CBC algorithm

PDF Full Text Request

Related items

1	Study And Applications Of Duplicate Web Page's Elimination And Clustering Algorithm In Search Engine System Of Colleges And Universities
2	Research And Application Of Video Search Result Analysis And Visualization Method
3	The Research Of Structured P2P Network Model Based On Semantic Search
4	Research On English Text Clustering Method Based On Vector Space
5	Research On Pivotal Technology Of Focused Search Engine
6	Research And Implementation On Chinese Information Retrieval System Based On Structured Vector Space Model
7	Research Of Text Categorization Based On Vector Space Model
8	Study Of An Information Retrieval Technology Based On Improved Vector Space Model
9	Research On Short-term Power Load Forecasting Based On Combined Model
10	Study Of Chinese Text Clustering On Improved K-means Algorithm