Font Size: a A A

The Research Of P2P Search Engine Based On A Subject-oriented

Posted on:2011-03-12Degree:MasterType:Thesis
Country:ChinaCandidate:Q L YiFull Text:PDF
GTID:2178360308970907Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
The rapid development of Internet leads to the explosive growth of Internet information resources, people begin to consider how to quickly and accurately obtain the most valuable information to meet user's individual requirements while enjoying the conveniences it bring to our life and work. The emergence of search engines, to some extent, solves this problem. It frees the users from remembering large number of complicated URLs, thus allowing users find their needed information quickly and accurately. Its fast and accuracy makes the search engines become one of the most popular application tools on the Internet. However, with the development of the Internet, there are still some problems urgently to solve. Firstly, how to search out a series of professional-subject-related information quickly and accurately becomes a problem, such as how to search out commodity auction information from Internet for users'personality requirements reference. Secondly, as the rapid increase in the amount of information on the Internet, how to deal with the increasingly expand of the current search engine and improve its overall efficiency become a problem as well.Solve this problem has become one of the main research directions of current search engine. This paper boldly introduced Chord of P2P into a subject-oriented search engines, based on the in-depth research on P2P technology and subject-oriented search engine technology, and proposed a P2P-based design that face to specific topics. In this paper, a specific topic based P2P search engines framework model is designed. At the same time, through the theme of a spider's search strategy, building DOM tree, the improvement of Chinese word dictionary design and theme of the relevance of calculations to optimize the search performance of the theme spider so as to effectively enhance the theme of spider search web content and themes of relevance. It also uses the Chord protocol organization P2P index nodes, and use DHT technology to organize the index file, and the experiments prove that it could more effectively improve the search engine's efficiency and reduce its query of keyword comparisons to use DHT technology organize the index file than the traditional inverted index technology do. Then the users send the information directly to the query server, and Query server locate the index file and the URL address related topics by retrieving Chord routing table.
Keywords/Search Tags:search engine, P2P, index, theme spider, correlation
PDF Full Text Request
Related items