Font Size: a A A

Research And Implementation Of Distributed Network Search Engine

Posted on:2012-02-28Degree:MasterType:Thesis
Country:ChinaCandidate:T ZhangFull Text:PDF
GTID:2218330371962578Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Since the rising of search engines drives the economic recovery of the Internet, search engines showed the world that there is still unknown amazing reserve force in the Internet on the other hand, the industry began to put more attention on the search engines performance, traffic and other concerns on. With the explosion of information accompanied by the information age, information on the Internet grow exponentially everyday, the industry and individual users resort to Internet search engine technology to process data, ranging from the search of local file to the massive Internet Data. Aimed at the needs of different searching scheme , we propose a structure of scalable distributed search engine and realize it.Based on detail description of the theory and technology about web search engine, we make pointed research on key technology in order to achieve a may be distributed storage for a particular industry and related software systems to provide network data indexing and functional system for the purpose of retrieval. What this paper referring are as follows:The page recommendation can satisfy the users'demand for information efficiently and conveniently. In consideration of the deficiencies of the traditional personalized technologies, this paper proposes a self-adaptive personalized Web page recommendation method based on semantics. The method constructs a self-adaptive semantic user model by the use of semantic ontology and user's interest drifting mechanism, and utilizes the centers of the semantic clusters to improve the precision of recommendation. Experimental results show that the new method has a higher precision and recall compared with the other recommendation method.Discusses the current status of internal and external, the problems and trends of the search engines; what is the search engine working principle and the main functions of the various parts; system introduces the search engine core principles and the implementation methods of it. Based on the plug-in mechanism, design and implement a scalable architecture which can be distributed on a search engine. Each machine is responsible for a specific domain index information collection and index, for storage on different machines on the web page data can be retrieved in parallel. Search here focuses on the implementation of the framework, not only gives the relationship between each module, but also the realization of the principles of each module and ideas.Conclusively, this paper discusses a search engine's plug-in mechanism and a distributed query design methods, experience certificate, the implementation of the search engine system building skills good usability, Finally,give the summary of the search engine service based on the reality.
Keywords/Search Tags:Search engine, network spider, segment, distributed search, recommendation system
PDF Full Text Request
Related items