Font Size: a A A

Search Engine Research And Implementation

Posted on:2009-07-12Degree:MasterType:Thesis
Country:ChinaCandidate:W W ZhangFull Text:PDF
GTID:2178360242975231Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Facing the explosive growth of the Internet Network Information, people depend on the Search Engine more and more,so research for the Search Engine has the very importantly theoretical meanings and practical values.This paper systematically discusses the system structure and work tenet of the Search Engine, on the basis of the relevant theory, making use of the Java technique to realize the nuclear parts of the News Search Engine, and putting forward the improvement on Webpage Search Algorithm and search result ranking algorithm.On the basis of analyzing the existing Web Crawling ,the article puts forward a Heuristics Searching Algorithm Based on Non- Greedy Policy and introduces the process of algorithm and Performance Analysis in detail , adoptting an incremental information extraction strategy based on index page to improve the efficiency of Indexed Database maintenance and insure the refreshment of Webpage Indexed Data in time. Aimming at the currently widespreadly adoptted the weakness of page ranking algorithm based on link analysis, put forward an improvement in page ranking algorithm of comprehensively consideratted various factors e.g.,Website Performance, Webpage contents,Webpage refresh time and the visiting rates of customers Etc., to filter the rubbish website and improve the search performance; Finally,the Realizing System of the News Search Engine Based on Java Technology is introduced.
Keywords/Search Tags:Search engine, Web spider, Page analysis, Index page
PDF Full Text Request
Related items