Font Size: a A A

The Design And Implementation Of Forestry Focused Search Engine

Posted on:2012-03-21Degree:MasterType:Thesis
Country:ChinaCandidate:Y F GuoFull Text:PDF
GTID:2178330335967227Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Because the Internet was expanding speedily, the amount of information has been larger than ever before. Relatively, it's uneasy to find the most interesting information what users need. General search engines try everything to cover more information, and provide various services. But it appears to be far from enough in front of millions of web pages. So, in order to adapt the development of WWW and the user requirements, topical search engine showed up.At first, the paper provided a general introduction about the history and current situation of search engine, structure of traditional general search engine and the problem of it. Then compared it with the topical search engine and introduce some open source frameworks which common used in building a search engine. And then, after research the core technologies of building a topical search engine, such as topic representation, topical crawler and Chinese word segmentation, the paper designed its own crawler and build a forestry focused search engine. It based on a forestry dictionary and a candidate dictionary. At last, the paper detailed the implementation, including crawling pages from web, analyzing the downloaded pages, indexing it, and showing users with structured results. According to the data from the experiments, compared to general search engine such as Google and Baidu, this search engine can improve precision rate greatly and have certain utility value.
Keywords/Search Tags:Topical Search Engine, Topical Crawler, Shark-Search Algorithm, Forestry Dictionary
PDF Full Text Request
Related items