Font Size: a A A

Research And Design Of Intelligent Search Engine Based On Java

Posted on:2011-11-19Degree:MasterType:Thesis
Country:ChinaCandidate:B LeiFull Text:PDF
GTID:2178330332959853Subject:Detection Technology and Automation
Abstract/Summary:PDF Full Text Request
With the rapid development of internet, the increasing amount of network information, the search engine is becoming more and more important for people to get useful information. Because of the large searching scope, general search engine has a low coverage and low accuracy, so the search efficiency also decreases, which is unfriendly to the users. In this case, the intelligent search engine which is focus on some particular areas has become more important than ever before. In this paper, an intelligent search engine system has been designed which can crawling in specific areas, indexing and searching, by comparing with the difference between common and intelligent search engine.First, the history of the demestic and international search engine is been discussed in this paper, also the classfication features, the technical features of vertical search engine, and highlights the teconology and application features of the Lucene toolkit.Then it designed the flowchart and method of the system by analysing the framework and technical of search engine, also analyzed the features of all the modules.Followig, an important module-thematic network crawler, is been analyzed, which has made some improvement on crawling algorithm based on the traditional Fish-search algorithm, put forward the strategy to separate the single waiting_list into two parts, as the results show, this improvement increase the accuracy of reptiles crawling.Finally, the indexing module and design module is designed and discussed, while creating index files, a new method is designed, which doing indexing on summary content and the keywords, the results show that this method can reduce the size of index files, and also can increase the precision of search results and the efficiency of the entire search engine.
Keywords/Search Tags:theme crawler, search engine, Lucene, automatic summarization
PDF Full Text Request
Related items