Font Size: a A A

The Implementation Of Web Search Engine Based On Lucene

Posted on:2009-04-03Degree:MasterType:Thesis
Country:ChinaCandidate:L T YaoFull Text:PDF
GTID:2178360272978301Subject:Software engineering
Abstract/Summary:PDF Full Text Request
As the information technology is being developed constantly, Internet's information is being enriched, the development of the search engine is faster, the webpage that Google includes exceeds 8 billion at present. Following the developing of the search engine, some porblems begin to appear: The index database becoming larger, the maintenance cost needed is higher and higher; The data source of the index is single, but the information type kind of and Internet is more abundant; The index lacks commonability and flexibility. Lucene because its opening, easy expanding, there are very good application prospects to the settlement of above-mentioned. problems. This text realizes a small-scale simple search engine based on Lucene, to study Lucene application prospect in the field of search engine, the main work includes:(1) Have analysed the composition structure and operation principle of Web search engine, have carried on the whole structural design based on Web search engine of Lucene on this basis, and divided into the three pieces as webpage and collect the subsystem, index and search the subsystem, result arrange subsystem.(2) Have designed developing to the function of each module, having designed the memory mechanism of relevant information, and the one that has realized between every module according to the operation principle of the search engine is mutual.(3) Have analysed in depth and realized the key technology of Web search engine based on Lucene: multi-thread collected of webpage, use Lucene realize index and search, the priority of search result.
Keywords/Search Tags:Search engine, Lucene, Full-Text Search, Priority of webpage, PageRank
PDF Full Text Request
Related items