Font Size: a A A

Design And Implementation Of Electronic File Retrival System Based On Lucene

Posted on:2011-04-28Degree:MasterType:Thesis
Country:ChinaCandidate:L ZhangFull Text:PDF
GTID:2178360302491763Subject:Software engineering
Abstract/Summary:PDF Full Text Request
"Information overload" is an increasingly serious problem as the development of information technology. The existing searching engine problems, such as the high costs of maintenance, the singleness of index data source and the lack of flexibility, etc., has bottlenecked the development of searching engine technology.This paper first introduces the classification and development of the information retrieval system. At the same time, the basic principle and performance indicator of searching engine together with the index and searching technologies in Lucene technique are studied. Then, the functional and non-functional requirements in an electronic file retrieval system are intensively studied and analyzed. At last, a high-performance Lucene-based site file retrieval system is designed and realized according to the demand of UNIS electronic file retrieval system based on Lucene which is an open source software package. The designed system can be divided into five modules which include heterogeneous document analysis, data processing, document indexing, document searching and custom retrieval service. These modules not only can fulfill their own function, but also can interate mutually.The system has also been tested and analyzed in this paper. The tested data and analysis results show that the performance indictors including the recall rate and precision rate in the new Lucene-based information retrieval system designed in this paper compeletely meet the design requirements, this new system is high efficient and feasible.
Keywords/Search Tags:Full-text Retrieval, Index, Lucene, Searching Engine
PDF Full Text Request
Related items