Font Size: a A A

The Research And Implementation Of Full-text Retrieval System Based On Lucene

Posted on:2006-02-03Degree:MasterType:Thesis
Country:ChinaCandidate:X Q ZhangFull Text:PDF
GTID:2168360152975698Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Full-text retrieval is an important information retrieval technology. It is a powerful tool for dealing with nonstmctural data, and is one of the key technologies of the search engine. This paper deeply research on Chinese full-text retrieval technology. In the filed of full-text index based on word inverted table, a improved word-based Chinese inverted index structure is proposed which has a better performance than traditional approaches, and convenient for constructing, maintaining and updating index. According to its characteristic, we design its corresponding optimized search method. Analysis shows that better dynamic performance and high indexing speed is possible using this structure. This paper pays more attention in application of full-text retrieval technologies. How to use new technique, optimize the structure of retrieval system, improve performance and efficiency, quicken search speed and adapt the development of current web is also discussed in this paper.Full-text retrieval is an I/O intensive application. Its previous developments are carried on the basis of relation database. This paper deeply discusses the abuse and deficiency of this mode according to its characteristic. Because the development platform of full-text retrieval is absent currently, Lucene, a full-text search engine toolkit, is introduced into the paper. It has powerful performance and its body is cabinet, capable and .vigorous, this convenient for it embedded applications. At present, Lucene is employed world abroad, so that many professional companies such as IBM also use its core code. As an open source code soft, Lucene offer a superexcellent chance to study search engine key technology. It is worthful to take a parse research and carry second development to it.In the application aspect, this paper work mostly in the design and implement of the degree dissertation full-text database in university. Its retrieval subsystem realize constructing indexer, database memory design and searcher design on the basis of relative work such as document data process, information extracting and sorter. Finally, the system realizes many functions such as navigation browser of document, full-text retrieval and meta data retrieval. As for the retrieval results,, the system accomplish primal design target on the whole.
Keywords/Search Tags:Full-text Retrieval, Single Chinese Character Indexing, Inverted File, Lucene, Full-text Database
PDF Full Text Request
Related items