Font Size: a A A

Full-Text Search Technology Research And Application In "2008 Olympic Games" Multi-Language System

Posted on:2010-06-19Degree:MasterType:Thesis
Country:ChinaCandidate:S LiFull Text:PDF
GTID:2178360275951209Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Full-text retrieval is an important information retrieval technology.It is a powerful tool for dealing with nonstructural data, and is one of the key technologies of the search engine.This paper deeply research on Chinese full-text retrieval technology.In the filed of full-text index based on word inverted table, a improved word-based Chinese inverted index structure is proposed which has a better performance than traditional approaches, and convenient for constructing, maintaining and updating index.According to it's characteristic, we design it's corresponding optimized search method.Analysis shows that better dynamic performance and high indexing speed is possible using this structure.This paper pays more attention in application of full-text retrieval technologies.How to use new technique, optimize the structure of retrieval system,improve performance and effciency,quicken search speed and adapt the development of current web is also dicussed in this paper.Full-text retrieval is an I/O intensive application.It's previous developments are carried on the basis of relation database.This paper deeply discusses the abuse and deficiency of this mode according to it's characteristic.Because the development platform of full-text retrieval is absent currently,Lucene,a full-text search engine toolkit,is introduced into the paper.It has powerful performance and it's body is cabinet,capable and vigorous,this convenient for it embedded applications.At present,Lucene is employed world abroad,so that many professional companies such as IBM also use it's core code.As an open source code soft,Lucene offer a superexcellent chance to study search engine key technology.It is worthful to take a parse research and carry second development to it.In the application aspect, this paper work mostly in the design and implement of the Multi-Language System.As for the retrieval results, the system accomplish primal design target on the whole.
Keywords/Search Tags:Full-text Retrieval, Single Chinese Character Indexing, Lucene, Inverted File, Full-text Database
PDF Full Text Request
Related items