Font Size: a A A

Design And Implementation Of The Multi-type Data Full-text Retrieval System Base On Lucene

Posted on:2009-11-12Degree:MasterType:Thesis
Country:ChinaCandidate:Q Y LiuFull Text:PDF
GTID:2178360272974530Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
The full-text retrieval refers that the computer indexing program retrieval the article by scanning every word in whole article.It is the retrieval type of making the indexing for every word in articel, point out the location and the number of the term of appearing in article, when user query, the search program can retrieval the indexing basis of pre-established, and fed back the results to the user.With the advent of the information age, a variety of information resources grow apidly, it is increasingly concerned about how quickly and efficiently search from the mass of information resources for a potential and valuable information which can make it effective in the management and decision-making in army. At the same time, as the basis of some unit intelligence information, a wide range of applications documents, operational instruments, digital instruments, information databases and other types of digital information carriers has been increasing constantly.How safely and rapidly search out precise and effective information from the millions, even tens of millions and more intelligence and information has become an important task in a new period time of these unit infomationization building. However, as information processing technology in the most basic information retrieval technology, the full-text retrieval application of technology in information retrieval research and application are still at a less advanced stage, how to make the full text of the advanced information retrieval technology to the military informatization building, have increasingly been of great importance in some unit at all levels.This article analyzed the current army and foreign army forces in the field of information retrieval research and application of the status quo, studied the characteristics the main algorithm of full-text retrieval, the relevant theories and the hot trends and technology of full-text retrieval. On the popular open-source toolkit Lucene.Net full-text search system structure and function of the main module for the analysis of the main Lucene indexing algorithm: incremental algorithm, and merging algorithm to find the algorithm for analysis. At the same time, combined with information-based military construction, based in Lucene.Net kit on the basis of the analysis and design for the informatization of the armed forces of the army more than the full text of the source data retrieval system. For intelligence information resource for information security to the special requirements of the user based on security rights of full-text search methods, effective control user access retrieval system security permissions. The more data sources (such as doc, pdf, html, database ..) and plug-in technology, research-based interface and plug-in technology development mode, a good solution to the unknown file format, style and a new type of database index The expansion of the problem. Through this system to retrieve the performance test and application of the experiment, sum up the characteristics of the system, to verify the full text retrieval system of indicators to the unit information retrieval system standards.
Keywords/Search Tags:Multi-type Data, Full-text Retrieval System, Lucene.net
PDF Full Text Request
Related items