Font Size: a A A

Research And Application, Based On The Lucene Database To Retrieve

Posted on:2011-11-08Degree:MasterType:Thesis
Country:ChinaCandidate:Z G GeFull Text:PDF
GTID:2208360308970474Subject:Computer software and theory
Abstract/Summary:
With the popularity of the network, most of the enterprises are strengthening their construction of information, and a lot of business to be extended to network. With the business continues to grow rapidly for large amounts of data. How to quickly and accurately retrieve the desired content, is an urgent problem. Traditional fuzzy query of the database, due to its low efficiency, high delay, can't meet the needs of fast retrieval. Therefore we must use the full text of the relevant technology to achieve fast retrieval of information.Because of the need for rapid retrieval, search techniques have been developed. Whether a professional search engine company, or open-source community, have launched a number of techniques for the search fields. Such as Google, Baidu and other companies, they are dedicated to Internet information retrieval. Not only the big companies concern information retrieval, a variety of open source search technology is constantly improving.Lucene is open-source search technology. It's an outstanding representative of its compact, portable, extensible and other features. There are many user concerns Lucene. Using Lucene technology full-text search applications can be completed well.This paper comprehensively researchs the open-source Lucene search engine, extends Lucene word breaker, uses forward maximum matching algorithm for Chinese word segmentation. Based on Lucene designing a full-text scheme for the database information search, using the XML configuration file to the database table needs to be done to configure full-text search. When the retrieved information, you can display a detailed table data. Apply to this scheme, "Oilfield Chemical Technology expert support system" in the construction operations guidance module.
Keywords/Search Tags:Full text search, Index, Lucene, Chinese word segmentation
Related items