Font Size: a A A

Research And Application On Full-Text Retrieval Of Mine Law

Posted on:2006-12-04Degree:MasterType:Thesis
Country:ChinaCandidate:S L HeFull Text:PDF
GTID:2168360152493730Subject:Management Science and Engineering
Abstract/Summary:PDF Full Text Request
Full-Text Retrieval means you can directly get the chapters, paragraphs or sentences from the source text. It creates Full-Database with regarding text information as the searching object, which is the most efficient method of locating the needing information from the vast document warehouse. However, Chinese words isn't separated from each other by blank like English, and it is very difficult to dispose by the computer .So this article deeply studies the problems of Chinese full-text system, with the combination of the project named full-text Retrieval system of Mine Law.In this paper, the author mainly does these below jobs.(1) For avoiding the syncopate of Chinese words, the author take the Chinese text search on single word. Based on this point the author discussed in detail about a series of Chinese information-disposing technologies such as the syncopate technology, inverted index technology, model of index compressing and searching arithmetic.(2) The author summarizes and compares the index compressing arithmetic. In this paper, the author brings forward the method of Bernoulli model, on the basis of the fact of the project.(3) Based on full-text searching algorithm for single Chinese word, the author improves the searching algorithm, and uses parallel calculating technology and buffering technology. At the same time, the author discusses the searching design deeply.(4) Aimed at the needing fact of the enterprise, the author embeds the Chinese information processing components in lucene that supports western language only, and creates a tool package of full-text index on which a Chinese full-text system of mine law is builded.(5) The author uses B/S mode, the architecture of MVC, the thinking of OOA and OOP.Besides that the author also uses the language of Java.The running successfully of the system shows that it has a good performance based on the above technologies, and makes the documents of mine laws be keeped orderly, and makes a great advance in getting the information and knowledge of mine lawsv for the people.
Keywords/Search Tags:Chinese Full-Text Rretrieval, Full-Ttext Database, Lucene, Chinese Word Index
PDF Full Text Request
Related items