Font Size: a A A

Research And Realization On Full Text Retrieval System Based On Indexing Of Single Chinese Character

Posted on:2011-05-14Degree:MasterType:Thesis
Country:ChinaCandidate:M XiFull Text:PDF
GTID:2178330332988303Subject:Computer system architecture
Abstract/Summary:PDF Full Text Request
Full-text retrieval is a very important branch of modern information retrieval technology, and it is a powerful tool for dealing with unstructured data. In recent year, one of the most important field of full-text retrieval is office automation, and with the quick development of office automation, full-text retrieval technology especially Chinese full-text retrieval technology is needed increasingly.In this paper, the existing full-text retrieval technology was analyzed, in particular, this paper has carried on the comparison based on words and based on single Chinese character different full-text retrieval algorithm, and analyzed their respective advantages and disadvantages and implementation difficulties. In view of the local chronicles information this professional field's characteristic, this paper proposed an effective single Chinese character-based inverted index files storage structure and retrieval method, which has the recall rate of 100%.In the application, this paper designed and implemented to local chronicles information center database system, and in view of PDF documents to create the single Chinese character-based index and retrieval mechanisms, then located the specific location of keywords to the page and highlighted the keywords. According to the actual needs, this paper designed and implemented two kinds of indices, the first index located the keywords to the PDF documents which contained the keywords, the second index located the keywords to the specific coordinate position of the page.
Keywords/Search Tags:Local Chronicles information, Full-Text Retrieval, Single Chinese character-based index, Inverted files
PDF Full Text Request
Related items