Font Size: a A A

Based On B-Tree Indexes Of Fuzzy Query Library System Design And Implementation

Posted on:2016-05-28Degree:MasterType:Thesis
Country:ChinaCandidate:W XiaoFull Text:PDF
GTID:2348330488477270Subject:Software engineering
Abstract/Summary:PDF Full Text Request
With the rapid development of network and science technology, a better word library support is especially necessary if you want to make retrieval more accurate or to be with a higher querying efficiency while the computer configuration is not a factor which affects users' retrieval efficiency. However, the existing proofreading software published by the press cannot check out many wrong words, such as the latest Internet words, the updated words in Chinese dictionary, and the names and positions of national or local leaders. For this demand, related words fuzzy query in efficient thesaurus and recognition strategy are proposed.In this paper, we carry on the thesaurus search by crawler technology combined with fuzzy algorithm. And consequently design a related words fuzzy query thesaurus system based on b-tree index. The main steps of this topic contains: Firstly, gaining system requirements and confirmation by the analysis of the actual demand of the users and collected data. After confirming system requirements, we set up the thesaurus. Due to the rapid change of information, web crawler is applied in this article in order to ensure that the data can be updated in time and regularly update the existing thesaurus with expands network words. Then, use the b-tree index as the index method of words and polyphone, words and related phrases, words or phrases and its synonyms and antonyms. Design b-tree index structure of the thesaurus,establish subject index according to demands. Finally, on the basis of precise query we develop fuzzy query by analyzing the existing precise query algorithm, we adjusted the scope of fuzzy via combining the concept of fuzzy membership function and fuzzy operators, gradually complete a better required querying process from precise to fuzzy, so that the system can meet the higher needs of most users.This paper chooses thesaurus management software to begin the research and experiment of the fuzzy search which has been applied in a real environment. The designed system in this paper is verified to be more effective and have a high rate of query; the result of the query can be effectively sorted which can meet the needs of the users' information searching.
Keywords/Search Tags:thesaurus management, fuzzy query, Web crawler, B-tree index
PDF Full Text Request
Related items