Font Size: a A A

Design And Implementation Of Clucene-Based Personal Data Retrieval Support System

Posted on:2012-04-06Degree:MasterType:Thesis
Country:ChinaCandidate:T T HanFull Text:PDF
GTID:2178330335960736Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
With the development of Internet and the growing popularity of computer, more and more people have a lot of personal digital information, including SMS, photo, video, email, contact, blog, document and so on. People urgently need to have a handy system to manage and retrieve the personal data. People produce these data mostly depend on events. Hyper-Photo-Link system is a personal data management and retrieval system which retrieves personal data based events as a retrieval unit; the infrastructure of the retrieval system is clucene. It can help users to get other personal data based event from a picture as the starting point.Firstly, the thesis analysis relationship of daily data generated and the event, demonstrate the the feasibility of retrieve the event as the unit; and then gives the system needs analysis, and describes the hardware and software environment; and then describe the system from a functional point of view the overall design, including spatial and temporal clustering and storage of photos, personal data access, personal data retrieval, interface display, which focuses on personal data retrieval section and describes the design of each module; finally focus on description the implementation of each module of individual data retrieval system based clucene, including clucene architecture, clucene indexing and retrieval, and synonym index build, synonyms expansion and improvement of calculating similarity in clucene, highlighte search results, online upgrade of synonyms indexes.Personal data retrieval based on clucene systems description firstly introduces the basic concepts of information retrieval, including query expansion method, similarity calculation method, clucene modules and architecture, wordnet thesaurus introduction; then describe clucene index and retrieval; then describes the design and implementation of Synonyms Index building; and then describe the implementation of clucene retrieval module and the similarity algorithm and how to add the synonym expansion in clucene; and then describes the design and implementation highlight module, and finally describes design and implementation the online upgrade module of synonym Index.System testing part firstly shows the test results of synonyms index, and then show the text synonyms expanded search and sort results, and then shows the test results of synonyms index update; finally shows the photo-super-link system test results.Finally, the thesis made a summary of the text, and describes the shortcomings of the system and further work, summed up the work and results the in period of my graduate students.
Keywords/Search Tags:information retrieval, Clucene, query expansion, WordNet
PDF Full Text Request
Related items