Font Size: a A A

Design And Implementation Of A Search Engine Used In Optical Jukebox

Posted on:2014-04-07Degree:MasterType:Thesis
Country:ChinaCandidate:Y LuFull Text:PDF
GTID:2268330422964738Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the era of big data, enterprise and organizations accumulated lots of importantdata rapidly. Optical jukebox provides important data technical means for long-termpreservation and convenient usage, mainly used in backup and archiving systems. In orderto manage and utilize data in a convenient way, these two systems need to be equippedwith corresponding retrieval mechanism. Traditional retrieval mechanisms searchinformation according to information added artificially, resulting in low efficiency and lowavailability. Search engine is often used for the retrieval of information on the Internet,generally using full-text search technology, allowing users to find comprehensive andaccurate network resources in a short period of time based on keywords. Introducingsearch engine into optical jukebox system can provide users with more convenientretrieval services, meanwhile, improve the efficiency of optical jukebox.In order to establish a full-text search system for optical jukebox, the principle ofsearch engine was studied, the architecture of optical disk library system and the propertyof multi-level storage systems was analyzed, the common retrieval models was estimated.Kinds of text extraction tools were used for extracting content from different types ofdocument; ICTCLAS30tools was used to complete the function of stemming Chineseword; indexing database was established according to the traditional inverted indexalgorithm; in order to reduce the frequency of replace discs, a sort method suitable foroptical jukebox search system was given; a query retrieval server supporting Booleanquery and sort retrieval was accomplished; a client inquiry procedure was implemented,with a user interface designed by VC++6.0development tool.According to test, the search engine is able to return the correct list of documentswhich are related to the given keyword, with a higher recall and moderate precision.Moreover, it supports queries joined by the basic logical operators, and can respond touser queries in less than1second.
Keywords/Search Tags:Optical jukebox, Search engine, Backup, Archive, Retrieval
PDF Full Text Request
Related items