Font Size: a A A

Research And Implement Of Compression System Based On Network Information For Search Engine

Posted on:2013-03-11Degree:MasterType:Thesis
Country:ChinaCandidate:Z X BaoFull Text:PDF
GTID:2248330392957640Subject:Circuits and Systems
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet, network information shows an explosiveincreasing trend. In order to obtain these vast amounts of information the search enginecame into being. How to use effective technology to store and retrieve the massive searchengine information is of great significance. At present, search engines commonly usedatabase or index to store this information and provide an interface to retrieve. This paperimplements a combination of compression and indexing of technical methods to solve thesearch engine network information storage and retrieval problems.This paper first introduces the search engine related technology, and expounds theZip and GZip algorithms, which are the theoretical basis for the final system of mine.Through analyzing the characteristics of network information for search engine,which are large amount, time-intensive, multimedia information and hyperlink, I design ascheme in which the Zip and GZip algorithms are used for text and non-text information,respectively, and which based on network information for search engine.Then we implement the system. The system is divided into three parts, and in the firstpart, it collect network information. In the second part, it processes information usingcompression algorithms and stores data. In the final part, it decompresses the data andretrieves the original information.As the final work, I test the system and use accuracy, compression ratio, compressionspeed three indicators to evaluate the effectiveness of the system based on networkinformation for search engine. The experimental data prove that the compressionalgorithm of the system has high compression ratio and compression speed, andpracticability, thus we can say that it effectively solves the search engine’s data storageand retrieval problems.
Keywords/Search Tags:Search engine, Compression, Decompression, Compression ratio, Compressing velocity
PDF Full Text Request
Related items