Font Size: a A A

The Reserach And Optimization Of Dedupli Cation Key Technologies And SRC Routing Protocol

Posted on:2014-02-08Degree:MasterType:Thesis
Country:ChinaCandidate:Q Q XingFull Text:PDF
GTID:2248330398461096Subject:Computer system architecture
Abstract/Summary:PDF Full Text Request
The explosive growth of data as well as large-scale centralized storage makes waste of space caused by the duplication of data; the problem is getting worse, which prompted the emergence and development of data deduplication technology. The concept of deduplication is very simple, if we do one minute’elevator speech":a vast Britannia Encyclopedia Series include44million words, a total of more than30,000words, and all papers really have simple26letters. Vast amounts of data up to EB level data deduplication technology is to find’letter’s in the massive data and the data is in the form of "letters", which would duplicate data for better storage space cost-effective.The current research work in duplication has made a series of valuable results to eliminate redundancy data, such as performance optimization, distributed routing algorithm which effectively promotes the application of the technology. In service-oriented distributed deduplication, deduplication system needs to support the regulation of the quality of service; this project is to study this problem, which targets to deduplication technology to establish duplicate data-based multi-strategy design and optimization technology.First, deduplication technology system is composed by key technologies, key indicators.and in order to establish a data deduplication system prototype to eliminate redundancy data, we build a engines as a key core deduplication technology, which includs the routing algorithm, block data warehouse, parallel pipelined control network communication protocols. Based on the key technology of model analysis to index model, data model analysis, performance model analysis and validation from the theoretical point of view. Second, the routing algorithm is a key technology in distributed deduplication storage systems, but existing routing algorithms cannot meet the requirements of distributed systems eliminate redundant efficiency, data migration, and cluster elastic Therefore, our innovative design based on Chord complete convergence of the algorithm the similar route detection algorithm SRC (Similarity Routing Based on Chord), and from a theoretical point of view, to prove consistency, further details the three stages of the SRC routing algorithm. Finally, starting from the model analysis results of the three key technologies, specific technology strategy improved optimization program including the organization of graded index optimization, optimization of migration based on data value, based on the the read request recombinant performance optimization.During the experiment, the completion of a distributed cluster environment to build, and select a set of test data and experimental tools to complete the read and write to concurrency corresponding time test cluster literacy test, routing algorithms, load balancing, node fault tolerance testing. The experiments prove that the strategy optimization of key technologies in distributed deduplication system, and SRC routing algorithm design significantly overcome the original technology system in the hot spot bottlenecks and performance deficiencies, multi-angle, deep-seated, and the completion of the wide-ranging The key technology research deduplication, deduplication technology to further improve service quality, and promote the green storage concept deduplication technology center in the further application of the concept of cloud storage.
Keywords/Search Tags:Data deduplication, Key Technology, SRC Routing Algorithms, DataMigration
PDF Full Text Request
Related items