Font Size: a A A

Data Redundancy Strategy Based On RS Erasure Codes

Posted on:2016-05-27Degree:MasterType:Thesis
Country:ChinaCandidate:X L LiFull Text:PDF
GTID:2348330542475782Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
Due to the distance education and education resource sharing are based on the data network structure,more convenient and highly efficient education resources are still urgently desired.The characteristics of education resources is that the visit amount is larger for the resource which is relatively new,but the traffic is very small for historical resource data which reliability of data is not high.The two data availability require quite different.This paper analyzes the principle and classification of erasure codes in detail.In this paper,the application of erasure codes in data redundancy is also researched in detail.At the same time,it introduces the relevant theoretical knowledge of HDFS and the function of data redundancy strategy.According to the characteristics of education resource management,the difference of page view in different periods is large.Then the requirement of data availability is large.The strategy of data redundancy is 3 replicas by default in traditional system.It exists the problems of costing too much storage and the unbalanced load.In order to solve the above problem,it puts forward the dynamic copy strategy based on RS erasure codes.Firstly,according to the characteristics of education resource,the strategy of 3 replicas as default is adjusted to the strategy of file heat which is calculated by the visits of information.According to the heat level,the number of replicas is adjusted.On the one hand,the problem of load imbalance caused by high traffic have been solved,on the other hand,the problem of the waste of storage space due to less traffic have been solved.History traffic is introduced into the formula to improve the formula of heat file.It will make the threshold calculation conform to the characteristics of education resources.Secondly,it introduces the RS erasure codes in order to solve the problem of occupying more storage space by low heat files.Low heat file is encoded by RS erasure codes.It ensures the high reliability and availability while it saves the storage space.Finally,it puts forward the replica placement strategy based on grey prediction system to solve the problem of resource jitter and replication latency.It can use gray system to predict the traffic of next cycle based on historical traffic.Then the file heat is calculated to realize replications adjustment in advance.According to the characteristics of education resources,the related parameters of RS erasure codes are selected through experimental analysis.In this paper,it uses MATLAB as an experimental tool.It can predict the traffic of next cycle based on historical traffic.The proposedstrategy is validated through simulation experiments.The experimental results show that the proposed strategy can improve the reliability and availability of system data and reduce the cost of storage space effectively.The proposed strategy in this paper is verified in the treatment of the effectiveness of the education resources data redundancy.
Keywords/Search Tags:Education resource management, Replica placement, RS erasure codes, Heat, Prediction
PDF Full Text Request
Related items