Optimization Of Cloud Storage Qo S Based On Deep Reinforcement Learning

Posted on:2021-02-21

Degree:Master

Type:Thesis

Country:China

Candidate:X P Li

Full Text:PDF

GTID:2428330647461960

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

With the rapid growth of the global data volume,human beings have entered the era of big data,and the first problem facing them is how to store these massive data.Because the traditional centralized data storage methods have many restrictions,such as high cost and poor scalability.It can no longer meet the storage requirements of massive data.Therefore,various distributed storage systems have emerged at the historic moment.The QoS performance of the system has become a research hotspot.An excellent data distribution strategy is the key to designing a distributed storage system,which determines how to distribute massive data to data storage nodes as evenly as possible,and the balance of data distribution has a great impact on data read and write performance.Affects the QoS performance of the cloud storage system.CRUSH algorithm,as the core algorithm of data distribution in Ceph distributed storage system,is a pseudorandom data distribution algorithm with high efficiency and high scalability.However,the CRUSH algorithm has the problem that the data is unevenly distributed on the storage nodes and affects the read and write QoS performance,which is specifically manifested in two aspects: storage nodes with a large amount of data will inevitably bear the load of heavy reads and writes,and it is easy to become a system performance bottle;on the other hand,the relatively low utilization of storage nodes with a small amount of data,is a waste of resources and reduces the QoS performance of the system.Aiming at the above problems,the work of this paper is based on the current widely studied and used cloud storage system Ceph.Its main work and innovation are as follows:(1)In a small-scale scenario,aiming at the problem that the Ceph data distribution algorithm CRUSH has an uneven distribution of data on the device space,which ultimately affects the read and write QoS performance of the cloud storage system,this paper designs and implements a data based on reinforcement learning The distribution algorithm RLCRUSH,first performs theoretical analysis and experimental verification from the data distribution process and the algorithm itself.It is concluded that the uneven distribution of PGs on the OSD nodes is the cause of the uneven distribution of user data on various storage devices.The three steps of modeling,designing a training model based on reinforcement learning,and designing and implementing a new model-based algorithm are used to improve the existing CRUSH algorithm,so that the PG can be approximately evenly distributed to each OSD node,eliminating uneven data distribution.The resulting system bottleneck,which improves OSD disk usage and QoS performance of cloud storage systems.(2)For medium and large-scale scenarios,the environment becomes more complicated due to the increase in scale.We have designed and implemented a new data distribution algorithm DRL-CRUSH based on deep reinforcement learning.The ability to quickly extract and transform information combined with the ability of reinforcement learning to make automatic decisions and control improves the original data distribution algorithm.The experimental results show that the algorithm still has obvious distribution equilibrium of PG among OSD nodes in medium and large-scale scenarios.Improvement,and has better time performance.(3)At the same time,the current algorithms based on reinforcement learning are mostly used in video games,robot control,and unmanned driving.We try to introduce them into the cloud storage field,and use reinforcement learning to continuously interact with the environment,feedback and automatic decision-making.And the ability to optimize goals to optimize the QoS performance of cloud storage systems,and hope to bring some inspiration to researchers in different fields.

Keywords/Search Tags:

Cloud storage, Data placement, Lad-balance, Reinforcement learning, QoS

PDF Full Text Request

Related items

1	Research And Implementation On Data Placement And Load Balance Strategy For Multicloud Storage System
2	Research And Experiment About The Data Replica Placement Algorithm In Cloud Storage System
3	Research On Cloud Data Placement Method For Cloud Storage System
4	Data Placement Strategy For Data-intensive Applications In Cloud Storage System
5	Data Placement Optimization Strategy For Social Networks In Cloud Computing
6	Research On Strategy Of Data Replica Placement For Geo-distributed Cloud Storage Services
7	The Research On Data Placement And Schedule Scheme In Large-Scale Multimedia Storage System
8	Research On Data Replication Strategy In Cloud Storage
9	Research On Big Data Placement For Low Latency Based On Refinforcement Learning
10	Research Of Replica Placement Strategy In Cloud Storage System