Reseach On Data Placement Strategy For Data-intensive Applications In Cloud

Posted on:2014-02-24

Degree:Master

Type:Thesis

Country:China

Candidate:J Kang

Full Text:PDF

GTID:2248330395984277

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Recently, with the continuous development of Internet and cloud computing, data-intensiveapplications have been received more and more attention. These applications will produce a greatquantity of data so that facing some huge challenges for establishing cloud computing system. Inorder to achieve rational data placement, the article analyzes the related concepts and keytechnologies of cloud computing deeply, then the article proposes one data placement strategyusing improved K-means clustering algorithm and another data placement strategy using improvedK-means clustering algorithm based on Fisherâ€™s linear discriminant analysis, which purposes are toincrease the number of systemâ€™s data movements.The selection of initial clustering centers has been optimized using improved K-meansclustering algorithm, which improves the quality of clustering results. According to the improvedalgorithm, the article designs corresponding data placement strategy immediately. The results showthat the improved algorithm decreases43%of the number in data movements than the randomalgorithm. Another data placement strategy using improved K-means clustering algorithm based onFisherâ€™s linear discriminant analysis which combines Fisherâ€™s linear discriminant analysis withimproved K-means clustering algorithm, which objective is to refine the boundary of data centers.According to the improved algorithm, the article designs corresponding data placement strategyimmediately. The results show that the improved algorithm decreases26.6%of the number in datamovements than the random algorithm.Using one data placement strategy using improved K-means clustering algorithm and anotherdata placement strategy using improved K-means clustering algorithm based on Fisherâ€™s lineardiscriminant analysis can reduce the data-intensive applicationsâ€™ number of data movements acrossmultiple data centers in the runtime and effectively improve the clustering quality and the overallutility.

Keywords/Search Tags:

Cloud computing, Data-intensive, Data Placement, K-means clustering algorithm, Fisherâ€™s linear discriminant analysis

PDF Full Text Request

Related items

1	Research On Clustering Analysis Algorithm And Implementation In Data-intensive Computing Environments
2	Research On Optimization Of Map Reduce For Interactive Analysis On Big Data
3	Data Placement Strategy Research For Scientific Workflow In Hybrid Cloud Computing
4	Replica Optimization Mechanism For Data-Intensive Applications Under Cloud-Edge Collaboration
5	Parallel Optimization Of Data Intensive Computing On Sunway TaihuLight
6	Data Placement Strategy For Data-intensive Applications In Cloud Storage System
7	Research On Several Key Technologies For Data-intensive Heterogeneous Enviornments
8	Research On Cloud Computing Search Engine Design And Parallelization K-means Clustering Algorithms For Big Data
9	Clustering Algorithm Based On The Background Of Big Data
10	Data Placement Strategy Towards Efficient Execution Of Scientific Workflows In Cloud Computing Platform