Font Size: a A A

Research On Technology Of Massive Data Stores Based On Cloud Computing

Posted on:2012-07-28Degree:MasterType:Thesis
Country:ChinaCandidate:L ChenFull Text:PDF
GTID:2178330335499596Subject:Computer applications
Abstract/Summary:PDF Full Text Request
Cloud computing is for the first three quarters in 2007 before the birth process, and only after half year, Cloud computing is a hot point in computer industry today. Research on cloud computing have started in many companies and universities. Cloud computing is a emerging way of Shared infrastructure, will eventually become a universal service.Essentially, Cloud computing is an extension of distributed computing and grid computing.Base on the starting point of this problem, after analyzing the existing key technologies of distributed storage and computing, combined with Hadoop cluster technology research,which is based on the massive Hadoop data storage and computing model, and from the data structure design, program flow and use of programming to introduce several aspects of the development of this model.The model is applied to the data process of large-scale(Frequency Statistics). The model could also applied on web site log,search engines and large-scale files storage.This study using leading edge distributed technical framework to meet the demand of the project and deploy the model to actual instance. The characteristic of this study is the integration of model research and business applications. Using leading edge distributed technical framework to meet the demand of the project and deploy the model to actual instance. With the experimental results for testing models of practical value, such as high-efficiency, low-cost, scalability and maintenance and so on. We also perform the performance optimization against the basic model on the basis of the integration with original pre-process system, example for the refinement of simplified rules and so on. With the experimental results for testing models of practical value. The experimental results show that using Hadoop which is a cloud computing platform can effectively enhance the speed of massive data processing. It provides a good solution for large-scale data-processing.
Keywords/Search Tags:cloud computing, Hadoop, distributed, massive data, Frequency Statistics
PDF Full Text Request
Related items