Dynamic Resource Adjustment Of Hadoop-based Big Data Services

Posted on:2014-02-15

Degree:Master

Type:Thesis

Country:China

Candidate:F H Yang

Full Text:PDF

GTID:2248330398494451

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the development of technology. From the traditional Internet is widely usedin recent years and the explosive growth of the mobile Internet and Internet ofthings,the data which dependent on the network data growing, according toInternational Data Corporation (IDC),EMC companyâ€™s research shows that in themobile network equipment and video surveillance, the amount of data around theworld has reached487billion GB, and the amount of data in the2007report that only161billion GB. These data include a large number of telephone, mail, photos,SNS,news, and video content. How to effectively use these data to provide users withhigh-quality user experience in scientific research, with a large number of GPS datacollected by the device data collection and research are urgently need to get technicalsupport.In the future, with the further development of the Internet of Things, a largenumber of services is based on the data of the location-based services (LBS), will alsohave a large number of requests services based on LBS or personal preference. Thisalso requires the services in the future is different from now.Google has a wealth ofexperience in data mining,The Hadoop, which is drawing on Googleâ€™s GFS conceptused by many companies whom have a large data store, search and processing.Distributed parallel computing, the processing of large data decomposition to eachnode to do parallel computing, and thus more rapid completion of the data processingis applied to the Web logs, transaction flow, data backup has a huge advantage.Thereserach trying to ming information from structured or unstructured data, summed upthe characteristics of the data, and put data stored in Hadoop cloud computing system,application side access services, issued by the application server request, the nearest access to resources. In this process, Hadoop achieve traversal of resources and toadjust the location of the node and the node number of resource distribution toachieve the best application for the hardware and software resources. Provides a newidea for the LBS-based big data services.The main works are:First, Analysis the big data storage research, including the GFS file system and theHadoop framework, technical support for data stored in large data services.Second, Research data characteristics and the related to data mining algorithmsmining unstructured data summarized read structured data analysis, data hotspotweights initialization.The main results are:First, Based on the existing Hadoop framework.to modify Hadoop storage backupalgorithm right through the resource list of values to achieve the resource file storageresources Points weights in accordance with the data in a modified frame.Second, In this paper, on the basis of Hadoop design resource weight initializationalgorithm, when the resource file access increased or import settings via an external hotweight, calculated and distributed file storage node data resources.Three, Achieve weight adjustment in the process of library resource file hotspotschanging the resource file, and pass the value of the rights to redistribute the dataresource file and adjusted to achieve the service resources amplification algorithm andservice resources contraction algorithm.

Keywords/Search Tags:

Hadoop could, computing, big data service, GFS

PDF Full Text Request

Related items

1	Hadoop Method And System Design Of Massive Sensor Information Processing Based On Service Platform Of The Internet Of Things
2	Research And Construction On Data Acquisition Model Of The Tourism Information Based On Hadoop Cloud Computing
3	The Reseach Of Data Mining Based On HADOOP
4	The Research Of Algorithm About Social Network Recommendation Service Based On Hadoop
5	GPU Computing In Massive Data Processing
6	Research On Optimization Of Map Reduce For Interactive Analysis On Big Data
7	The Design Of The Cloud Computing System Based On Hadoop
8	The Research Of Data Mining Based On Hadoop Platform
9	Design And Implementation Of The Data Analysis System Besed On Hadoop
10	Research On Some Key Technologies Of Service Oriented Data Integration