A Priority-based Scheduling Algorithm For Hadoop

Posted on:2013-12-12

Degree:Master

Type:Thesis

Country:China

Candidate:F Fan

Full Text:PDF

GTID:2248330395951217

Subject:Software and theory

Abstract/Summary:

PDF Full Text Request

With the progress of science and technology, cloud computing is deeply root-ed among the people. The distributed platform based on cloud has become a hot spot in research. Hadoop is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. It provides a set of open, stable and reliable dataflow. The Google MapReduce is achieved on Hadoop. Programs could be divided into large amounts of work units, and every unit could be executed on the nodes in the cluster. My research mainly focuses on the field of program scheduling. Hadoop has delivered capacity sched-uler, fair scheduler and HOD scheduler. Nowadays, there are two major directions in the research on Hadoop scheduling algorithms. One bases on the architecture of MapReduce which is trying to reach the goal of optimization through less data shuffling, less I/O throughput and time estimation. Another bases on the Hadoop fair scheduling algorithm which optimizes the strategy of scheduling.According to the deseriptions above, no scheduling algorithm with priority is involved. We develop a priority-based scheduling algorithm in order to reduce the average waiting time of high priority works in a priority work queue. We describe the definition of work priority and architecture of priority-based Hadoop MapReduce. The comparison of average waiting time of every priority level be-tween general scheduling algorithm and and priority-based scheduling algorithm is given through experiments.

Keywords/Search Tags:

cloud computing, Hadoop MapReduce, scheduling algorithm, priori-ty

PDF Full Text Request

Related items

1	Research On Scheduling Algroithm In Hadoop Mapreduce
2	The Mapreduce Model In The Hadoop Implementation Of Performance Analysis And Optimization Improvements
3	The Research Of MapReduce Job Scheduling Algorithm Based On The Hadoop Platform
4	Research And Improvement Of MapReduce Scheduling Mechanism On Cloud Computing
5	Research On Optimization And Improvement Of MapReduce Job Scheduling Algorithm
6	The Research And Implementation Of Hadoop Scheduling Algorithm
7	Research On Algorithm Analysis And Modificating Of Job Scheduling For Hadoop
8	Research On Hadoop Platform And Its Job Scheduling Algorithm
9	Research And Improvement Of Job Scheduling Algorithm Based On Hadoop
10	The Research Of Hadoop Scheduling Algorithm And Improvement Strategy