The Research Of Data Warehouse Etl Distribution And Scheduling Model

Posted on:2011-04-07

Degree:Master

Type:Thesis

Country:China

Candidate:E Z Hou

Full Text:PDF

GTID:2198330332483437

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

In recent years, with the development of society, competition between enterprises is becoming increasingly fierce. So enterprises have to improve the competitiveness of the company for its own survival and development. It is the key to business success depends on the correct business decision or not. Therefore, it is growing that enterprise dependent on the decision support system. Enterprise have called for the establishment of data warehouse to store, analyze, process data, and ultimately provide the basis for the corporate decision-making. Data warehouse system in the process of handling data, often dealing with the relationship between the data precedence constraints. This process is a data warehouse in one of the most challenging. This article is a precedence constraint to deal with this multi-task scheduling problem.This paper analyzes the research situation, and introduces related knowledge, and then constraints on task scheduling problem in making the required number. With a directed acyclic graph that the relationship between tasks, a mathematical model. In solving the model, the presented popular method, and we compare the advantages and disadvantages of each method. Finally, we select genetic algorithm to solve this model.In the process of genetic algorithm, task, the processor that are encoded. It has precedence constraints between the tasks. The individual by encoded may not satisfy the constraint, so this paper adjusts the result by a reverse adjacency list. Depending on the requirements of the shortest, and giving the corresponding fitness function. When making individual choices, using roulette wheel method and the best combining individual preservation. According to the individual on the task of encoding processors, respectively, cross-individual variation also used different strategies. At last, we confirm the validity of the algorithm by the simulation experiment...

Keywords/Search Tags:

Data Mining system, genetic algorithms, Inverse adjacency list, Distributed task scheduling, Precedence constraints

PDF Full Text Request

Related items

1	The Research Of Data Warehouse Etl Distribution And Scheduling Model
2	Real-time scheduling of robotic and control systems with task dependencies
3	Task Scheduling Problem In Distributed Systems And Genetic Algorithms Applied Research
4	Research On Task Scheduling Algorithms For Distributed Systems Based On Computational Intelligence
5	Research On Causal Consistency Model Based On Grouping Strategy And Adjacency List
6	Research Of Scheduling Algorithms In Distributed Systems
7	Research On Policy And Algorithm Of Task Scheduling In Real-time System
8	Research On Real-time Task Scheduling Algorithm Of Reconfigurable System
9	Two Styles Of Hybrid Genetic Algorithms For Task Scheduling Problem In Optical Network Based On Distributed Computing System
10	Research On Task Scheduling Algorithms Based On Pre-Release Resource List In Hadoop