Font Size: a A A

The Research And Implement About ETL Tool Based On Workflow And Metadata

Posted on:2007-02-11Degree:MasterType:Thesis
Country:ChinaCandidate:H ZhangFull Text:PDF
GTID:2178360215995271Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Under the global economic integration background, corresponding with the fact that Internet and the IT fleet developing, enterprise building information system has become an inevitable trend. However enterprise in the process of information construction has been mostly put into effect in a lot of different enterprise application system. Therefore, every enterprise have to reinforce information share and application integrating among deferent branches and deferent information systems. So collecting information and analysis manage have become key. The ETL (Extract-Transform-Load) tool provide data to data warehousing by extracting, transforming and loading deferent data storage format. So it can share resource and provide data to decision-maker. It is to say the ETL tool resolve the problem of isolated information island from data integrated angle.According to the paper research aim, it elaborated the research and implement about ETL tool. The paper firstly introduced related background knowledge of the ETL tool, and has analyzed the research present situation of ETL tool. According to the character of data sources with different architecture, the ETL tool with common technology of data accessing (ADO.NET), workflow engine, management of repository, abundant data cleaning functions, friendly user interface and mufti-thread data manipulation is provided. The design and implement of this system is completed in the article.In this article, all function modules have the corresponding model. The main research work and innovating spot of this topic is: the model of work flow, the model of ETL metadata, the model of data cushion, the model of concurrent rule execution. The model of work flow is the foundation of every task that is automatic execution on time and event. The realization of metadata is the foundation of the error control of data, the quality examination of data, the definition of ETL. The design of data cushion model has overcome the erroneous question in the data sheet records, and provided the good quality of data for the data conversion. The model of concurrent rule execution establishes above the model of data cushion. The many sub-cushions of the model of data cushion have provided the foundation for the concurrent execution of multi-thread. The model of concurrent rule execution enhanced the performance of system and the throughput of system.The paper finally has carried on the summary to the work of ETL, and elaborated some work which might further consummate in the future.
Keywords/Search Tags:etl, work flow, metadata, data cleaning, data transformation, data mapping
PDF Full Text Request
Related items