Font Size: a A A

Analysis And Implement Of Data Management Based On ETL

Posted on:2009-02-09Degree:MasterType:Thesis
Country:ChinaCandidate:B WangFull Text:PDF
GTID:2178360245474046Subject:Software engineering
Abstract/Summary:PDF Full Text Request
With the rapid development of computer networks and database technology, people have more and more ways to get data, the volume and amount of data are increasing dramatically. As an important member of the society, the university has changed a lot at information acquiring and utilizing. There are much more officers in the department of university who use the right software to finish their work to improve the efficiency.But all kinds of information would make some trouble which needs us to figure out on our database management, especially at data cleaning and replication, such as how to correct data errors, avoid wrong decisions and reduce the risk of making decision? How to exchange and share resources and meanwhile we can manage and use? Data cleaning and replication can make it. Firstly, we get the credible, safety and consistency data from the data cleaning tool; Secondly, we integrate all these cleaned data to our public database, so every departments are able to share the data they need.This paper is about data cleaning and replication based on the ETL structure, and its main tasks are as follow:(1) Get a survey about the data cleaning and replication in china and abroad;(2) Point out the data problems between the departments, such as data source, data quality and data consistence.(3)Analyze the data problems about quality and consistence, and make data cleaning plan and replication plan.(4) In order to share the data sources, the paper described how to extract data from distinguished data sources ,and transfer them after finishing data cleaning according to the required rules, then replicate them to the target database, that's public database with the relevant tool, called Oracle Data Integrator(abbr. ODI)(5) The paper need to make progress at how to draw up the stratagem of data cleaning and how to balance the efficiency and property for the data replication.
Keywords/Search Tags:data cleaning, data replication, ODI
PDF Full Text Request
Related items