Font Size: a A A

Research On The Problem Of Data Quality Control In Data Warehouse

Posted on:2005-02-21Degree:MasterType:Thesis
Country:ChinaCandidate:X XiongFull Text:PDF
GTID:2168360125956419Subject:Information Science
Abstract/Summary:PDF Full Text Request
With the continuous advancement of various computer technology such as data model, database technology and application & development technology, data warehouse technology is also constantly developing and playing an important part in practical application since 1990s. At the same time, the great benefit produced by using data warehouse also stimulates the need for data warehouse technology, data warehouse market is advancing at a rapid trend: on one hand, the market need for data warehouse is becoming larger and larger; on the other hand, data warehouse products become more mature, the factories that produce data warehouse tool are becoming more and more. After many years' development, data warehouse technology is being perfected, but it still exists some problems, for example, it is necessary to improve the content of database, to improve its usability and to improve the quality of operating data warehouse. To make data warehouse more perfect and make it serve the senior managers of enterprise for their scientific decisions much better, the key is to strictly grasp the control of data quality in data warehouse, that is to say, sorting out the "dirty" data, and supplying DSS(Decision Support System ) with clean, integrated, consistent, correct, accurate, harmonious and higher quality data. Studying the problem of data quality control in data warehouse is a thing with practical significance. In view of this idea, this paper analyses and discusses the control of data quality in data warehouse.The first part of this paper analyses the basic theories of database, data mart and data mining that are related to data warehouse, and details the characteristics and system structure of data warehouse .The second part specifies the concepts and quality composition of data, the definition of data quality and the source & classification of data errors in data warehouse.In the third part, the author tries to build the assessment indicators system of data quality, then discusses the control measures of data quality respectively from the problems of simple and multiple data sources, and also puts forward the realization methods of data quality in data warehouse.The last part takes a health care data warehouse as an example, initially analyses its design and the control measures of data quality in health care data warehouse.As for my idea, dividing and evaluating reasonably the problem of data quality in data warehouse is the base to control data quality, and setting up overall quality evaluation index system is the focal point. It is the key to put forward the control measures and realization methods of data quality to solve the problem.
Keywords/Search Tags:data warehouse, data quality, quality control
PDF Full Text Request
Related items