Font Size: a A A

Design And Implementation Of Data Quality Management Tool Of Data Center

Posted on:2014-11-12Degree:MasterType:Thesis
Country:ChinaCandidate:Z C LiuFull Text:PDF
GTID:2268330422963438Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
In current era, large amounts of business data have been accumulated in variousindustries for the development of information technology. In order to make good use ofthe data, data center has been built. At the same time, in order to ensure the data that hasbeen extracted into the data center to meet the requirements of data quality, a variety ofdata cleansing tools have been developed to deal with the data quality problems. However,the data that has been extracted into the data center may still have data quality problemsbecause of logic problems or different concerns during the cleaning process and etc.Therefore, it’s necessary to detect the data quality of the data extracted into the data center.In order to deal with the data quality of data in the data center, a data qualitymanagement tool based on the data center is designed, including the analysis of dataquality model and the design of the architecture of data quality management tool. Thereare four management modules such as the management of data source module, themanagement of standardization module, the management of data detecting module, theanalysis and visualization module. Data source module manages the information ofheterogeneous data sources in the data center; standardization module manages theanalysis and implements of the standardization meta rules, and standardization processthat standardizes the data sources according to the associating between the standardizationrules and the data sources; data detecting module manages the implements of four datadetecting rules that have been raised for the data quality property, and detecting processthat deals with the data sets in the data sources using related detecting rules; analysis andvisualization module of data quality property analyze the data quality property andanalyzes the overall situation of the data quality property for the data sets using data thathas been detected, then gives some advice according to the results.According to the test on the data quality management tool, and the analysis on theresults, it shows that it’s useful in the function of the tool, and it can deal with the data in the data center effectively.
Keywords/Search Tags:data center, data quality, data quality property, standardization
PDF Full Text Request
Related items