Font Size: a A A

Analysis And Design Of Domain Based Chinese Data Cleansing System

Posted on:2009-09-12Degree:MasterType:Thesis
Country:ChinaCandidate:X S ZhengFull Text:PDF
GTID:2178360242992466Subject:Management Science and Engineering
Abstract/Summary:PDF Full Text Request
As a new means of marketing, Database Marketing can greatly help enterprise hold their customers'info to carry out personalized customer services or marketing activities. But as the increase of information system data and the integration of different systems, the"dirty data"problem which affects the data quality arisen. The aim of data cleansing is to solve kinds of"dirty data"problem, and improve the data quality to support the analysis and other applications for a enterprise more strongly. On the basis of the existing theories of data cleansing, imported the conception of domain.Designed verify rules and connotations of twelve domains and relationship between some of them. Then designed a domain based data cleansing system. And treated it as a support of the usual checking methods of duplicated data. Designed corresponding data analysis reports and the ways to handle null and aberrant data for the domain based thought. Also proposed a"Bifurcate B- Tree"data structure, which is helpful for the efficiency of data cutting.
Keywords/Search Tags:DATA CLEANSING, DOMAIN, DUPLICATED DATA, DATA ANALYSIS REPORT, DATA CUTTING
PDF Full Text Request
Related items