Font Size: a A A

Data quality rules in the Analytic Health Repository

Posted on:2013-08-31Degree:M.SType:Thesis
University:The University of UtahCandidate:Pollock, Susan ElizabethFull Text:PDF
GTID:2458390008983450Subject:Information Technology
Abstract/Summary:
Data quality has become a significant issue in healthcare as large preexisting databases are integrated to provide greater depth for research and process improvement. Large scale data integration exposes and compounds data quality issues latent in source systems. Although the problems related to data quality in transactional databases have been identified and well-addressed, the application of data quality constraints to large scale data repositories has not and requires novel applications of traditional concepts and methodologies.;Despite an abundance of data quality theory, tools and software, there is no consensual technique available to guide developers in the identification of data integrity issues and the application of data quality rules in warehouse-type applications. Data quality measures are frequently developed on an ad hoc basis or methods designed to assure data quality in transactional systems are loosely applied to analytic data stores. These measures are inadequate to address the complex data quality issues in large, integrated data repositories particularly in the healthcare domain with its heterogeneous source systems.;This study derives a taxonomy of data quality rules from relational database theory. It describes the development and implementation of data quality rules in the Analytic Health Repository at Intermountain Healthcare and situates the data quality rules in the taxonomy. Further, it identifies areas in which more rigorous data quality should be explored. This comparison demonstrates the superiority of a structured approach to data quality rule identification.
Keywords/Search Tags:Data quality, Health, Large scale data
Related items