Font Size: a A A

Research On NRC-Based Data Provenance Tracing And Their Applications

Posted on:2012-08-11Degree:MasterType:Thesis
Country:ChinaCandidate:B LiuFull Text:PDF
GTID:2178330338496195Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
With the information flow accelerates as a result of the emergence of cloud storage and popularization of the Internet, a enormous, complex and heterogeneous data environment is forming in kinds of industries. In these complex data environments, some public databases are of utmost importance in professional fields as biology and astronomy and so on. The lack of unified standard as these data were collected and processed causes that data may vary widely in terms of quality. Therefore, analysing and restoring the process of data's generation and evolution means a lot to evaluation of data quality and correction of data fault. Data provenance aims at studying this problem.The research of this paper mainly consists of the following aspects:1,Studies related concepts and techniques of relational data provenance at present and focuses on the research of data provenance including queries with aggregate functions.2,According to the problems of query equivalence and aggregate function of relational data provenance, extends nested relational calculation expression and takes aggregate functions as basic operators. The NRC can transform with relational algebra expression after extended, laying foundations for implementing specific provenance tracing system.3,Builds a provenance tracing model based on NRC, introduces annotation model, derives query expression's actions by analysing output annotations from query expression, thus gets the dependency between outputs and inputs.4,Studies how to build a workable provenance tracing system. At last, in order to increase query and storage efficiency of provenance tracing system, this paper improves the storage model. Experiment results show that the improved storage model has a better performance compared to the original one.
Keywords/Search Tags:data provenance, nested relational calculation, aggregate function, relational algebra
PDF Full Text Request
Related items