Font Size: a A A

Xml Data Repair And Transformation Of Research

Posted on:2013-01-26Degree:MasterType:Thesis
Country:ChinaCandidate:L Y ZhangFull Text:PDF
GTID:2248330395950573Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
We study the problem of XML data exchange, it has two aspects, first, the quality of the data, then the mapping of different schemas and the imple-mentation. Before data exchange, we need to ensure that the data is clean according to functional depenencies, otherwise the data exchange will fail, so we repair xml first, we use XML functional dependency to describe data con-sistency, and repair violations by value modifications. And assigns a weight to each leaf node in the XML document, and the cost of a repair is measured by the total weight of the modified nodes. We provide an efficient two-step heuristic method to repair XML functional dependency violations. First, the initial violations are captured and fixed by leveraging the conflict hypergraph. Second, the remaining conflicts are resolved by modifying the violating nodes and their related nodes called determinants, in a way that guarantees no new violations. Then we process data exchange on the repaired result, in our work of data exchange, we enhance the mapping language which is based on tree pattern, we add the support of set to the mapping, so it can express the nodes whose number is not fixed. Then we introduce the "path query language" which can works on SAX stream, and mappings can be easily converted to this language, and we can extract all the data for data exchange through one time of SAX stream based on this language. Our implementation is based on three steps:extract data, convert data, and merge data. Based on SAX streams, we can exchange big data with complex mapping efficiently. The experimental results demonstrate that our algorithm is efficient and scales well and the algorithm of repairing is effective in quality.
Keywords/Search Tags:repair, functional dependency, hypergraph, data exchange, mapping, tree pattern
PDF Full Text Request
Related items