Font Size: a A A

Heterogeneous Data Integration Study Based On XML

Posted on:2009-11-24Degree:MasterType:Thesis
Country:ChinaCandidate:X J ZhangFull Text:PDF
GTID:2178360272474098Subject:Computer system architecture
Abstract/Summary:PDF Full Text Request
With the rapid development of computer networks and the strong advance of information construction, now the amount of information available on-line is proliferating at a tremendous rated. However, all those information for many different applications is independently stored in a great variety of data sources and managed by different systems, and their contents, structures and quality are different in thousands ways. In order to utilize this information more effectively, there is a need to integrate information from multiple distributed, heterogeneous and autonomous sources, and make differences invisible and provide uniform and transparent access to the data for all users. In addition, it is necessary to preserve data integrity and consistency over different systems. Thus, how to resolve those differences efficiently is a severe challenge in the domain about application and research of information integration.These years, with increasing development and strength of XML, which is language to describe document structure, technology which is base on XML and other relevant technology which can explain the semi-structure information are inpacting information technology field and large change still happens in computer technology field. This paper detate that how to use XML technology to integrate structure and non-structure data problem.The paper state its content from these aspect as blow:(1) Classify the data for integraton. One is structure data, and the other one is non-structrue data. Please note that we consider the semi-structure data as the special condition of non-structure. The paper state strategy of classification integration, and use Mediator/Wrapper way, with a data pool to integrate structure and nom-structure data.(2) Explain detailed the function module and architecture of a information integration system named XHDIS.(3) Analyse the relevant technology of information integration deeply, such as schema integration, common data model and Wrapper template and so on.(4) For non-structure data, using the needed rule to transform to XML semi-structure form to integrate. Mainly consider the rule of HTML/XML Web page. In the end, based on a summary of the research results and development trend of technology pertinent to information integration, some advice for further research and exploration was proposed.
Keywords/Search Tags:Classification Integration, Common Data Model, Schema Integration, Data Pool, Data Grain
PDF Full Text Request
Related items