Font Size: a A A

Design And Implementation Of Heterogeneous Data Query System In Health Field

Posted on:2019-11-21Degree:MasterType:Thesis
Country:ChinaCandidate:Y L TianFull Text:PDF
GTID:2428330593450416Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the development of society and the improvement of economic level,people are paying more and more attention to their own health problems.More researchers began to pay attention to data processing in the health field.Health data is more complex than other industries,including a lot of structured data and semi-structured data.At this stage,the data in the health field is mainly based on data from medical institutions.A large amount of data is stored in relational databases and XML documents.Due to the strong heterogeneity of data,a large number of information islands have formed in the health field.The data lies in the use.For the query of heterogeneous data in the health field,developers need to pay more attention to the method of acquiring data and waste a lot of time and energy.In order to solve this problem,this paper proposes to establish a heterogeneous data query system in the health field to solve the unified query problem of heterogeneous data,so that developers can devote more energy to the use of data,saving the developer's time and development costs.First of all,this paper designs the architecture of the heterogeneous data query system in the health field,chooses the mediator-wrapper approach,uses XML Schema as the common public data model,and uses XQuery as the unified query language.The system mainly implements data source registration,pattern extraction and conversion,pattern integration,and query decomposition four modules.Secondly,aiming at the deficiency of traditional manual configuration to complete the pattern integration,this paper studies the semantic similarity calculation and structural similarity calculation based on XML Schema,and defines three types of structural conflict detection and solution in pattern integration.Complete pattern integration work and generate schema mapping files,which greatly simplifies the development of pattern integration.Aiming at the structural conflict detection in the global pattern reconstruction process,combined with the tree structure characteristics of XML Schema,the definition of relationship nesting conflict,relationship direction conflict and entity attribute conflict is given.The path length between tree nodes is used to perform structural conflict.The detection further reduces the redundancy in the global mode and completes the mode integration function in this system.Finally,on the basis of the schema mapping file,this paper uses the XQuery query decomposition algorithm to implement the decomposition of the XQuery global query statement,and completes the unified query of relational data and XML document data in the health field,which facilitates the development of the business.
Keywords/Search Tags:Heterogeneous data query, schema integration, query decomposition, health data
PDF Full Text Request
Related items