Font Size: a A A

The Research And Implementation Of Distribution Of Medical Data Sharing And Integration Methods

Posted on:2015-12-28Degree:MasterType:Thesis
Country:ChinaCandidate:W Y LiFull Text:PDF
GTID:2308330482956052Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the rapid development of the Internet, electronic medical data trends are becoming evident, and an increasing number of medical data resources is available on the network. However, because of the database system, the system business logic and data formats vary widely between hospitals, they produce large amounts of distributed, heterogeneous data, thereby forming a plurality of mutually isolated "islands of information". But isolated data also seriously affected the comprehensive analysis of data. Hence it is urgent need to integrate and share those heterogeneous data among hospitals. After analysis of medical data, it can provide data for doctors to support the diagnosis of patients, and it enables more efficient and accurate diagnosis. However, due to the heterogeneity of the data, the data can not be used to conduct a comprehensive analysis of existing data can not make full use of the value of.To solve this problem, this paper based on the research of heterogeneous data integration technologies and theories, after analysis of existing data integration, using an integrated approach which is combining the data warehouse approach and Mediator/Wrapper approach, propose based on data warehouse approach and Mediator/Wrapper way of combining IHDS architecture. And from the design goals, layered structure model, the interaction structure model and analysis of the main modules of IHDS architecture described in detail IHDS architecture. Finally, from many aspects of IHDS architecture analyzes its characteristics and advantages.For the problem of heterogeneous data sources, previous methods usually require settings different processing module depending on the data source, which will reduce the extensibility and maintainability. This paper propose a heterogeneous data processing method based on conversion of storage, the data is converted to a standard intermediate data model and then send to the target data source, in order to avoid a large set of different data source processing module, improving scalability and maintainability. And the use of XML as an intermediate data representation, extracting intermediate data to XML standard data and then sends it to the target data source, in order to solve the problem of heterogeneous.Based on the research distributed, heterogeneous data integration, this paper implement a distributed data sharing and integrated system prototype for medical data, and then done a detailed explanation for the main module implement of the integrated system.On the basis of the distribution of medical data integration system, this paper has optuomized the ETL data processing method. Firstly, the original ETL optimization algorithm based on data fragmentation due to the unequal division of each section, resulting in a bottleneck segment has been busy, while the other segments are idle, while resulting in waste of resources in a certain extent. Then based on the disadvantage of this algorithm, this paper proposed a ETL optimization method based on duplication bottleneck. Finally, experiments were carried out to verify the feasibility of ETL optimization method.
Keywords/Search Tags:distribution data, heterogeneous, ETL, XML
PDF Full Text Request
Related items