Font Size: a A A

Biomedical XML Data Pre-processing & Conversion Of Downloadable Text Dataset

Posted on:2016-05-06Degree:MasterType:Thesis
Country:ChinaCandidate:Z K NingFull Text:PDF
GTID:2308330461467256Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the rapid development of biomedical technology, biomedical data exploding, and a lot of medical information is accurately recorded; collection, storage and management of data are becoming increasingly diverse. But the diversity of information makes so many biomedical databases to form a "data-rich, lack of information" phenomenon. Aiming at these "orphaned" database, how to organize, integrate together of these databases plays a crucial role about sharing data on biomedical engineering in further development. XML technology is a cross-platform in network environment, and based document content. It can be a good deal of information structured document, its plasticity, scalability, cross-platform and other advantages. So XML files can be used as the data exchange between the various databases transition document, and plays a crucial role in data mining preprocessing stage.This dissertation introduces the research background, purpose and significance, and XML related technology are analyzed and described. It then describes the function of the data conversion tool to achieve the requirements, respectively, data format conversion module designed for XML data and text; defined by the XML schema file corresponding to the document DTD and Schema, builds mapping rules between the original data and the relational database, and also establish the mapping between XML documents and relational databases for non-schema definition files; After builds the mapping rules based on specific biological data, design tools and the corresponding database structure conversion. Finally, for different concrete practice of biomedical database makes applying. And comparing data conversion efficiency and handling post-data on downloadable text datasets in tools or database interface. Among the biomedical data mining, data preprocessing is essential to the plate, the subject of research reflected in the biomedical data mining project necessity for data conversion.Thesis solves the following problems:1)Making mapping rules between data and database about specific XML data and downloadable text datasets, and draw the relevant algorithms.2) Analyzing of conversion technology between XML data and downloadable text datasets and making conversion tools, implementation of data conversion, and describing the conversion process about correlative specific data.3) On the basis of the data conversion, How to improve the conversion efficiency of data and reduce the complexity of database design.
Keywords/Search Tags:XML, mapping rules, downloadable text dataset, data conversion
PDF Full Text Request
Related items