Font Size: a A A

Data Organization Storage And Retrieval For Biomedical Big Data

Posted on:2019-12-17Degree:MasterType:Thesis
Country:ChinaCandidate:X S BuFull Text:PDF
GTID:2428330551454361Subject:Engineering
Abstract/Summary:PDF Full Text Request
In the semantic Web,the Resource Description Framework(RDF)has become a standard representation of Web resources.RDF data consists of triples of subjects,predicates,and objects.According to the W3C team statistics released by the end of 2016,the number of triples RDF data set on the Internet has reached 52 billion,of which the core in the field of biomedical data sets have 42,contains more than 30 billion RDF triples,these data are currently on the Internet is growing at the rate of exponential,these also spawned a lot of medical related data such as Uniprot,DrugBank many medical fields such as RDF knowledge base.Experts and scholars at home and abroad in this article,through analysis of RDF in medical field with an overview of the data research,combing,summarizes relevant literature,in combination with the actual situation of this topic,this paper discusses the distributed system of data storage and RDF query technology.The research on distributed storage and query of RDF big data will lay a foundation for efficient analysis and understanding of medical RDF big data.This paper studies and implements a storage and query system for biomedical data.The system is divided into three modules:dictionary module,basic operation module and visual module.Based on the research of the distributed architecture,deployment on CentOS system Hadoop and HBase,Java EE architecture based on Windows system,complete to serving client requests,at the front of the background by comparison with the dictionary will get data types of data into a dictionary,and then go to the server for the corresponding operation.The basic operation modules of the system include:insert data module,delete data module,modify data module;The visualization module is mainly:query module and query result diagram conversion module.The system realizes the storage of data in HBase,through the Windows client's query and update operation of data and the visual display of query results.
Keywords/Search Tags:Biomedical Big Data, Distributed Storage, Distributed Query, RDF Data
PDF Full Text Request
Related items