Font Size: a A A

Research And Implementation Of Multi-source Biodata Indexing And Processing Technology Based On Distributed Computing System

Posted on:2021-04-15Degree:MasterType:Thesis
Country:ChinaCandidate:M L YangFull Text:PDF
GTID:2518306308975729Subject:Electronic Science and Technology
Abstract/Summary:PDF Full Text Request
At present,when researching and applying biological data,it faces the problems of heterogeneous data of multiple biological centers,huge amounts of data,and lack of privacy protection,which makes many users face many inconveniences in file management or data sharing.By combining computer technology,cryptography technology and biological data,the research in this article can be adapted to the scenario design and algorithm transformation,which can more effectively implement data management and application research,and better protect and protect shared data.Based on the investigation and analysis of the current application status of biological data,this paper studies the above issues in three directions:first,a unified multi-biological center and data model file hierarchical management scheme is proposed,and then a biological The file encryption management scheme and the connection technology of homomorphic encrypted files based on different versions.Finally,based on this,a scheme to implement sequence retrieval in the homomorphic encrypted ciphertext domain is proposed.In the first point,in view of the current situation of multi-source heterogeneous data storage,a hierarchical file management and indexing scheme with a unified physical layer and a logical layer is designed and established to achieve a unified entrance to global file management,which improves management capabilities and search efficiency.In the second point,in view of the characteristics of high-level,multi-version,and strong privacy of biological files,a simple storage and security management technology based on bit encryption and homomorphic encryption is proposed.Define file formats such as standard files and difference files,reduce the data storage amount of multi-version files,and use bit homomorphism to implement version merge and modification operations of cipher text files.Through experimental verification,the storage amount is reduced while data security is guaranteed.Ability.In the third point,aiming at the common sequence retrieval operations and retrieval difficulties in the ciphertext domain for biological data research,after digitizing the original biological data,a combination of improved kmp algorithm and homomorphic encrypted ciphertext sequence search method is used.The sequence search operation is completed simultaneously under the premise of strong security.The effectiveness of this scheme is verified through comparative experiments.This paper applies computer technology and cryptography technology to file management and restricted access of biological data,and completes algorithms,scheme design and basic practice.In the laboratory environment,the three solutions proposed in this paper can solve the existing problems,that is,simple data management,security protection,and ciphertext search,and have certain universality for the same type of data.Further optimization can be done in terms of performance in subsequent studies.
Keywords/Search Tags:Biological Data, Homomorphic Encryption, Version Management, Sequence Alignment, Ciphertext Search
PDF Full Text Request
Related items