Font Size: a A A

Constraint And Design Of DNA Storage Coding Set Based On Improved MVO Algorithm

Posted on:2022-01-06Degree:MasterType:Thesis
Country:ChinaCandidate:B CaoFull Text:PDF
GTID:2518306332970129Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
The development of information technology also produces massive amounts of data.How to store data efficiently and at a low cost has become a problem.Due to the high density and high durability of DNA,DNA storage has become a feasible solution.DNA data storage is a process of encoding and decoding between binary data and synthetic DNA strands.It is a new type of storage method that uses DNA molecules as information carriers for data storage.The writing of data in DNA storage is the conversion of binary information into base sequences through coding rules,and the reading of data is read by sequencing technology.At present,experts and scholars at home and abroad paying close attention to the development of DNA storage and actively participating in the research work of DNA storage.The related technology has developed from the theoretical stage to the biological experiment stage.The DNA storage system first considers the problem of DNA coding,which converts information into DNA sequences.Efficient coding not only helps improve the accuracy of reading and writing DNA storage,but also helps reduce storage costs.In order to effectively construct the code,three coding constraints based on physical and thermodynamic properties are proposed in this paper,and three effective improvements to the MVO algorithm are proposed.The work done is as follows:(1)In order to construct a set of DNA storage codes,an improved KMVO(K-means Multi-Verse Optimizer)algorithm is proposed.The theoretical basis of the MVO(Multi-Verse Optimizer)algorithm comes from the theory of the multiverse.Due to the slow convergence speed in the early stage of the iteration,K-means is introduced to improve it,and the 13 test functions are compared with algorithms such as MVO,GA,and PSO.The statistical results(Mean value,standard deviation)is significantly improved.After that,we added the constraints of the DNA coding set,and calculated the corresponding size of the coding set under the given constraints,and compared with the previous work.The KMVO algorithm has a better boundary than the code set constructed by the MVO algorithm.A larger set of DNA storage codes can store more effective information in a given length,thereby improving storage utilization(2)Aiming at errors that are prone to occur during the sequencing and synthesis of DNA storage,a new combination constraint containing storage edit distance constraints is proposed,and an improved DMVO(Damp Multi-Verse Optimizer)algorithm is used to construct to meet the combination constraints A collection of DNA codes using these code words as non-data bits.The mechanism of the MVO algorithm is to exchange matter through black/white holes to achieve a stable state.In order to prevent the MVO algorithm from falling into a local optimum,a damping factor is added as a disturbance in the algorithm.The code set obtained by the DMVO algorithm is compared with previous work,and a code set of higher quality and size is constructed.Larger code sets can use shorter address bits to index more data,and high-quality codes can make the DNA storage system more robust and stable.(3)The progress of the biochemical reaction is accompanied by the change of heat,so the change of heat can better reflect the nature of the biochemical reaction.Therefore,a thermodynamic MFE constraint is proposed and applied to the construction of the DNA storage code set.In this part of the work,a new BMVO(Brownian Multi-Verse Optimizer)algorithm is used.In order to reduce the influence of the MVO algorithm on the optimization when the initialization value is too different from the optimal value when the initialization position of the MVO algorithm is not good.The Brownian motion update method is added to the MVO algorithm,and the BMVO algorithm builds a larger set of DNA storage codes.Compared with previous work in terms of ensemble size,code rate,thermodynamic properties,etc.,the results of the encoding set constructed by the BMVO algorithm are very competitive.Building a larger DNA code set can reduce the cost of DNA storage,and building a higher quality code set can improve the read and write accuracy of the DNA storage system.
Keywords/Search Tags:DNA storage, DNA coding, Multi-Verse algorithm, Sequencing, Thermodynamic
PDF Full Text Request
Related items