Font Size: a A A

Analysis Of Similarity Of Biological Sequences Based On Dynamic Time Warping Distance

Posted on:2011-03-30Degree:MasterType:Thesis
Country:ChinaCandidate:M LiFull Text:PDF
GTID:2120360302473619Subject:Applied Mathematics
Abstract/Summary:PDF Full Text Request
With the completion/development of the genome projects of human and some model organism,the focus of biology shifts from accumulation of biological data to the analysis and interpretation of them,and thus bioinformatics,also named computational molecular biology,emerges as a new developing interdiscipline. Biological sequences comparison is one of the main research contents of bioinformatics. It can unscramble the various information that contained in the biological sequence.Especially,the genetic and regulatory information and also the relationship between the structure and function of protein sequence can be analyzed distinctly.Whenever we gain a new DNA sequence,we always expect that it is similar with some known sequences by comparing the comparability of them.Then we can predict the structure and function of the new DNA sequence. This is the main purpose of biological sequences comparison. Based on the comparative analysis of biological sequences for background.We propose a new measurement method which can compare DNA sequence similarity. A new method is provided for the classification, analysis and comparison, etc of the biological sequences comparison. With the new method we Enumerate the specific application of Biological sequences comparison.1. In Chapter 2, This paper proposed a DNA sequence similarity measure method based on DTW(Dynamic Time Warping)distance.Based on one graphical representation of DNA sequences and main characterization of complex vector:module and phase, then we can translate the complex subsequence into time series is also the module sequence on the complex plane. And calculate DTW distance to measure sequence similarity to attribute characterization of DNA sequences. We also use this method to compare and analyze the gene sequence similarity of seven Buthus martensi Karsch neurotoxin, verify the effectiveness and accuracy of the method.2. In Chapter 3, based on a 3-D graphical representations of DNA sequences we get the corresponding time series.With the measure method based on DTW(Dynamic Time Warping)distance to compare the similarity of DNA sequence.The algorithm effectiveness and accuracy of the method is verified through an example about eleven species:human,chimpanzee etc.3. In Chapter 4,based on a kind of numerical description of protein sequences,with the measure method based on DTW(Dynamic Time Warping)distance .We make the similarity and dissimilarity analysis of the nine species of the protein sequences base on the method based on DTW diatance. And we find that the results is similar with the fact.
Keywords/Search Tags:DNA sequences, Time series, DTW distance, Similarity analysis
PDF Full Text Request
Related items