Font Size: a A A

Based On Wavelet Transform Technique To Predict The Dna Sequence Coding Region

Posted on:2007-06-10Degree:MasterType:Thesis
Country:ChinaCandidate:Y WangFull Text:PDF
GTID:2190360185456235Subject:Biomedical engineering
Abstract/Summary:PDF Full Text Request
In recent years, Genome projects have given rise to an exponentially growing amount of genetic information. How to find out useful information in the huge amounts of data is the problem that scientists focus on in current and future. One of the most important and basic problems is the gene identification, namely the identification of protein-coding regions in DNA sequences through computational means. In present, a number of methods for gene detection, based on distinctive features of protein coding sequences have been proposed. For example, neural net-based method, the method based on correlation function, Fourier-based analysis, and so on. At the same time, comprehensive evaluation of various methods suggest that they can't work equally well for all genes, and constant refinement is needed to evolve better methodologies, there is also a need for new method of gene predication.For most of DNA sequence, one of the principle feature is that the dominant signal in coding regions of genomic sequences exhibits a 1/3 periodicity which is evidenced as a sharp peak at frequency f=1/3 in the power spectrum. Such periodicity is universal for protein-coding sequences and is absent in genomic sequences that do not code for proteins thus making this parameter a convenient criterion for recognizing coding sequences in DNA sequence. However, statistical analysis by Fourier technology for DNA sequence will bring noise inevitably, only according to the result by Fourier analysis for DNA sequences, it is difficult to assess the probable genes of DNA sequence exactly. Wavelet transform is a space-scale analysis. It is called as a mathematical microscope for analysis signal. The wavelet transform can eliminate the noise in a certain scale, namely, separate the useful signal from noise signal. Thereby, we can utilize wavelet transform to analyze the result by Fourier analysis for DNA sequences, and can predicate the protein-coding region conveniently, and develop the prediction accuracy.In this paper, we analyze 1/3 periodicity of protein coding region using wavelet transformation, and the result from theoretical analysis and experiments show that wavelet transform is a useful tool for detecting the periodicity. Thereby, we develop a new method to recognize the probable genes of genomic sequence. This method is...
Keywords/Search Tags:Fourier technique, wavelet transform, DNA sequence, protein-coding region
PDF Full Text Request
Related items