Font Size: a A A

2D Graphical Representation Of DNA Sequences And Its Application

Posted on:2011-01-05Degree:MasterType:Thesis
Country:ChinaCandidate:Y S LiuFull Text:PDF
GTID:2248330395485557Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
With the implement of the Human Genome Project (HGP), life sciences have entered the post-genome era. Genes and proteins have become the main research subjects of life sciences. In the past, biological scientists studied a single gene or protein, but now, even in the future, scientists will focus on the study of DNA sequence information, protein structure information, as well as the interaction between them. In order to decipher the biological information of each level, gene or protein-related statistics and combinatorial mathematics problems are proposed, and the rapid growth of biological information has also brought the challenge of computer science. Thus, the computational molecular biology and bioinformatics have emerged.Computational molecular biology is one of the emerging field of cross mathematics, computer science and biological sciences, which mainly research gene or protein sequences-relate combination and computation, and its main topics are: sequence alignment, sequence analysis, Bio-information database, Genes identified, phylogenetic tree constructed as well as the molecular structure prediction and so on.With the development of our researches on life sciences, more and more molecular sequences data have been generated. The analyses and processing of these data accelerate the development of Bioinformatics. Sequence analysis is one of the most important operations in bioinformatics. Sequence analysis helps to predict the functions of novel genes within any species. On a broader scale these algorithms have also been used to determine homologies between sequences in order to predict structural and functional relationships.Graphical representation of DNA sequence provides a simple way of viewing, sorting and comparing various gene structures. Graphical techniques have emerged as a very powerful tool for the visualization and analysis of long DNA sequences. These techniques provide useful insights into local and global characteristics and the occurrences, variations and repetition of the nucleotides along a sequence which are not as easily obtainable by other methods。In this paper, we proposed a2D graphical representation of the DNA sequences based on the20amino acids and a stop code of protein sequences. Based on this2D graphical representation, we outlined a new approach to analyze the phylogenetic relationships of24coronaviruses based on UPGMA. The evolutionary distances are obtained through measuring the differences among the two-dimensional curves. Unlike most existing phylogeny construction methods, the proposed method does not require multiple alignment.
Keywords/Search Tags:DNA sequences, Graphical representation of sequence, Phylogenetic treeconstructing
PDF Full Text Request
Related items