Font Size: a A A

Design And Implementation Of Automatic Construction System For Species Tree Based On Genome-scale Data Of Whole Species

Posted on:2016-04-28Degree:MasterType:Thesis
Country:ChinaCandidate:F Q LiFull Text:PDF
GTID:2370330473464817Subject:Software engineering
Abstract/Summary:PDF Full Text Request
Along with the study of gene,many diseases are caused by the changes of the structure and function of gene.This is the biotechnology development front that scientists will not only discover the defective gene,but also master how to carry on the diagnosis,the repair,the treatment and the prevention.This result will bring immeasurable benefits to human health and life.The work done in this paper includes three aspects:Firstly,the genetic tree of the whole species genetic da ta is constructed and analyzed.The system uses the whole species gene data as data source,and compares the gene sequences with Blast,and then uses Kimura two parameters to build the g ene tree.Finally,MEGA is used to establish the gene tree.Experimental results show that the proposed system can generate the genetic tree which is more consistent with the tree topology and is reliable in the model reference.Secondly,the construction of multi species model tree based on biological back group.In order to better understand the biological tree construction method and reduce the error of the spanning tree biological and real biological tree,this paper from the introduction of the multispecies coalescent model analysis model,try to phylogenetic tree as the center,the gene tree and species tree as a guide,find out the most reasonable biological construction.Finally,based on the STAR model,the molecular evolu tionary model of GTR and BEST is assisted by 2245 genes of 48 species for validation.The experimental results show that the STAR model in all species in large-scale gene expression data produced by species tree results remained consistent.In some nodes,in the increase in the genetic data and statistical support rate has been significantly improved,even in some species are closely related and statistical support rate is very low node,the increase of genetic data,a direct result of the statistical suppo rt rate increased significantly.In addition,we can also conclude that cleaning tree by BEST and GTR performance structure in the abnormal comments,taking into account the GTR is most complex molecular evolution model builder,we can direct selection of the BEST as a molecular evolution model builder in further research.
Keywords/Search Tags:Gene tree, phylogenetic tree, GTR, Gene sequence comparison
PDF Full Text Request
Related items