Managing and analyzing phylogenetic databases

Posted on:2014-10-18

Degree:Ph.D

Type:Dissertation

University:Iowa State University

Candidate:Deepak, Akshay

Full Text:PDF

GTID:1450390005989160

Subject:Computer Science

Abstract/Summary:

The ever growing availability of phylogenomic data makes it increasingly possible to study and analyze phylogenetic relationships across a wide range of species. Indeed, current phylogenetic analyses are now producing enormous collections of trees that vary greatly in size. Our proposed research addresses the challenges posed by storing, querying, and analyzing such phylogenetic databases.;Our first contribution is the further development of STBase, a phylogenetic tree database consisting of a billion trees whose leaf sets range from four to 20000. STBase applies techniques from different areas of computer science for efficient tree storage and retrieval. It also introduces new ideas that are specific to tree databases.;STBase provides a unique opportunity to explore innovative ways to analyze the results from queries on large sets of phylogenetic trees. We propose new ways of extracting consensus information from a collection of phylogenetic trees. Specifically, this involves extending the maximum agreement subtree problem. We greatly improve upon an existing approach based on frequent subtrees and, propose two new approaches based on agreement subtrees and frequent subtrees respectively.;The final part of our proposed work deals with the problem of simplifying multi-labeled trees and handling "rogue" taxa. We propose a novel technique to extract conflict-free information from multi-labeled trees as a much smaller single labeled tree. We show that the inherent problem in identifying rogue taxa is NP-hard and give fixed-parameter tractable and integer linear programming solutions.

Keywords/Search Tags:

Phylogenetic

Related items

1	Molecular Phylogenetic Study On The Fungi Of The Genus Cyclamenes From China
2	Research On Algorithms Of Constructing Phylogenetic Network Based On The Rooted Phylogenetic Trees
3	Research On Algorithms And Platform Construction Of Distance Computing For Phylogenetic Networks
4	The Development Of Efficient Strategies For Resolving Contentious Phylogenetic Relationships In Phylogenomic Studies
5	Phylogenetic Relationship And Character Evolution Study Of Zygophyllum(Zygophyllaceae) Members In China
6	The Construction Of A Phylogenetic Tree Based On DNA Barcode
7	Research On Phylogenetic Network Construction Algorithm Analysis And Platform Constrution
8	Feature Extraction Of Genome Sequence And Research On Phylogenetic Tree Method
9	Characteristics And Influencing Factors Of The Phylogenetic Structure Of Herbaceous Plant Communities In The Middle-eastern Qilian Mountains
10	Studies on the phylogenetic relationships of the superfamily Opomyzoidea with an assessment of the phylogenetic utility of the internal female reproductive tract and a generic revision of Aulacigaster Macquart (Diptera, Cyclorrhapha)