Font Size: a A A

Pattern discovery for hypothesis generation in biology

Posted on:2007-08-29Degree:Ph.DType:Thesis
University:New York UniversityCandidate:Tsirigos, AristotelisFull Text:PDF
GTID:2448390005463947Subject:Computer Science
Abstract/Summary:
In recent years, the increase in the amounts of available genomic as well as gene expression data has provided researchers with the necessary information to train and test various models of gene origin, evolution, function and regulation. In this thesis, we present novel solutions to key problems in computational biology that deal with nucleotide sequences (horizontal gene transfer detection), amino-acid sequences (protein sub-cellular localization prediction), and gene expression data (transcription factor - binding site pair discovery). Different pattern discovery techniques are utilized, such as maximal sequence motif discovery and maximal itemset discovery, and combined with support vector machines in order to achieve significant improvements against previously proposed methods.
Keywords/Search Tags:Discovery, Gene
Related items