Font Size: a A A

Clustering algorithms for time series gene expression in microarray data

Posted on:2013-10-23Degree:M.SType:Thesis
University:University of North TexasCandidate:Zhang, GuilinFull Text:PDF
GTID:2458390008472196Subject:Bioinformatics
Abstract/Summary:
Clustering techniques are important for gene expression data analysis. However, efficient computational algorithms for clustering time-series data are still lacking. This work documents two improvements on an existing profile-based greedy algorithm for short time-series data; the first one is implementation of a scaling method on the pre-processing of the raw data to handle some extreme cases; the second improvement is modifying the strategy to generate better clusters. Simulation data and real microarray data were used to evaluate these improvements; this approach could efficiently generate more accurate clusters. A new feature-based algorithm was also developed in which steady state value; overshoot, rise time, settling time and peak time are generated by the 2nd order control system for the clustering purpose. This feature-based approach is much faster and more accurate than the existing profile-based algorithm for long time-series data.
Keywords/Search Tags:Algorithm, Gene expression, Time-series data, Clustering, Microarray data, Existing profile-based
Related items