Font Size: a A A

Bayesian Feature Selection Methods for Complex Biomedical Dat

Posted on:2015-03-28Degree:Ph.DType:Dissertation
University:Emory UniversityCandidate:Zhao, YizeFull Text:PDF
GTID:1478390017997648Subject:Biostatistics
Abstract/Summary:
Motivated by three different biomedical studies, this dissertation investigates novel Bayesian feature selection methods to analyze complex biomedical data.;In the first project, motivated by the colorectal cancer study, we propose a unified Bayesian approach for hierarchical feature selection of structured functional predictors in Generalized Functional Linear Models (GFLMs). Feature selection here is inherently hierarchical, involving selection of functional predictors and selection of regions within them. To achieve hierarchical feature selection, we construct a class of mixture priors for functional coefficients based on Gaussian processes. In addition, we use Ising priors on the model space to incorporate hierarchical structural information. Applying our approach to the motivating study, we find that one functional biomarker and its expression level in the transitional region between the proliferation and differentiation zones are associated with the risk for colorectal cancer.;In the second project, motivated by the Autism Brain Imaging Data Exchange (ABIDE) study, we are interested in identifying important biomarkers for early detection of the ASD under high resolution brain. We propose a novel multiresolution variable selection procedure under a Bayesian probit regression framework and it recursively uses posterior samples for variable selection at a lower resolution to guide variable selection at a higher resolution. The proposed algorithms are computationally feasible for ultra-high dimensional data. In addition, we also incorporate two levels of structural information into variable selection. Applied to the resting state functional magnetic resonance imaging (R-fMRI) data in the ABIDE study, our methods identify imaging biomarkers predictive of the ASD in several brain regions, which are biologically meaningful and interpretable.;Finally, with the goal to select gene and gene subnetworks with periodic behavior in a microarray dataset, we propose a nonparametric Bayesian model incorporating network information. In addition to identifying genes that have a strong association with a clinical outcome, our model can select genes with particular expressional behavior. We show that our proposed model is equivalent to an infinity mixture model for which we develop a posterior computation algorithm. We also propose two fast computing algorithms that approximate the posterior simulation with good gene selection accuracy but low computational cost.
Keywords/Search Tags:Selection, Bayesian, Methods, Biomedical, Propose, Data
Related items