Font Size: a A A

Integrative statistical methods for the analysis of transcriptomic and metabolomic data

Posted on:2011-10-07Degree:Ph.DType:Dissertation
University:University of MichiganCandidate:Poisson, Laila MFull Text:PDF
GTID:1440390002464463Subject:Biology
Abstract/Summary:
Cancer research is embracing the multiple "-omics" technologies available for global scale measurement of molecular events. Transcriptomics, as the global measure of gene expression, has been well developed through microarray technology. Metabolomics, an emerging omics field, involves chromatography-coupled mass spectrometry to measure the global activity of metabolites or small molecules. The aim of this dissertation is to integrate the analysis of these two data sources to enhance the ability to find molecular changes between two disease states. This work is motivated by a prostate cancer progression study in which tumor samples of varying stage and benign tissue were assessed for both gene expression and metabolomic levels. Using a pathway-directed approach, transcriptomics and metabolomics data can be mapped using publicly available metabolic pathways. Here we describe three integrative methods.;In the first topic we begin with a classification method that utilizes a differential list of elements from a prior study to make prognostic or diagnostic predictions about samples in a current study. We extend the classification method by providing a testing scenario for the classifier. Though motivated by the integration of in vitro and in vivo gene expression datasets we show that it is applicable across omics platforms as well.;The second topic explores the use of p-value weighting to improve the power of per-metabolite tests of differential intensity. We use gene-set enrichment testing to capture the gene expression information contained in pre-defined pathways. The results of these tests are used to devise pathway-based weights. In this way, metabolites that are involved in a pathway that is dysregulated in its gene expression are given higher importance.;Finally, the third topic extends two univariate set enrichment tests to jointly search for sets of genes and metabolites that are coordinately differential. We compare these methods to their univariate counterparts and to enrichment testing on the concatenated datasets. In almost all scenarios explored, testing the datasets jointly is preferred.;Each of these methods is applied to the motivating metabolomics and matched gene expression datasets and results are discussed.
Keywords/Search Tags:Gene expression, Methods, Datasets
Related items