Font Size: a A A

The proportion of true null hypotheses in microarray gene expression data

Posted on:2011-08-15Degree:Ph.DType:Dissertation
University:The George Washington UniversityCandidate:Markitsis, AnastasiosFull Text:PDF
GTID:1444390002964405Subject:Biology
Abstract/Summary:
Microarray technology is extensively used today in many applications. One of its uses is discovering sets of genes that are most likely to be related to cancer. Typically, based on microarray data, a statistical test is conducted for each of thousands of genes simultaneously. In a two-group comparative microarray experiment, an important parameter for controlling the rate of false positives and also for determining the appropriate sample size is the proportion of true null hypotheses (pi0).;To obtain an improved estimation method for pi0, we modify an existing simple method by introducing artificial censoring to p-values. The model is a two-component mixture of a censored Uniform(0,1) and a censored Beta(alpha,1) distribution. The model fitting is achieved through the Expectation Maximization algorithm. In a comprehensive simulation study and applications to experimental data sets, we illustrate the benefits of using our method by comparing its performance to other existing methods.;Furthermore, we develop and study the properties of a likelihood ratio test for pi0 at different combinations of sample size, number of genes, and H0 value of pi0. Maximization of the restricted likelihood is achieved by incorporating a linear constraint of the parameters into the Expectation Maximization algorithm. The p-value of the test is based on a parametric bootstrap approach. We illustrate the usefulness of the test through applications to experimental datasets.
Keywords/Search Tags:Microarray, Applications, Test
Related items