Font Size: a A A

Bioinformatic tools for genetic epidemiology: Application to candidate gene analysis in breast cancer

Posted on:2011-11-24Degree:Ph.DType:Dissertation
University:The University of UtahCandidate:Abo, Ryan PFull Text:PDF
GTID:1444390002968479Subject:Biology
Abstract/Summary:
This dissertation describes the development, implementation, and application of bioinformatic methods and tools. The objective was to provide enhanced methods for the identification of genetic factors underlying common, complex human diseases. In general, the developments were designed for genetic epidemiology applications to analyze candidate genes or regions with single nucleotide polymorphism (SNP) genotype data in resources of independent or pedigree-based subjects. The major theme of the methodological developments was to provide valid joint analyses across multiple SNPs to improve upon standard single SNP analyses. Methodological developments include a novel haplotype phasing and association analysis method, a haplotype-mining method, and a haplotype-haplotype interaction method, each of which allows for independent and/or related subjects. The novel haplotype-phasing algorithm and association method were implemented as additional modules in the Genie software. The haplotype-mining approach was implemented in the program, hapConstructor, which uses a stepwise heuristic to search for optimal multi-SNP associations. To explore gene-gene effects and identify interactions between unlinked genetic variants (that may be undetectable by single SNP or haplotype analyses), we implemented a gene-gene interaction module in the hapConstructor method. All of these novel developments were illustrated with applications to real and simulated data to demonstrate the utility and setting for such analyses. The main application was to a two-site breast cancer resource, including 3,888 subjects with data for 89 tagging-SNPs across seven genes in the apoptosis pathway. Applications to colon cancer and chronic lymphocytic leukemia (CLL) resources are also shown. Use of hapConstructor was valuable in refining the association evidence for CASP8 and breast cancer. Risk and protective haplotypes were identified, that are currently undergoing next generation sequencing to identify underlying critical variants. Novel haplotype associations were also identified for other apoptosis genes, including interaction effects, in breast cancer and also for colon cancer and CASP8. Finally, use of hapConstructor in a genome-wide association study for CLL led to new regions of interest, not identified in the original single SNP screen. These new associations identified will require further follow up analyses for verification. Nonetheless these potentially interesting findings indicate the utility of the tools developed.
Keywords/Search Tags:Tools, Breast cancer, Application, Single SNP, Genetic, Method, Analyses
Related items