Bayesian Methods for Genetic Association Studies

We develop statistical methods for tackling two important problems in genetic association studies. First, we propose a Bayesian approach to overcome the winner's curse in genetic studies. Second, we consider a Bayesian latent variable model for analyzing longitudinal family data with pleiotr...

Full description

Bibliographic Details
Main Author: Xu, Lizhen
Other Authors: Craiu, Radu V.
Language:en_ca
Published: 2012
Subjects:
Online Access:http://hdl.handle.net/1807/34972
id ndltd-TORONTO-oai-tspace.library.utoronto.ca-1807-34972
record_format oai_dc
spelling ndltd-TORONTO-oai-tspace.library.utoronto.ca-1807-349722013-11-01T04:11:06ZBayesian Methods for Genetic Association StudiesXu, Lizhenwinner's cursespike and slab priorHierarchical Bayes ModelBayesian Model AveragingLatent variable modelpleiotropygenetic association studiesMarkov chain Monte Carlopath samplingBayesian inference0463We develop statistical methods for tackling two important problems in genetic association studies. First, we propose a Bayesian approach to overcome the winner's curse in genetic studies. Second, we consider a Bayesian latent variable model for analyzing longitudinal family data with pleiotropic phenotypes. Winner's curse in genetic association studies refers to the estimation bias of the reported odds ratios (OR) for an associated genetic variant from the initial discovery samples. It is a consequence of the sequential procedure in which the estimated effect of an associated genetic marker must first pass a stringent significance threshold. We propose a hierarchical Bayes method in which a spike-and-slab prior is used to account for the possibility that the significant test result may be due to chance. We examine the robustness of the method using different priors corresponding to different degrees of confidence in the testing results and propose a Bayesian model averaging procedure to combine estimates produced by different models. The Bayesian estimators yield smaller variance compared to the conditional likelihood estimator and outperform the latter in the low power studies. We investigate the performance of the method with simulations and applications to four real data examples. Pleiotropy occurs when a single genetic factor influences multiple quantitative or qualitative phenotypes, and it is present in many genetic studies of complex human traits. The longitudinal family studies combine the features of longitudinal studies in individuals and cross-sectional studies in families. Therefore, they provide more information about the genetic and environmental factors associated with the trait of interest. We propose a Bayesian latent variable modeling approach to model multiple phenotypes simultaneously in order to detect the pleiotropic effect and allow for longitudinal and/or family data. An efficient MCMC algorithm is developed to obtain the posterior samples by using hierarchical centering and parameter expansion techniques. We apply spike and slab prior methods to test whether the phenotypes are significantly associated with the latent disease status. We compute Bayes factors using path sampling and discuss their application in testing the significance of factor loadings and the indirect fixed effects. We examine the performance of our methods via extensive simulations and apply them to the blood pressure data from a genetic study of type 1 diabetes (T1D) complications.Craiu, Radu V.Sun, Lei2012-112013-01-08T15:06:22ZNO_RESTRICTION2013-01-08T15:06:22Z2013-01-08Thesishttp://hdl.handle.net/1807/34972en_ca
collection NDLTD
language en_ca
sources NDLTD
topic winner's curse
spike and slab prior
Hierarchical Bayes Model
Bayesian Model Averaging
Latent variable model
pleiotropy
genetic association studies
Markov chain Monte Carlo
path sampling
Bayesian inference
0463
spellingShingle winner's curse
spike and slab prior
Hierarchical Bayes Model
Bayesian Model Averaging
Latent variable model
pleiotropy
genetic association studies
Markov chain Monte Carlo
path sampling
Bayesian inference
0463
Xu, Lizhen
Bayesian Methods for Genetic Association Studies
description We develop statistical methods for tackling two important problems in genetic association studies. First, we propose a Bayesian approach to overcome the winner's curse in genetic studies. Second, we consider a Bayesian latent variable model for analyzing longitudinal family data with pleiotropic phenotypes. Winner's curse in genetic association studies refers to the estimation bias of the reported odds ratios (OR) for an associated genetic variant from the initial discovery samples. It is a consequence of the sequential procedure in which the estimated effect of an associated genetic marker must first pass a stringent significance threshold. We propose a hierarchical Bayes method in which a spike-and-slab prior is used to account for the possibility that the significant test result may be due to chance. We examine the robustness of the method using different priors corresponding to different degrees of confidence in the testing results and propose a Bayesian model averaging procedure to combine estimates produced by different models. The Bayesian estimators yield smaller variance compared to the conditional likelihood estimator and outperform the latter in the low power studies. We investigate the performance of the method with simulations and applications to four real data examples. Pleiotropy occurs when a single genetic factor influences multiple quantitative or qualitative phenotypes, and it is present in many genetic studies of complex human traits. The longitudinal family studies combine the features of longitudinal studies in individuals and cross-sectional studies in families. Therefore, they provide more information about the genetic and environmental factors associated with the trait of interest. We propose a Bayesian latent variable modeling approach to model multiple phenotypes simultaneously in order to detect the pleiotropic effect and allow for longitudinal and/or family data. An efficient MCMC algorithm is developed to obtain the posterior samples by using hierarchical centering and parameter expansion techniques. We apply spike and slab prior methods to test whether the phenotypes are significantly associated with the latent disease status. We compute Bayes factors using path sampling and discuss their application in testing the significance of factor loadings and the indirect fixed effects. We examine the performance of our methods via extensive simulations and apply them to the blood pressure data from a genetic study of type 1 diabetes (T1D) complications.
author2 Craiu, Radu V.
author_facet Craiu, Radu V.
Xu, Lizhen
author Xu, Lizhen
author_sort Xu, Lizhen
title Bayesian Methods for Genetic Association Studies
title_short Bayesian Methods for Genetic Association Studies
title_full Bayesian Methods for Genetic Association Studies
title_fullStr Bayesian Methods for Genetic Association Studies
title_full_unstemmed Bayesian Methods for Genetic Association Studies
title_sort bayesian methods for genetic association studies
publishDate 2012
url http://hdl.handle.net/1807/34972
work_keys_str_mv AT xulizhen bayesianmethodsforgeneticassociationstudies
_version_ 1716612122251624448