Bayesian Methods for Genetic Association Studies
We develop statistical methods for tackling two important problems in genetic association studies. First, we propose a Bayesian approach to overcome the winner's curse in genetic studies. Second, we consider a Bayesian latent variable model for analyzing longitudinal family data with pleiotr...
Main Author: | |
---|---|
Other Authors: | |
Language: | en_ca |
Published: |
2012
|
Subjects: | |
Online Access: | http://hdl.handle.net/1807/34972 |
id |
ndltd-TORONTO-oai-tspace.library.utoronto.ca-1807-34972 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TORONTO-oai-tspace.library.utoronto.ca-1807-349722013-11-01T04:11:06ZBayesian Methods for Genetic Association StudiesXu, Lizhenwinner's cursespike and slab priorHierarchical Bayes ModelBayesian Model AveragingLatent variable modelpleiotropygenetic association studiesMarkov chain Monte Carlopath samplingBayesian inference0463We develop statistical methods for tackling two important problems in genetic association studies. First, we propose a Bayesian approach to overcome the winner's curse in genetic studies. Second, we consider a Bayesian latent variable model for analyzing longitudinal family data with pleiotropic phenotypes. Winner's curse in genetic association studies refers to the estimation bias of the reported odds ratios (OR) for an associated genetic variant from the initial discovery samples. It is a consequence of the sequential procedure in which the estimated effect of an associated genetic marker must first pass a stringent significance threshold. We propose a hierarchical Bayes method in which a spike-and-slab prior is used to account for the possibility that the significant test result may be due to chance. We examine the robustness of the method using different priors corresponding to different degrees of confidence in the testing results and propose a Bayesian model averaging procedure to combine estimates produced by different models. The Bayesian estimators yield smaller variance compared to the conditional likelihood estimator and outperform the latter in the low power studies. We investigate the performance of the method with simulations and applications to four real data examples. Pleiotropy occurs when a single genetic factor influences multiple quantitative or qualitative phenotypes, and it is present in many genetic studies of complex human traits. The longitudinal family studies combine the features of longitudinal studies in individuals and cross-sectional studies in families. Therefore, they provide more information about the genetic and environmental factors associated with the trait of interest. We propose a Bayesian latent variable modeling approach to model multiple phenotypes simultaneously in order to detect the pleiotropic effect and allow for longitudinal and/or family data. An efficient MCMC algorithm is developed to obtain the posterior samples by using hierarchical centering and parameter expansion techniques. We apply spike and slab prior methods to test whether the phenotypes are significantly associated with the latent disease status. We compute Bayes factors using path sampling and discuss their application in testing the significance of factor loadings and the indirect fixed effects. We examine the performance of our methods via extensive simulations and apply them to the blood pressure data from a genetic study of type 1 diabetes (T1D) complications.Craiu, Radu V.Sun, Lei2012-112013-01-08T15:06:22ZNO_RESTRICTION2013-01-08T15:06:22Z2013-01-08Thesishttp://hdl.handle.net/1807/34972en_ca |
collection |
NDLTD |
language |
en_ca |
sources |
NDLTD |
topic |
winner's curse spike and slab prior Hierarchical Bayes Model Bayesian Model Averaging Latent variable model pleiotropy genetic association studies Markov chain Monte Carlo path sampling Bayesian inference 0463 |
spellingShingle |
winner's curse spike and slab prior Hierarchical Bayes Model Bayesian Model Averaging Latent variable model pleiotropy genetic association studies Markov chain Monte Carlo path sampling Bayesian inference 0463 Xu, Lizhen Bayesian Methods for Genetic Association Studies |
description |
We develop statistical methods for tackling two important problems in genetic association studies. First, we propose
a Bayesian approach to overcome the winner's curse in genetic studies. Second, we consider a Bayesian latent variable
model for analyzing longitudinal family data with pleiotropic phenotypes.
Winner's curse in genetic association studies refers to the estimation bias of the reported odds ratios (OR) for an associated
genetic variant from the initial discovery samples. It is a consequence of the sequential procedure in which the estimated
effect of an associated genetic
marker must first pass a stringent significance threshold. We propose
a hierarchical Bayes method in which a spike-and-slab prior is used to account
for the possibility that the significant test result may be due to chance.
We examine the robustness of the method using different priors corresponding
to different degrees of confidence in the testing results and propose a
Bayesian model averaging procedure to combine estimates produced by different
models. The Bayesian estimators yield smaller variance compared to
the conditional likelihood estimator and outperform the latter in the low power studies.
We investigate the performance of the method with simulations
and applications to four real data examples.
Pleiotropy occurs when a single genetic factor influences multiple quantitative or qualitative phenotypes, and it is present in
many genetic studies of complex human traits. The longitudinal family studies combine the features of longitudinal studies
in individuals and cross-sectional studies in families. Therefore, they provide more information about the genetic and environmental factors associated with the trait of interest. We propose a Bayesian latent variable modeling approach to model multiple
phenotypes simultaneously in order to detect the pleiotropic effect and allow for longitudinal and/or family data. An efficient MCMC
algorithm is developed to obtain the posterior samples by using hierarchical centering and parameter expansion techniques.
We apply spike and slab prior methods to test whether the phenotypes are significantly associated with the latent disease status. We compute
Bayes factors using path sampling and discuss their application in testing the significance of factor loadings and the indirect fixed effects. We examine the performance of our methods via extensive simulations and
apply them to the blood pressure data from a genetic study of type 1 diabetes (T1D) complications. |
author2 |
Craiu, Radu V. |
author_facet |
Craiu, Radu V. Xu, Lizhen |
author |
Xu, Lizhen |
author_sort |
Xu, Lizhen |
title |
Bayesian Methods for Genetic Association Studies |
title_short |
Bayesian Methods for Genetic Association Studies |
title_full |
Bayesian Methods for Genetic Association Studies |
title_fullStr |
Bayesian Methods for Genetic Association Studies |
title_full_unstemmed |
Bayesian Methods for Genetic Association Studies |
title_sort |
bayesian methods for genetic association studies |
publishDate |
2012 |
url |
http://hdl.handle.net/1807/34972 |
work_keys_str_mv |
AT xulizhen bayesianmethodsforgeneticassociationstudies |
_version_ |
1716612122251624448 |