A unified framework for association analysis with multiple related phenotypes.

We consider the problem of assessing associations between multiple related outcome variables, and a single explanatory variable of interest. This problem arises in many settings, including genetic association studies, where the explanatory variable is genotype at a genetic variant. We outline a fram...

Full description

Bibliographic Details
Main Author: Matthew Stephens
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3702528?pdf=render
id doaj-a81808208b1e4682b8a6f6de363e6785
record_format Article
spelling doaj-a81808208b1e4682b8a6f6de363e67852020-11-25T01:17:10ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0187e6524510.1371/journal.pone.0065245A unified framework for association analysis with multiple related phenotypes.Matthew StephensWe consider the problem of assessing associations between multiple related outcome variables, and a single explanatory variable of interest. This problem arises in many settings, including genetic association studies, where the explanatory variable is genotype at a genetic variant. We outline a framework for conducting this type of analysis, based on Bayesian model comparison and model averaging for multivariate regressions. This framework unifies several common approaches to this problem, and includes both standard univariate and standard multivariate association tests as special cases. The framework also unifies the problems of testing for associations and explaining associations - that is, identifying which outcome variables are associated with genotype. This provides an alternative to the usual, but conceptually unsatisfying, approach of resorting to univariate tests when explaining and interpreting significant multivariate findings. The method is computationally tractable genome-wide for modest numbers of phenotypes (e.g. 5-10), and can be applied to summary data, without access to raw genotype and phenotype data. We illustrate the methods on both simulated examples, and to a genome-wide association study of blood lipid traits where we identify 18 potential novel genetic associations that were not identified by univariate analyses of the same data.http://europepmc.org/articles/PMC3702528?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Matthew Stephens
spellingShingle Matthew Stephens
A unified framework for association analysis with multiple related phenotypes.
PLoS ONE
author_facet Matthew Stephens
author_sort Matthew Stephens
title A unified framework for association analysis with multiple related phenotypes.
title_short A unified framework for association analysis with multiple related phenotypes.
title_full A unified framework for association analysis with multiple related phenotypes.
title_fullStr A unified framework for association analysis with multiple related phenotypes.
title_full_unstemmed A unified framework for association analysis with multiple related phenotypes.
title_sort unified framework for association analysis with multiple related phenotypes.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description We consider the problem of assessing associations between multiple related outcome variables, and a single explanatory variable of interest. This problem arises in many settings, including genetic association studies, where the explanatory variable is genotype at a genetic variant. We outline a framework for conducting this type of analysis, based on Bayesian model comparison and model averaging for multivariate regressions. This framework unifies several common approaches to this problem, and includes both standard univariate and standard multivariate association tests as special cases. The framework also unifies the problems of testing for associations and explaining associations - that is, identifying which outcome variables are associated with genotype. This provides an alternative to the usual, but conceptually unsatisfying, approach of resorting to univariate tests when explaining and interpreting significant multivariate findings. The method is computationally tractable genome-wide for modest numbers of phenotypes (e.g. 5-10), and can be applied to summary data, without access to raw genotype and phenotype data. We illustrate the methods on both simulated examples, and to a genome-wide association study of blood lipid traits where we identify 18 potential novel genetic associations that were not identified by univariate analyses of the same data.
url http://europepmc.org/articles/PMC3702528?pdf=render
work_keys_str_mv AT matthewstephens aunifiedframeworkforassociationanalysiswithmultiplerelatedphenotypes
AT matthewstephens unifiedframeworkforassociationanalysiswithmultiplerelatedphenotypes
_version_ 1725147678961041408