Learning gene networks underlying clinical phenotypes using SNP perturbation.

Availability of genome sequence, molecular, and clinical phenotype data for large patient cohorts generated by recent technological advances provides an opportunity to dissect the genetic architecture of complex diseases at system level. However, previous analyses of such data have largely focused o...

Full description

Bibliographic Details
Main Authors: Calvin McCarter, Judie Howrylak, Seyoung Kim
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-10-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1007940
id doaj-f0cc68211ed841309a1231f33042060d
record_format Article
spelling doaj-f0cc68211ed841309a1231f33042060d2021-04-21T16:41:14ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582020-10-011610e100794010.1371/journal.pcbi.1007940Learning gene networks underlying clinical phenotypes using SNP perturbation.Calvin McCarterJudie HowrylakSeyoung KimAvailability of genome sequence, molecular, and clinical phenotype data for large patient cohorts generated by recent technological advances provides an opportunity to dissect the genetic architecture of complex diseases at system level. However, previous analyses of such data have largely focused on the co-localization of SNPs associated with clinical and expression traits, each identified from genome-wide association studies and expression quantitative trait locus mapping. Thus, their description of the molecular mechanisms behind the SNPs influencing clinical phenotypes was limited to the single gene linked to the co-localized SNP. Here we introduce PerturbNet, a statistical framework for learning gene networks that modulate the influence of genetic variants on phenotypes, using genetic variants as naturally occurring perturbation of a biological system. PerturbNet uses a probabilistic graphical model to directly model the cascade of perturbation from genetic variants to the gene network to the phenotype network along with the networks at each layer of the biological system. PerturbNet learns the entire model by solving a single optimization problem with an efficient algorithm that can analyze human genome-wide data within a few hours. PerturbNet inference procedures extract a detailed description of how the gene network modulates the genetic effects on phenotypes. Using simulated and asthma data, we demonstrate that PerturbNet improves statistical power for detecting disease-linked SNPs and identifies gene networks and network modules mediating the SNP effects on traits, providing deeper insights into the underlying molecular mechanisms.https://doi.org/10.1371/journal.pcbi.1007940
collection DOAJ
language English
format Article
sources DOAJ
author Calvin McCarter
Judie Howrylak
Seyoung Kim
spellingShingle Calvin McCarter
Judie Howrylak
Seyoung Kim
Learning gene networks underlying clinical phenotypes using SNP perturbation.
PLoS Computational Biology
author_facet Calvin McCarter
Judie Howrylak
Seyoung Kim
author_sort Calvin McCarter
title Learning gene networks underlying clinical phenotypes using SNP perturbation.
title_short Learning gene networks underlying clinical phenotypes using SNP perturbation.
title_full Learning gene networks underlying clinical phenotypes using SNP perturbation.
title_fullStr Learning gene networks underlying clinical phenotypes using SNP perturbation.
title_full_unstemmed Learning gene networks underlying clinical phenotypes using SNP perturbation.
title_sort learning gene networks underlying clinical phenotypes using snp perturbation.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2020-10-01
description Availability of genome sequence, molecular, and clinical phenotype data for large patient cohorts generated by recent technological advances provides an opportunity to dissect the genetic architecture of complex diseases at system level. However, previous analyses of such data have largely focused on the co-localization of SNPs associated with clinical and expression traits, each identified from genome-wide association studies and expression quantitative trait locus mapping. Thus, their description of the molecular mechanisms behind the SNPs influencing clinical phenotypes was limited to the single gene linked to the co-localized SNP. Here we introduce PerturbNet, a statistical framework for learning gene networks that modulate the influence of genetic variants on phenotypes, using genetic variants as naturally occurring perturbation of a biological system. PerturbNet uses a probabilistic graphical model to directly model the cascade of perturbation from genetic variants to the gene network to the phenotype network along with the networks at each layer of the biological system. PerturbNet learns the entire model by solving a single optimization problem with an efficient algorithm that can analyze human genome-wide data within a few hours. PerturbNet inference procedures extract a detailed description of how the gene network modulates the genetic effects on phenotypes. Using simulated and asthma data, we demonstrate that PerturbNet improves statistical power for detecting disease-linked SNPs and identifies gene networks and network modules mediating the SNP effects on traits, providing deeper insights into the underlying molecular mechanisms.
url https://doi.org/10.1371/journal.pcbi.1007940
work_keys_str_mv AT calvinmccarter learninggenenetworksunderlyingclinicalphenotypesusingsnpperturbation
AT judiehowrylak learninggenenetworksunderlyingclinicalphenotypesusingsnpperturbation
AT seyoungkim learninggenenetworksunderlyingclinicalphenotypesusingsnpperturbation
_version_ 1714666815110512640