Learning causal networks with latent variables from multivariate information in genomic data.

Learning causal networks from large-scale genomic data remains challenging in absence of time series or controlled perturbation experiments. We report an information- theoretic method which learns a large class of causal or non-causal graphical models from purely observational data, while including...

Full description

Bibliographic Details
Main Authors: Louis Verny, Nadir Sella, Séverine Affeldt, Param Priya Singh, Hervé Isambert
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-10-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC5685645?pdf=render
id doaj-0efbd8c3c5ba441a89182fca2af90171
record_format Article
spelling doaj-0efbd8c3c5ba441a89182fca2af901712020-11-24T21:55:55ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-10-011310e100566210.1371/journal.pcbi.1005662Learning causal networks with latent variables from multivariate information in genomic data.Louis VernyNadir SellaSéverine AffeldtParam Priya SinghHervé IsambertLearning causal networks from large-scale genomic data remains challenging in absence of time series or controlled perturbation experiments. We report an information- theoretic method which learns a large class of causal or non-causal graphical models from purely observational data, while including the effects of unobserved latent variables, commonly found in many genomic datasets. Starting from a complete graph, the method iteratively removes dispensable edges, by uncovering significant information contributions from indirect paths, and assesses edge-specific confidences from randomization of available data. The remaining edges are then oriented based on the signature of causality in observational data. The approach and associated algorithm, miic, outperform earlier methods on a broad range of benchmark networks. Causal network reconstructions are presented at different biological size and time scales, from gene regulation in single cells to whole genome duplication in tumor development as well as long term evolution of vertebrates. Miic is publicly available at https://github.com/miicTeam/MIIC.http://europepmc.org/articles/PMC5685645?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Louis Verny
Nadir Sella
Séverine Affeldt
Param Priya Singh
Hervé Isambert
spellingShingle Louis Verny
Nadir Sella
Séverine Affeldt
Param Priya Singh
Hervé Isambert
Learning causal networks with latent variables from multivariate information in genomic data.
PLoS Computational Biology
author_facet Louis Verny
Nadir Sella
Séverine Affeldt
Param Priya Singh
Hervé Isambert
author_sort Louis Verny
title Learning causal networks with latent variables from multivariate information in genomic data.
title_short Learning causal networks with latent variables from multivariate information in genomic data.
title_full Learning causal networks with latent variables from multivariate information in genomic data.
title_fullStr Learning causal networks with latent variables from multivariate information in genomic data.
title_full_unstemmed Learning causal networks with latent variables from multivariate information in genomic data.
title_sort learning causal networks with latent variables from multivariate information in genomic data.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2017-10-01
description Learning causal networks from large-scale genomic data remains challenging in absence of time series or controlled perturbation experiments. We report an information- theoretic method which learns a large class of causal or non-causal graphical models from purely observational data, while including the effects of unobserved latent variables, commonly found in many genomic datasets. Starting from a complete graph, the method iteratively removes dispensable edges, by uncovering significant information contributions from indirect paths, and assesses edge-specific confidences from randomization of available data. The remaining edges are then oriented based on the signature of causality in observational data. The approach and associated algorithm, miic, outperform earlier methods on a broad range of benchmark networks. Causal network reconstructions are presented at different biological size and time scales, from gene regulation in single cells to whole genome duplication in tumor development as well as long term evolution of vertebrates. Miic is publicly available at https://github.com/miicTeam/MIIC.
url http://europepmc.org/articles/PMC5685645?pdf=render
work_keys_str_mv AT louisverny learningcausalnetworkswithlatentvariablesfrommultivariateinformationingenomicdata
AT nadirsella learningcausalnetworkswithlatentvariablesfrommultivariateinformationingenomicdata
AT severineaffeldt learningcausalnetworkswithlatentvariablesfrommultivariateinformationingenomicdata
AT parampriyasingh learningcausalnetworkswithlatentvariablesfrommultivariateinformationingenomicdata
AT herveisambert learningcausalnetworkswithlatentvariablesfrommultivariateinformationingenomicdata
_version_ 1725860505323569152