Learning causal networks with latent variables from multivariate information in genomic data.

Learning causal networks from large-scale genomic data remains challenging in absence of time series or controlled perturbation experiments. We report an information- theoretic method which learns a large class of causal or non-causal graphical models from purely observational data, while including...

Full description

Bibliographic Details
Main Authors:	Louis Verny, Nadir Sella, Séverine Affeldt, Param Priya Singh, Hervé Isambert
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2017-10-01
Series:	PLoS Computational Biology
Online Access:	http://europepmc.org/articles/PMC5685645?pdf=render

id	doaj-0efbd8c3c5ba441a89182fca2af90171
record_format	Article
spelling	doaj-0efbd8c3c5ba441a89182fca2af901712020-11-24T21:55:55ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-10-011310e100566210.1371/journal.pcbi.1005662Learning causal networks with latent variables from multivariate information in genomic data.Louis VernyNadir SellaSéverine AffeldtParam Priya SinghHervé IsambertLearning causal networks from large-scale genomic data remains challenging in absence of time series or controlled perturbation experiments. We report an information- theoretic method which learns a large class of causal or non-causal graphical models from purely observational data, while including the effects of unobserved latent variables, commonly found in many genomic datasets. Starting from a complete graph, the method iteratively removes dispensable edges, by uncovering significant information contributions from indirect paths, and assesses edge-specific confidences from randomization of available data. The remaining edges are then oriented based on the signature of causality in observational data. The approach and associated algorithm, miic, outperform earlier methods on a broad range of benchmark networks. Causal network reconstructions are presented at different biological size and time scales, from gene regulation in single cells to whole genome duplication in tumor development as well as long term evolution of vertebrates. Miic is publicly available at https://github.com/miicTeam/MIIC.http://europepmc.org/articles/PMC5685645?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Louis Verny Nadir Sella Séverine Affeldt Param Priya Singh Hervé Isambert
spellingShingle	Louis Verny Nadir Sella Séverine Affeldt Param Priya Singh Hervé Isambert Learning causal networks with latent variables from multivariate information in genomic data. PLoS Computational Biology
author_facet	Louis Verny Nadir Sella Séverine Affeldt Param Priya Singh Hervé Isambert
author_sort	Louis Verny
title	Learning causal networks with latent variables from multivariate information in genomic data.
title_short	Learning causal networks with latent variables from multivariate information in genomic data.
title_full	Learning causal networks with latent variables from multivariate information in genomic data.
title_fullStr	Learning causal networks with latent variables from multivariate information in genomic data.
title_full_unstemmed	Learning causal networks with latent variables from multivariate information in genomic data.
title_sort	learning causal networks with latent variables from multivariate information in genomic data.
publisher	Public Library of Science (PLoS)
series	PLoS Computational Biology
issn	1553-734X 1553-7358
publishDate	2017-10-01
description	Learning causal networks from large-scale genomic data remains challenging in absence of time series or controlled perturbation experiments. We report an information- theoretic method which learns a large class of causal or non-causal graphical models from purely observational data, while including the effects of unobserved latent variables, commonly found in many genomic datasets. Starting from a complete graph, the method iteratively removes dispensable edges, by uncovering significant information contributions from indirect paths, and assesses edge-specific confidences from randomization of available data. The remaining edges are then oriented based on the signature of causality in observational data. The approach and associated algorithm, miic, outperform earlier methods on a broad range of benchmark networks. Causal network reconstructions are presented at different biological size and time scales, from gene regulation in single cells to whole genome duplication in tumor development as well as long term evolution of vertebrates. Miic is publicly available at https://github.com/miicTeam/MIIC.
url	http://europepmc.org/articles/PMC5685645?pdf=render
work_keys_str_mv	AT louisverny learningcausalnetworkswithlatentvariablesfrommultivariateinformationingenomicdata AT nadirsella learningcausalnetworkswithlatentvariablesfrommultivariateinformationingenomicdata AT severineaffeldt learningcausalnetworkswithlatentvariablesfrommultivariateinformationingenomicdata AT parampriyasingh learningcausalnetworkswithlatentvariablesfrommultivariateinformationingenomicdata AT herveisambert learningcausalnetworkswithlatentvariablesfrommultivariateinformationingenomicdata
_version_	1725860505323569152

Learning causal networks with latent variables from multivariate information in genomic data.

Similar Items