Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite

Abstract Background High-throughput transcriptomic datasets are often examined to discover new actors and regulators of a biological response. To this end, graphical interfaces have been developed and allow a broad range of users to conduct standard analyses from RNA-seq data, even with little progr...

Full description

Bibliographic Details
Main Authors: Océane Cassan, Sophie Lèbre, Antoine Martin
Format: Article
Language:English
Published: BMC 2021-05-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-021-07659-2
id doaj-00a18c8eb72c436192022653058f0c61
record_format Article
spelling doaj-00a18c8eb72c436192022653058f0c612021-05-30T11:25:54ZengBMCBMC Genomics1471-21642021-05-0122111510.1186/s12864-021-07659-2Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suiteOcéane Cassan0Sophie Lèbre1Antoine Martin2BPMP, CNRS, INRAE, Institut Agro, Univ MontpellierIMAG, Univ. Montpellier, CNRSBPMP, CNRS, INRAE, Institut Agro, Univ MontpellierAbstract Background High-throughput transcriptomic datasets are often examined to discover new actors and regulators of a biological response. To this end, graphical interfaces have been developed and allow a broad range of users to conduct standard analyses from RNA-seq data, even with little programming experience. Although existing solutions usually provide adequate procedures for normalization, exploration or differential expression, more advanced features, such as gene clustering or regulatory network inference, often miss or do not reflect current state of the art methodologies. Results We developed here a user interface called DIANE (Dashboard for the Inference and Analysis of Networks from Expression data) designed to harness the potential of multi-factorial expression datasets from any organisms through a precise set of methods. DIANE interactive workflow provides normalization, dimensionality reduction, differential expression and ontology enrichment. Gene clustering can be performed and explored via configurable Mixture Models, and Random Forests are used to infer gene regulatory networks. DIANE also includes a novel procedure to assess the statistical significance of regulator-target influence measures based on permutations for Random Forest importance metrics. All along the pipeline, session reports and results can be downloaded to ensure clear and reproducible analyses. Conclusions We demonstrate the value and the benefits of DIANE using a recently published data set describing the transcriptional response of Arabidopsis thaliana under the combination of temperature, drought and salinity perturbations. We show that DIANE can intuitively carry out informative exploration and statistical procedures with RNA-Seq data, perform model based gene expression profiles clustering and go further into gene network reconstruction, providing relevant candidate genes or signalling pathways to explore. DIANE is available as a web service ( https://diane.bpmp.inrae.fr ), or can be installed and locally launched as a complete R package.https://doi.org/10.1186/s12864-021-07659-2Gene regulatory network inferenceGraphical user interfaceMultifactorial transcriptomic analysisModel-based clusteringAnalysis workflow
collection DOAJ
language English
format Article
sources DOAJ
author Océane Cassan
Sophie Lèbre
Antoine Martin
spellingShingle Océane Cassan
Sophie Lèbre
Antoine Martin
Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite
BMC Genomics
Gene regulatory network inference
Graphical user interface
Multifactorial transcriptomic analysis
Model-based clustering
Analysis workflow
author_facet Océane Cassan
Sophie Lèbre
Antoine Martin
author_sort Océane Cassan
title Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite
title_short Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite
title_full Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite
title_fullStr Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite
title_full_unstemmed Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite
title_sort inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2021-05-01
description Abstract Background High-throughput transcriptomic datasets are often examined to discover new actors and regulators of a biological response. To this end, graphical interfaces have been developed and allow a broad range of users to conduct standard analyses from RNA-seq data, even with little programming experience. Although existing solutions usually provide adequate procedures for normalization, exploration or differential expression, more advanced features, such as gene clustering or regulatory network inference, often miss or do not reflect current state of the art methodologies. Results We developed here a user interface called DIANE (Dashboard for the Inference and Analysis of Networks from Expression data) designed to harness the potential of multi-factorial expression datasets from any organisms through a precise set of methods. DIANE interactive workflow provides normalization, dimensionality reduction, differential expression and ontology enrichment. Gene clustering can be performed and explored via configurable Mixture Models, and Random Forests are used to infer gene regulatory networks. DIANE also includes a novel procedure to assess the statistical significance of regulator-target influence measures based on permutations for Random Forest importance metrics. All along the pipeline, session reports and results can be downloaded to ensure clear and reproducible analyses. Conclusions We demonstrate the value and the benefits of DIANE using a recently published data set describing the transcriptional response of Arabidopsis thaliana under the combination of temperature, drought and salinity perturbations. We show that DIANE can intuitively carry out informative exploration and statistical procedures with RNA-Seq data, perform model based gene expression profiles clustering and go further into gene network reconstruction, providing relevant candidate genes or signalling pathways to explore. DIANE is available as a web service ( https://diane.bpmp.inrae.fr ), or can be installed and locally launched as a complete R package.
topic Gene regulatory network inference
Graphical user interface
Multifactorial transcriptomic analysis
Model-based clustering
Analysis workflow
url https://doi.org/10.1186/s12864-021-07659-2
work_keys_str_mv AT oceanecassan inferringandanalyzinggeneregulatorynetworksfrommultifactorialexpressiondataacompleteandinteractivesuite
AT sophielebre inferringandanalyzinggeneregulatorynetworksfrommultifactorialexpressiondataacompleteandinteractivesuite
AT antoinemartin inferringandanalyzinggeneregulatorynetworksfrommultifactorialexpressiondataacompleteandinteractivesuite
_version_ 1721420324782735360