Latent network-based representations for large-scale gene expression data analysis

Abstract Background With the recent advancements in high-throughput experimental procedures, biologists are gathering huge quantities of data. A main priority in bioinformatics and computational biology is to provide system level analytical tools capable of meeting an ever-growing production of high...

Full description

Bibliographic Details
Main Authors: Wajdi Dhifli, Julia Puig, Aurélien Dispot, Mohamed Elati
Format: Article
Language:English
Published: BMC 2019-02-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-018-2481-y
id doaj-e0701edc078a4bebaa5f59e00c0d6ded
record_format Article
spelling doaj-e0701edc078a4bebaa5f59e00c0d6ded2020-11-25T03:20:12ZengBMCBMC Bioinformatics1471-21052019-02-0119S13819010.1186/s12859-018-2481-yLatent network-based representations for large-scale gene expression data analysisWajdi Dhifli0Julia Puig1Aurélien Dispot2Mohamed Elati3University of LilleUniversity of LilleUniversity of LilleUniversity of LilleAbstract Background With the recent advancements in high-throughput experimental procedures, biologists are gathering huge quantities of data. A main priority in bioinformatics and computational biology is to provide system level analytical tools capable of meeting an ever-growing production of high-throughput biological data while taking into account its biological context. In gene expression data analysis, genes have widely been considered as independent components. However, a systemic view shows that they act synergistically in living cells, forming functional complexes and more generally a biological system. Results In this paper, we propose LatNet, a signal transformation framework that, starting from an initial large-scale gene expression data, allows to generate new representations based on latent network-based relationships between the genes. LatNet aims to leverage system level relations between the genes as an underlying hidden structure to derive the new transformed latent signals. We present a concrete implementation of our framework, based on a gene regulatory network structure and two signal transformation approaches, to quantify latent network-based activity of regulators, as well as gene perturbation signals. The new gene/regulator signals are at the level of each sample of the input data and, thus, could directly be used instead of the initial expression signals for major bioinformatics analysis, including diagnosis and personalized medicine. Conclusion Multiple patterns could be hidden or weakly observed in expression data. LatNet helps in uncovering latent signals that could emphasize hidden patterns based on the relations between the genes and, thus, enhancing the performance of gene expression-based analysis algorithms. We use LatNet for the analysis of real-world gene expression data of bladder cancer and we show the efficiency of our transformation framework as compared to using the initial expression data.http://link.springer.com/article/10.1186/s12859-018-2481-yLatent signalsNetwork-based transformationsGene expressionGene perturbationRegulator activity
collection DOAJ
language English
format Article
sources DOAJ
author Wajdi Dhifli
Julia Puig
Aurélien Dispot
Mohamed Elati
spellingShingle Wajdi Dhifli
Julia Puig
Aurélien Dispot
Mohamed Elati
Latent network-based representations for large-scale gene expression data analysis
BMC Bioinformatics
Latent signals
Network-based transformations
Gene expression
Gene perturbation
Regulator activity
author_facet Wajdi Dhifli
Julia Puig
Aurélien Dispot
Mohamed Elati
author_sort Wajdi Dhifli
title Latent network-based representations for large-scale gene expression data analysis
title_short Latent network-based representations for large-scale gene expression data analysis
title_full Latent network-based representations for large-scale gene expression data analysis
title_fullStr Latent network-based representations for large-scale gene expression data analysis
title_full_unstemmed Latent network-based representations for large-scale gene expression data analysis
title_sort latent network-based representations for large-scale gene expression data analysis
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2019-02-01
description Abstract Background With the recent advancements in high-throughput experimental procedures, biologists are gathering huge quantities of data. A main priority in bioinformatics and computational biology is to provide system level analytical tools capable of meeting an ever-growing production of high-throughput biological data while taking into account its biological context. In gene expression data analysis, genes have widely been considered as independent components. However, a systemic view shows that they act synergistically in living cells, forming functional complexes and more generally a biological system. Results In this paper, we propose LatNet, a signal transformation framework that, starting from an initial large-scale gene expression data, allows to generate new representations based on latent network-based relationships between the genes. LatNet aims to leverage system level relations between the genes as an underlying hidden structure to derive the new transformed latent signals. We present a concrete implementation of our framework, based on a gene regulatory network structure and two signal transformation approaches, to quantify latent network-based activity of regulators, as well as gene perturbation signals. The new gene/regulator signals are at the level of each sample of the input data and, thus, could directly be used instead of the initial expression signals for major bioinformatics analysis, including diagnosis and personalized medicine. Conclusion Multiple patterns could be hidden or weakly observed in expression data. LatNet helps in uncovering latent signals that could emphasize hidden patterns based on the relations between the genes and, thus, enhancing the performance of gene expression-based analysis algorithms. We use LatNet for the analysis of real-world gene expression data of bladder cancer and we show the efficiency of our transformation framework as compared to using the initial expression data.
topic Latent signals
Network-based transformations
Gene expression
Gene perturbation
Regulator activity
url http://link.springer.com/article/10.1186/s12859-018-2481-y
work_keys_str_mv AT wajdidhifli latentnetworkbasedrepresentationsforlargescalegeneexpressiondataanalysis
AT juliapuig latentnetworkbasedrepresentationsforlargescalegeneexpressiondataanalysis
AT aureliendispot latentnetworkbasedrepresentationsforlargescalegeneexpressiondataanalysis
AT mohamedelati latentnetworkbasedrepresentationsforlargescalegeneexpressiondataanalysis
_version_ 1724618864130523136