Detection of network motifs using three-way ANOVA.

MOTIVATION:Gene regulatory networks (GRN) can be determined via various experimental techniques, and also by computational methods, which infer networks from gene expression data. However, these techniques treat interactions separately such that interdependencies of interactions forming meaningful s...

Full description

Bibliographic Details
Main Authors: Pegah Tavakkolkhah, Ralf Zimmer, Robert Küffner
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC6078297?pdf=render
id doaj-842cc8815d184e7eba7b49f328afc99c
record_format Article
spelling doaj-842cc8815d184e7eba7b49f328afc99c2020-11-25T01:37:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01138e020138210.1371/journal.pone.0201382Detection of network motifs using three-way ANOVA.Pegah TavakkolkhahRalf ZimmerRobert KüffnerMOTIVATION:Gene regulatory networks (GRN) can be determined via various experimental techniques, and also by computational methods, which infer networks from gene expression data. However, these techniques treat interactions separately such that interdependencies of interactions forming meaningful subnetworks are typically not considered. METHODS:For the investigation of network properties and for the classification of different (sub-)networks based on gene expression data, we consider biological network motifs consisting of three genes and up to three interactions, e.g. the cascade chain (CSC), feed-forward loop (FFL), and dense-overlapping regulon (DOR). We examine several conventional methods for the inference of network motifs, which typically consider each interaction individually. In addition, we propose a new method based on three-way ANOVA (ANalysis Of VAriance) (3WA) that analyzes entire subnetworks at once. To demonstrate the advantages of such a more holistic perspective, we compare the ability of 3WA and other methods to detect and categorize network motifs on large real and artificial datasets. RESULTS:We find that conventional methods perform much better on artificial data (AUC up to 80%), than on real E. coli expression datasets (AUC 50% corresponding to random guessing). To explain this observation, we examine several important properties that differ between datasets and analyze predicted motifs in detail. We find that in case of real networks our new 3WA method outperforms (AUC 70% in E. coli) previous methods by exploiting the interdependencies in the full motif structure. Because of important differences between current artificial datasets and real measurements, the construction and testing of motif detection methods should focus on real data.http://europepmc.org/articles/PMC6078297?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Pegah Tavakkolkhah
Ralf Zimmer
Robert Küffner
spellingShingle Pegah Tavakkolkhah
Ralf Zimmer
Robert Küffner
Detection of network motifs using three-way ANOVA.
PLoS ONE
author_facet Pegah Tavakkolkhah
Ralf Zimmer
Robert Küffner
author_sort Pegah Tavakkolkhah
title Detection of network motifs using three-way ANOVA.
title_short Detection of network motifs using three-way ANOVA.
title_full Detection of network motifs using three-way ANOVA.
title_fullStr Detection of network motifs using three-way ANOVA.
title_full_unstemmed Detection of network motifs using three-way ANOVA.
title_sort detection of network motifs using three-way anova.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2018-01-01
description MOTIVATION:Gene regulatory networks (GRN) can be determined via various experimental techniques, and also by computational methods, which infer networks from gene expression data. However, these techniques treat interactions separately such that interdependencies of interactions forming meaningful subnetworks are typically not considered. METHODS:For the investigation of network properties and for the classification of different (sub-)networks based on gene expression data, we consider biological network motifs consisting of three genes and up to three interactions, e.g. the cascade chain (CSC), feed-forward loop (FFL), and dense-overlapping regulon (DOR). We examine several conventional methods for the inference of network motifs, which typically consider each interaction individually. In addition, we propose a new method based on three-way ANOVA (ANalysis Of VAriance) (3WA) that analyzes entire subnetworks at once. To demonstrate the advantages of such a more holistic perspective, we compare the ability of 3WA and other methods to detect and categorize network motifs on large real and artificial datasets. RESULTS:We find that conventional methods perform much better on artificial data (AUC up to 80%), than on real E. coli expression datasets (AUC 50% corresponding to random guessing). To explain this observation, we examine several important properties that differ between datasets and analyze predicted motifs in detail. We find that in case of real networks our new 3WA method outperforms (AUC 70% in E. coli) previous methods by exploiting the interdependencies in the full motif structure. Because of important differences between current artificial datasets and real measurements, the construction and testing of motif detection methods should focus on real data.
url http://europepmc.org/articles/PMC6078297?pdf=render
work_keys_str_mv AT pegahtavakkolkhah detectionofnetworkmotifsusingthreewayanova
AT ralfzimmer detectionofnetworkmotifsusingthreewayanova
AT robertkuffner detectionofnetworkmotifsusingthreewayanova
_version_ 1725060378311786496