Detection of network motifs using three-way ANOVA.
MOTIVATION:Gene regulatory networks (GRN) can be determined via various experimental techniques, and also by computational methods, which infer networks from gene expression data. However, these techniques treat interactions separately such that interdependencies of interactions forming meaningful s...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2018-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC6078297?pdf=render |
id |
doaj-842cc8815d184e7eba7b49f328afc99c |
---|---|
record_format |
Article |
spelling |
doaj-842cc8815d184e7eba7b49f328afc99c2020-11-25T01:37:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01138e020138210.1371/journal.pone.0201382Detection of network motifs using three-way ANOVA.Pegah TavakkolkhahRalf ZimmerRobert KüffnerMOTIVATION:Gene regulatory networks (GRN) can be determined via various experimental techniques, and also by computational methods, which infer networks from gene expression data. However, these techniques treat interactions separately such that interdependencies of interactions forming meaningful subnetworks are typically not considered. METHODS:For the investigation of network properties and for the classification of different (sub-)networks based on gene expression data, we consider biological network motifs consisting of three genes and up to three interactions, e.g. the cascade chain (CSC), feed-forward loop (FFL), and dense-overlapping regulon (DOR). We examine several conventional methods for the inference of network motifs, which typically consider each interaction individually. In addition, we propose a new method based on three-way ANOVA (ANalysis Of VAriance) (3WA) that analyzes entire subnetworks at once. To demonstrate the advantages of such a more holistic perspective, we compare the ability of 3WA and other methods to detect and categorize network motifs on large real and artificial datasets. RESULTS:We find that conventional methods perform much better on artificial data (AUC up to 80%), than on real E. coli expression datasets (AUC 50% corresponding to random guessing). To explain this observation, we examine several important properties that differ between datasets and analyze predicted motifs in detail. We find that in case of real networks our new 3WA method outperforms (AUC 70% in E. coli) previous methods by exploiting the interdependencies in the full motif structure. Because of important differences between current artificial datasets and real measurements, the construction and testing of motif detection methods should focus on real data.http://europepmc.org/articles/PMC6078297?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Pegah Tavakkolkhah Ralf Zimmer Robert Küffner |
spellingShingle |
Pegah Tavakkolkhah Ralf Zimmer Robert Küffner Detection of network motifs using three-way ANOVA. PLoS ONE |
author_facet |
Pegah Tavakkolkhah Ralf Zimmer Robert Küffner |
author_sort |
Pegah Tavakkolkhah |
title |
Detection of network motifs using three-way ANOVA. |
title_short |
Detection of network motifs using three-way ANOVA. |
title_full |
Detection of network motifs using three-way ANOVA. |
title_fullStr |
Detection of network motifs using three-way ANOVA. |
title_full_unstemmed |
Detection of network motifs using three-way ANOVA. |
title_sort |
detection of network motifs using three-way anova. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2018-01-01 |
description |
MOTIVATION:Gene regulatory networks (GRN) can be determined via various experimental techniques, and also by computational methods, which infer networks from gene expression data. However, these techniques treat interactions separately such that interdependencies of interactions forming meaningful subnetworks are typically not considered. METHODS:For the investigation of network properties and for the classification of different (sub-)networks based on gene expression data, we consider biological network motifs consisting of three genes and up to three interactions, e.g. the cascade chain (CSC), feed-forward loop (FFL), and dense-overlapping regulon (DOR). We examine several conventional methods for the inference of network motifs, which typically consider each interaction individually. In addition, we propose a new method based on three-way ANOVA (ANalysis Of VAriance) (3WA) that analyzes entire subnetworks at once. To demonstrate the advantages of such a more holistic perspective, we compare the ability of 3WA and other methods to detect and categorize network motifs on large real and artificial datasets. RESULTS:We find that conventional methods perform much better on artificial data (AUC up to 80%), than on real E. coli expression datasets (AUC 50% corresponding to random guessing). To explain this observation, we examine several important properties that differ between datasets and analyze predicted motifs in detail. We find that in case of real networks our new 3WA method outperforms (AUC 70% in E. coli) previous methods by exploiting the interdependencies in the full motif structure. Because of important differences between current artificial datasets and real measurements, the construction and testing of motif detection methods should focus on real data. |
url |
http://europepmc.org/articles/PMC6078297?pdf=render |
work_keys_str_mv |
AT pegahtavakkolkhah detectionofnetworkmotifsusingthreewayanova AT ralfzimmer detectionofnetworkmotifsusingthreewayanova AT robertkuffner detectionofnetworkmotifsusingthreewayanova |
_version_ |
1725060378311786496 |