Modular biological function is most effectively captured by combining molecular interaction data types.

Large-scale molecular interaction data sets have the potential to provide a comprehensive, system-wide understanding of biological function. Although individual molecules can be promiscuous in terms of their contribution to function, molecular functions emerge from the specific interactions of molec...

Full description

Bibliographic Details
Main Authors: Ryan M Ames, Jamie I Macpherson, John W Pinney, Simon C Lovell, David L Robertson
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23658761/?tool=EBI
id doaj-7720d5fa451f47b68fd9e7e753877bf3
record_format Article
spelling doaj-7720d5fa451f47b68fd9e7e753877bf32021-03-03T20:23:27ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0185e6267010.1371/journal.pone.0062670Modular biological function is most effectively captured by combining molecular interaction data types.Ryan M AmesJamie I MacphersonJohn W PinneySimon C LovellDavid L RobertsonLarge-scale molecular interaction data sets have the potential to provide a comprehensive, system-wide understanding of biological function. Although individual molecules can be promiscuous in terms of their contribution to function, molecular functions emerge from the specific interactions of molecules giving rise to modular organisation. As functions often derive from a range of mechanisms, we demonstrate that they are best studied using networks derived from different sources. Implementing a graph partitioning algorithm we identify subnetworks in yeast protein-protein interaction (PPI), genetic interaction and gene co-regulation networks. Among these subnetworks we identify cohesive subgraphs that we expect to represent functional modules in the different data types. We demonstrate significant overlap between the subgraphs generated from the different data types and show these overlaps can represent related functions as represented by the Gene Ontology (GO). Next, we investigate the correspondence between our subgraphs and the Gene Ontology. This revealed varying degrees of coverage of the biological process, molecular function and cellular component ontologies, dependent on the data type. For example, subgraphs from the PPI show enrichment for 84%, 58% and 93% of annotated GO terms, respectively. Integrating the interaction data into a combined network increases the coverage of GO. Furthermore, the different annotation types of GO are not predominantly associated with one of the interaction data types. Collectively our results demonstrate that successful capture of functional relationships by network data depends on both the specific biological function being characterised and the type of network data being used. We identify functions that require integrated information to be accurately represented, demonstrating the limitations of individual data types. Combining interaction subnetworks across data types is therefore essential for fully understanding the complex and emergent nature of biological function.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23658761/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author Ryan M Ames
Jamie I Macpherson
John W Pinney
Simon C Lovell
David L Robertson
spellingShingle Ryan M Ames
Jamie I Macpherson
John W Pinney
Simon C Lovell
David L Robertson
Modular biological function is most effectively captured by combining molecular interaction data types.
PLoS ONE
author_facet Ryan M Ames
Jamie I Macpherson
John W Pinney
Simon C Lovell
David L Robertson
author_sort Ryan M Ames
title Modular biological function is most effectively captured by combining molecular interaction data types.
title_short Modular biological function is most effectively captured by combining molecular interaction data types.
title_full Modular biological function is most effectively captured by combining molecular interaction data types.
title_fullStr Modular biological function is most effectively captured by combining molecular interaction data types.
title_full_unstemmed Modular biological function is most effectively captured by combining molecular interaction data types.
title_sort modular biological function is most effectively captured by combining molecular interaction data types.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description Large-scale molecular interaction data sets have the potential to provide a comprehensive, system-wide understanding of biological function. Although individual molecules can be promiscuous in terms of their contribution to function, molecular functions emerge from the specific interactions of molecules giving rise to modular organisation. As functions often derive from a range of mechanisms, we demonstrate that they are best studied using networks derived from different sources. Implementing a graph partitioning algorithm we identify subnetworks in yeast protein-protein interaction (PPI), genetic interaction and gene co-regulation networks. Among these subnetworks we identify cohesive subgraphs that we expect to represent functional modules in the different data types. We demonstrate significant overlap between the subgraphs generated from the different data types and show these overlaps can represent related functions as represented by the Gene Ontology (GO). Next, we investigate the correspondence between our subgraphs and the Gene Ontology. This revealed varying degrees of coverage of the biological process, molecular function and cellular component ontologies, dependent on the data type. For example, subgraphs from the PPI show enrichment for 84%, 58% and 93% of annotated GO terms, respectively. Integrating the interaction data into a combined network increases the coverage of GO. Furthermore, the different annotation types of GO are not predominantly associated with one of the interaction data types. Collectively our results demonstrate that successful capture of functional relationships by network data depends on both the specific biological function being characterised and the type of network data being used. We identify functions that require integrated information to be accurately represented, demonstrating the limitations of individual data types. Combining interaction subnetworks across data types is therefore essential for fully understanding the complex and emergent nature of biological function.
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23658761/?tool=EBI
work_keys_str_mv AT ryanmames modularbiologicalfunctionismosteffectivelycapturedbycombiningmolecularinteractiondatatypes
AT jamieimacpherson modularbiologicalfunctionismosteffectivelycapturedbycombiningmolecularinteractiondatatypes
AT johnwpinney modularbiologicalfunctionismosteffectivelycapturedbycombiningmolecularinteractiondatatypes
AT simonclovell modularbiologicalfunctionismosteffectivelycapturedbycombiningmolecularinteractiondatatypes
AT davidlrobertson modularbiologicalfunctionismosteffectivelycapturedbycombiningmolecularinteractiondatatypes
_version_ 1714822744666800128