AGeNNT: annotation of enzyme families by means of refined neighborhood networks

Abstract Background Large enzyme families may contain functionally diverse members that give rise to clusters in a sequence similarity network (SSN). In prokaryotes, the genome neighborhood of a gene-product is indicative of its function and thus, a genome neighborhood network (GNN) deduced for an S...

Full description

Bibliographic Details
Main Authors: Florian Kandlinger, Maximilian G. Plach, Rainer Merkl
Format: Article
Language:English
Published: BMC 2017-05-01
Series:BMC Bioinformatics
Subjects:
SSN
GNN
Online Access:http://link.springer.com/article/10.1186/s12859-017-1689-6
id doaj-4333c0531144484db589b8ca8c9fc58a
record_format Article
spelling doaj-4333c0531144484db589b8ca8c9fc58a2020-11-25T00:43:27ZengBMCBMC Bioinformatics1471-21052017-05-0118111310.1186/s12859-017-1689-6AGeNNT: annotation of enzyme families by means of refined neighborhood networksFlorian Kandlinger0Maximilian G. Plach1Rainer Merkl2Institute of Biophysics and Physical Biochemistry, University of RegensburgInstitute of Biophysics and Physical Biochemistry, University of RegensburgInstitute of Biophysics and Physical Biochemistry, University of RegensburgAbstract Background Large enzyme families may contain functionally diverse members that give rise to clusters in a sequence similarity network (SSN). In prokaryotes, the genome neighborhood of a gene-product is indicative of its function and thus, a genome neighborhood network (GNN) deduced for an SSN provides strong clues to the specific function of enzymes constituting the different clusters. The Enzyme Function Initiative ( http://enzymefunction.org/ ) offers services that compute SSNs and GNNs. Results We have implemented AGeNNT that utilizes these services, albeit with datasets purged with respect to unspecific protein functions and overrepresented species. AGeNNT generates refined GNNs (rGNNs) that consist of cluster-nodes representing the sequences under study and Pfam-nodes representing enzyme functions encoded in the respective neighborhoods. For cluster-nodes, AGeNNT summarizes the phylogenetic relationships of the contributing species and a statistic indicates how unique nodes and GNs are within this rGNN. Pfam-nodes are annotated with additional features like GO terms describing protein function. For edges, the coverage is given, which is the relative number of neighborhoods containing the considered enzyme function (Pfam-node). AGeNNT is available at https://github.com/kandlinf/agennt . Conclusions An rGNN is easier to interpret than a conventional GNN, which commonly contains proteins without enzymatic function and overly specific neighborhoods due to phylogenetic bias. The implemented filter routines and the statistic allow the user to identify those neighborhoods that are most indicative of a specific metabolic capacity. Thus, AGeNNT facilitates to distinguish and annotate functionally different members of enzyme families.http://link.springer.com/article/10.1186/s12859-017-1689-6Sequence similarity networkSSNGenome neighborhood networkGNNGenome contentEnzyme function
collection DOAJ
language English
format Article
sources DOAJ
author Florian Kandlinger
Maximilian G. Plach
Rainer Merkl
spellingShingle Florian Kandlinger
Maximilian G. Plach
Rainer Merkl
AGeNNT: annotation of enzyme families by means of refined neighborhood networks
BMC Bioinformatics
Sequence similarity network
SSN
Genome neighborhood network
GNN
Genome content
Enzyme function
author_facet Florian Kandlinger
Maximilian G. Plach
Rainer Merkl
author_sort Florian Kandlinger
title AGeNNT: annotation of enzyme families by means of refined neighborhood networks
title_short AGeNNT: annotation of enzyme families by means of refined neighborhood networks
title_full AGeNNT: annotation of enzyme families by means of refined neighborhood networks
title_fullStr AGeNNT: annotation of enzyme families by means of refined neighborhood networks
title_full_unstemmed AGeNNT: annotation of enzyme families by means of refined neighborhood networks
title_sort agennt: annotation of enzyme families by means of refined neighborhood networks
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2017-05-01
description Abstract Background Large enzyme families may contain functionally diverse members that give rise to clusters in a sequence similarity network (SSN). In prokaryotes, the genome neighborhood of a gene-product is indicative of its function and thus, a genome neighborhood network (GNN) deduced for an SSN provides strong clues to the specific function of enzymes constituting the different clusters. The Enzyme Function Initiative ( http://enzymefunction.org/ ) offers services that compute SSNs and GNNs. Results We have implemented AGeNNT that utilizes these services, albeit with datasets purged with respect to unspecific protein functions and overrepresented species. AGeNNT generates refined GNNs (rGNNs) that consist of cluster-nodes representing the sequences under study and Pfam-nodes representing enzyme functions encoded in the respective neighborhoods. For cluster-nodes, AGeNNT summarizes the phylogenetic relationships of the contributing species and a statistic indicates how unique nodes and GNs are within this rGNN. Pfam-nodes are annotated with additional features like GO terms describing protein function. For edges, the coverage is given, which is the relative number of neighborhoods containing the considered enzyme function (Pfam-node). AGeNNT is available at https://github.com/kandlinf/agennt . Conclusions An rGNN is easier to interpret than a conventional GNN, which commonly contains proteins without enzymatic function and overly specific neighborhoods due to phylogenetic bias. The implemented filter routines and the statistic allow the user to identify those neighborhoods that are most indicative of a specific metabolic capacity. Thus, AGeNNT facilitates to distinguish and annotate functionally different members of enzyme families.
topic Sequence similarity network
SSN
Genome neighborhood network
GNN
Genome content
Enzyme function
url http://link.springer.com/article/10.1186/s12859-017-1689-6
work_keys_str_mv AT floriankandlinger agenntannotationofenzymefamiliesbymeansofrefinedneighborhoodnetworks
AT maximiliangplach agenntannotationofenzymefamiliesbymeansofrefinedneighborhoodnetworks
AT rainermerkl agenntannotationofenzymefamiliesbymeansofrefinedneighborhoodnetworks
_version_ 1725278321385668608