GO-based Functional Dissimilarity of Gene Sets

<p>Abstract</p> <p>Background</p> <p>The Gene Ontology (GO) provides a controlled vocabulary for describing the functions of genes and can be used to evaluate the functional coherence of gene sets. Many functional coherence measures consider each pair of gene functions...

Full description

Bibliographic Details
Main Authors: Aguilar-Ruiz Jesús S, Díaz-Díaz Norberto
Format: Article
Language:English
Published: BMC 2011-09-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/12/360
id doaj-ff1c9846a68d4434b9688336b836eba0
record_format Article
spelling doaj-ff1c9846a68d4434b9688336b836eba02020-11-25T01:29:38ZengBMCBMC Bioinformatics1471-21052011-09-0112136010.1186/1471-2105-12-360GO-based Functional Dissimilarity of Gene SetsAguilar-Ruiz Jesús SDíaz-Díaz Norberto<p>Abstract</p> <p>Background</p> <p>The Gene Ontology (GO) provides a controlled vocabulary for describing the functions of genes and can be used to evaluate the functional coherence of gene sets. Many functional coherence measures consider each pair of gene functions in a set and produce an output based on all pairwise distances. A single gene can encode multiple proteins that may differ in function. For each functionality, other proteins that exhibit the same activity may also participate. Therefore, an identification of the most common function for all of the genes involved in a biological process is important in evaluating the functional similarity of groups of genes and a quantification of functional coherence can helps to clarify the role of a group of genes working together.</p> <p>Results</p> <p>To implement this approach to functional assessment, we present G<smcaps>FD</smcaps> (GO-based Functional Dissimilarity), a novel dissimilarity measure for evaluating groups of genes based on the most relevant functions of the whole set. The measure assigns a numerical value to the gene set for each of the three GO sub-ontologies.</p> <p>Conclusions</p> <p>Results show that G<smcaps>FD</smcaps> performs robustly when applied to gene set of known functionality (extracted from K<smcaps>EGG</smcaps>). It performs particularly well on randomly generated gene sets. An ROC analysis reveals that the performance of G<smcaps>FD</smcaps> in evaluating the functional dissimilarity of gene sets is very satisfactory. A comparative analysis against other functional measures, such as GS<sup>2 </sup>and those presented by Resnik and Wang, also demonstrates the robustness of G<smcaps>FD</smcaps>.</p> http://www.biomedcentral.com/1471-2105/12/360
collection DOAJ
language English
format Article
sources DOAJ
author Aguilar-Ruiz Jesús S
Díaz-Díaz Norberto
spellingShingle Aguilar-Ruiz Jesús S
Díaz-Díaz Norberto
GO-based Functional Dissimilarity of Gene Sets
BMC Bioinformatics
author_facet Aguilar-Ruiz Jesús S
Díaz-Díaz Norberto
author_sort Aguilar-Ruiz Jesús S
title GO-based Functional Dissimilarity of Gene Sets
title_short GO-based Functional Dissimilarity of Gene Sets
title_full GO-based Functional Dissimilarity of Gene Sets
title_fullStr GO-based Functional Dissimilarity of Gene Sets
title_full_unstemmed GO-based Functional Dissimilarity of Gene Sets
title_sort go-based functional dissimilarity of gene sets
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2011-09-01
description <p>Abstract</p> <p>Background</p> <p>The Gene Ontology (GO) provides a controlled vocabulary for describing the functions of genes and can be used to evaluate the functional coherence of gene sets. Many functional coherence measures consider each pair of gene functions in a set and produce an output based on all pairwise distances. A single gene can encode multiple proteins that may differ in function. For each functionality, other proteins that exhibit the same activity may also participate. Therefore, an identification of the most common function for all of the genes involved in a biological process is important in evaluating the functional similarity of groups of genes and a quantification of functional coherence can helps to clarify the role of a group of genes working together.</p> <p>Results</p> <p>To implement this approach to functional assessment, we present G<smcaps>FD</smcaps> (GO-based Functional Dissimilarity), a novel dissimilarity measure for evaluating groups of genes based on the most relevant functions of the whole set. The measure assigns a numerical value to the gene set for each of the three GO sub-ontologies.</p> <p>Conclusions</p> <p>Results show that G<smcaps>FD</smcaps> performs robustly when applied to gene set of known functionality (extracted from K<smcaps>EGG</smcaps>). It performs particularly well on randomly generated gene sets. An ROC analysis reveals that the performance of G<smcaps>FD</smcaps> in evaluating the functional dissimilarity of gene sets is very satisfactory. A comparative analysis against other functional measures, such as GS<sup>2 </sup>and those presented by Resnik and Wang, also demonstrates the robustness of G<smcaps>FD</smcaps>.</p>
url http://www.biomedcentral.com/1471-2105/12/360
work_keys_str_mv AT aguilarruizjesuss gobasedfunctionaldissimilarityofgenesets
AT diazdiaznorberto gobasedfunctionaldissimilarityofgenesets
_version_ 1725095812582604800