GROOLS: reactive graph reasoning for genome annotation through biological processes

Abstract Background High quality functional annotation is essential for understanding the phenotypic consequences encoded in a genome. Despite improvements in bioinformatics methods, millions of sequences in databanks are not assigned reliable functions. The curation of protein functions in the cont...

Full description

Bibliographic Details
Main Authors: Jonathan Mercier, Adrien Josso, Claudine Médigue, David Vallenet
Format: Article
Language:English
Published: BMC 2018-04-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-018-2126-1
id doaj-bd8f6ad3a72043d9a57f87a6dc99ce61
record_format Article
spelling doaj-bd8f6ad3a72043d9a57f87a6dc99ce612020-11-25T00:34:24ZengBMCBMC Bioinformatics1471-21052018-04-0119111210.1186/s12859-018-2126-1GROOLS: reactive graph reasoning for genome annotation through biological processesJonathan Mercier0Adrien Josso1Claudine Médigue2David Vallenet3LABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-SaclayLABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-SaclayLABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-SaclayLABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-SaclayAbstract Background High quality functional annotation is essential for understanding the phenotypic consequences encoded in a genome. Despite improvements in bioinformatics methods, millions of sequences in databanks are not assigned reliable functions. The curation of protein functions in the context of biological processes is a way to evaluate and improve their annotation. Results We developed an expert system using paraconsistent logic, named GROOLS (Genomic Rule Object-Oriented Logic System), that evaluates the completeness and the consistency of predicted functions through biological processes like metabolic pathways. Using a generic and hierarchical representation of knowledge, biological processes are modeled in a graph from which observations (i.e. predictions and expectations) are propagated by rules. At the end of the reasoning, conclusions are assigned to biological process components and highlight uncertainties and inconsistencies. Results on 14 microbial organisms are presented. Conclusions GROOLS software is designed to evaluate the overall accuracy of functional unit and pathway predictions according to organism experimental data like growth phenotypes. It assists biocurators in the functional annotation of proteins by focusing on missing or contradictory observations.http://link.springer.com/article/10.1186/s12859-018-2126-1Genome annotationCurationMetabolic pathwaysKnowledge representationParaconsistent logicExpert system
collection DOAJ
language English
format Article
sources DOAJ
author Jonathan Mercier
Adrien Josso
Claudine Médigue
David Vallenet
spellingShingle Jonathan Mercier
Adrien Josso
Claudine Médigue
David Vallenet
GROOLS: reactive graph reasoning for genome annotation through biological processes
BMC Bioinformatics
Genome annotation
Curation
Metabolic pathways
Knowledge representation
Paraconsistent logic
Expert system
author_facet Jonathan Mercier
Adrien Josso
Claudine Médigue
David Vallenet
author_sort Jonathan Mercier
title GROOLS: reactive graph reasoning for genome annotation through biological processes
title_short GROOLS: reactive graph reasoning for genome annotation through biological processes
title_full GROOLS: reactive graph reasoning for genome annotation through biological processes
title_fullStr GROOLS: reactive graph reasoning for genome annotation through biological processes
title_full_unstemmed GROOLS: reactive graph reasoning for genome annotation through biological processes
title_sort grools: reactive graph reasoning for genome annotation through biological processes
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2018-04-01
description Abstract Background High quality functional annotation is essential for understanding the phenotypic consequences encoded in a genome. Despite improvements in bioinformatics methods, millions of sequences in databanks are not assigned reliable functions. The curation of protein functions in the context of biological processes is a way to evaluate and improve their annotation. Results We developed an expert system using paraconsistent logic, named GROOLS (Genomic Rule Object-Oriented Logic System), that evaluates the completeness and the consistency of predicted functions through biological processes like metabolic pathways. Using a generic and hierarchical representation of knowledge, biological processes are modeled in a graph from which observations (i.e. predictions and expectations) are propagated by rules. At the end of the reasoning, conclusions are assigned to biological process components and highlight uncertainties and inconsistencies. Results on 14 microbial organisms are presented. Conclusions GROOLS software is designed to evaluate the overall accuracy of functional unit and pathway predictions according to organism experimental data like growth phenotypes. It assists biocurators in the functional annotation of proteins by focusing on missing or contradictory observations.
topic Genome annotation
Curation
Metabolic pathways
Knowledge representation
Paraconsistent logic
Expert system
url http://link.springer.com/article/10.1186/s12859-018-2126-1
work_keys_str_mv AT jonathanmercier groolsreactivegraphreasoningforgenomeannotationthroughbiologicalprocesses
AT adrienjosso groolsreactivegraphreasoningforgenomeannotationthroughbiologicalprocesses
AT claudinemedigue groolsreactivegraphreasoningforgenomeannotationthroughbiologicalprocesses
AT davidvallenet groolsreactivegraphreasoningforgenomeannotationthroughbiologicalprocesses
_version_ 1725313501947232256