GROOLS: reactive graph reasoning for genome annotation through biological processes
Abstract Background High quality functional annotation is essential for understanding the phenotypic consequences encoded in a genome. Despite improvements in bioinformatics methods, millions of sequences in databanks are not assigned reliable functions. The curation of protein functions in the cont...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2018-04-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-018-2126-1 |
id |
doaj-bd8f6ad3a72043d9a57f87a6dc99ce61 |
---|---|
record_format |
Article |
spelling |
doaj-bd8f6ad3a72043d9a57f87a6dc99ce612020-11-25T00:34:24ZengBMCBMC Bioinformatics1471-21052018-04-0119111210.1186/s12859-018-2126-1GROOLS: reactive graph reasoning for genome annotation through biological processesJonathan Mercier0Adrien Josso1Claudine Médigue2David Vallenet3LABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-SaclayLABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-SaclayLABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-SaclayLABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-SaclayAbstract Background High quality functional annotation is essential for understanding the phenotypic consequences encoded in a genome. Despite improvements in bioinformatics methods, millions of sequences in databanks are not assigned reliable functions. The curation of protein functions in the context of biological processes is a way to evaluate and improve their annotation. Results We developed an expert system using paraconsistent logic, named GROOLS (Genomic Rule Object-Oriented Logic System), that evaluates the completeness and the consistency of predicted functions through biological processes like metabolic pathways. Using a generic and hierarchical representation of knowledge, biological processes are modeled in a graph from which observations (i.e. predictions and expectations) are propagated by rules. At the end of the reasoning, conclusions are assigned to biological process components and highlight uncertainties and inconsistencies. Results on 14 microbial organisms are presented. Conclusions GROOLS software is designed to evaluate the overall accuracy of functional unit and pathway predictions according to organism experimental data like growth phenotypes. It assists biocurators in the functional annotation of proteins by focusing on missing or contradictory observations.http://link.springer.com/article/10.1186/s12859-018-2126-1Genome annotationCurationMetabolic pathwaysKnowledge representationParaconsistent logicExpert system |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jonathan Mercier Adrien Josso Claudine Médigue David Vallenet |
spellingShingle |
Jonathan Mercier Adrien Josso Claudine Médigue David Vallenet GROOLS: reactive graph reasoning for genome annotation through biological processes BMC Bioinformatics Genome annotation Curation Metabolic pathways Knowledge representation Paraconsistent logic Expert system |
author_facet |
Jonathan Mercier Adrien Josso Claudine Médigue David Vallenet |
author_sort |
Jonathan Mercier |
title |
GROOLS: reactive graph reasoning for genome annotation through biological processes |
title_short |
GROOLS: reactive graph reasoning for genome annotation through biological processes |
title_full |
GROOLS: reactive graph reasoning for genome annotation through biological processes |
title_fullStr |
GROOLS: reactive graph reasoning for genome annotation through biological processes |
title_full_unstemmed |
GROOLS: reactive graph reasoning for genome annotation through biological processes |
title_sort |
grools: reactive graph reasoning for genome annotation through biological processes |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2018-04-01 |
description |
Abstract Background High quality functional annotation is essential for understanding the phenotypic consequences encoded in a genome. Despite improvements in bioinformatics methods, millions of sequences in databanks are not assigned reliable functions. The curation of protein functions in the context of biological processes is a way to evaluate and improve their annotation. Results We developed an expert system using paraconsistent logic, named GROOLS (Genomic Rule Object-Oriented Logic System), that evaluates the completeness and the consistency of predicted functions through biological processes like metabolic pathways. Using a generic and hierarchical representation of knowledge, biological processes are modeled in a graph from which observations (i.e. predictions and expectations) are propagated by rules. At the end of the reasoning, conclusions are assigned to biological process components and highlight uncertainties and inconsistencies. Results on 14 microbial organisms are presented. Conclusions GROOLS software is designed to evaluate the overall accuracy of functional unit and pathway predictions according to organism experimental data like growth phenotypes. It assists biocurators in the functional annotation of proteins by focusing on missing or contradictory observations. |
topic |
Genome annotation Curation Metabolic pathways Knowledge representation Paraconsistent logic Expert system |
url |
http://link.springer.com/article/10.1186/s12859-018-2126-1 |
work_keys_str_mv |
AT jonathanmercier groolsreactivegraphreasoningforgenomeannotationthroughbiologicalprocesses AT adrienjosso groolsreactivegraphreasoningforgenomeannotationthroughbiologicalprocesses AT claudinemedigue groolsreactivegraphreasoningforgenomeannotationthroughbiologicalprocesses AT davidvallenet groolsreactivegraphreasoningforgenomeannotationthroughbiologicalprocesses |
_version_ |
1725313501947232256 |