Genome majority vote improves gene predictions.

Recent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the accuracy of gene start-site prediction. We applied an algorithm using a genome...

Full description

Bibliographic Details
Main Authors: Michael E Wall, Sindhu Raghavan, Judith D Cohn, John Dunbar
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2011-11-01
Series:PLoS Computational Biology
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22131910/?tool=EBI
id doaj-d0ed2a5550804a92ae45ece1ed78b69f
record_format Article
spelling doaj-d0ed2a5550804a92ae45ece1ed78b69f2021-04-21T15:09:54ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582011-11-01711e100228410.1371/journal.pcbi.1002284Genome majority vote improves gene predictions.Michael E WallSindhu RaghavanJudith D CohnJohn DunbarRecent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the accuracy of gene start-site prediction. We applied an algorithm using a genome majority vote (GMV) scheme to increase the consistency of gene starts among orthologs. We used a set of validated Escherichia coli genes as a standard to quantify accuracy. Results showed that the GMV algorithm can correct hundreds of gene prediction errors in sets of five or ten genomes while introducing few errors. Using a conservative calculation, we project that GMV would resolve many inconsistencies and errors in publicly available microbial gene maps. Our simple and logical solution provides a notable advance toward accurate gene maps.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22131910/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author Michael E Wall
Sindhu Raghavan
Judith D Cohn
John Dunbar
spellingShingle Michael E Wall
Sindhu Raghavan
Judith D Cohn
John Dunbar
Genome majority vote improves gene predictions.
PLoS Computational Biology
author_facet Michael E Wall
Sindhu Raghavan
Judith D Cohn
John Dunbar
author_sort Michael E Wall
title Genome majority vote improves gene predictions.
title_short Genome majority vote improves gene predictions.
title_full Genome majority vote improves gene predictions.
title_fullStr Genome majority vote improves gene predictions.
title_full_unstemmed Genome majority vote improves gene predictions.
title_sort genome majority vote improves gene predictions.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2011-11-01
description Recent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the accuracy of gene start-site prediction. We applied an algorithm using a genome majority vote (GMV) scheme to increase the consistency of gene starts among orthologs. We used a set of validated Escherichia coli genes as a standard to quantify accuracy. Results showed that the GMV algorithm can correct hundreds of gene prediction errors in sets of five or ten genomes while introducing few errors. Using a conservative calculation, we project that GMV would resolve many inconsistencies and errors in publicly available microbial gene maps. Our simple and logical solution provides a notable advance toward accurate gene maps.
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22131910/?tool=EBI
work_keys_str_mv AT michaelewall genomemajorityvoteimprovesgenepredictions
AT sindhuraghavan genomemajorityvoteimprovesgenepredictions
AT judithdcohn genomemajorityvoteimprovesgenepredictions
AT johndunbar genomemajorityvoteimprovesgenepredictions
_version_ 1714667963042234368