Analysis of two large functionally uncharacterized regions in the <it>Methanopyrus kandleri </it>AV19 genome

<p>Abstract</p> <p>Background</p> <p>For most sequenced prokaryotic genomes, about a third of the protein coding genes annotated are "orphan proteins", that is, they lack homology to known proteins. These hypothetical genes are typically short and randomly sca...

Full description

Bibliographic Details
Main Authors: Petersen Nanna, Pedersen Corinna, Lundegaard Christiane, Jørgensen Merete, Sicheritz-Pontén Thomas, Skovgaard Marie, Jensen Lars, Ussery David
Format: Article
Language:English
Published: BMC 2003-04-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/4/12
id doaj-99428d38aa564a2d8e1f9be69bbd0fd4
record_format Article
spelling doaj-99428d38aa564a2d8e1f9be69bbd0fd42020-11-25T02:19:06ZengBMCBMC Genomics1471-21642003-04-01411210.1186/1471-2164-4-12Analysis of two large functionally uncharacterized regions in the <it>Methanopyrus kandleri </it>AV19 genomePetersen NannaPedersen CorinnaLundegaard ChristianeJørgensen MereteSicheritz-Pontén ThomasSkovgaard MarieJensen LarsUssery David<p>Abstract</p> <p>Background</p> <p>For most sequenced prokaryotic genomes, about a third of the protein coding genes annotated are "orphan proteins", that is, they lack homology to known proteins. These hypothetical genes are typically short and randomly scattered throughout the genome. This trend is seen for most of the bacterial and archaeal genomes published to date.</p> <p>Results</p> <p>In contrast we have found that a large fraction of the genes coding for such orphan proteins in the <it>Methanopyrus kandleri</it> AV19 genome occur within two large regions. These genes have no known homologs except from other <it>M. kandleri</it> genes. However, analysis of their lengths, codon usage, and Ribosomal Binding Site (RBS) sequences shows that they are most likely true protein coding genes and not random open reading frames.</p> <p>Conclusions</p> <p>Although these regions can be considered as candidates for massive lateral gene transfer, our bioinformatics analysis suggests that this is not the case. We predict many of the organism specific proteins to be transmembrane and belong to protein families that are non-randomly distributed between the regions. Consistent with this, we suggest that the two regions are most likely unrelated, and that they may be integrated plasmids.</p> http://www.biomedcentral.com/1471-2164/4/12
collection DOAJ
language English
format Article
sources DOAJ
author Petersen Nanna
Pedersen Corinna
Lundegaard Christiane
Jørgensen Merete
Sicheritz-Pontén Thomas
Skovgaard Marie
Jensen Lars
Ussery David
spellingShingle Petersen Nanna
Pedersen Corinna
Lundegaard Christiane
Jørgensen Merete
Sicheritz-Pontén Thomas
Skovgaard Marie
Jensen Lars
Ussery David
Analysis of two large functionally uncharacterized regions in the <it>Methanopyrus kandleri </it>AV19 genome
BMC Genomics
author_facet Petersen Nanna
Pedersen Corinna
Lundegaard Christiane
Jørgensen Merete
Sicheritz-Pontén Thomas
Skovgaard Marie
Jensen Lars
Ussery David
author_sort Petersen Nanna
title Analysis of two large functionally uncharacterized regions in the <it>Methanopyrus kandleri </it>AV19 genome
title_short Analysis of two large functionally uncharacterized regions in the <it>Methanopyrus kandleri </it>AV19 genome
title_full Analysis of two large functionally uncharacterized regions in the <it>Methanopyrus kandleri </it>AV19 genome
title_fullStr Analysis of two large functionally uncharacterized regions in the <it>Methanopyrus kandleri </it>AV19 genome
title_full_unstemmed Analysis of two large functionally uncharacterized regions in the <it>Methanopyrus kandleri </it>AV19 genome
title_sort analysis of two large functionally uncharacterized regions in the <it>methanopyrus kandleri </it>av19 genome
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2003-04-01
description <p>Abstract</p> <p>Background</p> <p>For most sequenced prokaryotic genomes, about a third of the protein coding genes annotated are "orphan proteins", that is, they lack homology to known proteins. These hypothetical genes are typically short and randomly scattered throughout the genome. This trend is seen for most of the bacterial and archaeal genomes published to date.</p> <p>Results</p> <p>In contrast we have found that a large fraction of the genes coding for such orphan proteins in the <it>Methanopyrus kandleri</it> AV19 genome occur within two large regions. These genes have no known homologs except from other <it>M. kandleri</it> genes. However, analysis of their lengths, codon usage, and Ribosomal Binding Site (RBS) sequences shows that they are most likely true protein coding genes and not random open reading frames.</p> <p>Conclusions</p> <p>Although these regions can be considered as candidates for massive lateral gene transfer, our bioinformatics analysis suggests that this is not the case. We predict many of the organism specific proteins to be transmembrane and belong to protein families that are non-randomly distributed between the regions. Consistent with this, we suggest that the two regions are most likely unrelated, and that they may be integrated plasmids.</p>
url http://www.biomedcentral.com/1471-2164/4/12
work_keys_str_mv AT petersennanna analysisoftwolargefunctionallyuncharacterizedregionsintheitmethanopyruskandleriitav19genome
AT pedersencorinna analysisoftwolargefunctionallyuncharacterizedregionsintheitmethanopyruskandleriitav19genome
AT lundegaardchristiane analysisoftwolargefunctionallyuncharacterizedregionsintheitmethanopyruskandleriitav19genome
AT jørgensenmerete analysisoftwolargefunctionallyuncharacterizedregionsintheitmethanopyruskandleriitav19genome
AT sicheritzpontenthomas analysisoftwolargefunctionallyuncharacterizedregionsintheitmethanopyruskandleriitav19genome
AT skovgaardmarie analysisoftwolargefunctionallyuncharacterizedregionsintheitmethanopyruskandleriitav19genome
AT jensenlars analysisoftwolargefunctionallyuncharacterizedregionsintheitmethanopyruskandleriitav19genome
AT usserydavid analysisoftwolargefunctionallyuncharacterizedregionsintheitmethanopyruskandleriitav19genome
_version_ 1724878519510499328