Identification of Unannotated Small Genes in Salmonella
Increasing evidence indicates that many, if not all, small genes encoding proteins ≤100 aa are missing in annotations of bacterial genomes currently available. To uncover unannotated small genes in the model bacterium Salmonella enterica Typhimurium 14028s, we used the genomic technique ribosome pro...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Oxford University Press
2017-03-01
|
Series: | G3: Genes, Genomes, Genetics |
Subjects: | |
Online Access: | http://g3journal.org/lookup/doi/10.1534/g3.116.036939 |
id |
doaj-f6a5fb71c18d46489b1065018cf9ec3a |
---|---|
record_format |
Article |
spelling |
doaj-f6a5fb71c18d46489b1065018cf9ec3a2021-07-02T06:25:03ZengOxford University PressG3: Genes, Genomes, Genetics2160-18362017-03-017398398910.1534/g3.116.03693921Identification of Unannotated Small Genes in SalmonellaJonghwan BaekJiyoung LeeKihoon YoonHyunwoo LeeIncreasing evidence indicates that many, if not all, small genes encoding proteins ≤100 aa are missing in annotations of bacterial genomes currently available. To uncover unannotated small genes in the model bacterium Salmonella enterica Typhimurium 14028s, we used the genomic technique ribosome profiling, which provides a snapshot of all mRNAs being translated (translatome) in a given growth condition. For comprehensive identification of unannotated small genes, we obtained Salmonella translatomes from four different growth conditions: LB, MOPS rich defined medium, and two infection-relevant conditions low Mg2+ (10 µM) and low pH (5.8). To facilitate the identification of small genes, ribosome profiling data were analyzed in combination with in silico predicted putative open reading frames and transcriptome profiles. As a result, we uncovered 130 unannotated ORFs. Of them, 98% were small ORFs putatively encoding peptides/proteins ≤100 aa, and some of them were only expressed in the infection-relevant low Mg2+ and/or low pH condition. We validated the expression of 25 of these ORFs by western blot, including the smallest, which encodes a peptide of 7 aa residues. Our results suggest that many sequenced bacterial genomes are underannotated with regard to small genes and their gene annotations need to be revised.http://g3journal.org/lookup/doi/10.1534/g3.116.036939genome annotationribosome profilingsmall proteinssmall genesshort ORF |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jonghwan Baek Jiyoung Lee Kihoon Yoon Hyunwoo Lee |
spellingShingle |
Jonghwan Baek Jiyoung Lee Kihoon Yoon Hyunwoo Lee Identification of Unannotated Small Genes in Salmonella G3: Genes, Genomes, Genetics genome annotation ribosome profiling small proteins small genes short ORF |
author_facet |
Jonghwan Baek Jiyoung Lee Kihoon Yoon Hyunwoo Lee |
author_sort |
Jonghwan Baek |
title |
Identification of Unannotated Small Genes in Salmonella |
title_short |
Identification of Unannotated Small Genes in Salmonella |
title_full |
Identification of Unannotated Small Genes in Salmonella |
title_fullStr |
Identification of Unannotated Small Genes in Salmonella |
title_full_unstemmed |
Identification of Unannotated Small Genes in Salmonella |
title_sort |
identification of unannotated small genes in salmonella |
publisher |
Oxford University Press |
series |
G3: Genes, Genomes, Genetics |
issn |
2160-1836 |
publishDate |
2017-03-01 |
description |
Increasing evidence indicates that many, if not all, small genes encoding proteins ≤100 aa are missing in annotations of bacterial genomes currently available. To uncover unannotated small genes in the model bacterium Salmonella enterica Typhimurium 14028s, we used the genomic technique ribosome profiling, which provides a snapshot of all mRNAs being translated (translatome) in a given growth condition. For comprehensive identification of unannotated small genes, we obtained Salmonella translatomes from four different growth conditions: LB, MOPS rich defined medium, and two infection-relevant conditions low Mg2+ (10 µM) and low pH (5.8). To facilitate the identification of small genes, ribosome profiling data were analyzed in combination with in silico predicted putative open reading frames and transcriptome profiles. As a result, we uncovered 130 unannotated ORFs. Of them, 98% were small ORFs putatively encoding peptides/proteins ≤100 aa, and some of them were only expressed in the infection-relevant low Mg2+ and/or low pH condition. We validated the expression of 25 of these ORFs by western blot, including the smallest, which encodes a peptide of 7 aa residues. Our results suggest that many sequenced bacterial genomes are underannotated with regard to small genes and their gene annotations need to be revised. |
topic |
genome annotation ribosome profiling small proteins small genes short ORF |
url |
http://g3journal.org/lookup/doi/10.1534/g3.116.036939 |
work_keys_str_mv |
AT jonghwanbaek identificationofunannotatedsmallgenesinsalmonella AT jiyounglee identificationofunannotatedsmallgenesinsalmonella AT kihoonyoon identificationofunannotatedsmallgenesinsalmonella AT hyunwoolee identificationofunannotatedsmallgenesinsalmonella |
_version_ |
1721337316449976320 |