Identification of Unannotated Small Genes in Salmonella

Increasing evidence indicates that many, if not all, small genes encoding proteins ≤100 aa are missing in annotations of bacterial genomes currently available. To uncover unannotated small genes in the model bacterium Salmonella enterica Typhimurium 14028s, we used the genomic technique ribosome pro...

Full description

Bibliographic Details
Main Authors: Jonghwan Baek, Jiyoung Lee, Kihoon Yoon, Hyunwoo Lee
Format: Article
Language:English
Published: Oxford University Press 2017-03-01
Series:G3: Genes, Genomes, Genetics
Subjects:
Online Access:http://g3journal.org/lookup/doi/10.1534/g3.116.036939
id doaj-f6a5fb71c18d46489b1065018cf9ec3a
record_format Article
spelling doaj-f6a5fb71c18d46489b1065018cf9ec3a2021-07-02T06:25:03ZengOxford University PressG3: Genes, Genomes, Genetics2160-18362017-03-017398398910.1534/g3.116.03693921Identification of Unannotated Small Genes in SalmonellaJonghwan BaekJiyoung LeeKihoon YoonHyunwoo LeeIncreasing evidence indicates that many, if not all, small genes encoding proteins ≤100 aa are missing in annotations of bacterial genomes currently available. To uncover unannotated small genes in the model bacterium Salmonella enterica Typhimurium 14028s, we used the genomic technique ribosome profiling, which provides a snapshot of all mRNAs being translated (translatome) in a given growth condition. For comprehensive identification of unannotated small genes, we obtained Salmonella translatomes from four different growth conditions: LB, MOPS rich defined medium, and two infection-relevant conditions low Mg2+ (10 µM) and low pH (5.8). To facilitate the identification of small genes, ribosome profiling data were analyzed in combination with in silico predicted putative open reading frames and transcriptome profiles. As a result, we uncovered 130 unannotated ORFs. Of them, 98% were small ORFs putatively encoding peptides/proteins ≤100 aa, and some of them were only expressed in the infection-relevant low Mg2+ and/or low pH condition. We validated the expression of 25 of these ORFs by western blot, including the smallest, which encodes a peptide of 7 aa residues. Our results suggest that many sequenced bacterial genomes are underannotated with regard to small genes and their gene annotations need to be revised.http://g3journal.org/lookup/doi/10.1534/g3.116.036939genome annotationribosome profilingsmall proteinssmall genesshort ORF
collection DOAJ
language English
format Article
sources DOAJ
author Jonghwan Baek
Jiyoung Lee
Kihoon Yoon
Hyunwoo Lee
spellingShingle Jonghwan Baek
Jiyoung Lee
Kihoon Yoon
Hyunwoo Lee
Identification of Unannotated Small Genes in Salmonella
G3: Genes, Genomes, Genetics
genome annotation
ribosome profiling
small proteins
small genes
short ORF
author_facet Jonghwan Baek
Jiyoung Lee
Kihoon Yoon
Hyunwoo Lee
author_sort Jonghwan Baek
title Identification of Unannotated Small Genes in Salmonella
title_short Identification of Unannotated Small Genes in Salmonella
title_full Identification of Unannotated Small Genes in Salmonella
title_fullStr Identification of Unannotated Small Genes in Salmonella
title_full_unstemmed Identification of Unannotated Small Genes in Salmonella
title_sort identification of unannotated small genes in salmonella
publisher Oxford University Press
series G3: Genes, Genomes, Genetics
issn 2160-1836
publishDate 2017-03-01
description Increasing evidence indicates that many, if not all, small genes encoding proteins ≤100 aa are missing in annotations of bacterial genomes currently available. To uncover unannotated small genes in the model bacterium Salmonella enterica Typhimurium 14028s, we used the genomic technique ribosome profiling, which provides a snapshot of all mRNAs being translated (translatome) in a given growth condition. For comprehensive identification of unannotated small genes, we obtained Salmonella translatomes from four different growth conditions: LB, MOPS rich defined medium, and two infection-relevant conditions low Mg2+ (10 µM) and low pH (5.8). To facilitate the identification of small genes, ribosome profiling data were analyzed in combination with in silico predicted putative open reading frames and transcriptome profiles. As a result, we uncovered 130 unannotated ORFs. Of them, 98% were small ORFs putatively encoding peptides/proteins ≤100 aa, and some of them were only expressed in the infection-relevant low Mg2+ and/or low pH condition. We validated the expression of 25 of these ORFs by western blot, including the smallest, which encodes a peptide of 7 aa residues. Our results suggest that many sequenced bacterial genomes are underannotated with regard to small genes and their gene annotations need to be revised.
topic genome annotation
ribosome profiling
small proteins
small genes
short ORF
url http://g3journal.org/lookup/doi/10.1534/g3.116.036939
work_keys_str_mv AT jonghwanbaek identificationofunannotatedsmallgenesinsalmonella
AT jiyounglee identificationofunannotatedsmallgenesinsalmonella
AT kihoonyoon identificationofunannotatedsmallgenesinsalmonella
AT hyunwoolee identificationofunannotatedsmallgenesinsalmonella
_version_ 1721337316449976320