Identification of Microbial Dark Matter in Antarctic Environments

Numerous studies have applied molecular techniques to understand the diversity, evolution, and ecological function of Antarctic bacteria and archaea. One common technique is sequencing of the 16S rRNA gene, which produces a nearly quantitative profile of community membership. However, the utility of...

Full description

Bibliographic Details
Main Author: Jeff S. Bowman
Format: Article
Language:English
Published: Frontiers Media S.A. 2018-12-01
Series:Frontiers in Microbiology
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fmicb.2018.03165/full
id doaj-1d3b82a5838c4c11a64fa00d752beee7
record_format Article
spelling doaj-1d3b82a5838c4c11a64fa00d752beee72020-11-24T22:05:37ZengFrontiers Media S.A.Frontiers in Microbiology1664-302X2018-12-01910.3389/fmicb.2018.03165422712Identification of Microbial Dark Matter in Antarctic EnvironmentsJeff S. Bowman0Jeff S. Bowman1Scripps Institution of Oceanography, University of California, San Diego, La Jolla, CA, United StatesCenter for Microbiome Innovation, University of California, San Diego, La Jolla, CA, United StatesNumerous studies have applied molecular techniques to understand the diversity, evolution, and ecological function of Antarctic bacteria and archaea. One common technique is sequencing of the 16S rRNA gene, which produces a nearly quantitative profile of community membership. However, the utility of this and similar approaches is limited by what is known about the evolution, physiology, and ecology of surveyed taxa. When representative genomes are available in public databases some of this information can be gleaned from genomic studies, and automated pipelines exist to carry out this task. Here the paprica metabolic inference pipeline was used to assess how well Antarctic microbial communities are represented by the available completed genomes. The NCBI’s Sequence Read Archive (SRA) was searched for Antarctic datasets that used one of the Illumina platforms to sequence the 16S rRNA gene. These data were quality controlled and denoised to identify unique reads, then analyzed with paprica to determine the degree of overlap with the closest phylogenetic neighbor with a completely sequenced genome. While some unique reads had perfect mapping to 16S rRNA genes from completed genomes, the mean percent overlap for all mapped reads was 86.6%. When samples were grouped by environment, some environments appeared more or less well represented by the available genomes. For the domain Bacteria, seawater was particularly poorly represented with a mean overlap of 80.2%, while for the domain Archaea glacial ice was particularly poorly represented with an overlap of only 48.0% for a single sample. These findings suggest that a considerable effort is needed to improve the representation of Antarctic microbes in genome sequence databases.https://www.frontiersin.org/article/10.3389/fmicb.2018.03165/fullAntarctica16S rRNAglaciersea icecryoconitesediment
collection DOAJ
language English
format Article
sources DOAJ
author Jeff S. Bowman
Jeff S. Bowman
spellingShingle Jeff S. Bowman
Jeff S. Bowman
Identification of Microbial Dark Matter in Antarctic Environments
Frontiers in Microbiology
Antarctica
16S rRNA
glacier
sea ice
cryoconite
sediment
author_facet Jeff S. Bowman
Jeff S. Bowman
author_sort Jeff S. Bowman
title Identification of Microbial Dark Matter in Antarctic Environments
title_short Identification of Microbial Dark Matter in Antarctic Environments
title_full Identification of Microbial Dark Matter in Antarctic Environments
title_fullStr Identification of Microbial Dark Matter in Antarctic Environments
title_full_unstemmed Identification of Microbial Dark Matter in Antarctic Environments
title_sort identification of microbial dark matter in antarctic environments
publisher Frontiers Media S.A.
series Frontiers in Microbiology
issn 1664-302X
publishDate 2018-12-01
description Numerous studies have applied molecular techniques to understand the diversity, evolution, and ecological function of Antarctic bacteria and archaea. One common technique is sequencing of the 16S rRNA gene, which produces a nearly quantitative profile of community membership. However, the utility of this and similar approaches is limited by what is known about the evolution, physiology, and ecology of surveyed taxa. When representative genomes are available in public databases some of this information can be gleaned from genomic studies, and automated pipelines exist to carry out this task. Here the paprica metabolic inference pipeline was used to assess how well Antarctic microbial communities are represented by the available completed genomes. The NCBI’s Sequence Read Archive (SRA) was searched for Antarctic datasets that used one of the Illumina platforms to sequence the 16S rRNA gene. These data were quality controlled and denoised to identify unique reads, then analyzed with paprica to determine the degree of overlap with the closest phylogenetic neighbor with a completely sequenced genome. While some unique reads had perfect mapping to 16S rRNA genes from completed genomes, the mean percent overlap for all mapped reads was 86.6%. When samples were grouped by environment, some environments appeared more or less well represented by the available genomes. For the domain Bacteria, seawater was particularly poorly represented with a mean overlap of 80.2%, while for the domain Archaea glacial ice was particularly poorly represented with an overlap of only 48.0% for a single sample. These findings suggest that a considerable effort is needed to improve the representation of Antarctic microbes in genome sequence databases.
topic Antarctica
16S rRNA
glacier
sea ice
cryoconite
sediment
url https://www.frontiersin.org/article/10.3389/fmicb.2018.03165/full
work_keys_str_mv AT jeffsbowman identificationofmicrobialdarkmatterinantarcticenvironments
AT jeffsbowman identificationofmicrobialdarkmatterinantarcticenvironments
_version_ 1725825460764409856