Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies [version 4; peer review: 2 approved, 2 not approved]

Shotgun metagenomics sequencing is a powerful tool for the characterization of complex biological matrices, enabling analysis of prokaryotic and eukaryotic organisms and viruses in a single experiment, with the possibility of reconstructing de novo the whole metagenome or a set of genes of interest....

Full description

Bibliographic Details
Main Authors: Federica Cattonaro, Alessandro Spadotto, Slobodanka Radovic, Fabio Marroni
Format: Article
Language:English
Published: F1000 Research Ltd 2020-01-01
Series:F1000Research
Online Access:https://f1000research.com/articles/7-1767/v4
id doaj-17d604b8d5b9417a89c79457f13c2bdf
record_format Article
spelling doaj-17d604b8d5b9417a89c79457f13c2bdf2020-11-25T02:50:36ZengF1000 Research LtdF1000Research2046-14022020-01-01710.12688/f1000research.16804.424347Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies [version 4; peer review: 2 approved, 2 not approved]Federica Cattonaro0Alessandro Spadotto1Slobodanka Radovic2Fabio Marroni3IGA Technology Services Srl, Udine, Udine, 33100, ItalyIGA Technology Services Srl, Udine, Udine, 33100, ItalyIGA Technology Services Srl, Udine, Udine, 33100, ItalyIGA Technology Services Srl, Udine, Udine, 33100, ItalyShotgun metagenomics sequencing is a powerful tool for the characterization of complex biological matrices, enabling analysis of prokaryotic and eukaryotic organisms and viruses in a single experiment, with the possibility of reconstructing de novo the whole metagenome or a set of genes of interest. One of the main factors limiting the use of shotgun metagenomics on wide scale projects is the high cost associated with the approach. We set out to determine if it is possible to use shallow shotgun metagenomics to characterize complex biological matrices while reducing costs. We used a staggered mock community to estimate the optimal threshold for species detection. We measured the variation of several summary statistics simulating a decrease in sequencing depth by randomly subsampling a number of reads. The main statistics that were compared are diversity estimates, species abundance, and ability of reconstructing de novo the metagenome in terms of length and completeness. Our results show that diversity indices of complex prokaryotic, eukaryotic and viral communities can be accurately estimated with 500,000 reads or less, although particularly complex samples may require 1,000,000 reads. On the contrary, any task involving the reconstruction of the metagenome performed poorly, even with the largest simulated subsample (1,000,000 reads). The length of the reconstructed assembly was smaller than the length obtained with the full dataset, and the proportion of conserved genes that were identified in the meta-genome was drastically reduced compared to the full sample. Shallow shotgun metagenomics can be a useful tool to describe the structure of complex matrices, but it is not adequate to reconstruct—even partially—the metagenome.https://f1000research.com/articles/7-1767/v4
collection DOAJ
language English
format Article
sources DOAJ
author Federica Cattonaro
Alessandro Spadotto
Slobodanka Radovic
Fabio Marroni
spellingShingle Federica Cattonaro
Alessandro Spadotto
Slobodanka Radovic
Fabio Marroni
Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies [version 4; peer review: 2 approved, 2 not approved]
F1000Research
author_facet Federica Cattonaro
Alessandro Spadotto
Slobodanka Radovic
Fabio Marroni
author_sort Federica Cattonaro
title Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies [version 4; peer review: 2 approved, 2 not approved]
title_short Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies [version 4; peer review: 2 approved, 2 not approved]
title_full Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies [version 4; peer review: 2 approved, 2 not approved]
title_fullStr Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies [version 4; peer review: 2 approved, 2 not approved]
title_full_unstemmed Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies [version 4; peer review: 2 approved, 2 not approved]
title_sort do you cov me? effect of coverage reduction on metagenome shotgun sequencing studies [version 4; peer review: 2 approved, 2 not approved]
publisher F1000 Research Ltd
series F1000Research
issn 2046-1402
publishDate 2020-01-01
description Shotgun metagenomics sequencing is a powerful tool for the characterization of complex biological matrices, enabling analysis of prokaryotic and eukaryotic organisms and viruses in a single experiment, with the possibility of reconstructing de novo the whole metagenome or a set of genes of interest. One of the main factors limiting the use of shotgun metagenomics on wide scale projects is the high cost associated with the approach. We set out to determine if it is possible to use shallow shotgun metagenomics to characterize complex biological matrices while reducing costs. We used a staggered mock community to estimate the optimal threshold for species detection. We measured the variation of several summary statistics simulating a decrease in sequencing depth by randomly subsampling a number of reads. The main statistics that were compared are diversity estimates, species abundance, and ability of reconstructing de novo the metagenome in terms of length and completeness. Our results show that diversity indices of complex prokaryotic, eukaryotic and viral communities can be accurately estimated with 500,000 reads or less, although particularly complex samples may require 1,000,000 reads. On the contrary, any task involving the reconstruction of the metagenome performed poorly, even with the largest simulated subsample (1,000,000 reads). The length of the reconstructed assembly was smaller than the length obtained with the full dataset, and the proportion of conserved genes that were identified in the meta-genome was drastically reduced compared to the full sample. Shallow shotgun metagenomics can be a useful tool to describe the structure of complex matrices, but it is not adequate to reconstruct—even partially—the metagenome.
url https://f1000research.com/articles/7-1767/v4
work_keys_str_mv AT federicacattonaro doyoucovmeeffectofcoveragereductiononmetagenomeshotgunsequencingstudiesversion4peerreview2approved2notapproved
AT alessandrospadotto doyoucovmeeffectofcoveragereductiononmetagenomeshotgunsequencingstudiesversion4peerreview2approved2notapproved
AT slobodankaradovic doyoucovmeeffectofcoveragereductiononmetagenomeshotgunsequencingstudiesversion4peerreview2approved2notapproved
AT fabiomarroni doyoucovmeeffectofcoveragereductiononmetagenomeshotgunsequencingstudiesversion4peerreview2approved2notapproved
_version_ 1724737710989508608