A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform

Abstract Background Advances in sequencing technologies and bioinformatics have made the analysis of microbial communities almost routine. Nonetheless, the need remains to improve on the techniques used for gathering such data, including increasing throughput while lowering cost and benchmarking the...

Full description

Bibliographic Details
Main Authors:	Eric J. de Muinck, Pål Trosvik, Gregor D. Gilfillan, Johannes R. Hov, Arvind Y. M. Sundaram
Format:	Article
Language:	English
Published:	BMC 2017-07-01
Series:	Microbiome
Subjects:	16S rRNA gene amplicon sequencing Illumina library preparation Indexed PCR Mock community Environmental sequencing Benchmarking
Online Access:	http://link.springer.com/article/10.1186/s40168-017-0279-1

id	doaj-b975e433f4f74d3589795b9e12907553
record_format	Article
spelling	doaj-b975e433f4f74d3589795b9e129075532020-11-25T00:42:35ZengBMCMicrobiome2049-26182017-07-015111510.1186/s40168-017-0279-1A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platformEric J. de Muinck0Pål Trosvik1Gregor D. Gilfillan2Johannes R. Hov3Arvind Y. M. Sundaram4Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of OsloCentre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of OsloDepartment of Medical Genetics, Oslo University Hospital and University of OsloNorwegian PSC Research Center and Research Institute of Internal Medicine, Oslo University Hospital Rikshospitalet and University of OsloDepartment of Medical Genetics, Oslo University Hospital and University of OsloAbstract Background Advances in sequencing technologies and bioinformatics have made the analysis of microbial communities almost routine. Nonetheless, the need remains to improve on the techniques used for gathering such data, including increasing throughput while lowering cost and benchmarking the techniques so that potential sources of bias can be better characterized. Methods We present a triple-index amplicon sequencing strategy to sequence large numbers of samples at significantly lower c ost and in a shorter timeframe compared to existing methods. The design employs a two-stage PCR protocol, incorpo rating three barcodes to each sample, with the possibility to add a fourth-index. It also includes heterogeneity spacers to overcome low complexity issues faced when sequencing amplicons on Illumina platforms. Results The library preparation method was extensively benchmarked through analysis of a mock community in order to assess biases introduced by sample indexing, number of PCR cycles, and template concentration. We further evaluated the method through re-sequencing of a standardized environmental sample. Finally, we evaluated our protocol on a set of fecal samples from a small cohort of healthy adults, demonstrating good performance in a realistic experimental setting. Between-sample variation was mainly related to batch effects, such as DNA extraction, while sample indexing was also a significant source of bias. PCR cycle number strongly influenced chimera formation and affected relative abundance estimates of species with high GC content. Libraries were sequenced using the Illumina HiSeq and MiSeq platforms to demonstrate that this protocol is highly scalable to sequence thousands of samples at a very low cost. Conclusions Here, we provide the most comprehensive study of performance and bias inherent to a 16S rRNA gene amplicon sequencing method to date. Triple-indexing greatly reduces the number of long custom DNA oligos required for library preparation, while the inclusion of variable length heterogeneity spacers minimizes the need for PhiX spike-in. This design results in a significant cost reduction of highly multiplexed amplicon sequencing. The biases we characterize highlight the need for highly standardized protocols. Reassuringly, we find that the biological signal is a far stronger structuring factor than the various sources of bias.http://link.springer.com/article/10.1186/s40168-017-0279-116S rRNA gene amplicon sequencingIllumina library preparationIndexed PCRMock communityEnvironmental sequencingBenchmarking
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Eric J. de Muinck Pål Trosvik Gregor D. Gilfillan Johannes R. Hov Arvind Y. M. Sundaram
spellingShingle	Eric J. de Muinck Pål Trosvik Gregor D. Gilfillan Johannes R. Hov Arvind Y. M. Sundaram A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform Microbiome 16S rRNA gene amplicon sequencing Illumina library preparation Indexed PCR Mock community Environmental sequencing Benchmarking
author_facet	Eric J. de Muinck Pål Trosvik Gregor D. Gilfillan Johannes R. Hov Arvind Y. M. Sundaram
author_sort	Eric J. de Muinck
title	A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform
title_short	A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform
title_full	A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform
title_fullStr	A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform
title_full_unstemmed	A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform
title_sort	novel ultra high-throughput 16s rrna gene amplicon sequencing library preparation method for the illumina hiseq platform
publisher	BMC
series	Microbiome
issn	2049-2618
publishDate	2017-07-01
description	Abstract Background Advances in sequencing technologies and bioinformatics have made the analysis of microbial communities almost routine. Nonetheless, the need remains to improve on the techniques used for gathering such data, including increasing throughput while lowering cost and benchmarking the techniques so that potential sources of bias can be better characterized. Methods We present a triple-index amplicon sequencing strategy to sequence large numbers of samples at significantly lower c ost and in a shorter timeframe compared to existing methods. The design employs a two-stage PCR protocol, incorpo rating three barcodes to each sample, with the possibility to add a fourth-index. It also includes heterogeneity spacers to overcome low complexity issues faced when sequencing amplicons on Illumina platforms. Results The library preparation method was extensively benchmarked through analysis of a mock community in order to assess biases introduced by sample indexing, number of PCR cycles, and template concentration. We further evaluated the method through re-sequencing of a standardized environmental sample. Finally, we evaluated our protocol on a set of fecal samples from a small cohort of healthy adults, demonstrating good performance in a realistic experimental setting. Between-sample variation was mainly related to batch effects, such as DNA extraction, while sample indexing was also a significant source of bias. PCR cycle number strongly influenced chimera formation and affected relative abundance estimates of species with high GC content. Libraries were sequenced using the Illumina HiSeq and MiSeq platforms to demonstrate that this protocol is highly scalable to sequence thousands of samples at a very low cost. Conclusions Here, we provide the most comprehensive study of performance and bias inherent to a 16S rRNA gene amplicon sequencing method to date. Triple-indexing greatly reduces the number of long custom DNA oligos required for library preparation, while the inclusion of variable length heterogeneity spacers minimizes the need for PhiX spike-in. This design results in a significant cost reduction of highly multiplexed amplicon sequencing. The biases we characterize highlight the need for highly standardized protocols. Reassuringly, we find that the biological signal is a far stronger structuring factor than the various sources of bias.
topic	16S rRNA gene amplicon sequencing Illumina library preparation Indexed PCR Mock community Environmental sequencing Benchmarking
url	http://link.springer.com/article/10.1186/s40168-017-0279-1
work_keys_str_mv	AT ericjdemuinck anovelultrahighthroughput16srrnageneampliconsequencinglibrarypreparationmethodfortheilluminahiseqplatform AT paltrosvik anovelultrahighthroughput16srrnageneampliconsequencinglibrarypreparationmethodfortheilluminahiseqplatform AT gregordgilfillan anovelultrahighthroughput16srrnageneampliconsequencinglibrarypreparationmethodfortheilluminahiseqplatform AT johannesrhov anovelultrahighthroughput16srrnageneampliconsequencinglibrarypreparationmethodfortheilluminahiseqplatform AT arvindymsundaram anovelultrahighthroughput16srrnageneampliconsequencinglibrarypreparationmethodfortheilluminahiseqplatform AT ericjdemuinck novelultrahighthroughput16srrnageneampliconsequencinglibrarypreparationmethodfortheilluminahiseqplatform AT paltrosvik novelultrahighthroughput16srrnageneampliconsequencinglibrarypreparationmethodfortheilluminahiseqplatform AT gregordgilfillan novelultrahighthroughput16srrnageneampliconsequencinglibrarypreparationmethodfortheilluminahiseqplatform AT johannesrhov novelultrahighthroughput16srrnageneampliconsequencinglibrarypreparationmethodfortheilluminahiseqplatform AT arvindymsundaram novelultrahighthroughput16srrnageneampliconsequencinglibrarypreparationmethodfortheilluminahiseqplatform
_version_	1725281536404619264

A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform

Similar Items