A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
Abstract Background Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsu...
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2017-04-01
|
Series: | Virology Journal |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12985-017-0741-5 |
id |
doaj-7c5f2abd192f469db2106fc3a171e0fd |
---|---|
record_format |
Article |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Kiril M. Dimitrov Poonam Sharma Jeremy D. Volkening Iryna V. Goraichuk Abdul Wajid Shafqat Fatima Rehmani Asma Basharat Ismaila Shittu Tony M. Joannis Patti J. Miller Claudio L. Afonso |
spellingShingle |
Kiril M. Dimitrov Poonam Sharma Jeremy D. Volkening Iryna V. Goraichuk Abdul Wajid Shafqat Fatima Rehmani Asma Basharat Ismaila Shittu Tony M. Joannis Patti J. Miller Claudio L. Afonso A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses Virology Journal Newcastle disease virus Next-generation sequencing Multiplexing Galaxy De novo assembly Multiplexing |
author_facet |
Kiril M. Dimitrov Poonam Sharma Jeremy D. Volkening Iryna V. Goraichuk Abdul Wajid Shafqat Fatima Rehmani Asma Basharat Ismaila Shittu Tony M. Joannis Patti J. Miller Claudio L. Afonso |
author_sort |
Kiril M. Dimitrov |
title |
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses |
title_short |
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses |
title_full |
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses |
title_fullStr |
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses |
title_full_unstemmed |
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses |
title_sort |
robust and cost-effective approach to sequence and analyze complete genomes of small rna viruses |
publisher |
BMC |
series |
Virology Journal |
issn |
1743-422X |
publishDate |
2017-04-01 |
description |
Abstract Background Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected variants and co-infecting agents. However, NGS is not widely used for small RNA viruses because of incorrectly perceived cost estimates and inefficient utilization of freely available bioinformatics tools. Methods In this study, we have utilized NGS-based random sequencing of total RNA combined with barcode multiplexing of libraries to quickly, effectively and simultaneously characterize the genomic sequences of multiple avian paramyxoviruses. Thirty libraries were prepared from diagnostic samples amplified in allantoic fluids and their total RNAs were sequenced in a single flow cell on an Illumina MiSeq instrument. After digital normalization, data were assembled using the MIRA assembler within a customized workflow on the Galaxy platform. Results Twenty-eight avian paramyxovirus 1 (APMV-1), one APMV-13, four avian influenza and two infectious bronchitis virus complete or nearly complete genome sequences were obtained from the single run. The 29 avian paramyxovirus genomes displayed 99.6% mean coverage based on bases with Phred quality scores of 30 or more. The lower and upper quartiles of sample median depth per position for those 29 samples were 2984 and 6894, respectively, indicating coverage across samples sufficient for deep variant analysis. Sample processing and library preparation took approximately 25–30 h, the sequencing run took 39 h, and processing through the Galaxy workflow took approximately 2–3 h. The cost of all steps, excluding labor, was estimated to be 106 USD per sample. Conclusions This work describes an efficient multiplexing NGS approach, a detailed analysis workflow, and customized tools for the characterization of the genomes of RNA viruses. The combination of multiplexing NGS technology with the Galaxy workflow platform resulted in a fast, user-friendly, and cost-efficient protocol for the simultaneous characterization of multiple full-length viral genomes. Twenty-nine full-length or near-full-length APMV genomes with a high median depth were successfully sequenced out of 30 samples. The applied de novo assembly approach also allowed identification of mixed viral populations in some of the samples. |
topic |
Newcastle disease virus Next-generation sequencing Multiplexing Galaxy De novo assembly Multiplexing |
url |
http://link.springer.com/article/10.1186/s12985-017-0741-5 |
work_keys_str_mv |
AT kirilmdimitrov arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT poonamsharma arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT jeremydvolkening arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT irynavgoraichuk arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT abdulwajid arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT shafqatfatimarehmani arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT asmabasharat arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT ismailashittu arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT tonymjoannis arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT pattijmiller arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT claudiolafonso arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT kirilmdimitrov robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT poonamsharma robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT jeremydvolkening robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT irynavgoraichuk robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT abdulwajid robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT shafqatfatimarehmani robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT asmabasharat robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT ismailashittu robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT tonymjoannis robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT pattijmiller robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses AT claudiolafonso robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses |
_version_ |
1725031419087945728 |
spelling |
doaj-7c5f2abd192f469db2106fc3a171e0fd2020-11-25T01:43:48ZengBMCVirology Journal1743-422X2017-04-0114111410.1186/s12985-017-0741-5A robust and cost-effective approach to sequence and analyze complete genomes of small RNA virusesKiril M. Dimitrov0Poonam Sharma1Jeremy D. Volkening2Iryna V. Goraichuk3Abdul Wajid4Shafqat Fatima Rehmani5Asma Basharat6Ismaila Shittu7Tony M. Joannis8Patti J. Miller9Claudio L. Afonso10Exotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, US National Poultry Research Center, Agricultural Research Service, USDAExotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, US National Poultry Research Center, Agricultural Research Service, USDABASE2BIOExotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, US National Poultry Research Center, Agricultural Research Service, USDAQuality Operations Laboratory (QOL), University of Veterinary and Animal SciencesQuality Operations Laboratory (QOL), University of Veterinary and Animal SciencesQuality Operations Laboratory (QOL), University of Veterinary and Animal SciencesRegional Laboratory for Animal Influenza and other Transboundary Animal Diseases, National Veterinary Research Institute, PMB01Regional Laboratory for Animal Influenza and other Transboundary Animal Diseases, National Veterinary Research Institute, PMB01Exotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, US National Poultry Research Center, Agricultural Research Service, USDAExotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, US National Poultry Research Center, Agricultural Research Service, USDAAbstract Background Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected variants and co-infecting agents. However, NGS is not widely used for small RNA viruses because of incorrectly perceived cost estimates and inefficient utilization of freely available bioinformatics tools. Methods In this study, we have utilized NGS-based random sequencing of total RNA combined with barcode multiplexing of libraries to quickly, effectively and simultaneously characterize the genomic sequences of multiple avian paramyxoviruses. Thirty libraries were prepared from diagnostic samples amplified in allantoic fluids and their total RNAs were sequenced in a single flow cell on an Illumina MiSeq instrument. After digital normalization, data were assembled using the MIRA assembler within a customized workflow on the Galaxy platform. Results Twenty-eight avian paramyxovirus 1 (APMV-1), one APMV-13, four avian influenza and two infectious bronchitis virus complete or nearly complete genome sequences were obtained from the single run. The 29 avian paramyxovirus genomes displayed 99.6% mean coverage based on bases with Phred quality scores of 30 or more. The lower and upper quartiles of sample median depth per position for those 29 samples were 2984 and 6894, respectively, indicating coverage across samples sufficient for deep variant analysis. Sample processing and library preparation took approximately 25–30 h, the sequencing run took 39 h, and processing through the Galaxy workflow took approximately 2–3 h. The cost of all steps, excluding labor, was estimated to be 106 USD per sample. Conclusions This work describes an efficient multiplexing NGS approach, a detailed analysis workflow, and customized tools for the characterization of the genomes of RNA viruses. The combination of multiplexing NGS technology with the Galaxy workflow platform resulted in a fast, user-friendly, and cost-efficient protocol for the simultaneous characterization of multiple full-length viral genomes. Twenty-nine full-length or near-full-length APMV genomes with a high median depth were successfully sequenced out of 30 samples. The applied de novo assembly approach also allowed identification of mixed viral populations in some of the samples.http://link.springer.com/article/10.1186/s12985-017-0741-5Newcastle disease virusNext-generation sequencingMultiplexingGalaxyDe novo assemblyMultiplexing |