A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses

Abstract Background Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsu...

Full description

Bibliographic Details
Main Authors: Kiril M. Dimitrov, Poonam Sharma, Jeremy D. Volkening, Iryna V. Goraichuk, Abdul Wajid, Shafqat Fatima Rehmani, Asma Basharat, Ismaila Shittu, Tony M. Joannis, Patti J. Miller, Claudio L. Afonso
Format: Article
Language:English
Published: BMC 2017-04-01
Series:Virology Journal
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12985-017-0741-5
id doaj-7c5f2abd192f469db2106fc3a171e0fd
record_format Article
collection DOAJ
language English
format Article
sources DOAJ
author Kiril M. Dimitrov
Poonam Sharma
Jeremy D. Volkening
Iryna V. Goraichuk
Abdul Wajid
Shafqat Fatima Rehmani
Asma Basharat
Ismaila Shittu
Tony M. Joannis
Patti J. Miller
Claudio L. Afonso
spellingShingle Kiril M. Dimitrov
Poonam Sharma
Jeremy D. Volkening
Iryna V. Goraichuk
Abdul Wajid
Shafqat Fatima Rehmani
Asma Basharat
Ismaila Shittu
Tony M. Joannis
Patti J. Miller
Claudio L. Afonso
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
Virology Journal
Newcastle disease virus
Next-generation sequencing
Multiplexing
Galaxy
De novo assembly
Multiplexing
author_facet Kiril M. Dimitrov
Poonam Sharma
Jeremy D. Volkening
Iryna V. Goraichuk
Abdul Wajid
Shafqat Fatima Rehmani
Asma Basharat
Ismaila Shittu
Tony M. Joannis
Patti J. Miller
Claudio L. Afonso
author_sort Kiril M. Dimitrov
title A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
title_short A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
title_full A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
title_fullStr A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
title_full_unstemmed A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
title_sort robust and cost-effective approach to sequence and analyze complete genomes of small rna viruses
publisher BMC
series Virology Journal
issn 1743-422X
publishDate 2017-04-01
description Abstract Background Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected variants and co-infecting agents. However, NGS is not widely used for small RNA viruses because of incorrectly perceived cost estimates and inefficient utilization of freely available bioinformatics tools. Methods In this study, we have utilized NGS-based random sequencing of total RNA combined with barcode multiplexing of libraries to quickly, effectively and simultaneously characterize the genomic sequences of multiple avian paramyxoviruses. Thirty libraries were prepared from diagnostic samples amplified in allantoic fluids and their total RNAs were sequenced in a single flow cell on an Illumina MiSeq instrument. After digital normalization, data were assembled using the MIRA assembler within a customized workflow on the Galaxy platform. Results Twenty-eight avian paramyxovirus 1 (APMV-1), one APMV-13, four avian influenza and two infectious bronchitis virus complete or nearly complete genome sequences were obtained from the single run. The 29 avian paramyxovirus genomes displayed 99.6% mean coverage based on bases with Phred quality scores of 30 or more. The lower and upper quartiles of sample median depth per position for those 29 samples were 2984 and 6894, respectively, indicating coverage across samples sufficient for deep variant analysis. Sample processing and library preparation took approximately 25–30 h, the sequencing run took 39 h, and processing through the Galaxy workflow took approximately 2–3 h. The cost of all steps, excluding labor, was estimated to be 106 USD per sample. Conclusions This work describes an efficient multiplexing NGS approach, a detailed analysis workflow, and customized tools for the characterization of the genomes of RNA viruses. The combination of multiplexing NGS technology with the Galaxy workflow platform resulted in a fast, user-friendly, and cost-efficient protocol for the simultaneous characterization of multiple full-length viral genomes. Twenty-nine full-length or near-full-length APMV genomes with a high median depth were successfully sequenced out of 30 samples. The applied de novo assembly approach also allowed identification of mixed viral populations in some of the samples.
topic Newcastle disease virus
Next-generation sequencing
Multiplexing
Galaxy
De novo assembly
Multiplexing
url http://link.springer.com/article/10.1186/s12985-017-0741-5
work_keys_str_mv AT kirilmdimitrov arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT poonamsharma arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT jeremydvolkening arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT irynavgoraichuk arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT abdulwajid arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT shafqatfatimarehmani arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT asmabasharat arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT ismailashittu arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT tonymjoannis arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT pattijmiller arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT claudiolafonso arobustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT kirilmdimitrov robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT poonamsharma robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT jeremydvolkening robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT irynavgoraichuk robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT abdulwajid robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT shafqatfatimarehmani robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT asmabasharat robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT ismailashittu robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT tonymjoannis robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT pattijmiller robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
AT claudiolafonso robustandcosteffectiveapproachtosequenceandanalyzecompletegenomesofsmallrnaviruses
_version_ 1725031419087945728
spelling doaj-7c5f2abd192f469db2106fc3a171e0fd2020-11-25T01:43:48ZengBMCVirology Journal1743-422X2017-04-0114111410.1186/s12985-017-0741-5A robust and cost-effective approach to sequence and analyze complete genomes of small RNA virusesKiril M. Dimitrov0Poonam Sharma1Jeremy D. Volkening2Iryna V. Goraichuk3Abdul Wajid4Shafqat Fatima Rehmani5Asma Basharat6Ismaila Shittu7Tony M. Joannis8Patti J. Miller9Claudio L. Afonso10Exotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, US National Poultry Research Center, Agricultural Research Service, USDAExotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, US National Poultry Research Center, Agricultural Research Service, USDABASE2BIOExotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, US National Poultry Research Center, Agricultural Research Service, USDAQuality Operations Laboratory (QOL), University of Veterinary and Animal SciencesQuality Operations Laboratory (QOL), University of Veterinary and Animal SciencesQuality Operations Laboratory (QOL), University of Veterinary and Animal SciencesRegional Laboratory for Animal Influenza and other Transboundary Animal Diseases, National Veterinary Research Institute, PMB01Regional Laboratory for Animal Influenza and other Transboundary Animal Diseases, National Veterinary Research Institute, PMB01Exotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, US National Poultry Research Center, Agricultural Research Service, USDAExotic and Emerging Avian Viral Diseases Research Unit, Southeast Poultry Research Laboratory, US National Poultry Research Center, Agricultural Research Service, USDAAbstract Background Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected variants and co-infecting agents. However, NGS is not widely used for small RNA viruses because of incorrectly perceived cost estimates and inefficient utilization of freely available bioinformatics tools. Methods In this study, we have utilized NGS-based random sequencing of total RNA combined with barcode multiplexing of libraries to quickly, effectively and simultaneously characterize the genomic sequences of multiple avian paramyxoviruses. Thirty libraries were prepared from diagnostic samples amplified in allantoic fluids and their total RNAs were sequenced in a single flow cell on an Illumina MiSeq instrument. After digital normalization, data were assembled using the MIRA assembler within a customized workflow on the Galaxy platform. Results Twenty-eight avian paramyxovirus 1 (APMV-1), one APMV-13, four avian influenza and two infectious bronchitis virus complete or nearly complete genome sequences were obtained from the single run. The 29 avian paramyxovirus genomes displayed 99.6% mean coverage based on bases with Phred quality scores of 30 or more. The lower and upper quartiles of sample median depth per position for those 29 samples were 2984 and 6894, respectively, indicating coverage across samples sufficient for deep variant analysis. Sample processing and library preparation took approximately 25–30 h, the sequencing run took 39 h, and processing through the Galaxy workflow took approximately 2–3 h. The cost of all steps, excluding labor, was estimated to be 106 USD per sample. Conclusions This work describes an efficient multiplexing NGS approach, a detailed analysis workflow, and customized tools for the characterization of the genomes of RNA viruses. The combination of multiplexing NGS technology with the Galaxy workflow platform resulted in a fast, user-friendly, and cost-efficient protocol for the simultaneous characterization of multiple full-length viral genomes. Twenty-nine full-length or near-full-length APMV genomes with a high median depth were successfully sequenced out of 30 samples. The applied de novo assembly approach also allowed identification of mixed viral populations in some of the samples.http://link.springer.com/article/10.1186/s12985-017-0741-5Newcastle disease virusNext-generation sequencingMultiplexingGalaxyDe novo assemblyMultiplexing