Critical evaluation of short, long, and hybrid assembly for contextual analysis of antibiotic resistance genes in complex environmental metagenomes

Abstract In the fight to limit the global spread of antibiotic resistance, the assembly of environmental metagenomes has the potential to provide rich contextual information (e.g., taxonomic hosts, carriage on mobile genetic elements) about antibiotic resistance genes (ARG) in the environment. Howev...

Full description

Bibliographic Details
Main Authors: Connor L. Brown, Ishi M. Keenum, Dongjuan Dai, Liqing Zhang, Peter J. Vikesland, Amy Pruden
Format: Article
Language:English
Published: Nature Publishing Group 2021-02-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-83081-8
id doaj-2b75f9d0f0ba43eebd37b5e835d68635
record_format Article
spelling doaj-2b75f9d0f0ba43eebd37b5e835d686352021-02-14T12:33:55ZengNature Publishing GroupScientific Reports2045-23222021-02-0111111210.1038/s41598-021-83081-8Critical evaluation of short, long, and hybrid assembly for contextual analysis of antibiotic resistance genes in complex environmental metagenomesConnor L. Brown0Ishi M. Keenum1Dongjuan Dai2Liqing Zhang3Peter J. Vikesland4Amy Pruden5Genetics, Bioinformatics, and Computational Biology, Virginia TechDepartment of Civil & Environmental Engineering, Virginia TechDepartment of Civil & Environmental Engineering, Virginia TechGenetics, Bioinformatics, and Computational Biology, Virginia TechDepartment of Civil & Environmental Engineering, Virginia TechDepartment of Civil & Environmental Engineering, Virginia TechAbstract In the fight to limit the global spread of antibiotic resistance, the assembly of environmental metagenomes has the potential to provide rich contextual information (e.g., taxonomic hosts, carriage on mobile genetic elements) about antibiotic resistance genes (ARG) in the environment. However, computational challenges associated with assembly can impact the accuracy of downstream analyses. This work critically evaluates the impact of assembly leveraging short reads, nanopore MinION long-reads, and a combination of the two (hybrid) on ARG contextualization for ten environmental metagenomes using seven prominent assemblers (IDBA-UD, MEGAHIT, Canu, Flye, Opera-MS, metaSpades and HybridSpades). While short-read and hybrid assemblies produced similar patterns of ARG contextualization, raw or assembled long nanopore reads produced distinct patterns. Based on an in-silico spike-in experiment using real and simulated reads, we show that low to intermediate coverage species are more likely to be incorporated into chimeric contigs across all assemblers and sequencing technologies, while more abundant species produce assemblies with a greater frequency of inversions and insertion/deletions (indels). In sum, our analyses support hybrid assembly as a valuable technique for boosting the reliability and accuracy of assembly-based analyses of ARGs and neighboring genes at environmentally-relevant coverages, provided that sufficient short-read sequencing depth is achieved.https://doi.org/10.1038/s41598-021-83081-8
collection DOAJ
language English
format Article
sources DOAJ
author Connor L. Brown
Ishi M. Keenum
Dongjuan Dai
Liqing Zhang
Peter J. Vikesland
Amy Pruden
spellingShingle Connor L. Brown
Ishi M. Keenum
Dongjuan Dai
Liqing Zhang
Peter J. Vikesland
Amy Pruden
Critical evaluation of short, long, and hybrid assembly for contextual analysis of antibiotic resistance genes in complex environmental metagenomes
Scientific Reports
author_facet Connor L. Brown
Ishi M. Keenum
Dongjuan Dai
Liqing Zhang
Peter J. Vikesland
Amy Pruden
author_sort Connor L. Brown
title Critical evaluation of short, long, and hybrid assembly for contextual analysis of antibiotic resistance genes in complex environmental metagenomes
title_short Critical evaluation of short, long, and hybrid assembly for contextual analysis of antibiotic resistance genes in complex environmental metagenomes
title_full Critical evaluation of short, long, and hybrid assembly for contextual analysis of antibiotic resistance genes in complex environmental metagenomes
title_fullStr Critical evaluation of short, long, and hybrid assembly for contextual analysis of antibiotic resistance genes in complex environmental metagenomes
title_full_unstemmed Critical evaluation of short, long, and hybrid assembly for contextual analysis of antibiotic resistance genes in complex environmental metagenomes
title_sort critical evaluation of short, long, and hybrid assembly for contextual analysis of antibiotic resistance genes in complex environmental metagenomes
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-02-01
description Abstract In the fight to limit the global spread of antibiotic resistance, the assembly of environmental metagenomes has the potential to provide rich contextual information (e.g., taxonomic hosts, carriage on mobile genetic elements) about antibiotic resistance genes (ARG) in the environment. However, computational challenges associated with assembly can impact the accuracy of downstream analyses. This work critically evaluates the impact of assembly leveraging short reads, nanopore MinION long-reads, and a combination of the two (hybrid) on ARG contextualization for ten environmental metagenomes using seven prominent assemblers (IDBA-UD, MEGAHIT, Canu, Flye, Opera-MS, metaSpades and HybridSpades). While short-read and hybrid assemblies produced similar patterns of ARG contextualization, raw or assembled long nanopore reads produced distinct patterns. Based on an in-silico spike-in experiment using real and simulated reads, we show that low to intermediate coverage species are more likely to be incorporated into chimeric contigs across all assemblers and sequencing technologies, while more abundant species produce assemblies with a greater frequency of inversions and insertion/deletions (indels). In sum, our analyses support hybrid assembly as a valuable technique for boosting the reliability and accuracy of assembly-based analyses of ARGs and neighboring genes at environmentally-relevant coverages, provided that sufficient short-read sequencing depth is achieved.
url https://doi.org/10.1038/s41598-021-83081-8
work_keys_str_mv AT connorlbrown criticalevaluationofshortlongandhybridassemblyforcontextualanalysisofantibioticresistancegenesincomplexenvironmentalmetagenomes
AT ishimkeenum criticalevaluationofshortlongandhybridassemblyforcontextualanalysisofantibioticresistancegenesincomplexenvironmentalmetagenomes
AT dongjuandai criticalevaluationofshortlongandhybridassemblyforcontextualanalysisofantibioticresistancegenesincomplexenvironmentalmetagenomes
AT liqingzhang criticalevaluationofshortlongandhybridassemblyforcontextualanalysisofantibioticresistancegenesincomplexenvironmentalmetagenomes
AT peterjvikesland criticalevaluationofshortlongandhybridassemblyforcontextualanalysisofantibioticresistancegenesincomplexenvironmentalmetagenomes
AT amypruden criticalevaluationofshortlongandhybridassemblyforcontextualanalysisofantibioticresistancegenesincomplexenvironmentalmetagenomes
_version_ 1724270259708362752