Effects of sample age on data quality from targeted sequencing of museum specimens: what are we capturing in time?

Abstract Background Next generation sequencing (NGS) can recover DNA data from valuable extant and extinct museum specimens. However, archived or preserved DNA is difficult to sequence because of its fragmented, damaged nature, such that the most successful NGS methods for preserved specimens remain...

Full description

Bibliographic Details
Main Author: Angela McGaughran
Format: Article
Language:English
Published: BMC 2020-02-01
Series:BMC Genomics
Subjects:
NGS
Online Access:http://link.springer.com/article/10.1186/s12864-020-6594-0
id doaj-66fd8618865949f3b000128222c40796
record_format Article
spelling doaj-66fd8618865949f3b000128222c407962020-11-25T01:24:55ZengBMCBMC Genomics1471-21642020-02-0121111010.1186/s12864-020-6594-0Effects of sample age on data quality from targeted sequencing of museum specimens: what are we capturing in time?Angela McGaughran0Australian National University, Research School of Biology, Division of Ecology and EvolutionAbstract Background Next generation sequencing (NGS) can recover DNA data from valuable extant and extinct museum specimens. However, archived or preserved DNA is difficult to sequence because of its fragmented, damaged nature, such that the most successful NGS methods for preserved specimens remain sub-optimal. Improving wet-lab protocols and comprehensively determining the effects of sample age on NGS library quality are therefore of vital importance. Here, I examine the relationship between sample age and several indicators of library quality following targeted NGS sequencing of ~ 1300 loci using 271 samples of pinned moth specimens (Helicoverpa armigera) ranging in age from 5 to 117 years. Results I find that older samples have lower DNA concentrations following extraction and thus require a higher number of indexing PCR cycles during library preparation. When sequenced reads are aligned to a reference genome or to only the targeted region, older samples have a lower number of sequenced and mapped reads, lower mean coverage, and lower estimated library sizes, while the percentage of adapters in sequenced reads increases significantly as samples become older. Older samples also show the poorest capture success, with lower enrichment and a higher improved coverage anticipated from further sequencing. Conclusions Sample age has significant, measurable impacts on the quality of NGS data following targeted enrichment. However, incorporating a uracil-removing enzyme into the blunt end-repair step during library preparation could help to repair DNA damage, and using a method that prevents adapter-dimer formation may result in improved data yields.http://link.springer.com/article/10.1186/s12864-020-6594-0Historical DNALibrary qualityMuseum genomicsNGSTargeted capture
collection DOAJ
language English
format Article
sources DOAJ
author Angela McGaughran
spellingShingle Angela McGaughran
Effects of sample age on data quality from targeted sequencing of museum specimens: what are we capturing in time?
BMC Genomics
Historical DNA
Library quality
Museum genomics
NGS
Targeted capture
author_facet Angela McGaughran
author_sort Angela McGaughran
title Effects of sample age on data quality from targeted sequencing of museum specimens: what are we capturing in time?
title_short Effects of sample age on data quality from targeted sequencing of museum specimens: what are we capturing in time?
title_full Effects of sample age on data quality from targeted sequencing of museum specimens: what are we capturing in time?
title_fullStr Effects of sample age on data quality from targeted sequencing of museum specimens: what are we capturing in time?
title_full_unstemmed Effects of sample age on data quality from targeted sequencing of museum specimens: what are we capturing in time?
title_sort effects of sample age on data quality from targeted sequencing of museum specimens: what are we capturing in time?
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2020-02-01
description Abstract Background Next generation sequencing (NGS) can recover DNA data from valuable extant and extinct museum specimens. However, archived or preserved DNA is difficult to sequence because of its fragmented, damaged nature, such that the most successful NGS methods for preserved specimens remain sub-optimal. Improving wet-lab protocols and comprehensively determining the effects of sample age on NGS library quality are therefore of vital importance. Here, I examine the relationship between sample age and several indicators of library quality following targeted NGS sequencing of ~ 1300 loci using 271 samples of pinned moth specimens (Helicoverpa armigera) ranging in age from 5 to 117 years. Results I find that older samples have lower DNA concentrations following extraction and thus require a higher number of indexing PCR cycles during library preparation. When sequenced reads are aligned to a reference genome or to only the targeted region, older samples have a lower number of sequenced and mapped reads, lower mean coverage, and lower estimated library sizes, while the percentage of adapters in sequenced reads increases significantly as samples become older. Older samples also show the poorest capture success, with lower enrichment and a higher improved coverage anticipated from further sequencing. Conclusions Sample age has significant, measurable impacts on the quality of NGS data following targeted enrichment. However, incorporating a uracil-removing enzyme into the blunt end-repair step during library preparation could help to repair DNA damage, and using a method that prevents adapter-dimer formation may result in improved data yields.
topic Historical DNA
Library quality
Museum genomics
NGS
Targeted capture
url http://link.springer.com/article/10.1186/s12864-020-6594-0
work_keys_str_mv AT angelamcgaughran effectsofsampleageondataqualityfromtargetedsequencingofmuseumspecimenswhatarewecapturingintime
_version_ 1725116301557366784