Targeted long-read sequencing of a locus under long-term balancing selection in Capsella

Yes === Rapid advances in short-read DNA sequencing technologies have revolutionized population genomic studies, but there are genomic regions where this technology reaches its limits. Limitations mostly arise due to the difficulties in assembly or alignment to genomic regions of high sequence diver...

Full description

Bibliographic Details
Main Authors: Bachmann, J.A., Tedder, Andrew, Laenen, B., Steige, K.A., Slotte, T.
Language:en
Published: 2019
Subjects:
Online Access:http://hdl.handle.net/10454/17277
id ndltd-BRADFORD-oai-bradscholars.brad.ac.uk-10454-17277
record_format oai_dc
spelling ndltd-BRADFORD-oai-bradscholars.brad.ac.uk-10454-172772019-10-03T03:01:56Z Targeted long-read sequencing of a locus under long-term balancing selection in Capsella Bachmann, J.A. Tedder, Andrew Laenen, B. Steige, K.A. Slotte, T. Single-molecule real-time sequencing Bacterial artificial chromosomes Sequencing errors Assembly Self-incompatibility locus Capsella Brassicaceae Yes Rapid advances in short-read DNA sequencing technologies have revolutionized population genomic studies, but there are genomic regions where this technology reaches its limits. Limitations mostly arise due to the difficulties in assembly or alignment to genomic regions of high sequence divergence and high repeat content, which are typical characteristics for loci under strong long-term balancing selection. Studying genetic diversity at such loci therefore remains challenging. Here, we investigate the feasibility and error rates associated with targeted long-read sequencing of a locus under balancing selection. For this purpose, we generated bacterial artificial chromosomes (BACs) containing the Brassicaceae S-locus, a region under strong negative frequency-dependent selection which has previously proven difficult to assemble in its entirety using short reads. We sequence S-locus BACs with single-molecule long-read sequencing technology and conduct de novo assembly of these S-locus haplotypes. By comparing repeated assemblies resulting from independent long-read sequencing runs on the same BAC clone we do not detect any structural errors, suggesting that reliable assemblies are generated, but we estimate an indel error rate of 5.7×10−5. A similar error rate was estimated based on comparison of Illumina short-read sequences and BAC assemblies. Our results show that, until de novo assembly of multiple individuals using long-read sequencing becomes feasible, targeted long-read sequencing of loci under balancing selection is a viable option with low error rates for single nucleotide polymorphisms or structural variation. We further find that short-read sequencing is a valuable complement, allowing correction of the relatively high rate of indel errors that result from this approach. This study was supported by a grant from the Swedish Research Council to T.S. 2019-09-13T09:42:40Z 2019-10-01T15:16:31Z 2019-09-13T09:42:40Z 2019-10-01T15:16:31Z 2018-04 2018-02-20 2018-02-20 2019-09-13T08:42:41Z Article Published version Bachmann JA, Tedder A, Laenen B et al (2018) Targeted long-read sequencing of a locus under long-term balancing selection in Capsella. G3: Genes, Genomes, Genetics. 8(4): 1327-1333. http://hdl.handle.net/10454/17277 en https://doi.org/10.1534/g3.117.300467 © 2018 Bachmann et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/ by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
collection NDLTD
language en
sources NDLTD
topic Single-molecule real-time sequencing
Bacterial artificial chromosomes
Sequencing errors
Assembly
Self-incompatibility locus
Capsella
Brassicaceae
spellingShingle Single-molecule real-time sequencing
Bacterial artificial chromosomes
Sequencing errors
Assembly
Self-incompatibility locus
Capsella
Brassicaceae
Bachmann, J.A.
Tedder, Andrew
Laenen, B.
Steige, K.A.
Slotte, T.
Targeted long-read sequencing of a locus under long-term balancing selection in Capsella
description Yes === Rapid advances in short-read DNA sequencing technologies have revolutionized population genomic studies, but there are genomic regions where this technology reaches its limits. Limitations mostly arise due to the difficulties in assembly or alignment to genomic regions of high sequence divergence and high repeat content, which are typical characteristics for loci under strong long-term balancing selection. Studying genetic diversity at such loci therefore remains challenging. Here, we investigate the feasibility and error rates associated with targeted long-read sequencing of a locus under balancing selection. For this purpose, we generated bacterial artificial chromosomes (BACs) containing the Brassicaceae S-locus, a region under strong negative frequency-dependent selection which has previously proven difficult to assemble in its entirety using short reads. We sequence S-locus BACs with single-molecule long-read sequencing technology and conduct de novo assembly of these S-locus haplotypes. By comparing repeated assemblies resulting from independent long-read sequencing runs on the same BAC clone we do not detect any structural errors, suggesting that reliable assemblies are generated, but we estimate an indel error rate of 5.7×10−5. A similar error rate was estimated based on comparison of Illumina short-read sequences and BAC assemblies. Our results show that, until de novo assembly of multiple individuals using long-read sequencing becomes feasible, targeted long-read sequencing of loci under balancing selection is a viable option with low error rates for single nucleotide polymorphisms or structural variation. We further find that short-read sequencing is a valuable complement, allowing correction of the relatively high rate of indel errors that result from this approach. === This study was supported by a grant from the Swedish Research Council to T.S.
author Bachmann, J.A.
Tedder, Andrew
Laenen, B.
Steige, K.A.
Slotte, T.
author_facet Bachmann, J.A.
Tedder, Andrew
Laenen, B.
Steige, K.A.
Slotte, T.
author_sort Bachmann, J.A.
title Targeted long-read sequencing of a locus under long-term balancing selection in Capsella
title_short Targeted long-read sequencing of a locus under long-term balancing selection in Capsella
title_full Targeted long-read sequencing of a locus under long-term balancing selection in Capsella
title_fullStr Targeted long-read sequencing of a locus under long-term balancing selection in Capsella
title_full_unstemmed Targeted long-read sequencing of a locus under long-term balancing selection in Capsella
title_sort targeted long-read sequencing of a locus under long-term balancing selection in capsella
publishDate 2019
url http://hdl.handle.net/10454/17277
work_keys_str_mv AT bachmannja targetedlongreadsequencingofalocusunderlongtermbalancingselectionincapsella
AT tedderandrew targetedlongreadsequencingofalocusunderlongtermbalancingselectionincapsella
AT laenenb targetedlongreadsequencingofalocusunderlongtermbalancingselectionincapsella
AT steigeka targetedlongreadsequencingofalocusunderlongtermbalancingselectionincapsella
AT slottet targetedlongreadsequencingofalocusunderlongtermbalancingselectionincapsella
_version_ 1719259040915128320