Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.
DNA fragmentation is a fundamental step during library preparation in hybridization capture-based, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have e...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0227427 |
id |
doaj-9d69d4d00edc47cfb2da57a958e630c7 |
---|---|
record_format |
Article |
spelling |
doaj-9d69d4d00edc47cfb2da57a958e630c72021-03-03T21:23:12ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01151e022742710.1371/journal.pone.0227427Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.Norio TanakaAkihisa TakaharaTaichi HagioRika NishikoJunko KanayamaOsamu GotohSeiichi MoriDNA fragmentation is a fundamental step during library preparation in hybridization capture-based, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have employed library preparation methods that rely on enzymatic fragmentation with DNA endonucleases to minimize DNA loss, particularly in nano-quantity samples. Yet, despite their wide use, the effect of enzymatic fragmentation on the resultant sequences has not been carefully assessed. Here, we used pairwise comparisons of somatic variants of the same tumor DNA samples prepared using ultrasonic and enzymatic fragmentation methods. Our analysis revealed a substantially larger number of recurrent artifactual SNVs/indels in endonuclease-treated libraries as compared with those created through ultrasonication. These artifacts were marked by palindromic structure in the genomic context, positional bias in sequenced reads, and multi-nucleotide substitutions. Taking advantage of these distinctive features, we developed a filtering algorithm to distinguish genuine somatic mutations from artifactual noise with high specificity and sensitivity. Noise cancelling recovered the composition of the mutational signatures in the tumor samples. Thus, we provide an informatics algorithm as a solution to the sequencing errors produced as a consequence of endonuclease-mediated fragmentation, highlighted for the first time in this study.https://doi.org/10.1371/journal.pone.0227427 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Norio Tanaka Akihisa Takahara Taichi Hagio Rika Nishiko Junko Kanayama Osamu Gotoh Seiichi Mori |
spellingShingle |
Norio Tanaka Akihisa Takahara Taichi Hagio Rika Nishiko Junko Kanayama Osamu Gotoh Seiichi Mori Sequencing artifacts derived from a library preparation method using enzymatic fragmentation. PLoS ONE |
author_facet |
Norio Tanaka Akihisa Takahara Taichi Hagio Rika Nishiko Junko Kanayama Osamu Gotoh Seiichi Mori |
author_sort |
Norio Tanaka |
title |
Sequencing artifacts derived from a library preparation method using enzymatic fragmentation. |
title_short |
Sequencing artifacts derived from a library preparation method using enzymatic fragmentation. |
title_full |
Sequencing artifacts derived from a library preparation method using enzymatic fragmentation. |
title_fullStr |
Sequencing artifacts derived from a library preparation method using enzymatic fragmentation. |
title_full_unstemmed |
Sequencing artifacts derived from a library preparation method using enzymatic fragmentation. |
title_sort |
sequencing artifacts derived from a library preparation method using enzymatic fragmentation. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2020-01-01 |
description |
DNA fragmentation is a fundamental step during library preparation in hybridization capture-based, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have employed library preparation methods that rely on enzymatic fragmentation with DNA endonucleases to minimize DNA loss, particularly in nano-quantity samples. Yet, despite their wide use, the effect of enzymatic fragmentation on the resultant sequences has not been carefully assessed. Here, we used pairwise comparisons of somatic variants of the same tumor DNA samples prepared using ultrasonic and enzymatic fragmentation methods. Our analysis revealed a substantially larger number of recurrent artifactual SNVs/indels in endonuclease-treated libraries as compared with those created through ultrasonication. These artifacts were marked by palindromic structure in the genomic context, positional bias in sequenced reads, and multi-nucleotide substitutions. Taking advantage of these distinctive features, we developed a filtering algorithm to distinguish genuine somatic mutations from artifactual noise with high specificity and sensitivity. Noise cancelling recovered the composition of the mutational signatures in the tumor samples. Thus, we provide an informatics algorithm as a solution to the sequencing errors produced as a consequence of endonuclease-mediated fragmentation, highlighted for the first time in this study. |
url |
https://doi.org/10.1371/journal.pone.0227427 |
work_keys_str_mv |
AT noriotanaka sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation AT akihisatakahara sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation AT taichihagio sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation AT rikanishiko sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation AT junkokanayama sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation AT osamugotoh sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation AT seiichimori sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation |
_version_ |
1714817120724844544 |