Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.

DNA fragmentation is a fundamental step during library preparation in hybridization capture-based, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have e...

Full description

Bibliographic Details
Main Authors: Norio Tanaka, Akihisa Takahara, Taichi Hagio, Rika Nishiko, Junko Kanayama, Osamu Gotoh, Seiichi Mori
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0227427
id doaj-9d69d4d00edc47cfb2da57a958e630c7
record_format Article
spelling doaj-9d69d4d00edc47cfb2da57a958e630c72021-03-03T21:23:12ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01151e022742710.1371/journal.pone.0227427Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.Norio TanakaAkihisa TakaharaTaichi HagioRika NishikoJunko KanayamaOsamu GotohSeiichi MoriDNA fragmentation is a fundamental step during library preparation in hybridization capture-based, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have employed library preparation methods that rely on enzymatic fragmentation with DNA endonucleases to minimize DNA loss, particularly in nano-quantity samples. Yet, despite their wide use, the effect of enzymatic fragmentation on the resultant sequences has not been carefully assessed. Here, we used pairwise comparisons of somatic variants of the same tumor DNA samples prepared using ultrasonic and enzymatic fragmentation methods. Our analysis revealed a substantially larger number of recurrent artifactual SNVs/indels in endonuclease-treated libraries as compared with those created through ultrasonication. These artifacts were marked by palindromic structure in the genomic context, positional bias in sequenced reads, and multi-nucleotide substitutions. Taking advantage of these distinctive features, we developed a filtering algorithm to distinguish genuine somatic mutations from artifactual noise with high specificity and sensitivity. Noise cancelling recovered the composition of the mutational signatures in the tumor samples. Thus, we provide an informatics algorithm as a solution to the sequencing errors produced as a consequence of endonuclease-mediated fragmentation, highlighted for the first time in this study.https://doi.org/10.1371/journal.pone.0227427
collection DOAJ
language English
format Article
sources DOAJ
author Norio Tanaka
Akihisa Takahara
Taichi Hagio
Rika Nishiko
Junko Kanayama
Osamu Gotoh
Seiichi Mori
spellingShingle Norio Tanaka
Akihisa Takahara
Taichi Hagio
Rika Nishiko
Junko Kanayama
Osamu Gotoh
Seiichi Mori
Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.
PLoS ONE
author_facet Norio Tanaka
Akihisa Takahara
Taichi Hagio
Rika Nishiko
Junko Kanayama
Osamu Gotoh
Seiichi Mori
author_sort Norio Tanaka
title Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.
title_short Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.
title_full Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.
title_fullStr Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.
title_full_unstemmed Sequencing artifacts derived from a library preparation method using enzymatic fragmentation.
title_sort sequencing artifacts derived from a library preparation method using enzymatic fragmentation.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description DNA fragmentation is a fundamental step during library preparation in hybridization capture-based, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have employed library preparation methods that rely on enzymatic fragmentation with DNA endonucleases to minimize DNA loss, particularly in nano-quantity samples. Yet, despite their wide use, the effect of enzymatic fragmentation on the resultant sequences has not been carefully assessed. Here, we used pairwise comparisons of somatic variants of the same tumor DNA samples prepared using ultrasonic and enzymatic fragmentation methods. Our analysis revealed a substantially larger number of recurrent artifactual SNVs/indels in endonuclease-treated libraries as compared with those created through ultrasonication. These artifacts were marked by palindromic structure in the genomic context, positional bias in sequenced reads, and multi-nucleotide substitutions. Taking advantage of these distinctive features, we developed a filtering algorithm to distinguish genuine somatic mutations from artifactual noise with high specificity and sensitivity. Noise cancelling recovered the composition of the mutational signatures in the tumor samples. Thus, we provide an informatics algorithm as a solution to the sequencing errors produced as a consequence of endonuclease-mediated fragmentation, highlighted for the first time in this study.
url https://doi.org/10.1371/journal.pone.0227427
work_keys_str_mv AT noriotanaka sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT akihisatakahara sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT taichihagio sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT rikanishiko sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT junkokanayama sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT osamugotoh sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
AT seiichimori sequencingartifactsderivedfromalibrarypreparationmethodusingenzymaticfragmentation
_version_ 1714817120724844544