Decoding of superimposed traces produced by direct sequencing of heterozygous indels.

Direct Sanger sequencing of a diploid template containing a heterozygous insertion or deletion results in a difficult-to-interpret mixed trace formed by two allelic traces superimposed onto each other. Existing computational methods for deconvolution of such traces require knowledge of a reference s...

Full description

Bibliographic Details
Main Authors: Dmitry A Dmitriev, Roman A Rakitov
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2008-01-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC2429969?pdf=render
id doaj-84b06d43a34141888fcb10d16bb9c29f
record_format Article
spelling doaj-84b06d43a34141888fcb10d16bb9c29f2020-11-24T21:55:35ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582008-01-0147e100011310.1371/journal.pcbi.1000113Decoding of superimposed traces produced by direct sequencing of heterozygous indels.Dmitry A DmitrievRoman A RakitovDirect Sanger sequencing of a diploid template containing a heterozygous insertion or deletion results in a difficult-to-interpret mixed trace formed by two allelic traces superimposed onto each other. Existing computational methods for deconvolution of such traces require knowledge of a reference sequence or the availability of both direct and reverse mixed sequences of the same template. We describe a simple yet accurate method, which uses dynamic programming optimization to predict superimposed allelic sequences solely from a string of letters representing peaks within an individual mixed trace. We used the method to decode 104 human traces (mean length 294 bp) containing heterozygous indels 5 to 30 bp with a mean of 99.1% bases per allelic sequence reconstructed correctly and unambiguously. Simulations with artificial sequences have demonstrated that the method yields accurate reconstructions when (1) the allelic sequences forming the mixed trace are sufficiently similar, (2) the analyzed fragment is significantly longer than the indel, and (3) multiple indels, if present, are well-spaced. Because these conditions occur in most encountered DNA sequences, the method is widely applicable. It is available as a free Web application Indelligent at http://ctap.inhs.uiuc.edu/dmitriev/indel.asp.http://europepmc.org/articles/PMC2429969?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Dmitry A Dmitriev
Roman A Rakitov
spellingShingle Dmitry A Dmitriev
Roman A Rakitov
Decoding of superimposed traces produced by direct sequencing of heterozygous indels.
PLoS Computational Biology
author_facet Dmitry A Dmitriev
Roman A Rakitov
author_sort Dmitry A Dmitriev
title Decoding of superimposed traces produced by direct sequencing of heterozygous indels.
title_short Decoding of superimposed traces produced by direct sequencing of heterozygous indels.
title_full Decoding of superimposed traces produced by direct sequencing of heterozygous indels.
title_fullStr Decoding of superimposed traces produced by direct sequencing of heterozygous indels.
title_full_unstemmed Decoding of superimposed traces produced by direct sequencing of heterozygous indels.
title_sort decoding of superimposed traces produced by direct sequencing of heterozygous indels.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2008-01-01
description Direct Sanger sequencing of a diploid template containing a heterozygous insertion or deletion results in a difficult-to-interpret mixed trace formed by two allelic traces superimposed onto each other. Existing computational methods for deconvolution of such traces require knowledge of a reference sequence or the availability of both direct and reverse mixed sequences of the same template. We describe a simple yet accurate method, which uses dynamic programming optimization to predict superimposed allelic sequences solely from a string of letters representing peaks within an individual mixed trace. We used the method to decode 104 human traces (mean length 294 bp) containing heterozygous indels 5 to 30 bp with a mean of 99.1% bases per allelic sequence reconstructed correctly and unambiguously. Simulations with artificial sequences have demonstrated that the method yields accurate reconstructions when (1) the allelic sequences forming the mixed trace are sufficiently similar, (2) the analyzed fragment is significantly longer than the indel, and (3) multiple indels, if present, are well-spaced. Because these conditions occur in most encountered DNA sequences, the method is widely applicable. It is available as a free Web application Indelligent at http://ctap.inhs.uiuc.edu/dmitriev/indel.asp.
url http://europepmc.org/articles/PMC2429969?pdf=render
work_keys_str_mv AT dmitryadmitriev decodingofsuperimposedtracesproducedbydirectsequencingofheterozygousindels
AT romanarakitov decodingofsuperimposedtracesproducedbydirectsequencingofheterozygousindels
_version_ 1725861680152313856