Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo Variants

Summary: De novo genetic variants are an important source of causative variation in complex genetic disorders. Many methods for variant discovery rely on mapping reads to a reference genome, detecting numerous inherited variants irrelevant to the phenotype of interest. To distinguish between inherit...

Full description

Bibliographic Details
Main Authors: Daniel S. Standage, C. Titus Brown, Fereydoun Hormozdiari
Format: Article
Language:English
Published: Elsevier 2019-08-01
Series:iScience
Online Access:http://www.sciencedirect.com/science/article/pii/S2589004219302597
id doaj-92be6aaf9a58490584377b34c194349e
record_format Article
spelling doaj-92be6aaf9a58490584377b34c194349e2020-11-25T02:11:17ZengElsevieriScience2589-00422019-08-01182836Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo VariantsDaniel S. Standage0C. Titus Brown1Fereydoun Hormozdiari2Population Health and Reproduction, University of California, Davis, USA; Corresponding authorPopulation Health and Reproduction, University of California, Davis, USA; Genome Center, University of California, Davis, USA; Corresponding authorGenome Center, University of California, Davis, USA; MIND Institute, University of California, Davis, USA; Biochemistry and Molecular Medicine, University of California, Davis, 1 Shields Avenue, Davis, CA 95616, USA; Corresponding authorSummary: De novo genetic variants are an important source of causative variation in complex genetic disorders. Many methods for variant discovery rely on mapping reads to a reference genome, detecting numerous inherited variants irrelevant to the phenotype of interest. To distinguish between inherited and de novo variation, sequencing of families (parents and siblings) is commonly pursued. However, standard mapping-based approaches tend to have a high false-discovery rate for de novo variant prediction. Kevlar is a mapping-free method for de novo variant discovery, based on direct comparison of sequences between related individuals. Kevlar identifies high-abundance k-mers unique to the individual of interest. Reads containing these k-mers are partitioned into disjoint sets by shared k-mer content for variant calling, and preliminary variant predictions are sorted using a probabilistic score. We evaluated Kevlar on simulated and real datasets, demonstrating its ability to detect both de novo single-nucleotide variants and indels with high accuracy. : Bioinformatics; Biological Sciences; Genetics Subject Areas: Bioinformatics, Biological Sciences, Geneticshttp://www.sciencedirect.com/science/article/pii/S2589004219302597
collection DOAJ
language English
format Article
sources DOAJ
author Daniel S. Standage
C. Titus Brown
Fereydoun Hormozdiari
spellingShingle Daniel S. Standage
C. Titus Brown
Fereydoun Hormozdiari
Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo Variants
iScience
author_facet Daniel S. Standage
C. Titus Brown
Fereydoun Hormozdiari
author_sort Daniel S. Standage
title Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo Variants
title_short Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo Variants
title_full Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo Variants
title_fullStr Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo Variants
title_full_unstemmed Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo Variants
title_sort kevlar: a mapping-free framework for accurate discovery of de novo variants
publisher Elsevier
series iScience
issn 2589-0042
publishDate 2019-08-01
description Summary: De novo genetic variants are an important source of causative variation in complex genetic disorders. Many methods for variant discovery rely on mapping reads to a reference genome, detecting numerous inherited variants irrelevant to the phenotype of interest. To distinguish between inherited and de novo variation, sequencing of families (parents and siblings) is commonly pursued. However, standard mapping-based approaches tend to have a high false-discovery rate for de novo variant prediction. Kevlar is a mapping-free method for de novo variant discovery, based on direct comparison of sequences between related individuals. Kevlar identifies high-abundance k-mers unique to the individual of interest. Reads containing these k-mers are partitioned into disjoint sets by shared k-mer content for variant calling, and preliminary variant predictions are sorted using a probabilistic score. We evaluated Kevlar on simulated and real datasets, demonstrating its ability to detect both de novo single-nucleotide variants and indels with high accuracy. : Bioinformatics; Biological Sciences; Genetics Subject Areas: Bioinformatics, Biological Sciences, Genetics
url http://www.sciencedirect.com/science/article/pii/S2589004219302597
work_keys_str_mv AT danielsstandage kevlaramappingfreeframeworkforaccuratediscoveryofdenovovariants
AT ctitusbrown kevlaramappingfreeframeworkforaccuratediscoveryofdenovovariants
AT fereydounhormozdiari kevlaramappingfreeframeworkforaccuratediscoveryofdenovovariants
_version_ 1724915269993758720