SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases.

There is an increasing need to use genome and transcriptome sequencing to genetically diagnose patients suffering from suspected monogenic rare diseases. The proper detection of compound heterozygous variant combinations as disease-causing candidates is a challenge in diagnostic workflows as haploty...

Full description

Bibliographic Details
Main Authors: Paul Hager, Hans-Werner Mewes, Meino Rohlfs, Christoph Klein, Tim Jeske
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-02-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1007613
id doaj-e07d9376c1494f1bbb913a615ee66c6d
record_format Article
spelling doaj-e07d9376c1494f1bbb913a615ee66c6d2021-04-21T15:14:06ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582020-02-01162e100761310.1371/journal.pcbi.1007613SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases.Paul HagerHans-Werner MewesMeino RohlfsChristoph KleinTim JeskeThere is an increasing need to use genome and transcriptome sequencing to genetically diagnose patients suffering from suspected monogenic rare diseases. The proper detection of compound heterozygous variant combinations as disease-causing candidates is a challenge in diagnostic workflows as haplotype information is lost by currently used next-generation sequencing technologies. Consequently, computational tools are required to phase, or resolve the haplotype of, the high number of heterozygous variants in the exome or genome of each patient. Here we present SmartPhase, a phasing tool designed to efficiently reduce the set of potential compound heterozygous variant pairs in genetic diagnoses pipelines. The phasing algorithm of SmartPhase creates haplotypes using both parental genotype information and reads generated by DNA or RNA sequencing and is thus well suited to resolve the phase of rare variants. To inform the user about the reliability of a phasing prediction, it computes a confidence score which is essential to select error-free predictions. It incorporates existing haplotype information and applies logical rules to determine variants that can be excluded as causing a recessive, monogenic disease. SmartPhase can phase either all possible variant pairs in predefined genetic loci or preselected variant pairs of interest, thus keeping the focus on clinically relevant results. We compared SmartPhase to WhatsHap, one of the leading comparable phasing tools, using simulated data and a real clinical cohort of 921 patients. On both data sets, SmartPhase generated error-free predictions using our derived confidence score threshold. It outperformed WhatsHap with regard to the percentage of resolved pairs when parental genotype information is available. On the cohort data, SmartPhase enabled on average the exclusion of approximately 22% of the input variant pairs in each singleton patient and 44% in each trio patient. SmartPhase is implemented as an open-source Java tool and freely available at http://ibis.helmholtz-muenchen.de/smartphase/.https://doi.org/10.1371/journal.pcbi.1007613
collection DOAJ
language English
format Article
sources DOAJ
author Paul Hager
Hans-Werner Mewes
Meino Rohlfs
Christoph Klein
Tim Jeske
spellingShingle Paul Hager
Hans-Werner Mewes
Meino Rohlfs
Christoph Klein
Tim Jeske
SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases.
PLoS Computational Biology
author_facet Paul Hager
Hans-Werner Mewes
Meino Rohlfs
Christoph Klein
Tim Jeske
author_sort Paul Hager
title SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases.
title_short SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases.
title_full SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases.
title_fullStr SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases.
title_full_unstemmed SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases.
title_sort smartphase: accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2020-02-01
description There is an increasing need to use genome and transcriptome sequencing to genetically diagnose patients suffering from suspected monogenic rare diseases. The proper detection of compound heterozygous variant combinations as disease-causing candidates is a challenge in diagnostic workflows as haplotype information is lost by currently used next-generation sequencing technologies. Consequently, computational tools are required to phase, or resolve the haplotype of, the high number of heterozygous variants in the exome or genome of each patient. Here we present SmartPhase, a phasing tool designed to efficiently reduce the set of potential compound heterozygous variant pairs in genetic diagnoses pipelines. The phasing algorithm of SmartPhase creates haplotypes using both parental genotype information and reads generated by DNA or RNA sequencing and is thus well suited to resolve the phase of rare variants. To inform the user about the reliability of a phasing prediction, it computes a confidence score which is essential to select error-free predictions. It incorporates existing haplotype information and applies logical rules to determine variants that can be excluded as causing a recessive, monogenic disease. SmartPhase can phase either all possible variant pairs in predefined genetic loci or preselected variant pairs of interest, thus keeping the focus on clinically relevant results. We compared SmartPhase to WhatsHap, one of the leading comparable phasing tools, using simulated data and a real clinical cohort of 921 patients. On both data sets, SmartPhase generated error-free predictions using our derived confidence score threshold. It outperformed WhatsHap with regard to the percentage of resolved pairs when parental genotype information is available. On the cohort data, SmartPhase enabled on average the exclusion of approximately 22% of the input variant pairs in each singleton patient and 44% in each trio patient. SmartPhase is implemented as an open-source Java tool and freely available at http://ibis.helmholtz-muenchen.de/smartphase/.
url https://doi.org/10.1371/journal.pcbi.1007613
work_keys_str_mv AT paulhager smartphaseaccurateandfastphasingofheterozygousvariantpairsforgeneticdiagnosisofrarediseases
AT hanswernermewes smartphaseaccurateandfastphasingofheterozygousvariantpairsforgeneticdiagnosisofrarediseases
AT meinorohlfs smartphaseaccurateandfastphasingofheterozygousvariantpairsforgeneticdiagnosisofrarediseases
AT christophklein smartphaseaccurateandfastphasingofheterozygousvariantpairsforgeneticdiagnosisofrarediseases
AT timjeske smartphaseaccurateandfastphasingofheterozygousvariantpairsforgeneticdiagnosisofrarediseases
_version_ 1714667712622362624