DeepPVP: phenotype-based prioritization of causative variants using deep learning

Abstract Background Prioritization of variants in personal genomic data is a major challenge. Recently, computational methods that rely on comparing phenotype similarity have shown to be useful to identify causative variants. In these methods, pathogenicity prediction is combined with a semantic sim...

Full description

Bibliographic Details
Main Authors: Imane Boudellioua, Maxat Kulmanov, Paul N. Schofield, Georgios V. Gkoutos, Robert Hoehndorf
Format: Article
Language:English
Published: BMC 2019-02-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-019-2633-8
id doaj-94b829447d024000b5f6956e0b2b7462
record_format Article
spelling doaj-94b829447d024000b5f6956e0b2b74622020-11-25T03:20:12ZengBMCBMC Bioinformatics1471-21052019-02-012011810.1186/s12859-019-2633-8DeepPVP: phenotype-based prioritization of causative variants using deep learningImane Boudellioua0Maxat Kulmanov1Paul N. Schofield2Georgios V. Gkoutos3Robert Hoehndorf4Computational Bioscience Research Center (CBRC), King Abdullah University of Science and TechnologyComputational Bioscience Research Center (CBRC), King Abdullah University of Science and TechnologyDepartment of Physiology, Development & Neuroscience, University of CambridgeCollege of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of BirminghamComputational Bioscience Research Center (CBRC), King Abdullah University of Science and TechnologyAbstract Background Prioritization of variants in personal genomic data is a major challenge. Recently, computational methods that rely on comparing phenotype similarity have shown to be useful to identify causative variants. In these methods, pathogenicity prediction is combined with a semantic similarity measure to prioritize not only variants that are likely to be dysfunctional but those that are likely involved in the pathogenesis of a patient’s phenotype. Results We have developed DeepPVP, a variant prioritization method that combined automated inference with deep neural networks to identify the likely causative variants in whole exome or whole genome sequence data. We demonstrate that DeepPVP performs significantly better than existing methods, including phenotype-based methods that use similar features. DeepPVP is freely available at https://github.com/bio-ontology-research-group/phenomenet-vp. Conclusions DeepPVP further improves on existing variant prioritization methods both in terms of speed as well as accuracy.http://link.springer.com/article/10.1186/s12859-019-2633-8Variant prioritizationPhenotypeMachine learningOntology
collection DOAJ
language English
format Article
sources DOAJ
author Imane Boudellioua
Maxat Kulmanov
Paul N. Schofield
Georgios V. Gkoutos
Robert Hoehndorf
spellingShingle Imane Boudellioua
Maxat Kulmanov
Paul N. Schofield
Georgios V. Gkoutos
Robert Hoehndorf
DeepPVP: phenotype-based prioritization of causative variants using deep learning
BMC Bioinformatics
Variant prioritization
Phenotype
Machine learning
Ontology
author_facet Imane Boudellioua
Maxat Kulmanov
Paul N. Schofield
Georgios V. Gkoutos
Robert Hoehndorf
author_sort Imane Boudellioua
title DeepPVP: phenotype-based prioritization of causative variants using deep learning
title_short DeepPVP: phenotype-based prioritization of causative variants using deep learning
title_full DeepPVP: phenotype-based prioritization of causative variants using deep learning
title_fullStr DeepPVP: phenotype-based prioritization of causative variants using deep learning
title_full_unstemmed DeepPVP: phenotype-based prioritization of causative variants using deep learning
title_sort deeppvp: phenotype-based prioritization of causative variants using deep learning
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2019-02-01
description Abstract Background Prioritization of variants in personal genomic data is a major challenge. Recently, computational methods that rely on comparing phenotype similarity have shown to be useful to identify causative variants. In these methods, pathogenicity prediction is combined with a semantic similarity measure to prioritize not only variants that are likely to be dysfunctional but those that are likely involved in the pathogenesis of a patient’s phenotype. Results We have developed DeepPVP, a variant prioritization method that combined automated inference with deep neural networks to identify the likely causative variants in whole exome or whole genome sequence data. We demonstrate that DeepPVP performs significantly better than existing methods, including phenotype-based methods that use similar features. DeepPVP is freely available at https://github.com/bio-ontology-research-group/phenomenet-vp. Conclusions DeepPVP further improves on existing variant prioritization methods both in terms of speed as well as accuracy.
topic Variant prioritization
Phenotype
Machine learning
Ontology
url http://link.springer.com/article/10.1186/s12859-019-2633-8
work_keys_str_mv AT imaneboudellioua deeppvpphenotypebasedprioritizationofcausativevariantsusingdeeplearning
AT maxatkulmanov deeppvpphenotypebasedprioritizationofcausativevariantsusingdeeplearning
AT paulnschofield deeppvpphenotypebasedprioritizationofcausativevariantsusingdeeplearning
AT georgiosvgkoutos deeppvpphenotypebasedprioritizationofcausativevariantsusingdeeplearning
AT roberthoehndorf deeppvpphenotypebasedprioritizationofcausativevariantsusingdeeplearning
_version_ 1724618815878201344