Deep GONet: self-explainable deep neural network based on Gene Ontology for phenotype prediction from gene expression data

Abstract Background With the rapid advancement of genomic sequencing techniques, massive production of gene expression data is becoming possible, which prompts the development of precision medicine. Deep learning is a promising approach for phenotype prediction (clinical diagnosis, prognosis, and dr...

Full description

Bibliographic Details
Main Authors: Victoria Bourgeais, Farida Zehraoui, Mohamed Ben Hamdoune, Blaise Hanczar
Format: Article
Language:English
Published: BMC 2021-09-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-021-04370-7
id doaj-23205e129b944739ab9be3fcd73ed5cd
record_format Article
spelling doaj-23205e129b944739ab9be3fcd73ed5cd2021-09-26T11:15:29ZengBMCBMC Bioinformatics1471-21052021-09-0122S1012510.1186/s12859-021-04370-7Deep GONet: self-explainable deep neural network based on Gene Ontology for phenotype prediction from gene expression dataVictoria Bourgeais0Farida Zehraoui1Mohamed Ben Hamdoune2Blaise Hanczar3IBISC, Univ Evry, Université Paris-SaclayIBISC, Univ Evry, Université Paris-SaclayIBISC, Univ Evry, Université Paris-SaclayIBISC, Univ Evry, Université Paris-SaclayAbstract Background With the rapid advancement of genomic sequencing techniques, massive production of gene expression data is becoming possible, which prompts the development of precision medicine. Deep learning is a promising approach for phenotype prediction (clinical diagnosis, prognosis, and drug response) based on gene expression profile. Existing deep learning models are usually considered as black-boxes that provide accurate predictions but are not interpretable. However, accuracy and interpretation are both essential for precision medicine. In addition, most models do not integrate the knowledge of the domain. Hence, making deep learning models interpretable for medical applications using prior biological knowledge is the main focus of this paper. Results In this paper, we propose a new self-explainable deep learning model, called Deep GONet, integrating the Gene Ontology into the hierarchical architecture of the neural network. This model is based on a fully-connected architecture constrained by the Gene Ontology annotations, such that each neuron represents a biological function. The experiments on cancer diagnosis datasets demonstrate that Deep GONet is both easily interpretable and highly performant to discriminate cancer and non-cancer samples. Conclusions Our model provides an explanation to its predictions by identifying the most important neurons and associating them with biological functions, making the model understandable for biologists and physicians.https://doi.org/10.1186/s12859-021-04370-7Gene expressionPhenotype predictionModel interpretationDeep learningGene Ontology
collection DOAJ
language English
format Article
sources DOAJ
author Victoria Bourgeais
Farida Zehraoui
Mohamed Ben Hamdoune
Blaise Hanczar
spellingShingle Victoria Bourgeais
Farida Zehraoui
Mohamed Ben Hamdoune
Blaise Hanczar
Deep GONet: self-explainable deep neural network based on Gene Ontology for phenotype prediction from gene expression data
BMC Bioinformatics
Gene expression
Phenotype prediction
Model interpretation
Deep learning
Gene Ontology
author_facet Victoria Bourgeais
Farida Zehraoui
Mohamed Ben Hamdoune
Blaise Hanczar
author_sort Victoria Bourgeais
title Deep GONet: self-explainable deep neural network based on Gene Ontology for phenotype prediction from gene expression data
title_short Deep GONet: self-explainable deep neural network based on Gene Ontology for phenotype prediction from gene expression data
title_full Deep GONet: self-explainable deep neural network based on Gene Ontology for phenotype prediction from gene expression data
title_fullStr Deep GONet: self-explainable deep neural network based on Gene Ontology for phenotype prediction from gene expression data
title_full_unstemmed Deep GONet: self-explainable deep neural network based on Gene Ontology for phenotype prediction from gene expression data
title_sort deep gonet: self-explainable deep neural network based on gene ontology for phenotype prediction from gene expression data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2021-09-01
description Abstract Background With the rapid advancement of genomic sequencing techniques, massive production of gene expression data is becoming possible, which prompts the development of precision medicine. Deep learning is a promising approach for phenotype prediction (clinical diagnosis, prognosis, and drug response) based on gene expression profile. Existing deep learning models are usually considered as black-boxes that provide accurate predictions but are not interpretable. However, accuracy and interpretation are both essential for precision medicine. In addition, most models do not integrate the knowledge of the domain. Hence, making deep learning models interpretable for medical applications using prior biological knowledge is the main focus of this paper. Results In this paper, we propose a new self-explainable deep learning model, called Deep GONet, integrating the Gene Ontology into the hierarchical architecture of the neural network. This model is based on a fully-connected architecture constrained by the Gene Ontology annotations, such that each neuron represents a biological function. The experiments on cancer diagnosis datasets demonstrate that Deep GONet is both easily interpretable and highly performant to discriminate cancer and non-cancer samples. Conclusions Our model provides an explanation to its predictions by identifying the most important neurons and associating them with biological functions, making the model understandable for biologists and physicians.
topic Gene expression
Phenotype prediction
Model interpretation
Deep learning
Gene Ontology
url https://doi.org/10.1186/s12859-021-04370-7
work_keys_str_mv AT victoriabourgeais deepgonetselfexplainabledeepneuralnetworkbasedongeneontologyforphenotypepredictionfromgeneexpressiondata
AT faridazehraoui deepgonetselfexplainabledeepneuralnetworkbasedongeneontologyforphenotypepredictionfromgeneexpressiondata
AT mohamedbenhamdoune deepgonetselfexplainabledeepneuralnetworkbasedongeneontologyforphenotypepredictionfromgeneexpressiondata
AT blaisehanczar deepgonetselfexplainabledeepneuralnetworkbasedongeneontologyforphenotypepredictionfromgeneexpressiondata
_version_ 1716868068701896704