Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis

Abstract Background Calcific aortic valve stenosis (CAVS) is a fatal disease and there is no pharmacological treatment to prevent the progression of CAVS. This study aims to identify genes potentially implicated with CAVS in patients with congenital bicuspid aortic valve (BAV) and tricuspid aortic v...

Full description

Bibliographic Details
Main Authors: Erika Cantor, Rodrigo Salas, Harvey Rosas, Sandra Guauque-Olarte
Format: Article
Language:English
Published: BMC 2021-07-01
Series:BioData Mining
Subjects:
Online Access:https://doi.org/10.1186/s13040-021-00269-4
id doaj-162e33cf5e174b8fa93d1c9b9153c544
record_format Article
spelling doaj-162e33cf5e174b8fa93d1c9b9153c5442021-07-25T11:04:07ZengBMCBioData Mining1756-03812021-07-0114111110.1186/s13040-021-00269-4Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosisErika Cantor0Rodrigo Salas1Harvey Rosas2Sandra Guauque-Olarte3Institute of Statistics, Universidad de ValparaísoSchool of Biomedical Engineering, Universidad de ValparaísoInstitute of Statistics, Universidad de ValparaísoFaculty of Dentistry, Universidad Cooperativa de ColombiaAbstract Background Calcific aortic valve stenosis (CAVS) is a fatal disease and there is no pharmacological treatment to prevent the progression of CAVS. This study aims to identify genes potentially implicated with CAVS in patients with congenital bicuspid aortic valve (BAV) and tricuspid aortic valve (TAV) in comparison with patients having normal valves, using a knowledge-slanted random forest (RF). Results This study implemented a knowledge-slanted random forest (RF) using information extracted from a protein-protein interactions network to rank genes in order to modify their selection probability to draw the candidate split-variables. A total of 15,191 genes were assessed in 19 valves with CAVS (BAV, n = 10; TAV, n = 9) and 8 normal valves. The performance of the model was evaluated using accuracy, sensitivity, and specificity to discriminate cases with CAVS. A comparison with conventional RF was also performed. The performance of this proposed approach reported improved accuracy in comparison with conventional RF to classify cases separately with BAV and TAV (Slanted RF: 59.3% versus 40.7%). When patients with BAV and TAV were grouped against patients with normal valves, the addition of prior biological information was not relevant with an accuracy of 92.6%. Conclusion The knowledge-slanted RF approach reflected prior biological knowledge, leading to better precision in distinguishing between cases with BAV, TAV, and normal valves. The results of this study suggest that the integration of biological knowledge can be useful during difficult classification tasks.https://doi.org/10.1186/s13040-021-00269-4Machine learningCalcific aortic valve diseaseRandom ForestPrior-knowledgeGene-selection
collection DOAJ
language English
format Article
sources DOAJ
author Erika Cantor
Rodrigo Salas
Harvey Rosas
Sandra Guauque-Olarte
spellingShingle Erika Cantor
Rodrigo Salas
Harvey Rosas
Sandra Guauque-Olarte
Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis
BioData Mining
Machine learning
Calcific aortic valve disease
Random Forest
Prior-knowledge
Gene-selection
author_facet Erika Cantor
Rodrigo Salas
Harvey Rosas
Sandra Guauque-Olarte
author_sort Erika Cantor
title Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis
title_short Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis
title_full Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis
title_fullStr Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis
title_full_unstemmed Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis
title_sort biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis
publisher BMC
series BioData Mining
issn 1756-0381
publishDate 2021-07-01
description Abstract Background Calcific aortic valve stenosis (CAVS) is a fatal disease and there is no pharmacological treatment to prevent the progression of CAVS. This study aims to identify genes potentially implicated with CAVS in patients with congenital bicuspid aortic valve (BAV) and tricuspid aortic valve (TAV) in comparison with patients having normal valves, using a knowledge-slanted random forest (RF). Results This study implemented a knowledge-slanted random forest (RF) using information extracted from a protein-protein interactions network to rank genes in order to modify their selection probability to draw the candidate split-variables. A total of 15,191 genes were assessed in 19 valves with CAVS (BAV, n = 10; TAV, n = 9) and 8 normal valves. The performance of the model was evaluated using accuracy, sensitivity, and specificity to discriminate cases with CAVS. A comparison with conventional RF was also performed. The performance of this proposed approach reported improved accuracy in comparison with conventional RF to classify cases separately with BAV and TAV (Slanted RF: 59.3% versus 40.7%). When patients with BAV and TAV were grouped against patients with normal valves, the addition of prior biological information was not relevant with an accuracy of 92.6%. Conclusion The knowledge-slanted RF approach reflected prior biological knowledge, leading to better precision in distinguishing between cases with BAV, TAV, and normal valves. The results of this study suggest that the integration of biological knowledge can be useful during difficult classification tasks.
topic Machine learning
Calcific aortic valve disease
Random Forest
Prior-knowledge
Gene-selection
url https://doi.org/10.1186/s13040-021-00269-4
work_keys_str_mv AT erikacantor biologicalknowledgeslantedrandomforestapproachfortheclassificationofcalcifiedaorticvalvestenosis
AT rodrigosalas biologicalknowledgeslantedrandomforestapproachfortheclassificationofcalcifiedaorticvalvestenosis
AT harveyrosas biologicalknowledgeslantedrandomforestapproachfortheclassificationofcalcifiedaorticvalvestenosis
AT sandraguauqueolarte biologicalknowledgeslantedrandomforestapproachfortheclassificationofcalcifiedaorticvalvestenosis
_version_ 1721283499796725760