Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis
Abstract Background Calcific aortic valve stenosis (CAVS) is a fatal disease and there is no pharmacological treatment to prevent the progression of CAVS. This study aims to identify genes potentially implicated with CAVS in patients with congenital bicuspid aortic valve (BAV) and tricuspid aortic v...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-07-01
|
Series: | BioData Mining |
Subjects: | |
Online Access: | https://doi.org/10.1186/s13040-021-00269-4 |
id |
doaj-162e33cf5e174b8fa93d1c9b9153c544 |
---|---|
record_format |
Article |
spelling |
doaj-162e33cf5e174b8fa93d1c9b9153c5442021-07-25T11:04:07ZengBMCBioData Mining1756-03812021-07-0114111110.1186/s13040-021-00269-4Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosisErika Cantor0Rodrigo Salas1Harvey Rosas2Sandra Guauque-Olarte3Institute of Statistics, Universidad de ValparaísoSchool of Biomedical Engineering, Universidad de ValparaísoInstitute of Statistics, Universidad de ValparaísoFaculty of Dentistry, Universidad Cooperativa de ColombiaAbstract Background Calcific aortic valve stenosis (CAVS) is a fatal disease and there is no pharmacological treatment to prevent the progression of CAVS. This study aims to identify genes potentially implicated with CAVS in patients with congenital bicuspid aortic valve (BAV) and tricuspid aortic valve (TAV) in comparison with patients having normal valves, using a knowledge-slanted random forest (RF). Results This study implemented a knowledge-slanted random forest (RF) using information extracted from a protein-protein interactions network to rank genes in order to modify their selection probability to draw the candidate split-variables. A total of 15,191 genes were assessed in 19 valves with CAVS (BAV, n = 10; TAV, n = 9) and 8 normal valves. The performance of the model was evaluated using accuracy, sensitivity, and specificity to discriminate cases with CAVS. A comparison with conventional RF was also performed. The performance of this proposed approach reported improved accuracy in comparison with conventional RF to classify cases separately with BAV and TAV (Slanted RF: 59.3% versus 40.7%). When patients with BAV and TAV were grouped against patients with normal valves, the addition of prior biological information was not relevant with an accuracy of 92.6%. Conclusion The knowledge-slanted RF approach reflected prior biological knowledge, leading to better precision in distinguishing between cases with BAV, TAV, and normal valves. The results of this study suggest that the integration of biological knowledge can be useful during difficult classification tasks.https://doi.org/10.1186/s13040-021-00269-4Machine learningCalcific aortic valve diseaseRandom ForestPrior-knowledgeGene-selection |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Erika Cantor Rodrigo Salas Harvey Rosas Sandra Guauque-Olarte |
spellingShingle |
Erika Cantor Rodrigo Salas Harvey Rosas Sandra Guauque-Olarte Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis BioData Mining Machine learning Calcific aortic valve disease Random Forest Prior-knowledge Gene-selection |
author_facet |
Erika Cantor Rodrigo Salas Harvey Rosas Sandra Guauque-Olarte |
author_sort |
Erika Cantor |
title |
Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis |
title_short |
Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis |
title_full |
Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis |
title_fullStr |
Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis |
title_full_unstemmed |
Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis |
title_sort |
biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis |
publisher |
BMC |
series |
BioData Mining |
issn |
1756-0381 |
publishDate |
2021-07-01 |
description |
Abstract Background Calcific aortic valve stenosis (CAVS) is a fatal disease and there is no pharmacological treatment to prevent the progression of CAVS. This study aims to identify genes potentially implicated with CAVS in patients with congenital bicuspid aortic valve (BAV) and tricuspid aortic valve (TAV) in comparison with patients having normal valves, using a knowledge-slanted random forest (RF). Results This study implemented a knowledge-slanted random forest (RF) using information extracted from a protein-protein interactions network to rank genes in order to modify their selection probability to draw the candidate split-variables. A total of 15,191 genes were assessed in 19 valves with CAVS (BAV, n = 10; TAV, n = 9) and 8 normal valves. The performance of the model was evaluated using accuracy, sensitivity, and specificity to discriminate cases with CAVS. A comparison with conventional RF was also performed. The performance of this proposed approach reported improved accuracy in comparison with conventional RF to classify cases separately with BAV and TAV (Slanted RF: 59.3% versus 40.7%). When patients with BAV and TAV were grouped against patients with normal valves, the addition of prior biological information was not relevant with an accuracy of 92.6%. Conclusion The knowledge-slanted RF approach reflected prior biological knowledge, leading to better precision in distinguishing between cases with BAV, TAV, and normal valves. The results of this study suggest that the integration of biological knowledge can be useful during difficult classification tasks. |
topic |
Machine learning Calcific aortic valve disease Random Forest Prior-knowledge Gene-selection |
url |
https://doi.org/10.1186/s13040-021-00269-4 |
work_keys_str_mv |
AT erikacantor biologicalknowledgeslantedrandomforestapproachfortheclassificationofcalcifiedaorticvalvestenosis AT rodrigosalas biologicalknowledgeslantedrandomforestapproachfortheclassificationofcalcifiedaorticvalvestenosis AT harveyrosas biologicalknowledgeslantedrandomforestapproachfortheclassificationofcalcifiedaorticvalvestenosis AT sandraguauqueolarte biologicalknowledgeslantedrandomforestapproachfortheclassificationofcalcifiedaorticvalvestenosis |
_version_ |
1721283499796725760 |