PMDFI: Predicting miRNA–Disease Associations Based on High-Order Feature Interaction
MicroRNAs (miRNAs) are non-coding RNA molecules that make a significant contribution to diverse biological processes, and their mutations and dysregulations are closely related to the occurrence, development, and treatment of human diseases. Therefore, identification of potential miRNA–disease assoc...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2021-04-01
|
Series: | Frontiers in Genetics |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fgene.2021.656107/full |
id |
doaj-d2adab7c0e4444c79ff75664c87fd8ed |
---|---|
record_format |
Article |
spelling |
doaj-d2adab7c0e4444c79ff75664c87fd8ed2021-04-09T12:46:44ZengFrontiers Media S.A.Frontiers in Genetics1664-80212021-04-011210.3389/fgene.2021.656107656107PMDFI: Predicting miRNA–Disease Associations Based on High-Order Feature InteractionMingyan TangChenzhe LiuDayun LiuJunyi LiuJiaqi LiuLei DengMicroRNAs (miRNAs) are non-coding RNA molecules that make a significant contribution to diverse biological processes, and their mutations and dysregulations are closely related to the occurrence, development, and treatment of human diseases. Therefore, identification of potential miRNA–disease associations contributes to elucidating the pathogenesis of tumorigenesis and seeking the effective treatment method for diseases. Due to the expensive cost of traditional biological experiments of determining associations between miRNAs and diseases, increasing numbers of effective computational models are being used to compensate for this limitation. In this study, we propose a novel computational method, named PMDFI, which is an ensemble learning method to predict potential miRNA–disease associations based on high-order feature interactions. We initially use a stacked autoencoder to extract meaningful high-order features from the original similarity matrix, and then perform feature interactive learning, and finally utilize an integrated model composed of multiple random forests and logistic regression to make comprehensive predictions. The experimental results illustrate that PMDFI achieves excellent performance in predicting potential miRNA–disease associations, with the average area under the ROC curve scores of 0.9404 and 0.9415 in 5-fold and 10-fold cross-validation, respectively.https://www.frontiersin.org/articles/10.3389/fgene.2021.656107/fullmiRNA-disease associationshigh-order featuresfeature interactionsrandom forestlogistic regression |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mingyan Tang Chenzhe Liu Dayun Liu Junyi Liu Jiaqi Liu Lei Deng |
spellingShingle |
Mingyan Tang Chenzhe Liu Dayun Liu Junyi Liu Jiaqi Liu Lei Deng PMDFI: Predicting miRNA–Disease Associations Based on High-Order Feature Interaction Frontiers in Genetics miRNA-disease associations high-order features feature interactions random forest logistic regression |
author_facet |
Mingyan Tang Chenzhe Liu Dayun Liu Junyi Liu Jiaqi Liu Lei Deng |
author_sort |
Mingyan Tang |
title |
PMDFI: Predicting miRNA–Disease Associations Based on High-Order Feature Interaction |
title_short |
PMDFI: Predicting miRNA–Disease Associations Based on High-Order Feature Interaction |
title_full |
PMDFI: Predicting miRNA–Disease Associations Based on High-Order Feature Interaction |
title_fullStr |
PMDFI: Predicting miRNA–Disease Associations Based on High-Order Feature Interaction |
title_full_unstemmed |
PMDFI: Predicting miRNA–Disease Associations Based on High-Order Feature Interaction |
title_sort |
pmdfi: predicting mirna–disease associations based on high-order feature interaction |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Genetics |
issn |
1664-8021 |
publishDate |
2021-04-01 |
description |
MicroRNAs (miRNAs) are non-coding RNA molecules that make a significant contribution to diverse biological processes, and their mutations and dysregulations are closely related to the occurrence, development, and treatment of human diseases. Therefore, identification of potential miRNA–disease associations contributes to elucidating the pathogenesis of tumorigenesis and seeking the effective treatment method for diseases. Due to the expensive cost of traditional biological experiments of determining associations between miRNAs and diseases, increasing numbers of effective computational models are being used to compensate for this limitation. In this study, we propose a novel computational method, named PMDFI, which is an ensemble learning method to predict potential miRNA–disease associations based on high-order feature interactions. We initially use a stacked autoencoder to extract meaningful high-order features from the original similarity matrix, and then perform feature interactive learning, and finally utilize an integrated model composed of multiple random forests and logistic regression to make comprehensive predictions. The experimental results illustrate that PMDFI achieves excellent performance in predicting potential miRNA–disease associations, with the average area under the ROC curve scores of 0.9404 and 0.9415 in 5-fold and 10-fold cross-validation, respectively. |
topic |
miRNA-disease associations high-order features feature interactions random forest logistic regression |
url |
https://www.frontiersin.org/articles/10.3389/fgene.2021.656107/full |
work_keys_str_mv |
AT mingyantang pmdfipredictingmirnadiseaseassociationsbasedonhighorderfeatureinteraction AT chenzheliu pmdfipredictingmirnadiseaseassociationsbasedonhighorderfeatureinteraction AT dayunliu pmdfipredictingmirnadiseaseassociationsbasedonhighorderfeatureinteraction AT junyiliu pmdfipredictingmirnadiseaseassociationsbasedonhighorderfeatureinteraction AT jiaqiliu pmdfipredictingmirnadiseaseassociationsbasedonhighorderfeatureinteraction AT leideng pmdfipredictingmirnadiseaseassociationsbasedonhighorderfeatureinteraction |
_version_ |
1721532676903534592 |