Graph regularized L 2,1-nonnegative matrix factorization for miRNA-disease association prediction

Abstract Background The aberrant expression of microRNAs is closely connected to the occurrence and development of a great deal of human diseases. To study human diseases, numerous effective computational models that are valuable and meaningful have been presented by researchers. Results Here, we pr...

Full description

Bibliographic Details
Main Authors: Zhen Gao, Yu-Tian Wang, Qing-Wen Wu, Jian-Cheng Ni, Chun-Hou Zheng
Format: Article
Language:English
Published: BMC 2020-02-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-020-3409-x
id doaj-bf9d454bf77c491e85ec9b801f471714
record_format Article
spelling doaj-bf9d454bf77c491e85ec9b801f4717142020-11-25T00:33:38ZengBMCBMC Bioinformatics1471-21052020-02-0121111310.1186/s12859-020-3409-xGraph regularized L 2,1-nonnegative matrix factorization for miRNA-disease association predictionZhen Gao0Yu-Tian Wang1Qing-Wen Wu2Jian-Cheng Ni3Chun-Hou Zheng4School of Software, Qufu Normal UniversitySchool of Software, Qufu Normal UniversitySchool of Software, Qufu Normal UniversitySchool of Software, Qufu Normal UniversitySchool of Software, Qufu Normal UniversityAbstract Background The aberrant expression of microRNAs is closely connected to the occurrence and development of a great deal of human diseases. To study human diseases, numerous effective computational models that are valuable and meaningful have been presented by researchers. Results Here, we present a computational framework based on graph Laplacian regularized L 2, 1 -nonnegative matrix factorization (GRL 2, 1 -NMF) for inferring possible human disease-connected miRNAs. First, manually validated disease-connected microRNAs were integrated, and microRNA functional similarity information along with two kinds of disease semantic similarities were calculated. Next, we measured Gaussian interaction profile (GIP) kernel similarities for both diseases and microRNAs. Then, we adopted a preprocessing step, namely, weighted K nearest known neighbours (WKNKN), to decrease the sparsity of the miRNA-disease association matrix network. Finally, the GRL 2,1 -NMF framework was used to predict links between microRNAs and diseases. Conclusions The new method (GRL2, 1-NMF) achieved AUC values of 0.9280 and 0.9276 in global leave-one-out cross validation (global LOOCV) and five-fold cross validation (5-CV), respectively, showing that GRL2, 1-NMF can powerfully discover potential disease-related miRNAs, even if there is no known associated disease.http://link.springer.com/article/10.1186/s12859-020-3409-xmiRNADiseasemiRNA-disease associationsNMF L 2, 1-norm
collection DOAJ
language English
format Article
sources DOAJ
author Zhen Gao
Yu-Tian Wang
Qing-Wen Wu
Jian-Cheng Ni
Chun-Hou Zheng
spellingShingle Zhen Gao
Yu-Tian Wang
Qing-Wen Wu
Jian-Cheng Ni
Chun-Hou Zheng
Graph regularized L 2,1-nonnegative matrix factorization for miRNA-disease association prediction
BMC Bioinformatics
miRNA
Disease
miRNA-disease associations
NMF L 2, 1-norm
author_facet Zhen Gao
Yu-Tian Wang
Qing-Wen Wu
Jian-Cheng Ni
Chun-Hou Zheng
author_sort Zhen Gao
title Graph regularized L 2,1-nonnegative matrix factorization for miRNA-disease association prediction
title_short Graph regularized L 2,1-nonnegative matrix factorization for miRNA-disease association prediction
title_full Graph regularized L 2,1-nonnegative matrix factorization for miRNA-disease association prediction
title_fullStr Graph regularized L 2,1-nonnegative matrix factorization for miRNA-disease association prediction
title_full_unstemmed Graph regularized L 2,1-nonnegative matrix factorization for miRNA-disease association prediction
title_sort graph regularized l 2,1-nonnegative matrix factorization for mirna-disease association prediction
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2020-02-01
description Abstract Background The aberrant expression of microRNAs is closely connected to the occurrence and development of a great deal of human diseases. To study human diseases, numerous effective computational models that are valuable and meaningful have been presented by researchers. Results Here, we present a computational framework based on graph Laplacian regularized L 2, 1 -nonnegative matrix factorization (GRL 2, 1 -NMF) for inferring possible human disease-connected miRNAs. First, manually validated disease-connected microRNAs were integrated, and microRNA functional similarity information along with two kinds of disease semantic similarities were calculated. Next, we measured Gaussian interaction profile (GIP) kernel similarities for both diseases and microRNAs. Then, we adopted a preprocessing step, namely, weighted K nearest known neighbours (WKNKN), to decrease the sparsity of the miRNA-disease association matrix network. Finally, the GRL 2,1 -NMF framework was used to predict links between microRNAs and diseases. Conclusions The new method (GRL2, 1-NMF) achieved AUC values of 0.9280 and 0.9276 in global leave-one-out cross validation (global LOOCV) and five-fold cross validation (5-CV), respectively, showing that GRL2, 1-NMF can powerfully discover potential disease-related miRNAs, even if there is no known associated disease.
topic miRNA
Disease
miRNA-disease associations
NMF L 2, 1-norm
url http://link.springer.com/article/10.1186/s12859-020-3409-x
work_keys_str_mv AT zhengao graphregularizedl21nonnegativematrixfactorizationformirnadiseaseassociationprediction
AT yutianwang graphregularizedl21nonnegativematrixfactorizationformirnadiseaseassociationprediction
AT qingwenwu graphregularizedl21nonnegativematrixfactorizationformirnadiseaseassociationprediction
AT jianchengni graphregularizedl21nonnegativematrixfactorizationformirnadiseaseassociationprediction
AT chunhouzheng graphregularizedl21nonnegativematrixfactorizationformirnadiseaseassociationprediction
_version_ 1725315716253483008