IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method

Abstract Background It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations....

Full description

Bibliographic Details
Main Authors: Wenwen Fan, Junliang Shang, Feng Li, Yan Sun, Shasha Yuan, Jin-Xing Liu
Format: Article
Language:English
Published: BMC 2020-07-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-020-03699-9
id doaj-88ed1042b23e414ab580834ccf9e4aa1
record_format Article
spelling doaj-88ed1042b23e414ab580834ccf9e4aa12020-11-25T03:25:20ZengBMCBMC Bioinformatics1471-21052020-07-0121111410.1186/s12859-020-03699-9IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity methodWenwen Fan0Junliang Shang1Feng Li2Yan Sun3Shasha Yuan4Jin-Xing Liu5School of Information Science and Engineering, Qufu Normal UniversitySchool of Information Science and Engineering, Qufu Normal UniversitySchool of Information Science and Engineering, Qufu Normal UniversitySchool of Information Science and Engineering, Qufu Normal UniversitySchool of Information Science and Engineering, Qufu Normal UniversitySchool of Information Science and Engineering, Qufu Normal UniversityAbstract Background It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations. Nevertheless, among them, little effort has been attempted to measure lncRNA functional similarity, which is an essential part of association prediction models. Results In this study, we presented an lncRNA functional similarity calculation model, IDSSIM for short, based on an improved disease semantic similarity method, highlight of which is the introduction of information content contribution factor into the semantic value calculation to take into account both the hierarchical structures of disease directed acyclic graphs and the disease specificities. IDSSIM and three state-of-the-art models, i.e., LNCSIM1, LNCSIM2, and ILNCSIM, were evaluated by applying their disease semantic similarity matrices and the lncRNA functional similarity matrices, as well as corresponding matrices of human lncRNA-disease associations coming from either lncRNADisease database or MNDR database, into an association prediction method WKNKN for lncRNA-disease association prediction. In addition, case studies of breast cancer and adenocarcinoma were also performed to validate the effectiveness of IDSSIM. Conclusions Results demonstrated that in terms of ROC curves and AUC values, IDSSIM is superior to compared models, and can improve accuracy of disease semantic similarity effectively, leading to increase the association prediction ability of the IDSSIM-WKNKN model; in terms of case studies, most of potential disease-associated lncRNAs predicted by IDSSIM can be confirmed by databases and literatures, implying that IDSSIM can serve as a promising tool for predicting lncRNA functions, identifying potential lncRNA-disease associations, and pre-screening candidate lncRNAs to perform biological experiments. The IDSSIM code, all experimental data and prediction results are available online at https://github.com/CDMB-lab/IDSSIM .http://link.springer.com/article/10.1186/s12859-020-03699-9LncRNA functional similarityDisease semantic similaritylncRNA-disease associations
collection DOAJ
language English
format Article
sources DOAJ
author Wenwen Fan
Junliang Shang
Feng Li
Yan Sun
Shasha Yuan
Jin-Xing Liu
spellingShingle Wenwen Fan
Junliang Shang
Feng Li
Yan Sun
Shasha Yuan
Jin-Xing Liu
IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
BMC Bioinformatics
LncRNA functional similarity
Disease semantic similarity
lncRNA-disease associations
author_facet Wenwen Fan
Junliang Shang
Feng Li
Yan Sun
Shasha Yuan
Jin-Xing Liu
author_sort Wenwen Fan
title IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title_short IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title_full IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title_fullStr IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title_full_unstemmed IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title_sort idssim: an lncrna functional similarity calculation model based on an improved disease semantic similarity method
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2020-07-01
description Abstract Background It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations. Nevertheless, among them, little effort has been attempted to measure lncRNA functional similarity, which is an essential part of association prediction models. Results In this study, we presented an lncRNA functional similarity calculation model, IDSSIM for short, based on an improved disease semantic similarity method, highlight of which is the introduction of information content contribution factor into the semantic value calculation to take into account both the hierarchical structures of disease directed acyclic graphs and the disease specificities. IDSSIM and three state-of-the-art models, i.e., LNCSIM1, LNCSIM2, and ILNCSIM, were evaluated by applying their disease semantic similarity matrices and the lncRNA functional similarity matrices, as well as corresponding matrices of human lncRNA-disease associations coming from either lncRNADisease database or MNDR database, into an association prediction method WKNKN for lncRNA-disease association prediction. In addition, case studies of breast cancer and adenocarcinoma were also performed to validate the effectiveness of IDSSIM. Conclusions Results demonstrated that in terms of ROC curves and AUC values, IDSSIM is superior to compared models, and can improve accuracy of disease semantic similarity effectively, leading to increase the association prediction ability of the IDSSIM-WKNKN model; in terms of case studies, most of potential disease-associated lncRNAs predicted by IDSSIM can be confirmed by databases and literatures, implying that IDSSIM can serve as a promising tool for predicting lncRNA functions, identifying potential lncRNA-disease associations, and pre-screening candidate lncRNAs to perform biological experiments. The IDSSIM code, all experimental data and prediction results are available online at https://github.com/CDMB-lab/IDSSIM .
topic LncRNA functional similarity
Disease semantic similarity
lncRNA-disease associations
url http://link.springer.com/article/10.1186/s12859-020-03699-9
work_keys_str_mv AT wenwenfan idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
AT junliangshang idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
AT fengli idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
AT yansun idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
AT shashayuan idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
AT jinxingliu idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
_version_ 1724597446474989568