IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
Abstract Background It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations....
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2020-07-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-020-03699-9 |
id |
doaj-88ed1042b23e414ab580834ccf9e4aa1 |
---|---|
record_format |
Article |
spelling |
doaj-88ed1042b23e414ab580834ccf9e4aa12020-11-25T03:25:20ZengBMCBMC Bioinformatics1471-21052020-07-0121111410.1186/s12859-020-03699-9IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity methodWenwen Fan0Junliang Shang1Feng Li2Yan Sun3Shasha Yuan4Jin-Xing Liu5School of Information Science and Engineering, Qufu Normal UniversitySchool of Information Science and Engineering, Qufu Normal UniversitySchool of Information Science and Engineering, Qufu Normal UniversitySchool of Information Science and Engineering, Qufu Normal UniversitySchool of Information Science and Engineering, Qufu Normal UniversitySchool of Information Science and Engineering, Qufu Normal UniversityAbstract Background It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations. Nevertheless, among them, little effort has been attempted to measure lncRNA functional similarity, which is an essential part of association prediction models. Results In this study, we presented an lncRNA functional similarity calculation model, IDSSIM for short, based on an improved disease semantic similarity method, highlight of which is the introduction of information content contribution factor into the semantic value calculation to take into account both the hierarchical structures of disease directed acyclic graphs and the disease specificities. IDSSIM and three state-of-the-art models, i.e., LNCSIM1, LNCSIM2, and ILNCSIM, were evaluated by applying their disease semantic similarity matrices and the lncRNA functional similarity matrices, as well as corresponding matrices of human lncRNA-disease associations coming from either lncRNADisease database or MNDR database, into an association prediction method WKNKN for lncRNA-disease association prediction. In addition, case studies of breast cancer and adenocarcinoma were also performed to validate the effectiveness of IDSSIM. Conclusions Results demonstrated that in terms of ROC curves and AUC values, IDSSIM is superior to compared models, and can improve accuracy of disease semantic similarity effectively, leading to increase the association prediction ability of the IDSSIM-WKNKN model; in terms of case studies, most of potential disease-associated lncRNAs predicted by IDSSIM can be confirmed by databases and literatures, implying that IDSSIM can serve as a promising tool for predicting lncRNA functions, identifying potential lncRNA-disease associations, and pre-screening candidate lncRNAs to perform biological experiments. The IDSSIM code, all experimental data and prediction results are available online at https://github.com/CDMB-lab/IDSSIM .http://link.springer.com/article/10.1186/s12859-020-03699-9LncRNA functional similarityDisease semantic similaritylncRNA-disease associations |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wenwen Fan Junliang Shang Feng Li Yan Sun Shasha Yuan Jin-Xing Liu |
spellingShingle |
Wenwen Fan Junliang Shang Feng Li Yan Sun Shasha Yuan Jin-Xing Liu IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method BMC Bioinformatics LncRNA functional similarity Disease semantic similarity lncRNA-disease associations |
author_facet |
Wenwen Fan Junliang Shang Feng Li Yan Sun Shasha Yuan Jin-Xing Liu |
author_sort |
Wenwen Fan |
title |
IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title_short |
IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title_full |
IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title_fullStr |
IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title_full_unstemmed |
IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title_sort |
idssim: an lncrna functional similarity calculation model based on an improved disease semantic similarity method |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2020-07-01 |
description |
Abstract Background It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations. Nevertheless, among them, little effort has been attempted to measure lncRNA functional similarity, which is an essential part of association prediction models. Results In this study, we presented an lncRNA functional similarity calculation model, IDSSIM for short, based on an improved disease semantic similarity method, highlight of which is the introduction of information content contribution factor into the semantic value calculation to take into account both the hierarchical structures of disease directed acyclic graphs and the disease specificities. IDSSIM and three state-of-the-art models, i.e., LNCSIM1, LNCSIM2, and ILNCSIM, were evaluated by applying their disease semantic similarity matrices and the lncRNA functional similarity matrices, as well as corresponding matrices of human lncRNA-disease associations coming from either lncRNADisease database or MNDR database, into an association prediction method WKNKN for lncRNA-disease association prediction. In addition, case studies of breast cancer and adenocarcinoma were also performed to validate the effectiveness of IDSSIM. Conclusions Results demonstrated that in terms of ROC curves and AUC values, IDSSIM is superior to compared models, and can improve accuracy of disease semantic similarity effectively, leading to increase the association prediction ability of the IDSSIM-WKNKN model; in terms of case studies, most of potential disease-associated lncRNAs predicted by IDSSIM can be confirmed by databases and literatures, implying that IDSSIM can serve as a promising tool for predicting lncRNA functions, identifying potential lncRNA-disease associations, and pre-screening candidate lncRNAs to perform biological experiments. The IDSSIM code, all experimental data and prediction results are available online at https://github.com/CDMB-lab/IDSSIM . |
topic |
LncRNA functional similarity Disease semantic similarity lncRNA-disease associations |
url |
http://link.springer.com/article/10.1186/s12859-020-03699-9 |
work_keys_str_mv |
AT wenwenfan idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod AT junliangshang idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod AT fengli idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod AT yansun idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod AT shashayuan idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod AT jinxingliu idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod |
_version_ |
1724597446474989568 |