SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations
Abstract Background Blood cancers (BCs) are responsible for over 720 K yearly deaths worldwide. Their prevalence and mortality-rate uphold the relevance of research related to BCs. Despite the availability of different resources establishing Disease-Disease Associations (DDAs), the knowledge is scat...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-10-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-021-04397-w |
id |
doaj-f4bd4457bc00426bb737384175949fb5 |
---|---|
record_format |
Article |
spelling |
doaj-f4bd4457bc00426bb737384175949fb52021-10-10T11:14:27ZengBMCBMC Bioinformatics1471-21052021-10-0122111210.1186/s12859-021-04397-wSicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associationsNícia Rosário-Ferreira0Victor Guimarães1Vítor S. Costa2Irina S. Moreira3CQC - Coimbra Chemistry Center, Chemistry Department, Faculty of Science and Technology, University of CoimbraDepartment of Sciences, University of PortoDepartment of Sciences, University of PortoDepartment of Life Sciences, University of CoimbraAbstract Background Blood cancers (BCs) are responsible for over 720 K yearly deaths worldwide. Their prevalence and mortality-rate uphold the relevance of research related to BCs. Despite the availability of different resources establishing Disease-Disease Associations (DDAs), the knowledge is scattered and not accessible in a straightforward way to the scientific community. Here, we propose SicknessMiner, a biomedical Text-Mining (TM) approach towards the centralization of DDAs. Our methodology encompasses Named Entity Recognition (NER) and Named Entity Normalization (NEN) steps, and the DDAs retrieved were compared to the DisGeNET resource for qualitative and quantitative comparison. Results We obtained the DDAs via co-mention using our SicknessMiner or gene- or variant-disease similarity on DisGeNET. SicknessMiner was able to retrieve around 92% of the DisGeNET results and nearly 15% of the SicknessMiner results were specific to our pipeline. Conclusions SicknessMiner is a valuable tool to extract disease-disease relationship from RAW input corpus.https://doi.org/10.1186/s12859-021-04397-wDisease-disease associationsNatural language processingBiomedical text-miningDeep learningBlood cancers |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Nícia Rosário-Ferreira Victor Guimarães Vítor S. Costa Irina S. Moreira |
spellingShingle |
Nícia Rosário-Ferreira Victor Guimarães Vítor S. Costa Irina S. Moreira SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations BMC Bioinformatics Disease-disease associations Natural language processing Biomedical text-mining Deep learning Blood cancers |
author_facet |
Nícia Rosário-Ferreira Victor Guimarães Vítor S. Costa Irina S. Moreira |
author_sort |
Nícia Rosário-Ferreira |
title |
SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations |
title_short |
SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations |
title_full |
SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations |
title_fullStr |
SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations |
title_full_unstemmed |
SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations |
title_sort |
sicknessminer: a deep-learning-driven text-mining tool to abridge disease-disease associations |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2021-10-01 |
description |
Abstract Background Blood cancers (BCs) are responsible for over 720 K yearly deaths worldwide. Their prevalence and mortality-rate uphold the relevance of research related to BCs. Despite the availability of different resources establishing Disease-Disease Associations (DDAs), the knowledge is scattered and not accessible in a straightforward way to the scientific community. Here, we propose SicknessMiner, a biomedical Text-Mining (TM) approach towards the centralization of DDAs. Our methodology encompasses Named Entity Recognition (NER) and Named Entity Normalization (NEN) steps, and the DDAs retrieved were compared to the DisGeNET resource for qualitative and quantitative comparison. Results We obtained the DDAs via co-mention using our SicknessMiner or gene- or variant-disease similarity on DisGeNET. SicknessMiner was able to retrieve around 92% of the DisGeNET results and nearly 15% of the SicknessMiner results were specific to our pipeline. Conclusions SicknessMiner is a valuable tool to extract disease-disease relationship from RAW input corpus. |
topic |
Disease-disease associations Natural language processing Biomedical text-mining Deep learning Blood cancers |
url |
https://doi.org/10.1186/s12859-021-04397-w |
work_keys_str_mv |
AT niciarosarioferreira sicknessmineradeeplearningdriventextminingtooltoabridgediseasediseaseassociations AT victorguimaraes sicknessmineradeeplearningdriventextminingtooltoabridgediseasediseaseassociations AT vitorscosta sicknessmineradeeplearningdriventextminingtooltoabridgediseasediseaseassociations AT irinasmoreira sicknessmineradeeplearningdriventextminingtooltoabridgediseasediseaseassociations |
_version_ |
1716829950931107840 |