SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations

Abstract Background Blood cancers (BCs) are responsible for over 720 K yearly deaths worldwide. Their prevalence and mortality-rate uphold the relevance of research related to BCs. Despite the availability of different resources establishing Disease-Disease Associations (DDAs), the knowledge is scat...

Full description

Bibliographic Details
Main Authors: Nícia Rosário-Ferreira, Victor Guimarães, Vítor S. Costa, Irina S. Moreira
Format: Article
Language:English
Published: BMC 2021-10-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-021-04397-w
id doaj-f4bd4457bc00426bb737384175949fb5
record_format Article
spelling doaj-f4bd4457bc00426bb737384175949fb52021-10-10T11:14:27ZengBMCBMC Bioinformatics1471-21052021-10-0122111210.1186/s12859-021-04397-wSicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associationsNícia Rosário-Ferreira0Victor Guimarães1Vítor S. Costa2Irina S. Moreira3CQC - Coimbra Chemistry Center, Chemistry Department, Faculty of Science and Technology, University of CoimbraDepartment of Sciences, University of PortoDepartment of Sciences, University of PortoDepartment of Life Sciences, University of CoimbraAbstract Background Blood cancers (BCs) are responsible for over 720 K yearly deaths worldwide. Their prevalence and mortality-rate uphold the relevance of research related to BCs. Despite the availability of different resources establishing Disease-Disease Associations (DDAs), the knowledge is scattered and not accessible in a straightforward way to the scientific community. Here, we propose SicknessMiner, a biomedical Text-Mining (TM) approach towards the centralization of DDAs. Our methodology encompasses Named Entity Recognition (NER) and Named Entity Normalization (NEN) steps, and the DDAs retrieved were compared to the DisGeNET resource for qualitative and quantitative comparison. Results We obtained the DDAs via co-mention using our SicknessMiner or gene- or variant-disease similarity on DisGeNET. SicknessMiner was able to retrieve around 92% of the DisGeNET results and nearly 15% of the SicknessMiner results were specific to our pipeline. Conclusions SicknessMiner is a valuable tool to extract disease-disease relationship from RAW input corpus.https://doi.org/10.1186/s12859-021-04397-wDisease-disease associationsNatural language processingBiomedical text-miningDeep learningBlood cancers
collection DOAJ
language English
format Article
sources DOAJ
author Nícia Rosário-Ferreira
Victor Guimarães
Vítor S. Costa
Irina S. Moreira
spellingShingle Nícia Rosário-Ferreira
Victor Guimarães
Vítor S. Costa
Irina S. Moreira
SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations
BMC Bioinformatics
Disease-disease associations
Natural language processing
Biomedical text-mining
Deep learning
Blood cancers
author_facet Nícia Rosário-Ferreira
Victor Guimarães
Vítor S. Costa
Irina S. Moreira
author_sort Nícia Rosário-Ferreira
title SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations
title_short SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations
title_full SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations
title_fullStr SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations
title_full_unstemmed SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations
title_sort sicknessminer: a deep-learning-driven text-mining tool to abridge disease-disease associations
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2021-10-01
description Abstract Background Blood cancers (BCs) are responsible for over 720 K yearly deaths worldwide. Their prevalence and mortality-rate uphold the relevance of research related to BCs. Despite the availability of different resources establishing Disease-Disease Associations (DDAs), the knowledge is scattered and not accessible in a straightforward way to the scientific community. Here, we propose SicknessMiner, a biomedical Text-Mining (TM) approach towards the centralization of DDAs. Our methodology encompasses Named Entity Recognition (NER) and Named Entity Normalization (NEN) steps, and the DDAs retrieved were compared to the DisGeNET resource for qualitative and quantitative comparison. Results We obtained the DDAs via co-mention using our SicknessMiner or gene- or variant-disease similarity on DisGeNET. SicknessMiner was able to retrieve around 92% of the DisGeNET results and nearly 15% of the SicknessMiner results were specific to our pipeline. Conclusions SicknessMiner is a valuable tool to extract disease-disease relationship from RAW input corpus.
topic Disease-disease associations
Natural language processing
Biomedical text-mining
Deep learning
Blood cancers
url https://doi.org/10.1186/s12859-021-04397-w
work_keys_str_mv AT niciarosarioferreira sicknessmineradeeplearningdriventextminingtooltoabridgediseasediseaseassociations
AT victorguimaraes sicknessmineradeeplearningdriventextminingtooltoabridgediseasediseaseassociations
AT vitorscosta sicknessmineradeeplearningdriventextminingtooltoabridgediseasediseaseassociations
AT irinasmoreira sicknessmineradeeplearningdriventextminingtooltoabridgediseasediseaseassociations
_version_ 1716829950931107840