A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
Abstract Background Fast and accurate identification of potential drug candidates against therapeutic targets (i.e., drug–target interactions, DTIs) is a fundamental step in the early drug discovery process. However, experimental determination of DTIs is time-consuming and costly, especially for tes...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2018-10-01
|
Series: | Journal of Cheminformatics |
Online Access: | http://link.springer.com/article/10.1186/s13321-018-0303-x |
id |
doaj-200f396a903c4c4eb1c3a60335b0ec9b |
---|---|
record_format |
Article |
spelling |
doaj-200f396a903c4c4eb1c3a60335b0ec9b2020-11-25T02:19:02ZengBMCJournal of Cheminformatics1758-29462018-10-011011910.1186/s13321-018-0303-xA new chemoinformatics approach with improved strategies for effective predictions of potential drugsMing Hao0Stephen H. Bryant1Yanli Wang2National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthNational Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthNational Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthAbstract Background Fast and accurate identification of potential drug candidates against therapeutic targets (i.e., drug–target interactions, DTIs) is a fundamental step in the early drug discovery process. However, experimental determination of DTIs is time-consuming and costly, especially for testing the associations between the entire chemical and genomic spaces. Therefore, computationally efficient algorithms with accurate predictions are required to achieve such a challenging task. In this work, we design a new chemoinformatics approach derived from neighbor-based collaborative filtering (NBCF) to infer potential drug candidates for targets of interest. One of the fundamental steps of NBCF in the application of DTI predictions is to accurately measure the similarity between drugs solely based on the DTI profiles of known knowledge. However, commonly used similarity calculation methods such as COSINE may be noise-prone due to the extremely sparse property of the DTI bipartite network, which decreases the model performance of NBCF. We herein propose three strategies to remedy such a dilemma, which include: (1) adopting a positive pointwise mutual information (PPMI)-based similarity metric, which is noise-immune to some extent; (2) performing low-rank approximation of the original prediction scores; (3) incorporating auxiliary (complementary) information to produce the final predictions. Results We test the proposed methods in three benchmark datasets and the results indicate that our strategies are helpful to improve the NBCF performance for DTI predictions. Comparing to the prior algorithm, our methods exhibit better results assessed by a recall-based evaluation metric. Conclusions A new chemoinformatics approach with improved strategies was successfully developed to predict potential DTIs. Among them, the model based on the sparsity resistant PPMI similarity metric exhibits the best performance, which may be helpful to researchers for identifying potential drugs against therapeutic targets of interest, and can also be applied to related research such as identifying candidate disease genes.http://link.springer.com/article/10.1186/s13321-018-0303-x |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ming Hao Stephen H. Bryant Yanli Wang |
spellingShingle |
Ming Hao Stephen H. Bryant Yanli Wang A new chemoinformatics approach with improved strategies for effective predictions of potential drugs Journal of Cheminformatics |
author_facet |
Ming Hao Stephen H. Bryant Yanli Wang |
author_sort |
Ming Hao |
title |
A new chemoinformatics approach with improved strategies for effective predictions of potential drugs |
title_short |
A new chemoinformatics approach with improved strategies for effective predictions of potential drugs |
title_full |
A new chemoinformatics approach with improved strategies for effective predictions of potential drugs |
title_fullStr |
A new chemoinformatics approach with improved strategies for effective predictions of potential drugs |
title_full_unstemmed |
A new chemoinformatics approach with improved strategies for effective predictions of potential drugs |
title_sort |
new chemoinformatics approach with improved strategies for effective predictions of potential drugs |
publisher |
BMC |
series |
Journal of Cheminformatics |
issn |
1758-2946 |
publishDate |
2018-10-01 |
description |
Abstract Background Fast and accurate identification of potential drug candidates against therapeutic targets (i.e., drug–target interactions, DTIs) is a fundamental step in the early drug discovery process. However, experimental determination of DTIs is time-consuming and costly, especially for testing the associations between the entire chemical and genomic spaces. Therefore, computationally efficient algorithms with accurate predictions are required to achieve such a challenging task. In this work, we design a new chemoinformatics approach derived from neighbor-based collaborative filtering (NBCF) to infer potential drug candidates for targets of interest. One of the fundamental steps of NBCF in the application of DTI predictions is to accurately measure the similarity between drugs solely based on the DTI profiles of known knowledge. However, commonly used similarity calculation methods such as COSINE may be noise-prone due to the extremely sparse property of the DTI bipartite network, which decreases the model performance of NBCF. We herein propose three strategies to remedy such a dilemma, which include: (1) adopting a positive pointwise mutual information (PPMI)-based similarity metric, which is noise-immune to some extent; (2) performing low-rank approximation of the original prediction scores; (3) incorporating auxiliary (complementary) information to produce the final predictions. Results We test the proposed methods in three benchmark datasets and the results indicate that our strategies are helpful to improve the NBCF performance for DTI predictions. Comparing to the prior algorithm, our methods exhibit better results assessed by a recall-based evaluation metric. Conclusions A new chemoinformatics approach with improved strategies was successfully developed to predict potential DTIs. Among them, the model based on the sparsity resistant PPMI similarity metric exhibits the best performance, which may be helpful to researchers for identifying potential drugs against therapeutic targets of interest, and can also be applied to related research such as identifying candidate disease genes. |
url |
http://link.springer.com/article/10.1186/s13321-018-0303-x |
work_keys_str_mv |
AT minghao anewchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs AT stephenhbryant anewchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs AT yanliwang anewchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs AT minghao newchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs AT stephenhbryant newchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs AT yanliwang newchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs |
_version_ |
1724879109100666880 |