A new chemoinformatics approach with improved strategies for effective predictions of potential drugs

Abstract Background Fast and accurate identification of potential drug candidates against therapeutic targets (i.e., drug–target interactions, DTIs) is a fundamental step in the early drug discovery process. However, experimental determination of DTIs is time-consuming and costly, especially for tes...

Full description

Bibliographic Details
Main Authors: Ming Hao, Stephen H. Bryant, Yanli Wang
Format: Article
Language:English
Published: BMC 2018-10-01
Series:Journal of Cheminformatics
Online Access:http://link.springer.com/article/10.1186/s13321-018-0303-x
id doaj-200f396a903c4c4eb1c3a60335b0ec9b
record_format Article
spelling doaj-200f396a903c4c4eb1c3a60335b0ec9b2020-11-25T02:19:02ZengBMCJournal of Cheminformatics1758-29462018-10-011011910.1186/s13321-018-0303-xA new chemoinformatics approach with improved strategies for effective predictions of potential drugsMing Hao0Stephen H. Bryant1Yanli Wang2National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthNational Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthNational Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthAbstract Background Fast and accurate identification of potential drug candidates against therapeutic targets (i.e., drug–target interactions, DTIs) is a fundamental step in the early drug discovery process. However, experimental determination of DTIs is time-consuming and costly, especially for testing the associations between the entire chemical and genomic spaces. Therefore, computationally efficient algorithms with accurate predictions are required to achieve such a challenging task. In this work, we design a new chemoinformatics approach derived from neighbor-based collaborative filtering (NBCF) to infer potential drug candidates for targets of interest. One of the fundamental steps of NBCF in the application of DTI predictions is to accurately measure the similarity between drugs solely based on the DTI profiles of known knowledge. However, commonly used similarity calculation methods such as COSINE may be noise-prone due to the extremely sparse property of the DTI bipartite network, which decreases the model performance of NBCF. We herein propose three strategies to remedy such a dilemma, which include: (1) adopting a positive pointwise mutual information (PPMI)-based similarity metric, which is noise-immune to some extent; (2) performing low-rank approximation of the original prediction scores; (3) incorporating auxiliary (complementary) information to produce the final predictions. Results We test the proposed methods in three benchmark datasets and the results indicate that our strategies are helpful to improve the NBCF performance for DTI predictions. Comparing to the prior algorithm, our methods exhibit better results assessed by a recall-based evaluation metric. Conclusions A new chemoinformatics approach with improved strategies was successfully developed to predict potential DTIs. Among them, the model based on the sparsity resistant PPMI similarity metric exhibits the best performance, which may be helpful to researchers for identifying potential drugs against therapeutic targets of interest, and can also be applied to related research such as identifying candidate disease genes.http://link.springer.com/article/10.1186/s13321-018-0303-x
collection DOAJ
language English
format Article
sources DOAJ
author Ming Hao
Stephen H. Bryant
Yanli Wang
spellingShingle Ming Hao
Stephen H. Bryant
Yanli Wang
A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
Journal of Cheminformatics
author_facet Ming Hao
Stephen H. Bryant
Yanli Wang
author_sort Ming Hao
title A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
title_short A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
title_full A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
title_fullStr A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
title_full_unstemmed A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
title_sort new chemoinformatics approach with improved strategies for effective predictions of potential drugs
publisher BMC
series Journal of Cheminformatics
issn 1758-2946
publishDate 2018-10-01
description Abstract Background Fast and accurate identification of potential drug candidates against therapeutic targets (i.e., drug–target interactions, DTIs) is a fundamental step in the early drug discovery process. However, experimental determination of DTIs is time-consuming and costly, especially for testing the associations between the entire chemical and genomic spaces. Therefore, computationally efficient algorithms with accurate predictions are required to achieve such a challenging task. In this work, we design a new chemoinformatics approach derived from neighbor-based collaborative filtering (NBCF) to infer potential drug candidates for targets of interest. One of the fundamental steps of NBCF in the application of DTI predictions is to accurately measure the similarity between drugs solely based on the DTI profiles of known knowledge. However, commonly used similarity calculation methods such as COSINE may be noise-prone due to the extremely sparse property of the DTI bipartite network, which decreases the model performance of NBCF. We herein propose three strategies to remedy such a dilemma, which include: (1) adopting a positive pointwise mutual information (PPMI)-based similarity metric, which is noise-immune to some extent; (2) performing low-rank approximation of the original prediction scores; (3) incorporating auxiliary (complementary) information to produce the final predictions. Results We test the proposed methods in three benchmark datasets and the results indicate that our strategies are helpful to improve the NBCF performance for DTI predictions. Comparing to the prior algorithm, our methods exhibit better results assessed by a recall-based evaluation metric. Conclusions A new chemoinformatics approach with improved strategies was successfully developed to predict potential DTIs. Among them, the model based on the sparsity resistant PPMI similarity metric exhibits the best performance, which may be helpful to researchers for identifying potential drugs against therapeutic targets of interest, and can also be applied to related research such as identifying candidate disease genes.
url http://link.springer.com/article/10.1186/s13321-018-0303-x
work_keys_str_mv AT minghao anewchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs
AT stephenhbryant anewchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs
AT yanliwang anewchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs
AT minghao newchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs
AT stephenhbryant newchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs
AT yanliwang newchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs
_version_ 1724879109100666880