A new chemoinformatics approach with improved strategies for effective predictions of potential drugs

Abstract Background Fast and accurate identification of potential drug candidates against therapeutic targets (i.e., drug–target interactions, DTIs) is a fundamental step in the early drug discovery process. However, experimental determination of DTIs is time-consuming and costly, especially for tes...

Full description

Bibliographic Details
Main Authors:	Ming Hao, Stephen H. Bryant, Yanli Wang
Format:	Article
Language:	English
Published:	BMC 2018-10-01
Series:	Journal of Cheminformatics
Online Access:	http://link.springer.com/article/10.1186/s13321-018-0303-x

id	doaj-200f396a903c4c4eb1c3a60335b0ec9b
record_format	Article
spelling	doaj-200f396a903c4c4eb1c3a60335b0ec9b2020-11-25T02:19:02ZengBMCJournal of Cheminformatics1758-29462018-10-011011910.1186/s13321-018-0303-xA new chemoinformatics approach with improved strategies for effective predictions of potential drugsMing Hao0Stephen H. Bryant1Yanli Wang2National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthNational Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthNational Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthAbstract Background Fast and accurate identification of potential drug candidates against therapeutic targets (i.e., drug–target interactions, DTIs) is a fundamental step in the early drug discovery process. However, experimental determination of DTIs is time-consuming and costly, especially for testing the associations between the entire chemical and genomic spaces. Therefore, computationally efficient algorithms with accurate predictions are required to achieve such a challenging task. In this work, we design a new chemoinformatics approach derived from neighbor-based collaborative filtering (NBCF) to infer potential drug candidates for targets of interest. One of the fundamental steps of NBCF in the application of DTI predictions is to accurately measure the similarity between drugs solely based on the DTI profiles of known knowledge. However, commonly used similarity calculation methods such as COSINE may be noise-prone due to the extremely sparse property of the DTI bipartite network, which decreases the model performance of NBCF. We herein propose three strategies to remedy such a dilemma, which include: (1) adopting a positive pointwise mutual information (PPMI)-based similarity metric, which is noise-immune to some extent; (2) performing low-rank approximation of the original prediction scores; (3) incorporating auxiliary (complementary) information to produce the final predictions. Results We test the proposed methods in three benchmark datasets and the results indicate that our strategies are helpful to improve the NBCF performance for DTI predictions. Comparing to the prior algorithm, our methods exhibit better results assessed by a recall-based evaluation metric. Conclusions A new chemoinformatics approach with improved strategies was successfully developed to predict potential DTIs. Among them, the model based on the sparsity resistant PPMI similarity metric exhibits the best performance, which may be helpful to researchers for identifying potential drugs against therapeutic targets of interest, and can also be applied to related research such as identifying candidate disease genes.http://link.springer.com/article/10.1186/s13321-018-0303-x
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Ming Hao Stephen H. Bryant Yanli Wang
spellingShingle	Ming Hao Stephen H. Bryant Yanli Wang A new chemoinformatics approach with improved strategies for effective predictions of potential drugs Journal of Cheminformatics
author_facet	Ming Hao Stephen H. Bryant Yanli Wang
author_sort	Ming Hao
title	A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
title_short	A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
title_full	A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
title_fullStr	A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
title_full_unstemmed	A new chemoinformatics approach with improved strategies for effective predictions of potential drugs
title_sort	new chemoinformatics approach with improved strategies for effective predictions of potential drugs
publisher	BMC
series	Journal of Cheminformatics
issn	1758-2946
publishDate	2018-10-01
description	Abstract Background Fast and accurate identification of potential drug candidates against therapeutic targets (i.e., drug–target interactions, DTIs) is a fundamental step in the early drug discovery process. However, experimental determination of DTIs is time-consuming and costly, especially for testing the associations between the entire chemical and genomic spaces. Therefore, computationally efficient algorithms with accurate predictions are required to achieve such a challenging task. In this work, we design a new chemoinformatics approach derived from neighbor-based collaborative filtering (NBCF) to infer potential drug candidates for targets of interest. One of the fundamental steps of NBCF in the application of DTI predictions is to accurately measure the similarity between drugs solely based on the DTI profiles of known knowledge. However, commonly used similarity calculation methods such as COSINE may be noise-prone due to the extremely sparse property of the DTI bipartite network, which decreases the model performance of NBCF. We herein propose three strategies to remedy such a dilemma, which include: (1) adopting a positive pointwise mutual information (PPMI)-based similarity metric, which is noise-immune to some extent; (2) performing low-rank approximation of the original prediction scores; (3) incorporating auxiliary (complementary) information to produce the final predictions. Results We test the proposed methods in three benchmark datasets and the results indicate that our strategies are helpful to improve the NBCF performance for DTI predictions. Comparing to the prior algorithm, our methods exhibit better results assessed by a recall-based evaluation metric. Conclusions A new chemoinformatics approach with improved strategies was successfully developed to predict potential DTIs. Among them, the model based on the sparsity resistant PPMI similarity metric exhibits the best performance, which may be helpful to researchers for identifying potential drugs against therapeutic targets of interest, and can also be applied to related research such as identifying candidate disease genes.
url	http://link.springer.com/article/10.1186/s13321-018-0303-x
work_keys_str_mv	AT minghao anewchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs AT stephenhbryant anewchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs AT yanliwang anewchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs AT minghao newchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs AT stephenhbryant newchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs AT yanliwang newchemoinformaticsapproachwithimprovedstrategiesforeffectivepredictionsofpotentialdrugs
_version_	1724879109100666880

A new chemoinformatics approach with improved strategies for effective predictions of potential drugs

Similar Items