Scoring functions and enrichment: a case study on Hsp90

<p>Abstract</p> <p>Background</p> <p>The need for fast and accurate scoring functions has been driven by the increased use of <it>in silico </it>virtual screening twinned with high-throughput screening as a method to rapidly identify potential candidates in...

Full description

Bibliographic Details
Main Authors: Mitchell John BO, Konstantinou-Kirtay Chrysi, Lumley James A
Format: Article
Language:English
Published: BMC 2007-01-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/8/27
id doaj-e2024fb762c44eba8cc9cb290f8021ac
record_format Article
spelling doaj-e2024fb762c44eba8cc9cb290f8021ac2020-11-24T21:16:06ZengBMCBMC Bioinformatics1471-21052007-01-01812710.1186/1471-2105-8-27Scoring functions and enrichment: a case study on Hsp90Mitchell John BOKonstantinou-Kirtay ChrysiLumley James A<p>Abstract</p> <p>Background</p> <p>The need for fast and accurate scoring functions has been driven by the increased use of <it>in silico </it>virtual screening twinned with high-throughput screening as a method to rapidly identify potential candidates in the early stages of drug development. We examine the ability of some the most common scoring functions (GOLD, ChemScore, DOCK, PMF, BLEEP and Consensus) to discriminate correctly and efficiently between active and non-active compounds among a library of ~3,600 diverse decoy compounds in a virtual screening experiment against heat shock protein 90 (Hsp90).</p> <p>Results</p> <p>Firstly, we investigated two ranking methodologies, GOLD<sub>rank </sub>and BestScore<sub>rank</sub>. GOLD<sub><it>rank </it></sub>is based on ranks generated using GOLD. The various scoring functions, GOLD, ChemScore, DOCK, PMF, BLEEP and Consensus, are applied to the pose ranked number one by GOLD for that ligand. BestScore<sub><it>rank </it></sub>uses multiple poses for each ligand and independently chooses the best ranked pose of the ligand according to each different scoring function. Secondly, we considered the effect of introducing the Thr184 hydrogen bond tether to guide the docking process towards a particular solution, and its effect on enrichment. Thirdly, we considered normalisation to account for the known bias of scoring functions to select larger molecules. All the scoring functions gave fairly similar enrichments, with the exception of PMF which was consistently the poorest performer. In most cases, GOLD was marginally the best performing individual function; the Consensus score usually performed similarly to the best single scoring function. Our best results were obtained using the Thr184 tether in combination with the BestScore<sub>rank </sub>protocol and normalisation for molecular weight. For that particular combination, DOCK was the best individual function; DOCK recovered 90% of the actives in the top 10% of the ranked list; Consensus similarly recovered 89% of the actives in its top 10%.</p> <p>Conclusion</p> <p>Overall, we demonstrate the validity of virtual screening as a method for identifying new leads from a pool of ligands with similar physicochemical properties and we believe that the outcome of this study provides useful insight into the setting up of a suitable docking and scoring protocol, resulting in enrichment of '<it>target active</it>' compounds.</p> http://www.biomedcentral.com/1471-2105/8/27
collection DOAJ
language English
format Article
sources DOAJ
author Mitchell John BO
Konstantinou-Kirtay Chrysi
Lumley James A
spellingShingle Mitchell John BO
Konstantinou-Kirtay Chrysi
Lumley James A
Scoring functions and enrichment: a case study on Hsp90
BMC Bioinformatics
author_facet Mitchell John BO
Konstantinou-Kirtay Chrysi
Lumley James A
author_sort Mitchell John BO
title Scoring functions and enrichment: a case study on Hsp90
title_short Scoring functions and enrichment: a case study on Hsp90
title_full Scoring functions and enrichment: a case study on Hsp90
title_fullStr Scoring functions and enrichment: a case study on Hsp90
title_full_unstemmed Scoring functions and enrichment: a case study on Hsp90
title_sort scoring functions and enrichment: a case study on hsp90
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2007-01-01
description <p>Abstract</p> <p>Background</p> <p>The need for fast and accurate scoring functions has been driven by the increased use of <it>in silico </it>virtual screening twinned with high-throughput screening as a method to rapidly identify potential candidates in the early stages of drug development. We examine the ability of some the most common scoring functions (GOLD, ChemScore, DOCK, PMF, BLEEP and Consensus) to discriminate correctly and efficiently between active and non-active compounds among a library of ~3,600 diverse decoy compounds in a virtual screening experiment against heat shock protein 90 (Hsp90).</p> <p>Results</p> <p>Firstly, we investigated two ranking methodologies, GOLD<sub>rank </sub>and BestScore<sub>rank</sub>. GOLD<sub><it>rank </it></sub>is based on ranks generated using GOLD. The various scoring functions, GOLD, ChemScore, DOCK, PMF, BLEEP and Consensus, are applied to the pose ranked number one by GOLD for that ligand. BestScore<sub><it>rank </it></sub>uses multiple poses for each ligand and independently chooses the best ranked pose of the ligand according to each different scoring function. Secondly, we considered the effect of introducing the Thr184 hydrogen bond tether to guide the docking process towards a particular solution, and its effect on enrichment. Thirdly, we considered normalisation to account for the known bias of scoring functions to select larger molecules. All the scoring functions gave fairly similar enrichments, with the exception of PMF which was consistently the poorest performer. In most cases, GOLD was marginally the best performing individual function; the Consensus score usually performed similarly to the best single scoring function. Our best results were obtained using the Thr184 tether in combination with the BestScore<sub>rank </sub>protocol and normalisation for molecular weight. For that particular combination, DOCK was the best individual function; DOCK recovered 90% of the actives in the top 10% of the ranked list; Consensus similarly recovered 89% of the actives in its top 10%.</p> <p>Conclusion</p> <p>Overall, we demonstrate the validity of virtual screening as a method for identifying new leads from a pool of ligands with similar physicochemical properties and we believe that the outcome of this study provides useful insight into the setting up of a suitable docking and scoring protocol, resulting in enrichment of '<it>target active</it>' compounds.</p>
url http://www.biomedcentral.com/1471-2105/8/27
work_keys_str_mv AT mitchelljohnbo scoringfunctionsandenrichmentacasestudyonhsp90
AT konstantinoukirtaychrysi scoringfunctionsandenrichmentacasestudyonhsp90
AT lumleyjamesa scoringfunctionsandenrichmentacasestudyonhsp90
_version_ 1726017071162064896