Large-scale bioactivity analysis of the small-molecule assayed proteome.

This study presents an analysis of the small molecule bioactivity profiles across large quantities of diverse protein families represented in PubChem BioAssay. We compared the bioactivity profiles of FDA approved drugs to non-FDA approved compounds, and report several distinct patterns characteristi...

Full description

Bibliographic Details
Main Authors: Tyler William H Backman, Daniel S Evans, Thomas Girke
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0171413
id doaj-5caa865cdf674bb296c05b5300d341e8
record_format Article
spelling doaj-5caa865cdf674bb296c05b5300d341e82021-03-03T20:32:50ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01122e017141310.1371/journal.pone.0171413Large-scale bioactivity analysis of the small-molecule assayed proteome.Tyler William H BackmanDaniel S EvansThomas GirkeThis study presents an analysis of the small molecule bioactivity profiles across large quantities of diverse protein families represented in PubChem BioAssay. We compared the bioactivity profiles of FDA approved drugs to non-FDA approved compounds, and report several distinct patterns characteristic of the approved drugs. We found that a large fraction of the previously reported higher target promiscuity among FDA approved compounds, compared to non-FDA approved bioactives, was frequently due to cross-reactivity within rather than across protein families. We identified 804 potentially novel protein target candidates for FDA approved drugs, as well as 901 potentially novel target candidates with active non-FDA approved compounds, but no FDA approved drugs with activity against these targets. We also identified 486348 potentially novel compounds active against the same targets as FDA approved drugs, as well as 153402 potentially novel compounds active against targets without active FDA approved drugs. By quantifying the agreement among replicated screens, we estimated that more than half of these novel outcomes are reproducible. Using biclustering, we identified many dense clusters of FDA approved drugs with enriched activity against a common set of protein targets. We also report the distribution of compound promiscuity using a Bayesian statistical model, and report the sensitivity and specificity of two common methods for identifying promiscuous compounds. Aggregator assays exhibited greater accuracy in identifying highly promiscuous compounds, while PAINS substructures were able to identify a much larger set of "middle range" promiscuous compounds. Additionally, we report a large number of promiscuous compounds not identified as aggregators or PAINS. In summary, the results of this study represent a rich reference for selecting novel drug and target protein candidates, as well as for eliminating candidate compounds with unselective activities.https://doi.org/10.1371/journal.pone.0171413
collection DOAJ
language English
format Article
sources DOAJ
author Tyler William H Backman
Daniel S Evans
Thomas Girke
spellingShingle Tyler William H Backman
Daniel S Evans
Thomas Girke
Large-scale bioactivity analysis of the small-molecule assayed proteome.
PLoS ONE
author_facet Tyler William H Backman
Daniel S Evans
Thomas Girke
author_sort Tyler William H Backman
title Large-scale bioactivity analysis of the small-molecule assayed proteome.
title_short Large-scale bioactivity analysis of the small-molecule assayed proteome.
title_full Large-scale bioactivity analysis of the small-molecule assayed proteome.
title_fullStr Large-scale bioactivity analysis of the small-molecule assayed proteome.
title_full_unstemmed Large-scale bioactivity analysis of the small-molecule assayed proteome.
title_sort large-scale bioactivity analysis of the small-molecule assayed proteome.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2017-01-01
description This study presents an analysis of the small molecule bioactivity profiles across large quantities of diverse protein families represented in PubChem BioAssay. We compared the bioactivity profiles of FDA approved drugs to non-FDA approved compounds, and report several distinct patterns characteristic of the approved drugs. We found that a large fraction of the previously reported higher target promiscuity among FDA approved compounds, compared to non-FDA approved bioactives, was frequently due to cross-reactivity within rather than across protein families. We identified 804 potentially novel protein target candidates for FDA approved drugs, as well as 901 potentially novel target candidates with active non-FDA approved compounds, but no FDA approved drugs with activity against these targets. We also identified 486348 potentially novel compounds active against the same targets as FDA approved drugs, as well as 153402 potentially novel compounds active against targets without active FDA approved drugs. By quantifying the agreement among replicated screens, we estimated that more than half of these novel outcomes are reproducible. Using biclustering, we identified many dense clusters of FDA approved drugs with enriched activity against a common set of protein targets. We also report the distribution of compound promiscuity using a Bayesian statistical model, and report the sensitivity and specificity of two common methods for identifying promiscuous compounds. Aggregator assays exhibited greater accuracy in identifying highly promiscuous compounds, while PAINS substructures were able to identify a much larger set of "middle range" promiscuous compounds. Additionally, we report a large number of promiscuous compounds not identified as aggregators or PAINS. In summary, the results of this study represent a rich reference for selecting novel drug and target protein candidates, as well as for eliminating candidate compounds with unselective activities.
url https://doi.org/10.1371/journal.pone.0171413
work_keys_str_mv AT tylerwilliamhbackman largescalebioactivityanalysisofthesmallmoleculeassayedproteome
AT danielsevans largescalebioactivityanalysisofthesmallmoleculeassayedproteome
AT thomasgirke largescalebioactivityanalysisofthesmallmoleculeassayedproteome
_version_ 1714821883124252672