Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence
G-protein-coupled receptors (GPCRs) are the largest and most diverse group of cell surface receptors that respond to various extracellular signals. The allosteric modulation of GPCRs has emerged in recent years as a promising approach for developing target-selective therapies. Moreover, the discover...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-06-01
|
Series: | Biomolecules |
Subjects: | |
Online Access: | https://www.mdpi.com/2218-273X/11/6/870 |
id |
doaj-dff3e293143048f184b7fff29493de08 |
---|---|
record_format |
Article |
spelling |
doaj-dff3e293143048f184b7fff29493de082021-06-30T23:55:59ZengMDPI AGBiomolecules2218-273X2021-06-011187087010.3390/biom11060870Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning IntelligenceTianling Hou0Yuemin Bian1Terence McGuire2Xiang-Qun Xie3Department of Pharmaceutical Sciences, Computational Chemical Genomics Screen (CCGS) Center and Pharmacometrics System Pharmacology Program, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USADepartment of Pharmaceutical Sciences, Computational Chemical Genomics Screen (CCGS) Center and Pharmacometrics System Pharmacology Program, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USADepartment of Pharmaceutical Sciences, Computational Chemical Genomics Screen (CCGS) Center and Pharmacometrics System Pharmacology Program, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USADepartment of Pharmaceutical Sciences, Computational Chemical Genomics Screen (CCGS) Center and Pharmacometrics System Pharmacology Program, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USAG-protein-coupled receptors (GPCRs) are the largest and most diverse group of cell surface receptors that respond to various extracellular signals. The allosteric modulation of GPCRs has emerged in recent years as a promising approach for developing target-selective therapies. Moreover, the discovery of new GPCR allosteric modulators can greatly benefit the further understanding of GPCR cell signaling mechanisms. It is critical but also challenging to make an accurate distinction of modulators for different GPCR groups in an efficient and effective manner. In this study, we focus on an 11-class classification task with 10 GPCR subtype classes and a random compounds class. We used a dataset containing 34,434 compounds with allosteric modulators collected from classical GPCR families A, B, and C, as well as random drug-like compounds. Six types of machine learning models, including support vector machine, naïve Bayes, decision tree, random forest, logistic regression, and multilayer perceptron, were trained using different combinations of features including molecular descriptors, Atom-pair fingerprints, MACCS fingerprints, and ECFP6 fingerprints. The performances of trained machine learning models with different feature combinations were closely investigated and discussed. To the best of our knowledge, this is the first work on the multi-class classification of GPCR allosteric modulators. We believe that the classification models developed in this study can be used as simple and accurate tools for the discovery and development of GPCR allosteric modulators.https://www.mdpi.com/2218-273X/11/6/870GPCRsallosteric regulationmachine learningfinger-printsdrug design |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Tianling Hou Yuemin Bian Terence McGuire Xiang-Qun Xie |
spellingShingle |
Tianling Hou Yuemin Bian Terence McGuire Xiang-Qun Xie Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence Biomolecules GPCRs allosteric regulation machine learning finger-prints drug design |
author_facet |
Tianling Hou Yuemin Bian Terence McGuire Xiang-Qun Xie |
author_sort |
Tianling Hou |
title |
Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence |
title_short |
Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence |
title_full |
Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence |
title_fullStr |
Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence |
title_full_unstemmed |
Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence |
title_sort |
integrated multi-class classification and prediction of gpcr allosteric modulators by machine learning intelligence |
publisher |
MDPI AG |
series |
Biomolecules |
issn |
2218-273X |
publishDate |
2021-06-01 |
description |
G-protein-coupled receptors (GPCRs) are the largest and most diverse group of cell surface receptors that respond to various extracellular signals. The allosteric modulation of GPCRs has emerged in recent years as a promising approach for developing target-selective therapies. Moreover, the discovery of new GPCR allosteric modulators can greatly benefit the further understanding of GPCR cell signaling mechanisms. It is critical but also challenging to make an accurate distinction of modulators for different GPCR groups in an efficient and effective manner. In this study, we focus on an 11-class classification task with 10 GPCR subtype classes and a random compounds class. We used a dataset containing 34,434 compounds with allosteric modulators collected from classical GPCR families A, B, and C, as well as random drug-like compounds. Six types of machine learning models, including support vector machine, naïve Bayes, decision tree, random forest, logistic regression, and multilayer perceptron, were trained using different combinations of features including molecular descriptors, Atom-pair fingerprints, MACCS fingerprints, and ECFP6 fingerprints. The performances of trained machine learning models with different feature combinations were closely investigated and discussed. To the best of our knowledge, this is the first work on the multi-class classification of GPCR allosteric modulators. We believe that the classification models developed in this study can be used as simple and accurate tools for the discovery and development of GPCR allosteric modulators. |
topic |
GPCRs allosteric regulation machine learning finger-prints drug design |
url |
https://www.mdpi.com/2218-273X/11/6/870 |
work_keys_str_mv |
AT tianlinghou integratedmulticlassclassificationandpredictionofgpcrallostericmodulatorsbymachinelearningintelligence AT yueminbian integratedmulticlassclassificationandpredictionofgpcrallostericmodulatorsbymachinelearningintelligence AT terencemcguire integratedmulticlassclassificationandpredictionofgpcrallostericmodulatorsbymachinelearningintelligence AT xiangqunxie integratedmulticlassclassificationandpredictionofgpcrallostericmodulatorsbymachinelearningintelligence |
_version_ |
1721350151160725504 |