Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence

G-protein-coupled receptors (GPCRs) are the largest and most diverse group of cell surface receptors that respond to various extracellular signals. The allosteric modulation of GPCRs has emerged in recent years as a promising approach for developing target-selective therapies. Moreover, the discover...

Full description

Bibliographic Details
Main Authors: Tianling Hou, Yuemin Bian, Terence McGuire, Xiang-Qun Xie
Format: Article
Language:English
Published: MDPI AG 2021-06-01
Series:Biomolecules
Subjects:
Online Access:https://www.mdpi.com/2218-273X/11/6/870
id doaj-dff3e293143048f184b7fff29493de08
record_format Article
spelling doaj-dff3e293143048f184b7fff29493de082021-06-30T23:55:59ZengMDPI AGBiomolecules2218-273X2021-06-011187087010.3390/biom11060870Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning IntelligenceTianling Hou0Yuemin Bian1Terence McGuire2Xiang-Qun Xie3Department of Pharmaceutical Sciences, Computational Chemical Genomics Screen (CCGS) Center and Pharmacometrics System Pharmacology Program, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USADepartment of Pharmaceutical Sciences, Computational Chemical Genomics Screen (CCGS) Center and Pharmacometrics System Pharmacology Program, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USADepartment of Pharmaceutical Sciences, Computational Chemical Genomics Screen (CCGS) Center and Pharmacometrics System Pharmacology Program, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USADepartment of Pharmaceutical Sciences, Computational Chemical Genomics Screen (CCGS) Center and Pharmacometrics System Pharmacology Program, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USAG-protein-coupled receptors (GPCRs) are the largest and most diverse group of cell surface receptors that respond to various extracellular signals. The allosteric modulation of GPCRs has emerged in recent years as a promising approach for developing target-selective therapies. Moreover, the discovery of new GPCR allosteric modulators can greatly benefit the further understanding of GPCR cell signaling mechanisms. It is critical but also challenging to make an accurate distinction of modulators for different GPCR groups in an efficient and effective manner. In this study, we focus on an 11-class classification task with 10 GPCR subtype classes and a random compounds class. We used a dataset containing 34,434 compounds with allosteric modulators collected from classical GPCR families A, B, and C, as well as random drug-like compounds. Six types of machine learning models, including support vector machine, naïve Bayes, decision tree, random forest, logistic regression, and multilayer perceptron, were trained using different combinations of features including molecular descriptors, Atom-pair fingerprints, MACCS fingerprints, and ECFP6 fingerprints. The performances of trained machine learning models with different feature combinations were closely investigated and discussed. To the best of our knowledge, this is the first work on the multi-class classification of GPCR allosteric modulators. We believe that the classification models developed in this study can be used as simple and accurate tools for the discovery and development of GPCR allosteric modulators.https://www.mdpi.com/2218-273X/11/6/870GPCRsallosteric regulationmachine learningfinger-printsdrug design
collection DOAJ
language English
format Article
sources DOAJ
author Tianling Hou
Yuemin Bian
Terence McGuire
Xiang-Qun Xie
spellingShingle Tianling Hou
Yuemin Bian
Terence McGuire
Xiang-Qun Xie
Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence
Biomolecules
GPCRs
allosteric regulation
machine learning
finger-prints
drug design
author_facet Tianling Hou
Yuemin Bian
Terence McGuire
Xiang-Qun Xie
author_sort Tianling Hou
title Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence
title_short Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence
title_full Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence
title_fullStr Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence
title_full_unstemmed Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence
title_sort integrated multi-class classification and prediction of gpcr allosteric modulators by machine learning intelligence
publisher MDPI AG
series Biomolecules
issn 2218-273X
publishDate 2021-06-01
description G-protein-coupled receptors (GPCRs) are the largest and most diverse group of cell surface receptors that respond to various extracellular signals. The allosteric modulation of GPCRs has emerged in recent years as a promising approach for developing target-selective therapies. Moreover, the discovery of new GPCR allosteric modulators can greatly benefit the further understanding of GPCR cell signaling mechanisms. It is critical but also challenging to make an accurate distinction of modulators for different GPCR groups in an efficient and effective manner. In this study, we focus on an 11-class classification task with 10 GPCR subtype classes and a random compounds class. We used a dataset containing 34,434 compounds with allosteric modulators collected from classical GPCR families A, B, and C, as well as random drug-like compounds. Six types of machine learning models, including support vector machine, naïve Bayes, decision tree, random forest, logistic regression, and multilayer perceptron, were trained using different combinations of features including molecular descriptors, Atom-pair fingerprints, MACCS fingerprints, and ECFP6 fingerprints. The performances of trained machine learning models with different feature combinations were closely investigated and discussed. To the best of our knowledge, this is the first work on the multi-class classification of GPCR allosteric modulators. We believe that the classification models developed in this study can be used as simple and accurate tools for the discovery and development of GPCR allosteric modulators.
topic GPCRs
allosteric regulation
machine learning
finger-prints
drug design
url https://www.mdpi.com/2218-273X/11/6/870
work_keys_str_mv AT tianlinghou integratedmulticlassclassificationandpredictionofgpcrallostericmodulatorsbymachinelearningintelligence
AT yueminbian integratedmulticlassclassificationandpredictionofgpcrallostericmodulatorsbymachinelearningintelligence
AT terencemcguire integratedmulticlassclassificationandpredictionofgpcrallostericmodulatorsbymachinelearningintelligence
AT xiangqunxie integratedmulticlassclassificationandpredictionofgpcrallostericmodulatorsbymachinelearningintelligence
_version_ 1721350151160725504