Identification of novel small molecule inhibitors for solute carrier SGLT1 using proteochemometric modeling

Abstract Sodium-dependent glucose co-transporter 1 (SGLT1) is a solute carrier responsible for active glucose absorption. SGLT1 is present in both the renal tubules and small intestine. In contrast, the closely related sodium-dependent glucose co-transporter 2 (SGLT2), a protein that is targeted in...

Full description

Bibliographic Details
Main Authors: Lindsey Burggraaff, Paul Oranje, Robin Gouka, Pieter van der Pijl, Marian Geldof, Herman W. T. van Vlijmen, Adriaan P. IJzerman, Gerard J. P. van Westen
Format: Article
Language:English
Published: BMC 2019-02-01
Series:Journal of Cheminformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13321-019-0337-8
id doaj-b9669e4d11ac4a33aff529bb88597da9
record_format Article
spelling doaj-b9669e4d11ac4a33aff529bb88597da92020-11-25T02:04:41ZengBMCJournal of Cheminformatics1758-29462019-02-0111111010.1186/s13321-019-0337-8Identification of novel small molecule inhibitors for solute carrier SGLT1 using proteochemometric modelingLindsey Burggraaff0Paul Oranje1Robin Gouka2Pieter van der Pijl3Marian Geldof4Herman W. T. van Vlijmen5Adriaan P. IJzerman6Gerard J. P. van Westen7Division of Drug Discovery & Safety, Leiden Academic Centre for Drug Research, Leiden UniversityUnilever Research & DevelopmentUnilever Research & DevelopmentUnilever Research & DevelopmentUnilever Research & DevelopmentDivision of Drug Discovery & Safety, Leiden Academic Centre for Drug Research, Leiden UniversityDivision of Drug Discovery & Safety, Leiden Academic Centre for Drug Research, Leiden UniversityDivision of Drug Discovery & Safety, Leiden Academic Centre for Drug Research, Leiden UniversityAbstract Sodium-dependent glucose co-transporter 1 (SGLT1) is a solute carrier responsible for active glucose absorption. SGLT1 is present in both the renal tubules and small intestine. In contrast, the closely related sodium-dependent glucose co-transporter 2 (SGLT2), a protein that is targeted in the treatment of diabetes type II, is only expressed in the renal tubules. Although dual inhibitors for both SGLT1 and SGLT2 have been developed, no drugs on the market are targeted at decreasing dietary glucose uptake by SGLT1 in the gastrointestinal tract. Here we aim at identifying SGLT1 inhibitors in silico by applying a machine learning approach that does not require structural information, which is absent for SGLT1. We applied proteochemometrics by implementation of compound- and protein-based information into random forest models. We obtained a predictive model with a sensitivity of 0.64 ± 0.06, specificity of 0.93 ± 0.01, positive predictive value of 0.47 ± 0.07, negative predictive value of 0.96 ± 0.01, and Matthews correlation coefficient of 0.49 ± 0.05. Subsequent to model training, we applied our model in virtual screening to identify novel SGLT1 inhibitors. Of the 77 tested compounds, 30 were experimentally confirmed for SGLT1-inhibiting activity in vitro, leading to a hit rate of 39% with activities in the low micromolar range. Moreover, the hit compounds included novel molecules, which is reflected by the low similarity of these compounds with the training set (< 0.3). Conclusively, proteochemometric modeling of SGLT1 is a viable strategy for identifying active small molecules. Therefore, this method may also be applied in detection of novel small molecules for other transporter proteins.http://link.springer.com/article/10.1186/s13321-019-0337-8Sodium-dependent glucose co-transporterSodium-glucose linked transporterSGLT1ProteochemometricsMolecular modelingMachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Lindsey Burggraaff
Paul Oranje
Robin Gouka
Pieter van der Pijl
Marian Geldof
Herman W. T. van Vlijmen
Adriaan P. IJzerman
Gerard J. P. van Westen
spellingShingle Lindsey Burggraaff
Paul Oranje
Robin Gouka
Pieter van der Pijl
Marian Geldof
Herman W. T. van Vlijmen
Adriaan P. IJzerman
Gerard J. P. van Westen
Identification of novel small molecule inhibitors for solute carrier SGLT1 using proteochemometric modeling
Journal of Cheminformatics
Sodium-dependent glucose co-transporter
Sodium-glucose linked transporter
SGLT1
Proteochemometrics
Molecular modeling
Machine learning
author_facet Lindsey Burggraaff
Paul Oranje
Robin Gouka
Pieter van der Pijl
Marian Geldof
Herman W. T. van Vlijmen
Adriaan P. IJzerman
Gerard J. P. van Westen
author_sort Lindsey Burggraaff
title Identification of novel small molecule inhibitors for solute carrier SGLT1 using proteochemometric modeling
title_short Identification of novel small molecule inhibitors for solute carrier SGLT1 using proteochemometric modeling
title_full Identification of novel small molecule inhibitors for solute carrier SGLT1 using proteochemometric modeling
title_fullStr Identification of novel small molecule inhibitors for solute carrier SGLT1 using proteochemometric modeling
title_full_unstemmed Identification of novel small molecule inhibitors for solute carrier SGLT1 using proteochemometric modeling
title_sort identification of novel small molecule inhibitors for solute carrier sglt1 using proteochemometric modeling
publisher BMC
series Journal of Cheminformatics
issn 1758-2946
publishDate 2019-02-01
description Abstract Sodium-dependent glucose co-transporter 1 (SGLT1) is a solute carrier responsible for active glucose absorption. SGLT1 is present in both the renal tubules and small intestine. In contrast, the closely related sodium-dependent glucose co-transporter 2 (SGLT2), a protein that is targeted in the treatment of diabetes type II, is only expressed in the renal tubules. Although dual inhibitors for both SGLT1 and SGLT2 have been developed, no drugs on the market are targeted at decreasing dietary glucose uptake by SGLT1 in the gastrointestinal tract. Here we aim at identifying SGLT1 inhibitors in silico by applying a machine learning approach that does not require structural information, which is absent for SGLT1. We applied proteochemometrics by implementation of compound- and protein-based information into random forest models. We obtained a predictive model with a sensitivity of 0.64 ± 0.06, specificity of 0.93 ± 0.01, positive predictive value of 0.47 ± 0.07, negative predictive value of 0.96 ± 0.01, and Matthews correlation coefficient of 0.49 ± 0.05. Subsequent to model training, we applied our model in virtual screening to identify novel SGLT1 inhibitors. Of the 77 tested compounds, 30 were experimentally confirmed for SGLT1-inhibiting activity in vitro, leading to a hit rate of 39% with activities in the low micromolar range. Moreover, the hit compounds included novel molecules, which is reflected by the low similarity of these compounds with the training set (< 0.3). Conclusively, proteochemometric modeling of SGLT1 is a viable strategy for identifying active small molecules. Therefore, this method may also be applied in detection of novel small molecules for other transporter proteins.
topic Sodium-dependent glucose co-transporter
Sodium-glucose linked transporter
SGLT1
Proteochemometrics
Molecular modeling
Machine learning
url http://link.springer.com/article/10.1186/s13321-019-0337-8
work_keys_str_mv AT lindseyburggraaff identificationofnovelsmallmoleculeinhibitorsforsolutecarriersglt1usingproteochemometricmodeling
AT pauloranje identificationofnovelsmallmoleculeinhibitorsforsolutecarriersglt1usingproteochemometricmodeling
AT robingouka identificationofnovelsmallmoleculeinhibitorsforsolutecarriersglt1usingproteochemometricmodeling
AT pietervanderpijl identificationofnovelsmallmoleculeinhibitorsforsolutecarriersglt1usingproteochemometricmodeling
AT mariangeldof identificationofnovelsmallmoleculeinhibitorsforsolutecarriersglt1usingproteochemometricmodeling
AT hermanwtvanvlijmen identificationofnovelsmallmoleculeinhibitorsforsolutecarriersglt1usingproteochemometricmodeling
AT adriaanpijzerman identificationofnovelsmallmoleculeinhibitorsforsolutecarriersglt1usingproteochemometricmodeling
AT gerardjpvanwesten identificationofnovelsmallmoleculeinhibitorsforsolutecarriersglt1usingproteochemometricmodeling
_version_ 1724941733673828352