PUResNet: prediction of protein-ligand binding sites using deep residual neural network
Abstract Background Predicting protein-ligand binding sites is a fundamental step in understanding the functional characteristics of proteins, which plays a vital role in elucidating different biological functions and is a crucial step in drug discovery. A protein exhibits its true nature after bind...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-09-01
|
Series: | Journal of Cheminformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s13321-021-00547-7 |
id |
doaj-a7630a2be27a4607a95bb61a296fe985 |
---|---|
record_format |
Article |
spelling |
doaj-a7630a2be27a4607a95bb61a296fe9852021-09-12T11:42:42ZengBMCJournal of Cheminformatics1758-29462021-09-0113111410.1186/s13321-021-00547-7PUResNet: prediction of protein-ligand binding sites using deep residual neural networkJeevan Kandel0Hilal Tayara1Kil To Chong2Graduate School of Integrated Energy-AI, Jeonbuk National UniversitySchool of International Engineering and Science, Jeonbuk National UniversityDepartment of Electronics and Information Engineering, Jeonbuk National UniversityAbstract Background Predicting protein-ligand binding sites is a fundamental step in understanding the functional characteristics of proteins, which plays a vital role in elucidating different biological functions and is a crucial step in drug discovery. A protein exhibits its true nature after binding to its interacting molecule known as a ligand that binds only in the favorable binding site of the protein structure. Different computational methods exploiting the features of proteins have been developed to identify the binding sites in the protein structure, but none seems to provide promising results, and therefore, further investigation is required. Results In this study, we present a deep learning model PUResNet and a novel data cleaning process based on structural similarity for predicting protein-ligand binding sites. From the whole scPDB (an annotated database of druggable binding sites extracted from the Protein DataBank) database, 5020 protein structures were selected to address this problem, which were used to train PUResNet. With this, we achieved better and justifiable performance than the existing methods while evaluating two independent sets using distance, volume and proportion metrics.https://doi.org/10.1186/s13321-021-00547-7Ligand binding sitesBinding site predictionDeep residual networkConvolutional neural networkData cleaning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jeevan Kandel Hilal Tayara Kil To Chong |
spellingShingle |
Jeevan Kandel Hilal Tayara Kil To Chong PUResNet: prediction of protein-ligand binding sites using deep residual neural network Journal of Cheminformatics Ligand binding sites Binding site prediction Deep residual network Convolutional neural network Data cleaning |
author_facet |
Jeevan Kandel Hilal Tayara Kil To Chong |
author_sort |
Jeevan Kandel |
title |
PUResNet: prediction of protein-ligand binding sites using deep residual neural network |
title_short |
PUResNet: prediction of protein-ligand binding sites using deep residual neural network |
title_full |
PUResNet: prediction of protein-ligand binding sites using deep residual neural network |
title_fullStr |
PUResNet: prediction of protein-ligand binding sites using deep residual neural network |
title_full_unstemmed |
PUResNet: prediction of protein-ligand binding sites using deep residual neural network |
title_sort |
puresnet: prediction of protein-ligand binding sites using deep residual neural network |
publisher |
BMC |
series |
Journal of Cheminformatics |
issn |
1758-2946 |
publishDate |
2021-09-01 |
description |
Abstract Background Predicting protein-ligand binding sites is a fundamental step in understanding the functional characteristics of proteins, which plays a vital role in elucidating different biological functions and is a crucial step in drug discovery. A protein exhibits its true nature after binding to its interacting molecule known as a ligand that binds only in the favorable binding site of the protein structure. Different computational methods exploiting the features of proteins have been developed to identify the binding sites in the protein structure, but none seems to provide promising results, and therefore, further investigation is required. Results In this study, we present a deep learning model PUResNet and a novel data cleaning process based on structural similarity for predicting protein-ligand binding sites. From the whole scPDB (an annotated database of druggable binding sites extracted from the Protein DataBank) database, 5020 protein structures were selected to address this problem, which were used to train PUResNet. With this, we achieved better and justifiable performance than the existing methods while evaluating two independent sets using distance, volume and proportion metrics. |
topic |
Ligand binding sites Binding site prediction Deep residual network Convolutional neural network Data cleaning |
url |
https://doi.org/10.1186/s13321-021-00547-7 |
work_keys_str_mv |
AT jeevankandel puresnetpredictionofproteinligandbindingsitesusingdeepresidualneuralnetwork AT hilaltayara puresnetpredictionofproteinligandbindingsitesusingdeepresidualneuralnetwork AT kiltochong puresnetpredictionofproteinligandbindingsitesusingdeepresidualneuralnetwork |
_version_ |
1717755505119592448 |