PUResNet: prediction of protein-ligand binding sites using deep residual neural network

Abstract Background Predicting protein-ligand binding sites is a fundamental step in understanding the functional characteristics of proteins, which plays a vital role in elucidating different biological functions and is a crucial step in drug discovery. A protein exhibits its true nature after bind...

Full description

Bibliographic Details
Main Authors: Jeevan Kandel, Hilal Tayara, Kil To Chong
Format: Article
Language:English
Published: BMC 2021-09-01
Series:Journal of Cheminformatics
Subjects:
Online Access:https://doi.org/10.1186/s13321-021-00547-7
id doaj-a7630a2be27a4607a95bb61a296fe985
record_format Article
spelling doaj-a7630a2be27a4607a95bb61a296fe9852021-09-12T11:42:42ZengBMCJournal of Cheminformatics1758-29462021-09-0113111410.1186/s13321-021-00547-7PUResNet: prediction of protein-ligand binding sites using deep residual neural networkJeevan Kandel0Hilal Tayara1Kil To Chong2Graduate School of Integrated Energy-AI, Jeonbuk National UniversitySchool of International Engineering and Science, Jeonbuk National UniversityDepartment of Electronics and Information Engineering, Jeonbuk National UniversityAbstract Background Predicting protein-ligand binding sites is a fundamental step in understanding the functional characteristics of proteins, which plays a vital role in elucidating different biological functions and is a crucial step in drug discovery. A protein exhibits its true nature after binding to its interacting molecule known as a ligand that binds only in the favorable binding site of the protein structure. Different computational methods exploiting the features of proteins have been developed to identify the binding sites in the protein structure, but none seems to provide promising results, and therefore, further investigation is required. Results In this study, we present a deep learning model PUResNet and a novel data cleaning process based on structural similarity for predicting protein-ligand binding sites. From the whole scPDB (an annotated database of druggable binding sites extracted from the Protein DataBank) database, 5020 protein structures were selected to address this problem, which were used to train PUResNet. With this, we achieved better and justifiable performance than the existing methods while evaluating two independent sets using distance, volume and proportion metrics.https://doi.org/10.1186/s13321-021-00547-7Ligand binding sitesBinding site predictionDeep residual networkConvolutional neural networkData cleaning
collection DOAJ
language English
format Article
sources DOAJ
author Jeevan Kandel
Hilal Tayara
Kil To Chong
spellingShingle Jeevan Kandel
Hilal Tayara
Kil To Chong
PUResNet: prediction of protein-ligand binding sites using deep residual neural network
Journal of Cheminformatics
Ligand binding sites
Binding site prediction
Deep residual network
Convolutional neural network
Data cleaning
author_facet Jeevan Kandel
Hilal Tayara
Kil To Chong
author_sort Jeevan Kandel
title PUResNet: prediction of protein-ligand binding sites using deep residual neural network
title_short PUResNet: prediction of protein-ligand binding sites using deep residual neural network
title_full PUResNet: prediction of protein-ligand binding sites using deep residual neural network
title_fullStr PUResNet: prediction of protein-ligand binding sites using deep residual neural network
title_full_unstemmed PUResNet: prediction of protein-ligand binding sites using deep residual neural network
title_sort puresnet: prediction of protein-ligand binding sites using deep residual neural network
publisher BMC
series Journal of Cheminformatics
issn 1758-2946
publishDate 2021-09-01
description Abstract Background Predicting protein-ligand binding sites is a fundamental step in understanding the functional characteristics of proteins, which plays a vital role in elucidating different biological functions and is a crucial step in drug discovery. A protein exhibits its true nature after binding to its interacting molecule known as a ligand that binds only in the favorable binding site of the protein structure. Different computational methods exploiting the features of proteins have been developed to identify the binding sites in the protein structure, but none seems to provide promising results, and therefore, further investigation is required. Results In this study, we present a deep learning model PUResNet and a novel data cleaning process based on structural similarity for predicting protein-ligand binding sites. From the whole scPDB (an annotated database of druggable binding sites extracted from the Protein DataBank) database, 5020 protein structures were selected to address this problem, which were used to train PUResNet. With this, we achieved better and justifiable performance than the existing methods while evaluating two independent sets using distance, volume and proportion metrics.
topic Ligand binding sites
Binding site prediction
Deep residual network
Convolutional neural network
Data cleaning
url https://doi.org/10.1186/s13321-021-00547-7
work_keys_str_mv AT jeevankandel puresnetpredictionofproteinligandbindingsitesusingdeepresidualneuralnetwork
AT hilaltayara puresnetpredictionofproteinligandbindingsitesusingdeepresidualneuralnetwork
AT kiltochong puresnetpredictionofproteinligandbindingsitesusingdeepresidualneuralnetwork
_version_ 1717755505119592448