An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method

The zinc ion is the second richest metal ion in organisms. The proteins binding to zinc ions have important biological functions. However, few scholars have integrated the existing tools to predict the zinc-binding sites in proteins. To make up for this gap, this paper combines three well-known pred...

Full description

Bibliographic Details
Main Authors: Hui Li, Dechang Pi, Chuanming Chen
Format: Article
Language:English
Published: Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek 2019-01-01
Series:Tehnički Vjesnik
Subjects:
Online Access:https://hrcak.srce.hr/file/329381
id doaj-546c0a9ee0d4492f974eb83f120f5203
record_format Article
spelling doaj-546c0a9ee0d4492f974eb83f120f52032020-11-25T02:15:33ZengFaculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek Tehnički Vjesnik1330-36511848-63392019-01-0126514221426An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian MethodHui Li0Dechang Pi1Chuanming Chen2(1) School of Software Engineering, Jinling Institute of Technology, Jiangsu 211169, China / (2) Center for Intelligent Computer Human Interaction, Nanjing Institute of Big Data, Jiangsu, 211169 China / (3) College of Computer Science and Technology, NanjCollege of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Jiangsu, 211169 ChinaCollege of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Jiangsu, 211169 ChinaThe zinc ion is the second richest metal ion in organisms. The proteins binding to zinc ions have important biological functions. However, few scholars have integrated the existing tools to predict the zinc-binding sites in proteins. To make up for this gap, this paper combines three well-known prediction tools into an improved model called IBayes_Zinc to predict the zinc-binding sites, and utilizes the advantages of the Bayesian method in handling incomplete or partial missing data. Specifically, the prediction scores of three existing sequence-based prediction tools were adopted, and the missing values were padded, forming an integrated classification tool. Then, the probabilities of positive and negative samples were computed and categorized as the class with higher probabilities. Experiments were conducted on a non-redundant training dataset and an independent testing dataset. The results show that our method surpassed the other three methods by nearly 5–13% in Matthew correlation coefficient (MCC) and outperformed the latter in recall and precision. The research findings promote the detection of zinc-binding sites in protein sequence and the identification of metalloprotein functions.https://hrcak.srce.hr/file/329381Bayesianmissingprotein predictionzinc-binding sites
collection DOAJ
language English
format Article
sources DOAJ
author Hui Li
Dechang Pi
Chuanming Chen
spellingShingle Hui Li
Dechang Pi
Chuanming Chen
An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method
Tehnički Vjesnik
Bayesian
missing
protein prediction
zinc-binding sites
author_facet Hui Li
Dechang Pi
Chuanming Chen
author_sort Hui Li
title An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method
title_short An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method
title_full An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method
title_fullStr An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method
title_full_unstemmed An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method
title_sort improved prediction model for zinc-binding sites in proteins based on bayesian method
publisher Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek
series Tehnički Vjesnik
issn 1330-3651
1848-6339
publishDate 2019-01-01
description The zinc ion is the second richest metal ion in organisms. The proteins binding to zinc ions have important biological functions. However, few scholars have integrated the existing tools to predict the zinc-binding sites in proteins. To make up for this gap, this paper combines three well-known prediction tools into an improved model called IBayes_Zinc to predict the zinc-binding sites, and utilizes the advantages of the Bayesian method in handling incomplete or partial missing data. Specifically, the prediction scores of three existing sequence-based prediction tools were adopted, and the missing values were padded, forming an integrated classification tool. Then, the probabilities of positive and negative samples were computed and categorized as the class with higher probabilities. Experiments were conducted on a non-redundant training dataset and an independent testing dataset. The results show that our method surpassed the other three methods by nearly 5–13% in Matthew correlation coefficient (MCC) and outperformed the latter in recall and precision. The research findings promote the detection of zinc-binding sites in protein sequence and the identification of metalloprotein functions.
topic Bayesian
missing
protein prediction
zinc-binding sites
url https://hrcak.srce.hr/file/329381
work_keys_str_mv AT huili animprovedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod
AT dechangpi animprovedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod
AT chuanmingchen animprovedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod
AT huili improvedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod
AT dechangpi improvedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod
AT chuanmingchen improvedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod
_version_ 1724895422222172160