An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method
The zinc ion is the second richest metal ion in organisms. The proteins binding to zinc ions have important biological functions. However, few scholars have integrated the existing tools to predict the zinc-binding sites in proteins. To make up for this gap, this paper combines three well-known pred...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek
2019-01-01
|
Series: | Tehnički Vjesnik |
Subjects: | |
Online Access: | https://hrcak.srce.hr/file/329381 |
id |
doaj-546c0a9ee0d4492f974eb83f120f5203 |
---|---|
record_format |
Article |
spelling |
doaj-546c0a9ee0d4492f974eb83f120f52032020-11-25T02:15:33ZengFaculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek Tehnički Vjesnik1330-36511848-63392019-01-0126514221426An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian MethodHui Li0Dechang Pi1Chuanming Chen2(1) School of Software Engineering, Jinling Institute of Technology, Jiangsu 211169, China / (2) Center for Intelligent Computer Human Interaction, Nanjing Institute of Big Data, Jiangsu, 211169 China / (3) College of Computer Science and Technology, NanjCollege of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Jiangsu, 211169 ChinaCollege of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Jiangsu, 211169 ChinaThe zinc ion is the second richest metal ion in organisms. The proteins binding to zinc ions have important biological functions. However, few scholars have integrated the existing tools to predict the zinc-binding sites in proteins. To make up for this gap, this paper combines three well-known prediction tools into an improved model called IBayes_Zinc to predict the zinc-binding sites, and utilizes the advantages of the Bayesian method in handling incomplete or partial missing data. Specifically, the prediction scores of three existing sequence-based prediction tools were adopted, and the missing values were padded, forming an integrated classification tool. Then, the probabilities of positive and negative samples were computed and categorized as the class with higher probabilities. Experiments were conducted on a non-redundant training dataset and an independent testing dataset. The results show that our method surpassed the other three methods by nearly 5–13% in Matthew correlation coefficient (MCC) and outperformed the latter in recall and precision. The research findings promote the detection of zinc-binding sites in protein sequence and the identification of metalloprotein functions.https://hrcak.srce.hr/file/329381Bayesianmissingprotein predictionzinc-binding sites |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hui Li Dechang Pi Chuanming Chen |
spellingShingle |
Hui Li Dechang Pi Chuanming Chen An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method Tehnički Vjesnik Bayesian missing protein prediction zinc-binding sites |
author_facet |
Hui Li Dechang Pi Chuanming Chen |
author_sort |
Hui Li |
title |
An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method |
title_short |
An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method |
title_full |
An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method |
title_fullStr |
An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method |
title_full_unstemmed |
An Improved Prediction Model for Zinc-Binding Sites in Proteins Based on Bayesian Method |
title_sort |
improved prediction model for zinc-binding sites in proteins based on bayesian method |
publisher |
Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek |
series |
Tehnički Vjesnik |
issn |
1330-3651 1848-6339 |
publishDate |
2019-01-01 |
description |
The zinc ion is the second richest metal ion in organisms. The proteins binding to zinc ions have important biological functions. However, few scholars have integrated the existing tools to predict the zinc-binding sites in proteins. To make up for this gap, this paper combines three well-known prediction tools into an improved model called IBayes_Zinc to predict the zinc-binding sites, and utilizes the advantages of the Bayesian method in handling incomplete or partial missing data. Specifically, the prediction scores of three existing sequence-based prediction tools were adopted, and the missing values were padded, forming an integrated classification tool. Then, the probabilities of positive and negative samples were computed and categorized as the class with higher probabilities. Experiments were conducted on a non-redundant training dataset and an independent testing dataset. The results show that our method surpassed the other three methods by nearly 5–13% in Matthew correlation coefficient (MCC) and outperformed the latter in recall and precision. The research findings promote the detection of zinc-binding sites in protein sequence and the identification of metalloprotein functions. |
topic |
Bayesian missing protein prediction zinc-binding sites |
url |
https://hrcak.srce.hr/file/329381 |
work_keys_str_mv |
AT huili animprovedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod AT dechangpi animprovedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod AT chuanmingchen animprovedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod AT huili improvedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod AT dechangpi improvedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod AT chuanmingchen improvedpredictionmodelforzincbindingsitesinproteinsbasedonbayesianmethod |
_version_ |
1724895422222172160 |