An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation

Butyrylation plays a crucial role in the cellular processes. Due to limit of techniques, it is a challenging task to identify histone butyrylation sites on a large scale. To fill the gap, we propose an approach based on information entropy and machine learning for computationally identifying histone...

Full description

Bibliographic Details
Main Authors: Guohua Huang, Yang Zheng, Yao-Qun Wu, Guo-Sheng Han, Zu-Guo Yu
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-02-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fgene.2019.01325/full
id doaj-497475d4d8af4af98f07e7ed2d8457ff
record_format Article
spelling doaj-497475d4d8af4af98f07e7ed2d8457ff2020-11-25T01:25:39ZengFrontiers Media S.A.Frontiers in Genetics1664-80212020-02-011010.3389/fgene.2019.01325497634An Information Entropy-Based Approach for Computationally Identifying Histone Lysine ButyrylationGuohua Huang0Yang Zheng1Yao-Qun Wu2Yao-Qun Wu3Guo-Sheng Han4Zu-Guo Yu5Zu-Guo Yu6Provincial Key Laboratory of Informational Service for Rural Area of Southwestern Hunan, Shaoyang University, Shaoyang, ChinaProvincial Key Laboratory of Informational Service for Rural Area of Southwestern Hunan, Shaoyang University, Shaoyang, ChinaProvincial Key Laboratory of Informational Service for Rural Area of Southwestern Hunan, Shaoyang University, Shaoyang, ChinaKey Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, ChinaKey Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, ChinaKey Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, ChinaSchool of Electrical Engineering and Computer Science, Queensland University of Technology, Brisbane, QLD, AustraliaButyrylation plays a crucial role in the cellular processes. Due to limit of techniques, it is a challenging task to identify histone butyrylation sites on a large scale. To fill the gap, we propose an approach based on information entropy and machine learning for computationally identifying histone butyrylation sites. The proposed method achieves 0.92 of area under the receiver operating characteristic (ROC) curve over the training set by 3-fold cross validation and 0.80 over the testing set by independent test. Feature analysis implies that amino acid residues in the down/upstream of butyrylation sites would exhibit specific sequence motif to a certain extent. Functional analysis suggests that histone butyrylation was most possibly associated with four pathways (systemic lupus erythematosus, alcoholism, viral carcinogenesis and transcriptional misregulation in cancer), was involved in binding with other molecules, processes of biosynthesis, assembly, arrangement or disassembly and was located in such complex as consists of DNA, RNA, protein, etc. The proposed method is useful to predict histone butyrylation sites. Analysis of feature and function improves understanding of histone butyrylation and increases knowledge of functions of butyrylated histones.https://www.frontiersin.org/article/10.3389/fgene.2019.01325/fullbutyrylationrandom foresthistonepost-translational modificationinformation entropy
collection DOAJ
language English
format Article
sources DOAJ
author Guohua Huang
Yang Zheng
Yao-Qun Wu
Yao-Qun Wu
Guo-Sheng Han
Zu-Guo Yu
Zu-Guo Yu
spellingShingle Guohua Huang
Yang Zheng
Yao-Qun Wu
Yao-Qun Wu
Guo-Sheng Han
Zu-Guo Yu
Zu-Guo Yu
An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
Frontiers in Genetics
butyrylation
random forest
histone
post-translational modification
information entropy
author_facet Guohua Huang
Yang Zheng
Yao-Qun Wu
Yao-Qun Wu
Guo-Sheng Han
Zu-Guo Yu
Zu-Guo Yu
author_sort Guohua Huang
title An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title_short An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title_full An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title_fullStr An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title_full_unstemmed An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
title_sort information entropy-based approach for computationally identifying histone lysine butyrylation
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2020-02-01
description Butyrylation plays a crucial role in the cellular processes. Due to limit of techniques, it is a challenging task to identify histone butyrylation sites on a large scale. To fill the gap, we propose an approach based on information entropy and machine learning for computationally identifying histone butyrylation sites. The proposed method achieves 0.92 of area under the receiver operating characteristic (ROC) curve over the training set by 3-fold cross validation and 0.80 over the testing set by independent test. Feature analysis implies that amino acid residues in the down/upstream of butyrylation sites would exhibit specific sequence motif to a certain extent. Functional analysis suggests that histone butyrylation was most possibly associated with four pathways (systemic lupus erythematosus, alcoholism, viral carcinogenesis and transcriptional misregulation in cancer), was involved in binding with other molecules, processes of biosynthesis, assembly, arrangement or disassembly and was located in such complex as consists of DNA, RNA, protein, etc. The proposed method is useful to predict histone butyrylation sites. Analysis of feature and function improves understanding of histone butyrylation and increases knowledge of functions of butyrylated histones.
topic butyrylation
random forest
histone
post-translational modification
information entropy
url https://www.frontiersin.org/article/10.3389/fgene.2019.01325/full
work_keys_str_mv AT guohuahuang aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT yangzheng aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT yaoqunwu aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT yaoqunwu aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT guoshenghan aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT zuguoyu aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT zuguoyu aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT guohuahuang informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT yangzheng informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT yaoqunwu informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT yaoqunwu informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT guoshenghan informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT zuguoyu informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
AT zuguoyu informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation
_version_ 1725112792603688960