An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation
Butyrylation plays a crucial role in the cellular processes. Due to limit of techniques, it is a challenging task to identify histone butyrylation sites on a large scale. To fill the gap, we propose an approach based on information entropy and machine learning for computationally identifying histone...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2020-02-01
|
Series: | Frontiers in Genetics |
Subjects: | |
Online Access: | https://www.frontiersin.org/article/10.3389/fgene.2019.01325/full |
id |
doaj-497475d4d8af4af98f07e7ed2d8457ff |
---|---|
record_format |
Article |
spelling |
doaj-497475d4d8af4af98f07e7ed2d8457ff2020-11-25T01:25:39ZengFrontiers Media S.A.Frontiers in Genetics1664-80212020-02-011010.3389/fgene.2019.01325497634An Information Entropy-Based Approach for Computationally Identifying Histone Lysine ButyrylationGuohua Huang0Yang Zheng1Yao-Qun Wu2Yao-Qun Wu3Guo-Sheng Han4Zu-Guo Yu5Zu-Guo Yu6Provincial Key Laboratory of Informational Service for Rural Area of Southwestern Hunan, Shaoyang University, Shaoyang, ChinaProvincial Key Laboratory of Informational Service for Rural Area of Southwestern Hunan, Shaoyang University, Shaoyang, ChinaProvincial Key Laboratory of Informational Service for Rural Area of Southwestern Hunan, Shaoyang University, Shaoyang, ChinaKey Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, ChinaKey Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, ChinaKey Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, ChinaSchool of Electrical Engineering and Computer Science, Queensland University of Technology, Brisbane, QLD, AustraliaButyrylation plays a crucial role in the cellular processes. Due to limit of techniques, it is a challenging task to identify histone butyrylation sites on a large scale. To fill the gap, we propose an approach based on information entropy and machine learning for computationally identifying histone butyrylation sites. The proposed method achieves 0.92 of area under the receiver operating characteristic (ROC) curve over the training set by 3-fold cross validation and 0.80 over the testing set by independent test. Feature analysis implies that amino acid residues in the down/upstream of butyrylation sites would exhibit specific sequence motif to a certain extent. Functional analysis suggests that histone butyrylation was most possibly associated with four pathways (systemic lupus erythematosus, alcoholism, viral carcinogenesis and transcriptional misregulation in cancer), was involved in binding with other molecules, processes of biosynthesis, assembly, arrangement or disassembly and was located in such complex as consists of DNA, RNA, protein, etc. The proposed method is useful to predict histone butyrylation sites. Analysis of feature and function improves understanding of histone butyrylation and increases knowledge of functions of butyrylated histones.https://www.frontiersin.org/article/10.3389/fgene.2019.01325/fullbutyrylationrandom foresthistonepost-translational modificationinformation entropy |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Guohua Huang Yang Zheng Yao-Qun Wu Yao-Qun Wu Guo-Sheng Han Zu-Guo Yu Zu-Guo Yu |
spellingShingle |
Guohua Huang Yang Zheng Yao-Qun Wu Yao-Qun Wu Guo-Sheng Han Zu-Guo Yu Zu-Guo Yu An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation Frontiers in Genetics butyrylation random forest histone post-translational modification information entropy |
author_facet |
Guohua Huang Yang Zheng Yao-Qun Wu Yao-Qun Wu Guo-Sheng Han Zu-Guo Yu Zu-Guo Yu |
author_sort |
Guohua Huang |
title |
An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation |
title_short |
An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation |
title_full |
An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation |
title_fullStr |
An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation |
title_full_unstemmed |
An Information Entropy-Based Approach for Computationally Identifying Histone Lysine Butyrylation |
title_sort |
information entropy-based approach for computationally identifying histone lysine butyrylation |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Genetics |
issn |
1664-8021 |
publishDate |
2020-02-01 |
description |
Butyrylation plays a crucial role in the cellular processes. Due to limit of techniques, it is a challenging task to identify histone butyrylation sites on a large scale. To fill the gap, we propose an approach based on information entropy and machine learning for computationally identifying histone butyrylation sites. The proposed method achieves 0.92 of area under the receiver operating characteristic (ROC) curve over the training set by 3-fold cross validation and 0.80 over the testing set by independent test. Feature analysis implies that amino acid residues in the down/upstream of butyrylation sites would exhibit specific sequence motif to a certain extent. Functional analysis suggests that histone butyrylation was most possibly associated with four pathways (systemic lupus erythematosus, alcoholism, viral carcinogenesis and transcriptional misregulation in cancer), was involved in binding with other molecules, processes of biosynthesis, assembly, arrangement or disassembly and was located in such complex as consists of DNA, RNA, protein, etc. The proposed method is useful to predict histone butyrylation sites. Analysis of feature and function improves understanding of histone butyrylation and increases knowledge of functions of butyrylated histones. |
topic |
butyrylation random forest histone post-translational modification information entropy |
url |
https://www.frontiersin.org/article/10.3389/fgene.2019.01325/full |
work_keys_str_mv |
AT guohuahuang aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT yangzheng aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT yaoqunwu aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT yaoqunwu aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT guoshenghan aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT zuguoyu aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT zuguoyu aninformationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT guohuahuang informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT yangzheng informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT yaoqunwu informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT yaoqunwu informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT guoshenghan informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT zuguoyu informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation AT zuguoyu informationentropybasedapproachforcomputationallyidentifyinghistonelysinebutyrylation |
_version_ |
1725112792603688960 |