Application of machine learning methods to histone methylation ChIP-Seq data reveals H4R3me2 globally represses gene expression

<p>Abstract</p> <p>Background</p> <p>In the last decade, biochemical studies have revealed that epigenetic modifications including histone modifications, histone variants and DNA methylation form a complex network that regulate the state of chromatin and processes that...

Full description

Bibliographic Details
Main Authors: Xu Xiaojiang, Hoang Stephen, Mayo Marty W, Bekiranov Stefan
Format: Article
Language:English
Published: BMC 2010-07-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/11/396
id doaj-e4266f29f92f49a6919fddd77152c5f7
record_format Article
spelling doaj-e4266f29f92f49a6919fddd77152c5f72020-11-25T00:25:06ZengBMCBMC Bioinformatics1471-21052010-07-0111139610.1186/1471-2105-11-396Application of machine learning methods to histone methylation ChIP-Seq data reveals H4R3me2 globally represses gene expressionXu XiaojiangHoang StephenMayo Marty WBekiranov Stefan<p>Abstract</p> <p>Background</p> <p>In the last decade, biochemical studies have revealed that epigenetic modifications including histone modifications, histone variants and DNA methylation form a complex network that regulate the state of chromatin and processes that depend on it including transcription and DNA replication. Currently, a large number of these epigenetic modifications are being mapped in a variety of cell lines at different stages of development using high throughput sequencing by members of the ENCODE consortium, the NIH Roadmap Epigenomics Program and the Human Epigenome Project. An extremely promising and underexplored area of research is the application of machine learning methods, which are designed to construct predictive network models, to these large-scale epigenomic data sets.</p> <p>Results</p> <p>Using a ChIP-Seq data set of 20 histone lysine and arginine methylations and histone variant H2A.Z in human CD4<sup>+ </sup>T-cells, we built predictive models of gene expression as a function of histone modification/variant levels using Multilinear (ML) Regression and Multivariate Adaptive Regression Splines (MARS). Along with extensive crosstalk among the 20 histone methylations, we found H4R3me2 was the most and second most globally repressive histone methylation among the 20 studied in the ML and MARS models, respectively. In support of our finding, a number of experimental studies show that PRMT5-catalyzed symmetric dimethylation of H4R3 is associated with repression of gene expression. This includes a recent study, which demonstrated that H4R3me2 is required for DNMT3A-mediated DNA methylation--a known global repressor of gene expression.</p> <p>Conclusion</p> <p>In stark contrast to univariate analysis of the relationship between H4R3me2 and gene expression levels, our study showed that the regulatory role of some modifications like H4R3me2 is masked by confounding variables, but can be elucidated by multivariate/systems-level approaches.</p> http://www.biomedcentral.com/1471-2105/11/396
collection DOAJ
language English
format Article
sources DOAJ
author Xu Xiaojiang
Hoang Stephen
Mayo Marty W
Bekiranov Stefan
spellingShingle Xu Xiaojiang
Hoang Stephen
Mayo Marty W
Bekiranov Stefan
Application of machine learning methods to histone methylation ChIP-Seq data reveals H4R3me2 globally represses gene expression
BMC Bioinformatics
author_facet Xu Xiaojiang
Hoang Stephen
Mayo Marty W
Bekiranov Stefan
author_sort Xu Xiaojiang
title Application of machine learning methods to histone methylation ChIP-Seq data reveals H4R3me2 globally represses gene expression
title_short Application of machine learning methods to histone methylation ChIP-Seq data reveals H4R3me2 globally represses gene expression
title_full Application of machine learning methods to histone methylation ChIP-Seq data reveals H4R3me2 globally represses gene expression
title_fullStr Application of machine learning methods to histone methylation ChIP-Seq data reveals H4R3me2 globally represses gene expression
title_full_unstemmed Application of machine learning methods to histone methylation ChIP-Seq data reveals H4R3me2 globally represses gene expression
title_sort application of machine learning methods to histone methylation chip-seq data reveals h4r3me2 globally represses gene expression
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2010-07-01
description <p>Abstract</p> <p>Background</p> <p>In the last decade, biochemical studies have revealed that epigenetic modifications including histone modifications, histone variants and DNA methylation form a complex network that regulate the state of chromatin and processes that depend on it including transcription and DNA replication. Currently, a large number of these epigenetic modifications are being mapped in a variety of cell lines at different stages of development using high throughput sequencing by members of the ENCODE consortium, the NIH Roadmap Epigenomics Program and the Human Epigenome Project. An extremely promising and underexplored area of research is the application of machine learning methods, which are designed to construct predictive network models, to these large-scale epigenomic data sets.</p> <p>Results</p> <p>Using a ChIP-Seq data set of 20 histone lysine and arginine methylations and histone variant H2A.Z in human CD4<sup>+ </sup>T-cells, we built predictive models of gene expression as a function of histone modification/variant levels using Multilinear (ML) Regression and Multivariate Adaptive Regression Splines (MARS). Along with extensive crosstalk among the 20 histone methylations, we found H4R3me2 was the most and second most globally repressive histone methylation among the 20 studied in the ML and MARS models, respectively. In support of our finding, a number of experimental studies show that PRMT5-catalyzed symmetric dimethylation of H4R3 is associated with repression of gene expression. This includes a recent study, which demonstrated that H4R3me2 is required for DNMT3A-mediated DNA methylation--a known global repressor of gene expression.</p> <p>Conclusion</p> <p>In stark contrast to univariate analysis of the relationship between H4R3me2 and gene expression levels, our study showed that the regulatory role of some modifications like H4R3me2 is masked by confounding variables, but can be elucidated by multivariate/systems-level approaches.</p>
url http://www.biomedcentral.com/1471-2105/11/396
work_keys_str_mv AT xuxiaojiang applicationofmachinelearningmethodstohistonemethylationchipseqdatarevealsh4r3me2globallyrepressesgeneexpression
AT hoangstephen applicationofmachinelearningmethodstohistonemethylationchipseqdatarevealsh4r3me2globallyrepressesgeneexpression
AT mayomartyw applicationofmachinelearningmethodstohistonemethylationchipseqdatarevealsh4r3me2globallyrepressesgeneexpression
AT bekiranovstefan applicationofmachinelearningmethodstohistonemethylationchipseqdatarevealsh4r3me2globallyrepressesgeneexpression
_version_ 1725350126713569280