RFECS: a random-forest based algorithm for enhancer identification from chromatin state.
Transcriptional enhancers play critical roles in regulation of gene expression, but their identification in the eukaryotic genome has been challenging. Recently, it was shown that enhancers in the mammalian genome are associated with characteristic histone modification patterns, which have been incr...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2013-01-01
|
Series: | PLoS Computational Biology |
Online Access: | http://europepmc.org/articles/PMC3597546?pdf=render |
id |
doaj-74cd5187b5454c549eea23cac04c95fe |
---|---|
record_format |
Article |
spelling |
doaj-74cd5187b5454c549eea23cac04c95fe2020-11-24T21:51:48ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582013-01-0193e100296810.1371/journal.pcbi.1002968RFECS: a random-forest based algorithm for enhancer identification from chromatin state.Nisha RajagopalWei XieYan LiUli WagnerWei WangJohn StamatoyannopoulosJason ErnstManolis KellisBing RenTranscriptional enhancers play critical roles in regulation of gene expression, but their identification in the eukaryotic genome has been challenging. Recently, it was shown that enhancers in the mammalian genome are associated with characteristic histone modification patterns, which have been increasingly exploited for enhancer identification. However, only a limited number of cell types or chromatin marks have previously been investigated for this purpose, leaving the question unanswered whether there exists an optimal set of histone modifications for enhancer prediction in different cell types. Here, we address this issue by exploring genome-wide profiles of 24 histone modifications in two distinct human cell types, embryonic stem cells and lung fibroblasts. We developed a Random-Forest based algorithm, RFECS (Random Forest based Enhancer identification from Chromatin States) to integrate histone modification profiles for identification of enhancers, and used it to identify enhancers in a number of cell-types. We show that RFECS not only leads to more accurate and precise prediction of enhancers than previous methods, but also helps identify the most informative and robust set of three chromatin marks for enhancer prediction.http://europepmc.org/articles/PMC3597546?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Nisha Rajagopal Wei Xie Yan Li Uli Wagner Wei Wang John Stamatoyannopoulos Jason Ernst Manolis Kellis Bing Ren |
spellingShingle |
Nisha Rajagopal Wei Xie Yan Li Uli Wagner Wei Wang John Stamatoyannopoulos Jason Ernst Manolis Kellis Bing Ren RFECS: a random-forest based algorithm for enhancer identification from chromatin state. PLoS Computational Biology |
author_facet |
Nisha Rajagopal Wei Xie Yan Li Uli Wagner Wei Wang John Stamatoyannopoulos Jason Ernst Manolis Kellis Bing Ren |
author_sort |
Nisha Rajagopal |
title |
RFECS: a random-forest based algorithm for enhancer identification from chromatin state. |
title_short |
RFECS: a random-forest based algorithm for enhancer identification from chromatin state. |
title_full |
RFECS: a random-forest based algorithm for enhancer identification from chromatin state. |
title_fullStr |
RFECS: a random-forest based algorithm for enhancer identification from chromatin state. |
title_full_unstemmed |
RFECS: a random-forest based algorithm for enhancer identification from chromatin state. |
title_sort |
rfecs: a random-forest based algorithm for enhancer identification from chromatin state. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS Computational Biology |
issn |
1553-734X 1553-7358 |
publishDate |
2013-01-01 |
description |
Transcriptional enhancers play critical roles in regulation of gene expression, but their identification in the eukaryotic genome has been challenging. Recently, it was shown that enhancers in the mammalian genome are associated with characteristic histone modification patterns, which have been increasingly exploited for enhancer identification. However, only a limited number of cell types or chromatin marks have previously been investigated for this purpose, leaving the question unanswered whether there exists an optimal set of histone modifications for enhancer prediction in different cell types. Here, we address this issue by exploring genome-wide profiles of 24 histone modifications in two distinct human cell types, embryonic stem cells and lung fibroblasts. We developed a Random-Forest based algorithm, RFECS (Random Forest based Enhancer identification from Chromatin States) to integrate histone modification profiles for identification of enhancers, and used it to identify enhancers in a number of cell-types. We show that RFECS not only leads to more accurate and precise prediction of enhancers than previous methods, but also helps identify the most informative and robust set of three chromatin marks for enhancer prediction. |
url |
http://europepmc.org/articles/PMC3597546?pdf=render |
work_keys_str_mv |
AT nisharajagopal rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate AT weixie rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate AT yanli rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate AT uliwagner rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate AT weiwang rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate AT johnstamatoyannopoulos rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate AT jasonernst rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate AT manoliskellis rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate AT bingren rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate |
_version_ |
1725878543380905984 |