RFECS: a random-forest based algorithm for enhancer identification from chromatin state.

Transcriptional enhancers play critical roles in regulation of gene expression, but their identification in the eukaryotic genome has been challenging. Recently, it was shown that enhancers in the mammalian genome are associated with characteristic histone modification patterns, which have been incr...

Full description

Bibliographic Details
Main Authors: Nisha Rajagopal, Wei Xie, Yan Li, Uli Wagner, Wei Wang, John Stamatoyannopoulos, Jason Ernst, Manolis Kellis, Bing Ren
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC3597546?pdf=render
id doaj-74cd5187b5454c549eea23cac04c95fe
record_format Article
spelling doaj-74cd5187b5454c549eea23cac04c95fe2020-11-24T21:51:48ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582013-01-0193e100296810.1371/journal.pcbi.1002968RFECS: a random-forest based algorithm for enhancer identification from chromatin state.Nisha RajagopalWei XieYan LiUli WagnerWei WangJohn StamatoyannopoulosJason ErnstManolis KellisBing RenTranscriptional enhancers play critical roles in regulation of gene expression, but their identification in the eukaryotic genome has been challenging. Recently, it was shown that enhancers in the mammalian genome are associated with characteristic histone modification patterns, which have been increasingly exploited for enhancer identification. However, only a limited number of cell types or chromatin marks have previously been investigated for this purpose, leaving the question unanswered whether there exists an optimal set of histone modifications for enhancer prediction in different cell types. Here, we address this issue by exploring genome-wide profiles of 24 histone modifications in two distinct human cell types, embryonic stem cells and lung fibroblasts. We developed a Random-Forest based algorithm, RFECS (Random Forest based Enhancer identification from Chromatin States) to integrate histone modification profiles for identification of enhancers, and used it to identify enhancers in a number of cell-types. We show that RFECS not only leads to more accurate and precise prediction of enhancers than previous methods, but also helps identify the most informative and robust set of three chromatin marks for enhancer prediction.http://europepmc.org/articles/PMC3597546?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Nisha Rajagopal
Wei Xie
Yan Li
Uli Wagner
Wei Wang
John Stamatoyannopoulos
Jason Ernst
Manolis Kellis
Bing Ren
spellingShingle Nisha Rajagopal
Wei Xie
Yan Li
Uli Wagner
Wei Wang
John Stamatoyannopoulos
Jason Ernst
Manolis Kellis
Bing Ren
RFECS: a random-forest based algorithm for enhancer identification from chromatin state.
PLoS Computational Biology
author_facet Nisha Rajagopal
Wei Xie
Yan Li
Uli Wagner
Wei Wang
John Stamatoyannopoulos
Jason Ernst
Manolis Kellis
Bing Ren
author_sort Nisha Rajagopal
title RFECS: a random-forest based algorithm for enhancer identification from chromatin state.
title_short RFECS: a random-forest based algorithm for enhancer identification from chromatin state.
title_full RFECS: a random-forest based algorithm for enhancer identification from chromatin state.
title_fullStr RFECS: a random-forest based algorithm for enhancer identification from chromatin state.
title_full_unstemmed RFECS: a random-forest based algorithm for enhancer identification from chromatin state.
title_sort rfecs: a random-forest based algorithm for enhancer identification from chromatin state.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2013-01-01
description Transcriptional enhancers play critical roles in regulation of gene expression, but their identification in the eukaryotic genome has been challenging. Recently, it was shown that enhancers in the mammalian genome are associated with characteristic histone modification patterns, which have been increasingly exploited for enhancer identification. However, only a limited number of cell types or chromatin marks have previously been investigated for this purpose, leaving the question unanswered whether there exists an optimal set of histone modifications for enhancer prediction in different cell types. Here, we address this issue by exploring genome-wide profiles of 24 histone modifications in two distinct human cell types, embryonic stem cells and lung fibroblasts. We developed a Random-Forest based algorithm, RFECS (Random Forest based Enhancer identification from Chromatin States) to integrate histone modification profiles for identification of enhancers, and used it to identify enhancers in a number of cell-types. We show that RFECS not only leads to more accurate and precise prediction of enhancers than previous methods, but also helps identify the most informative and robust set of three chromatin marks for enhancer prediction.
url http://europepmc.org/articles/PMC3597546?pdf=render
work_keys_str_mv AT nisharajagopal rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate
AT weixie rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate
AT yanli rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate
AT uliwagner rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate
AT weiwang rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate
AT johnstamatoyannopoulos rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate
AT jasonernst rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate
AT manoliskellis rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate
AT bingren rfecsarandomforestbasedalgorithmforenhanceridentificationfromchromatinstate
_version_ 1725878543380905984