WaveSeq: a novel data-driven method of detecting histone modification enrichments using wavelets.

BACKGROUND: Chromatin immunoprecipitation followed by next-generation sequencing is a genome-wide analysis technique that can be used to detect various epigenetic phenomena such as, transcription factor binding sites and histone modifications. Histone modification profiles can be either punctate or...

Full description

Bibliographic Details
Main Authors: Apratim Mitra, Jiuzhou Song
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3461018?pdf=render
id doaj-04b63e6ae71f435eb4f8efabc148e191
record_format Article
spelling doaj-04b63e6ae71f435eb4f8efabc148e1912020-11-25T02:42:27ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0179e4548610.1371/journal.pone.0045486WaveSeq: a novel data-driven method of detecting histone modification enrichments using wavelets.Apratim MitraJiuzhou SongBACKGROUND: Chromatin immunoprecipitation followed by next-generation sequencing is a genome-wide analysis technique that can be used to detect various epigenetic phenomena such as, transcription factor binding sites and histone modifications. Histone modification profiles can be either punctate or diffuse which makes it difficult to distinguish regions of enrichment from background noise. With the discovery of histone marks having a wide variety of enrichment patterns, there is an urgent need for analysis methods that are robust to various data characteristics and capable of detecting a broad range of enrichment patterns. RESULTS: To address these challenges we propose WaveSeq, a novel data-driven method of detecting regions of significant enrichment in ChIP-Seq data. Our approach utilizes the wavelet transform, is free of distributional assumptions and is robust to diverse data characteristics such as low signal-to-noise ratios and broad enrichment patterns. Using publicly available datasets we showed that WaveSeq compares favorably with other published methods, exhibiting high sensitivity and precision for both punctate and diffuse enrichment regions even in the absence of a control data set. The application of our algorithm to a complex histone modification data set helped make novel functional discoveries which further underlined its utility in such an experimental setup. CONCLUSIONS: WaveSeq is a highly sensitive method capable of accurate identification of enriched regions in a broad range of data sets. WaveSeq can detect both narrow and broad peaks with a high degree of accuracy even in low signal-to-noise ratio data sets. WaveSeq is also suited for application in complex experimental scenarios, helping make biologically relevant functional discoveries.http://europepmc.org/articles/PMC3461018?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Apratim Mitra
Jiuzhou Song
spellingShingle Apratim Mitra
Jiuzhou Song
WaveSeq: a novel data-driven method of detecting histone modification enrichments using wavelets.
PLoS ONE
author_facet Apratim Mitra
Jiuzhou Song
author_sort Apratim Mitra
title WaveSeq: a novel data-driven method of detecting histone modification enrichments using wavelets.
title_short WaveSeq: a novel data-driven method of detecting histone modification enrichments using wavelets.
title_full WaveSeq: a novel data-driven method of detecting histone modification enrichments using wavelets.
title_fullStr WaveSeq: a novel data-driven method of detecting histone modification enrichments using wavelets.
title_full_unstemmed WaveSeq: a novel data-driven method of detecting histone modification enrichments using wavelets.
title_sort waveseq: a novel data-driven method of detecting histone modification enrichments using wavelets.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2012-01-01
description BACKGROUND: Chromatin immunoprecipitation followed by next-generation sequencing is a genome-wide analysis technique that can be used to detect various epigenetic phenomena such as, transcription factor binding sites and histone modifications. Histone modification profiles can be either punctate or diffuse which makes it difficult to distinguish regions of enrichment from background noise. With the discovery of histone marks having a wide variety of enrichment patterns, there is an urgent need for analysis methods that are robust to various data characteristics and capable of detecting a broad range of enrichment patterns. RESULTS: To address these challenges we propose WaveSeq, a novel data-driven method of detecting regions of significant enrichment in ChIP-Seq data. Our approach utilizes the wavelet transform, is free of distributional assumptions and is robust to diverse data characteristics such as low signal-to-noise ratios and broad enrichment patterns. Using publicly available datasets we showed that WaveSeq compares favorably with other published methods, exhibiting high sensitivity and precision for both punctate and diffuse enrichment regions even in the absence of a control data set. The application of our algorithm to a complex histone modification data set helped make novel functional discoveries which further underlined its utility in such an experimental setup. CONCLUSIONS: WaveSeq is a highly sensitive method capable of accurate identification of enriched regions in a broad range of data sets. WaveSeq can detect both narrow and broad peaks with a high degree of accuracy even in low signal-to-noise ratio data sets. WaveSeq is also suited for application in complex experimental scenarios, helping make biologically relevant functional discoveries.
url http://europepmc.org/articles/PMC3461018?pdf=render
work_keys_str_mv AT apratimmitra waveseqanoveldatadrivenmethodofdetectinghistonemodificationenrichmentsusingwavelets
AT jiuzhousong waveseqanoveldatadrivenmethodofdetectinghistonemodificationenrichmentsusingwavelets
_version_ 1724773828267081728