Nonparametric Tests for Differential Histone Enrichment with ChIP-Seq Data

Chromatin immunoprecipitation sequencing (ChIP-seq) is a powerful method for analyzing protein interactions with DNA. It can be applied to identify the binding sites of transcription factors (TFs) and genomic landscape of histone modification marks (HMs). Previous research has largely focused on dev...

Full description

Bibliographic Details
Main Authors: Qian Wu, Kyoung-Jae Won, Hongzhe Li
Format: Article
Language:English
Published: SAGE Publishing 2015-01-01
Series:Cancer Informatics
Online Access:https://doi.org/10.4137/CIN.S13972
id doaj-18567f2bb7da4e4892c42078e0cf1292
record_format Article
spelling doaj-18567f2bb7da4e4892c42078e0cf12922020-11-25T03:20:53ZengSAGE PublishingCancer Informatics1176-93512015-01-0114s110.4137/CIN.S13972Nonparametric Tests for Differential Histone Enrichment with ChIP-Seq DataQian Wu0Kyoung-Jae Won1Hongzhe Li2Department of Biostatistics and Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.Department of Biostatistics and Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.Department of Biostatistics and Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.Chromatin immunoprecipitation sequencing (ChIP-seq) is a powerful method for analyzing protein interactions with DNA. It can be applied to identify the binding sites of transcription factors (TFs) and genomic landscape of histone modification marks (HMs). Previous research has largely focused on developing peak-calling procedures to detect the binding sites for TFs. However, these procedures may fail when applied to ChIP-seq data of HMs, which have diffuse signals and multiple local peaks. In addition, it is important to identify genes with differential histone enrichment regions between two experimental conditions, such as different cellular states or different time points. Parametric methods based on Poisson/negative binomial distribution have been proposed to address this differential enrichment problem and most of these methods require biological replications. However, many ChIP-seq data usually have a few or even no replicates. We propose a nonparametric method to identify the genes with differential histone enrichment regions even without replicates. Our method is based on nonparametric hypothesis testing and kernel smoothing in order to capture the spatial differences in histone-enriched profiles. We demonstrate the method using ChIP-seq data on a comparative epigenomic profiling of adipogenesis of murine adipose stromal cells and the Encyclopedia of DNA Elements (ENCODE) ChIP-seq data. Our method identifies many genes with differential H3K27ac histone enrichment profiles at gene promoter regions between proliferating preadipocytes and mature adipocytes in murine 3T3-L1 cells. The test statistics also correlate with the gene expression changes well and are predictive to gene expression changes, indicating that the identified differentially enriched regions are indeed biologically meaningful.https://doi.org/10.4137/CIN.S13972
collection DOAJ
language English
format Article
sources DOAJ
author Qian Wu
Kyoung-Jae Won
Hongzhe Li
spellingShingle Qian Wu
Kyoung-Jae Won
Hongzhe Li
Nonparametric Tests for Differential Histone Enrichment with ChIP-Seq Data
Cancer Informatics
author_facet Qian Wu
Kyoung-Jae Won
Hongzhe Li
author_sort Qian Wu
title Nonparametric Tests for Differential Histone Enrichment with ChIP-Seq Data
title_short Nonparametric Tests for Differential Histone Enrichment with ChIP-Seq Data
title_full Nonparametric Tests for Differential Histone Enrichment with ChIP-Seq Data
title_fullStr Nonparametric Tests for Differential Histone Enrichment with ChIP-Seq Data
title_full_unstemmed Nonparametric Tests for Differential Histone Enrichment with ChIP-Seq Data
title_sort nonparametric tests for differential histone enrichment with chip-seq data
publisher SAGE Publishing
series Cancer Informatics
issn 1176-9351
publishDate 2015-01-01
description Chromatin immunoprecipitation sequencing (ChIP-seq) is a powerful method for analyzing protein interactions with DNA. It can be applied to identify the binding sites of transcription factors (TFs) and genomic landscape of histone modification marks (HMs). Previous research has largely focused on developing peak-calling procedures to detect the binding sites for TFs. However, these procedures may fail when applied to ChIP-seq data of HMs, which have diffuse signals and multiple local peaks. In addition, it is important to identify genes with differential histone enrichment regions between two experimental conditions, such as different cellular states or different time points. Parametric methods based on Poisson/negative binomial distribution have been proposed to address this differential enrichment problem and most of these methods require biological replications. However, many ChIP-seq data usually have a few or even no replicates. We propose a nonparametric method to identify the genes with differential histone enrichment regions even without replicates. Our method is based on nonparametric hypothesis testing and kernel smoothing in order to capture the spatial differences in histone-enriched profiles. We demonstrate the method using ChIP-seq data on a comparative epigenomic profiling of adipogenesis of murine adipose stromal cells and the Encyclopedia of DNA Elements (ENCODE) ChIP-seq data. Our method identifies many genes with differential H3K27ac histone enrichment profiles at gene promoter regions between proliferating preadipocytes and mature adipocytes in murine 3T3-L1 cells. The test statistics also correlate with the gene expression changes well and are predictive to gene expression changes, indicating that the identified differentially enriched regions are indeed biologically meaningful.
url https://doi.org/10.4137/CIN.S13972
work_keys_str_mv AT qianwu nonparametrictestsfordifferentialhistoneenrichmentwithchipseqdata
AT kyoungjaewon nonparametrictestsfordifferentialhistoneenrichmentwithchipseqdata
AT hongzheli nonparametrictestsfordifferentialhistoneenrichmentwithchipseqdata
_version_ 1724615936185466880