A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
<p>Abstract</p> <p>We present a mixture model-based analysis for identifying differences in the distribution of RNA polymerase II (Pol II) in transcribed regions, measured using ChIP-seq (chromatin immunoprecipitation following massively parallel sequencing technology). The statist...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2008-09-01
|
Series: | BMC Genomics |
id |
doaj-cd102b2567b24f09802e27b6db6e97e9 |
---|---|
record_format |
Article |
spelling |
doaj-cd102b2567b24f09802e27b6db6e97e92020-11-24T23:36:35ZengBMCBMC Genomics1471-21642008-09-019Suppl 2S2310.1186/1471-2164-9-S2-S23A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technologyLiu YunlongFeng WeixingWu JiejunNephew Kenneth PHuang Tim HMLi Lang<p>Abstract</p> <p>We present a mixture model-based analysis for identifying differences in the distribution of RNA polymerase II (Pol II) in transcribed regions, measured using ChIP-seq (chromatin immunoprecipitation following massively parallel sequencing technology). The statistical model assumes that the number of Pol II-targeted sequences contained within each genomic region follows a <it>Poisson </it>distribution. A <it>Poisson </it>mixture model was then developed to distinguish Pol II binding changes in transcribed region using an empirical approach and an expectation-maximization (EM) algorithm developed for estimation and inference. In order to achieve a global maximum in the M-step, a particle swarm optimization (PSO) was implemented. We applied this model to Pol II binding data generated from hormone-dependent MCF7 breast cancer cells and antiestrogen-resistant MCF7 breast cancer cells before and after treatment with 17<it>β</it>-estradiol (E2). We determined that in the hormone-dependent cells, ~9.9% (2527) genes showed significant changes in Pol II binding after E2 treatment. However, only ~0.7% (172) genes displayed significant Pol II binding changes in E2-treated antiestrogen-resistant cells. These results show that a <it>Poisson </it>mixture model can be used to analyze ChIP-seq data.</p> |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Liu Yunlong Feng Weixing Wu Jiejun Nephew Kenneth P Huang Tim HM Li Lang |
spellingShingle |
Liu Yunlong Feng Weixing Wu Jiejun Nephew Kenneth P Huang Tim HM Li Lang A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology BMC Genomics |
author_facet |
Liu Yunlong Feng Weixing Wu Jiejun Nephew Kenneth P Huang Tim HM Li Lang |
author_sort |
Liu Yunlong |
title |
A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology |
title_short |
A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology |
title_full |
A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology |
title_fullStr |
A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology |
title_full_unstemmed |
A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology |
title_sort |
<it>poisson </it>mixture model to identify changes in rna polymerase ii binding quantity using high-throughput sequencing technology |
publisher |
BMC |
series |
BMC Genomics |
issn |
1471-2164 |
publishDate |
2008-09-01 |
description |
<p>Abstract</p> <p>We present a mixture model-based analysis for identifying differences in the distribution of RNA polymerase II (Pol II) in transcribed regions, measured using ChIP-seq (chromatin immunoprecipitation following massively parallel sequencing technology). The statistical model assumes that the number of Pol II-targeted sequences contained within each genomic region follows a <it>Poisson </it>distribution. A <it>Poisson </it>mixture model was then developed to distinguish Pol II binding changes in transcribed region using an empirical approach and an expectation-maximization (EM) algorithm developed for estimation and inference. In order to achieve a global maximum in the M-step, a particle swarm optimization (PSO) was implemented. We applied this model to Pol II binding data generated from hormone-dependent MCF7 breast cancer cells and antiestrogen-resistant MCF7 breast cancer cells before and after treatment with 17<it>β</it>-estradiol (E2). We determined that in the hormone-dependent cells, ~9.9% (2527) genes showed significant changes in Pol II binding after E2 treatment. However, only ~0.7% (172) genes displayed significant Pol II binding changes in E2-treated antiestrogen-resistant cells. These results show that a <it>Poisson </it>mixture model can be used to analyze ChIP-seq data.</p> |
work_keys_str_mv |
AT liuyunlong aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT fengweixing aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT wujiejun aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT nephewkennethp aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT huangtimhm aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT lilang aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT liuyunlong itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT fengweixing itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT wujiejun itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT nephewkennethp itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT huangtimhm itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology AT lilang itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology |
_version_ |
1725522702797635584 |