A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology

<p>Abstract</p> <p>We present a mixture model-based analysis for identifying differences in the distribution of RNA polymerase II (Pol II) in transcribed regions, measured using ChIP-seq (chromatin immunoprecipitation following massively parallel sequencing technology). The statist...

Full description

Bibliographic Details
Main Authors: Liu Yunlong, Feng Weixing, Wu Jiejun, Nephew Kenneth P, Huang Tim HM, Li Lang
Format: Article
Language:English
Published: BMC 2008-09-01
Series:BMC Genomics
id doaj-cd102b2567b24f09802e27b6db6e97e9
record_format Article
spelling doaj-cd102b2567b24f09802e27b6db6e97e92020-11-24T23:36:35ZengBMCBMC Genomics1471-21642008-09-019Suppl 2S2310.1186/1471-2164-9-S2-S23A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technologyLiu YunlongFeng WeixingWu JiejunNephew Kenneth PHuang Tim HMLi Lang<p>Abstract</p> <p>We present a mixture model-based analysis for identifying differences in the distribution of RNA polymerase II (Pol II) in transcribed regions, measured using ChIP-seq (chromatin immunoprecipitation following massively parallel sequencing technology). The statistical model assumes that the number of Pol II-targeted sequences contained within each genomic region follows a <it>Poisson </it>distribution. A <it>Poisson </it>mixture model was then developed to distinguish Pol II binding changes in transcribed region using an empirical approach and an expectation-maximization (EM) algorithm developed for estimation and inference. In order to achieve a global maximum in the M-step, a particle swarm optimization (PSO) was implemented. We applied this model to Pol II binding data generated from hormone-dependent MCF7 breast cancer cells and antiestrogen-resistant MCF7 breast cancer cells before and after treatment with 17<it>β</it>-estradiol (E2). We determined that in the hormone-dependent cells, ~9.9% (2527) genes showed significant changes in Pol II binding after E2 treatment. However, only ~0.7% (172) genes displayed significant Pol II binding changes in E2-treated antiestrogen-resistant cells. These results show that a <it>Poisson </it>mixture model can be used to analyze ChIP-seq data.</p>
collection DOAJ
language English
format Article
sources DOAJ
author Liu Yunlong
Feng Weixing
Wu Jiejun
Nephew Kenneth P
Huang Tim HM
Li Lang
spellingShingle Liu Yunlong
Feng Weixing
Wu Jiejun
Nephew Kenneth P
Huang Tim HM
Li Lang
A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
BMC Genomics
author_facet Liu Yunlong
Feng Weixing
Wu Jiejun
Nephew Kenneth P
Huang Tim HM
Li Lang
author_sort Liu Yunlong
title A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title_short A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title_full A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title_fullStr A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title_full_unstemmed A <it>Poisson </it>mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology
title_sort <it>poisson </it>mixture model to identify changes in rna polymerase ii binding quantity using high-throughput sequencing technology
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2008-09-01
description <p>Abstract</p> <p>We present a mixture model-based analysis for identifying differences in the distribution of RNA polymerase II (Pol II) in transcribed regions, measured using ChIP-seq (chromatin immunoprecipitation following massively parallel sequencing technology). The statistical model assumes that the number of Pol II-targeted sequences contained within each genomic region follows a <it>Poisson </it>distribution. A <it>Poisson </it>mixture model was then developed to distinguish Pol II binding changes in transcribed region using an empirical approach and an expectation-maximization (EM) algorithm developed for estimation and inference. In order to achieve a global maximum in the M-step, a particle swarm optimization (PSO) was implemented. We applied this model to Pol II binding data generated from hormone-dependent MCF7 breast cancer cells and antiestrogen-resistant MCF7 breast cancer cells before and after treatment with 17<it>β</it>-estradiol (E2). We determined that in the hormone-dependent cells, ~9.9% (2527) genes showed significant changes in Pol II binding after E2 treatment. However, only ~0.7% (172) genes displayed significant Pol II binding changes in E2-treated antiestrogen-resistant cells. These results show that a <it>Poisson </it>mixture model can be used to analyze ChIP-seq data.</p>
work_keys_str_mv AT liuyunlong aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT fengweixing aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT wujiejun aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT nephewkennethp aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT huangtimhm aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT lilang aitpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT liuyunlong itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT fengweixing itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT wujiejun itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT nephewkennethp itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT huangtimhm itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
AT lilang itpoissonitmixturemodeltoidentifychangesinrnapolymeraseiibindingquantityusinghighthroughputsequencingtechnology
_version_ 1725522702797635584