Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.

Identifying transcription factors (TF) involved in producing a genome-wide transcriptional profile is an essential step in building mechanistic model that can explain observed gene expression data. We developed a statistical framework for constructing genome-wide signatures of TF activity, and for u...

Full description

Bibliographic Details
Main Authors: Jing Chen, Zhen Hu, Mukta Phatak, John Reichard, Johannes M Freudenberg, Siva Sivaganesan, Mario Medvedovic
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS Computational Biology
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/24039560/pdf/?tool=EBI
id doaj-c4c9bcad41d842abb5563bb115e9351a
record_format Article
spelling doaj-c4c9bcad41d842abb5563bb115e9351a2021-04-21T15:41:48ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582013-01-0199e100319810.1371/journal.pcbi.1003198Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.Jing ChenZhen HuMukta PhatakJohn ReichardJohannes M FreudenbergSiva SivaganesanMario MedvedovicIdentifying transcription factors (TF) involved in producing a genome-wide transcriptional profile is an essential step in building mechanistic model that can explain observed gene expression data. We developed a statistical framework for constructing genome-wide signatures of TF activity, and for using such signatures in the analysis of gene expression data produced by complex transcriptional regulatory programs. Our framework integrates ChIP-seq data and appropriately matched gene expression profiles to identify True REGulatory (TREG) TF-gene interactions. It provides genome-wide quantification of the likelihood of regulatory TF-gene interaction that can be used to either identify regulated genes, or as genome-wide signature of TF activity. To effectively use ChIP-seq data, we introduce a novel statistical model that integrates information from all binding "peaks" within 2 Mb window around a gene's transcription start site (TSS), and provides gene-level binding scores and probabilities of regulatory interaction. In the second step we integrate these binding scores and regulatory probabilities with gene expression data to assess the likelihood of True REGulatory (TREG) TF-gene interactions. We demonstrate the advantages of TREG framework in identifying genes regulated by two TFs with widely different distribution of functional binding events (ERα and E2f1). We also show that TREG signatures of TF activity vastly improve our ability to detect involvement of ERα in producing complex diseases-related transcriptional profiles. Through a large study of disease-related transcriptional signatures and transcriptional signatures of drug activity, we demonstrate that increase in statistical power associated with the use of TREG signatures makes the crucial difference in identifying key targets for treatment, and drugs to use for treatment. All methods are implemented in an open-source R package treg. The package also contains all data used in the analysis including 494 TREG binding profiles based on ENCODE ChIP-seq data. The treg package can be downloaded at http://GenomicsPortals.org.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/24039560/pdf/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author Jing Chen
Zhen Hu
Mukta Phatak
John Reichard
Johannes M Freudenberg
Siva Sivaganesan
Mario Medvedovic
spellingShingle Jing Chen
Zhen Hu
Mukta Phatak
John Reichard
Johannes M Freudenberg
Siva Sivaganesan
Mario Medvedovic
Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.
PLoS Computational Biology
author_facet Jing Chen
Zhen Hu
Mukta Phatak
John Reichard
Johannes M Freudenberg
Siva Sivaganesan
Mario Medvedovic
author_sort Jing Chen
title Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.
title_short Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.
title_full Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.
title_fullStr Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.
title_full_unstemmed Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.
title_sort genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2013-01-01
description Identifying transcription factors (TF) involved in producing a genome-wide transcriptional profile is an essential step in building mechanistic model that can explain observed gene expression data. We developed a statistical framework for constructing genome-wide signatures of TF activity, and for using such signatures in the analysis of gene expression data produced by complex transcriptional regulatory programs. Our framework integrates ChIP-seq data and appropriately matched gene expression profiles to identify True REGulatory (TREG) TF-gene interactions. It provides genome-wide quantification of the likelihood of regulatory TF-gene interaction that can be used to either identify regulated genes, or as genome-wide signature of TF activity. To effectively use ChIP-seq data, we introduce a novel statistical model that integrates information from all binding "peaks" within 2 Mb window around a gene's transcription start site (TSS), and provides gene-level binding scores and probabilities of regulatory interaction. In the second step we integrate these binding scores and regulatory probabilities with gene expression data to assess the likelihood of True REGulatory (TREG) TF-gene interactions. We demonstrate the advantages of TREG framework in identifying genes regulated by two TFs with widely different distribution of functional binding events (ERα and E2f1). We also show that TREG signatures of TF activity vastly improve our ability to detect involvement of ERα in producing complex diseases-related transcriptional profiles. Through a large study of disease-related transcriptional signatures and transcriptional signatures of drug activity, we demonstrate that increase in statistical power associated with the use of TREG signatures makes the crucial difference in identifying key targets for treatment, and drugs to use for treatment. All methods are implemented in an open-source R package treg. The package also contains all data used in the analysis including 494 TREG binding profiles based on ENCODE ChIP-seq data. The treg package can be downloaded at http://GenomicsPortals.org.
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/24039560/pdf/?tool=EBI
work_keys_str_mv AT jingchen genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT zhenhu genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT muktaphatak genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT johnreichard genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT johannesmfreudenberg genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT sivasivaganesan genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
AT mariomedvedovic genomewidesignaturesoftranscriptionfactoractivityconnectingtranscriptionfactorsdiseaseandsmallmolecules
_version_ 1714667190131621888