MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.

Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors...

Full description

Bibliographic Details
Main Author: Avtar Roopra
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-04-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1007800
id doaj-d459d4e5e2b64aef93feef5437e244b5
record_format Article
spelling doaj-d459d4e5e2b64aef93feef5437e244b52021-04-21T16:42:36ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582020-04-01164e100780010.1371/journal.pcbi.1007800MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.Avtar RoopraTranscriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors behind gene changes. These approaches assign TFs and cofactors to genes via a binary designation of 'target', or 'non-target' followed by Fisher Exact Tests to assess enrichment of TFs and cofactors. ENCODE archives 2314 ChIP-seq tracks of 684 TFs and cofactors assayed across a 117 human cell lines under a multitude of growth and maintenance conditions. The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without an a priori binary classification of genes as targets or non-targets. When compared to other TF mining resources, MAGIC displayed favourable performance in predicting TFs and cofactors that drive gene changes in 4 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype 4) Single cell RNAseq analysis of neurons divided by Immediate Early Gene expression levels. In summary, MAGIC is a standalone application that produces meaningful predictions of TFs and cofactors in transcriptomic experiments.https://doi.org/10.1371/journal.pcbi.1007800
collection DOAJ
language English
format Article
sources DOAJ
author Avtar Roopra
spellingShingle Avtar Roopra
MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.
PLoS Computational Biology
author_facet Avtar Roopra
author_sort Avtar Roopra
title MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.
title_short MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.
title_full MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.
title_fullStr MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.
title_full_unstemmed MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.
title_sort magic: a tool for predicting transcription factors and cofactors driving gene sets using encode data.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2020-04-01
description Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors behind gene changes. These approaches assign TFs and cofactors to genes via a binary designation of 'target', or 'non-target' followed by Fisher Exact Tests to assess enrichment of TFs and cofactors. ENCODE archives 2314 ChIP-seq tracks of 684 TFs and cofactors assayed across a 117 human cell lines under a multitude of growth and maintenance conditions. The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without an a priori binary classification of genes as targets or non-targets. When compared to other TF mining resources, MAGIC displayed favourable performance in predicting TFs and cofactors that drive gene changes in 4 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype 4) Single cell RNAseq analysis of neurons divided by Immediate Early Gene expression levels. In summary, MAGIC is a standalone application that produces meaningful predictions of TFs and cofactors in transcriptomic experiments.
url https://doi.org/10.1371/journal.pcbi.1007800
work_keys_str_mv AT avtarroopra magicatoolforpredictingtranscriptionfactorsandcofactorsdrivinggenesetsusingencodedata
_version_ 1714666798798864384