MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.
Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-04-01
|
Series: | PLoS Computational Biology |
Online Access: | https://doi.org/10.1371/journal.pcbi.1007800 |
id |
doaj-d459d4e5e2b64aef93feef5437e244b5 |
---|---|
record_format |
Article |
spelling |
doaj-d459d4e5e2b64aef93feef5437e244b52021-04-21T16:42:36ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582020-04-01164e100780010.1371/journal.pcbi.1007800MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.Avtar RoopraTranscriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors behind gene changes. These approaches assign TFs and cofactors to genes via a binary designation of 'target', or 'non-target' followed by Fisher Exact Tests to assess enrichment of TFs and cofactors. ENCODE archives 2314 ChIP-seq tracks of 684 TFs and cofactors assayed across a 117 human cell lines under a multitude of growth and maintenance conditions. The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without an a priori binary classification of genes as targets or non-targets. When compared to other TF mining resources, MAGIC displayed favourable performance in predicting TFs and cofactors that drive gene changes in 4 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype 4) Single cell RNAseq analysis of neurons divided by Immediate Early Gene expression levels. In summary, MAGIC is a standalone application that produces meaningful predictions of TFs and cofactors in transcriptomic experiments.https://doi.org/10.1371/journal.pcbi.1007800 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Avtar Roopra |
spellingShingle |
Avtar Roopra MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data. PLoS Computational Biology |
author_facet |
Avtar Roopra |
author_sort |
Avtar Roopra |
title |
MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data. |
title_short |
MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data. |
title_full |
MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data. |
title_fullStr |
MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data. |
title_full_unstemmed |
MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data. |
title_sort |
magic: a tool for predicting transcription factors and cofactors driving gene sets using encode data. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS Computational Biology |
issn |
1553-734X 1553-7358 |
publishDate |
2020-04-01 |
description |
Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors behind gene changes. These approaches assign TFs and cofactors to genes via a binary designation of 'target', or 'non-target' followed by Fisher Exact Tests to assess enrichment of TFs and cofactors. ENCODE archives 2314 ChIP-seq tracks of 684 TFs and cofactors assayed across a 117 human cell lines under a multitude of growth and maintenance conditions. The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without an a priori binary classification of genes as targets or non-targets. When compared to other TF mining resources, MAGIC displayed favourable performance in predicting TFs and cofactors that drive gene changes in 4 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype 4) Single cell RNAseq analysis of neurons divided by Immediate Early Gene expression levels. In summary, MAGIC is a standalone application that produces meaningful predictions of TFs and cofactors in transcriptomic experiments. |
url |
https://doi.org/10.1371/journal.pcbi.1007800 |
work_keys_str_mv |
AT avtarroopra magicatoolforpredictingtranscriptionfactorsandcofactorsdrivinggenesetsusingencodedata |
_version_ |
1714666798798864384 |