PathCORE-T: identifying and visualizing globally co-occurring pathways in large transcriptomic compendia

Abstract Background Investigators often interpret genome-wide data by analyzing the expression levels of genes within pathways. While this within-pathway analysis is routine, the products of any one pathway can affect the activity of other pathways. Past efforts to identify relationships between bio...

Full description

Bibliographic Details
Main Authors: Kathleen M. Chen, Jie Tan, Gregory P. Way, Georgia Doing, Deborah A. Hogan, Casey S. Greene
Format: Article
Language:English
Published: BMC 2018-07-01
Series:BioData Mining
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13040-018-0175-7
id doaj-7e5f9bf8f9e149f7be157eaad8c52c55
record_format Article
spelling doaj-7e5f9bf8f9e149f7be157eaad8c52c552020-11-24T20:40:21ZengBMCBioData Mining1756-03812018-07-0111112010.1186/s13040-018-0175-7PathCORE-T: identifying and visualizing globally co-occurring pathways in large transcriptomic compendiaKathleen M. Chen0Jie Tan1Gregory P. Way2Georgia Doing3Deborah A. Hogan4Casey S. Greene5Department of Systems Pharmacology and Translational Therapeutics. Perelman School of Medicine, University of PennsylvaniaDepartment of Molecular and Systems Biology, Geisel School of Medicine at DartmouthDepartment of Systems Pharmacology and Translational Therapeutics. Perelman School of Medicine, University of PennsylvaniaDepartment of Microbiology and Immunology, Geisel School of Medicine at DartmouthDepartment of Microbiology and Immunology, Geisel School of Medicine at DartmouthDepartment of Systems Pharmacology and Translational Therapeutics. Perelman School of Medicine, University of PennsylvaniaAbstract Background Investigators often interpret genome-wide data by analyzing the expression levels of genes within pathways. While this within-pathway analysis is routine, the products of any one pathway can affect the activity of other pathways. Past efforts to identify relationships between biological processes have evaluated overlap in knowledge bases or evaluated changes that occur after specific treatments. Individual experiments can highlight condition-specific pathway-pathway relationships; however, constructing a complete network of such relationships across many conditions requires analyzing results from many studies. Results We developed PathCORE-T framework by implementing existing methods to identify pathway-pathway transcriptional relationships evident across a broad data compendium. PathCORE-T is applied to the output of feature construction algorithms; it identifies pairs of pathways observed in features more than expected by chance as functionally co-occurring. We demonstrate PathCORE-T by analyzing an existing eADAGE model of a microbial compendium and building and analyzing NMF features from the TCGA dataset of 33 cancer types. The PathCORE-T framework includes a demonstration web interface, with source code, that users can launch to (1) visualize the network and (2) review the expression levels of associated genes in the original data. PathCORE-T creates and displays the network of globally co-occurring pathways based on features observed in a machine learning analysis of gene expression data. Conclusions The PathCORE-T framework identifies transcriptionally co-occurring pathways from the results of unsupervised analysis of gene expression data and visualizes the relationships between pathways as a network. PathCORE-T recapitulated previously described pathway-pathway relationships and suggested experimentally testable additional hypotheses that remain to be explored.http://link.springer.com/article/10.1186/s13040-018-0175-7Gene expressionUnsupervised feature constructionCrosstalkPathway interactions
collection DOAJ
language English
format Article
sources DOAJ
author Kathleen M. Chen
Jie Tan
Gregory P. Way
Georgia Doing
Deborah A. Hogan
Casey S. Greene
spellingShingle Kathleen M. Chen
Jie Tan
Gregory P. Way
Georgia Doing
Deborah A. Hogan
Casey S. Greene
PathCORE-T: identifying and visualizing globally co-occurring pathways in large transcriptomic compendia
BioData Mining
Gene expression
Unsupervised feature construction
Crosstalk
Pathway interactions
author_facet Kathleen M. Chen
Jie Tan
Gregory P. Way
Georgia Doing
Deborah A. Hogan
Casey S. Greene
author_sort Kathleen M. Chen
title PathCORE-T: identifying and visualizing globally co-occurring pathways in large transcriptomic compendia
title_short PathCORE-T: identifying and visualizing globally co-occurring pathways in large transcriptomic compendia
title_full PathCORE-T: identifying and visualizing globally co-occurring pathways in large transcriptomic compendia
title_fullStr PathCORE-T: identifying and visualizing globally co-occurring pathways in large transcriptomic compendia
title_full_unstemmed PathCORE-T: identifying and visualizing globally co-occurring pathways in large transcriptomic compendia
title_sort pathcore-t: identifying and visualizing globally co-occurring pathways in large transcriptomic compendia
publisher BMC
series BioData Mining
issn 1756-0381
publishDate 2018-07-01
description Abstract Background Investigators often interpret genome-wide data by analyzing the expression levels of genes within pathways. While this within-pathway analysis is routine, the products of any one pathway can affect the activity of other pathways. Past efforts to identify relationships between biological processes have evaluated overlap in knowledge bases or evaluated changes that occur after specific treatments. Individual experiments can highlight condition-specific pathway-pathway relationships; however, constructing a complete network of such relationships across many conditions requires analyzing results from many studies. Results We developed PathCORE-T framework by implementing existing methods to identify pathway-pathway transcriptional relationships evident across a broad data compendium. PathCORE-T is applied to the output of feature construction algorithms; it identifies pairs of pathways observed in features more than expected by chance as functionally co-occurring. We demonstrate PathCORE-T by analyzing an existing eADAGE model of a microbial compendium and building and analyzing NMF features from the TCGA dataset of 33 cancer types. The PathCORE-T framework includes a demonstration web interface, with source code, that users can launch to (1) visualize the network and (2) review the expression levels of associated genes in the original data. PathCORE-T creates and displays the network of globally co-occurring pathways based on features observed in a machine learning analysis of gene expression data. Conclusions The PathCORE-T framework identifies transcriptionally co-occurring pathways from the results of unsupervised analysis of gene expression data and visualizes the relationships between pathways as a network. PathCORE-T recapitulated previously described pathway-pathway relationships and suggested experimentally testable additional hypotheses that remain to be explored.
topic Gene expression
Unsupervised feature construction
Crosstalk
Pathway interactions
url http://link.springer.com/article/10.1186/s13040-018-0175-7
work_keys_str_mv AT kathleenmchen pathcoretidentifyingandvisualizinggloballycooccurringpathwaysinlargetranscriptomiccompendia
AT jietan pathcoretidentifyingandvisualizinggloballycooccurringpathwaysinlargetranscriptomiccompendia
AT gregorypway pathcoretidentifyingandvisualizinggloballycooccurringpathwaysinlargetranscriptomiccompendia
AT georgiadoing pathcoretidentifyingandvisualizinggloballycooccurringpathwaysinlargetranscriptomiccompendia
AT deborahahogan pathcoretidentifyingandvisualizinggloballycooccurringpathwaysinlargetranscriptomiccompendia
AT caseysgreene pathcoretidentifyingandvisualizinggloballycooccurringpathwaysinlargetranscriptomiccompendia
_version_ 1716827203677716480