pathVar: a new method for pathway-based interpretation of gene expression variability

Identifying the pathways that control a cellular phenotype is the first step to building a mechanistic model. Recent examples in developmental biology, cancer genomics, and neurological disease have demonstrated how changes in the variability of gene expression can highlight important genes that are...

Full description

Bibliographic Details
Main Authors: Laurence de Torrente, Samuel Zimmerman, Deanne Taylor, Yu Hasegawa, Christine A. Wells, Jessica C. Mar
Format: Article
Language:English
Published: PeerJ Inc. 2017-05-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/3334.pdf
id doaj-3f8cd2117e2442a8b9b8a0c72a2bea26
record_format Article
spelling doaj-3f8cd2117e2442a8b9b8a0c72a2bea262020-11-24T23:15:50ZengPeerJ Inc.PeerJ2167-83592017-05-015e333410.7717/peerj.3334pathVar: a new method for pathway-based interpretation of gene expression variabilityLaurence de Torrente0Samuel Zimmerman1Deanne Taylor2Yu Hasegawa3Christine A. Wells4Jessica C. Mar5Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, United States of AmericaDepartment of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, United States of AmericaDepartment of Biomedical and Health Informatics, Children’s Hospital of Philadelphia, Philadelphia, PA, United States of AmericaDepartment of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, United States of AmericaDepartment of Anatomy and Neuroscience, University of Melbourne, Melbourne, Victoria, AustraliaDepartment of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, United States of AmericaIdentifying the pathways that control a cellular phenotype is the first step to building a mechanistic model. Recent examples in developmental biology, cancer genomics, and neurological disease have demonstrated how changes in the variability of gene expression can highlight important genes that are under different degrees of regulatory control. Simple statistical tests exist to identify differentially-variable genes; however, methods for investigating how changes in gene expression variability in the context of pathways and gene sets are under-explored. Here we present pathVar, a new method that provides functional interpretation of gene expression variability changes at the level of pathways and gene sets. pathVar is based on a multinomial exact test, or an asymptotic Chi-squared test as a more computationally-efficient alternative. The method can be used for gene expression studies from any technology platform in all biological settings either with a single phenotypic group, or two-group comparisons. To demonstrate its utility, we applied the method to a diverse set of diseases, species and samples. Results from pathVar are benchmarked against analyses based on average expression and two methods of GSEA, and demonstrate that analyses using both statistics are useful for understanding transcriptional regulation. We also provide recommendations for the choice of variability statistic that have been informed through analyses on simulations and real data. Based on the datasets selected, we show how pathVar can be used to gain insight into expression variability of single cell versus bulk samples, different stem cell populations, and cancer versus normal tissue comparisons.https://peerj.com/articles/3334.pdfTranscriptional regulationGene expression variabilitySingle cell analysisBioinformaticsFunctional genomicsCellular heterogeneity
collection DOAJ
language English
format Article
sources DOAJ
author Laurence de Torrente
Samuel Zimmerman
Deanne Taylor
Yu Hasegawa
Christine A. Wells
Jessica C. Mar
spellingShingle Laurence de Torrente
Samuel Zimmerman
Deanne Taylor
Yu Hasegawa
Christine A. Wells
Jessica C. Mar
pathVar: a new method for pathway-based interpretation of gene expression variability
PeerJ
Transcriptional regulation
Gene expression variability
Single cell analysis
Bioinformatics
Functional genomics
Cellular heterogeneity
author_facet Laurence de Torrente
Samuel Zimmerman
Deanne Taylor
Yu Hasegawa
Christine A. Wells
Jessica C. Mar
author_sort Laurence de Torrente
title pathVar: a new method for pathway-based interpretation of gene expression variability
title_short pathVar: a new method for pathway-based interpretation of gene expression variability
title_full pathVar: a new method for pathway-based interpretation of gene expression variability
title_fullStr pathVar: a new method for pathway-based interpretation of gene expression variability
title_full_unstemmed pathVar: a new method for pathway-based interpretation of gene expression variability
title_sort pathvar: a new method for pathway-based interpretation of gene expression variability
publisher PeerJ Inc.
series PeerJ
issn 2167-8359
publishDate 2017-05-01
description Identifying the pathways that control a cellular phenotype is the first step to building a mechanistic model. Recent examples in developmental biology, cancer genomics, and neurological disease have demonstrated how changes in the variability of gene expression can highlight important genes that are under different degrees of regulatory control. Simple statistical tests exist to identify differentially-variable genes; however, methods for investigating how changes in gene expression variability in the context of pathways and gene sets are under-explored. Here we present pathVar, a new method that provides functional interpretation of gene expression variability changes at the level of pathways and gene sets. pathVar is based on a multinomial exact test, or an asymptotic Chi-squared test as a more computationally-efficient alternative. The method can be used for gene expression studies from any technology platform in all biological settings either with a single phenotypic group, or two-group comparisons. To demonstrate its utility, we applied the method to a diverse set of diseases, species and samples. Results from pathVar are benchmarked against analyses based on average expression and two methods of GSEA, and demonstrate that analyses using both statistics are useful for understanding transcriptional regulation. We also provide recommendations for the choice of variability statistic that have been informed through analyses on simulations and real data. Based on the datasets selected, we show how pathVar can be used to gain insight into expression variability of single cell versus bulk samples, different stem cell populations, and cancer versus normal tissue comparisons.
topic Transcriptional regulation
Gene expression variability
Single cell analysis
Bioinformatics
Functional genomics
Cellular heterogeneity
url https://peerj.com/articles/3334.pdf
work_keys_str_mv AT laurencedetorrente pathvaranewmethodforpathwaybasedinterpretationofgeneexpressionvariability
AT samuelzimmerman pathvaranewmethodforpathwaybasedinterpretationofgeneexpressionvariability
AT deannetaylor pathvaranewmethodforpathwaybasedinterpretationofgeneexpressionvariability
AT yuhasegawa pathvaranewmethodforpathwaybasedinterpretationofgeneexpressionvariability
AT christineawells pathvaranewmethodforpathwaybasedinterpretationofgeneexpressionvariability
AT jessicacmar pathvaranewmethodforpathwaybasedinterpretationofgeneexpressionvariability
_version_ 1725589302072573952