Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq

Identifying gene expression programs underlying both cell-type identity and cellular activities (e.g. life-cycle processes, responses to environmental cues) is crucial for understanding the organization of cells and tissues. Although single-cell RNA-Seq (scRNA-Seq) can quantify transcripts in indivi...

Full description

Bibliographic Details
Main Authors: Dylan Kotliar, Adrian Veres, M Aurel Nagy, Shervin Tabrizi, Eran Hodis, Douglas A Melton, Pardis C Sabeti
Format: Article
Language:English
Published: eLife Sciences Publications Ltd 2019-07-01
Series:eLife
Subjects:
Online Access:https://elifesciences.org/articles/43803
id doaj-53e0519e4b954b8bbbf464efc1b123ac
record_format Article
spelling doaj-53e0519e4b954b8bbbf464efc1b123ac2021-05-05T17:44:57ZengeLife Sciences Publications LtdeLife2050-084X2019-07-01810.7554/eLife.43803Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-SeqDylan Kotliar0https://orcid.org/0000-0002-7968-645XAdrian Veres1M Aurel Nagy2https://orcid.org/0000-0003-4608-1152Shervin Tabrizi3https://orcid.org/0000-0003-2780-8432Eran Hodis4Douglas A Melton5https://orcid.org/0000-0002-1623-5504Pardis C Sabeti6Department of Systems Biology, Harvard Medical School, Boston, United States; Broad Institute of MIT and Harvard, Cambridge, United States; Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, United StatesDepartment of Systems Biology, Harvard Medical School, Boston, United States; Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, United States; Harvard Stem Cell Institute, Harvard University, Cambridge, United StatesHarvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, United States; Department of Neurobiology, Harvard Medical School, Boston, United StatesBroad Institute of MIT and Harvard, Cambridge, United StatesHarvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, United States; Biophysics Program, Harvard University, Cambridge, United StatesHarvard Stem Cell Institute, Harvard University, Cambridge, United States; Howard Hughes Medical Institute, Chevy Chase, United StatesDepartment of Systems Biology, Harvard Medical School, Boston, United States; Broad Institute of MIT and Harvard, Cambridge, United States; Howard Hughes Medical Institute, Chevy Chase, United StatesIdentifying gene expression programs underlying both cell-type identity and cellular activities (e.g. life-cycle processes, responses to environmental cues) is crucial for understanding the organization of cells and tissues. Although single-cell RNA-Seq (scRNA-Seq) can quantify transcripts in individual cells, each cell’s expression profile may be a mixture of both types of programs, making them difficult to disentangle. Here, we benchmark and enhance the use of matrix factorization to solve this problem. We show with simulations that a method we call consensus non-negative matrix factorization (cNMF) accurately infers identity and activity programs, including their relative contributions in each cell. To illustrate the insights this approach enables, we apply it to published brain organoid and visual cortex scRNA-Seq datasets; cNMF refines cell types and identifies both expected (e.g. cell cycle and hypoxia) and novel activity programs, including programs that may underlie a neurosecretory phenotype and synaptogenesis.https://elifesciences.org/articles/43803gene expression programssingle-cell Rna-Seqmatrix factorizationvisual cortexbrain organoidssynaptogenesis
collection DOAJ
language English
format Article
sources DOAJ
author Dylan Kotliar
Adrian Veres
M Aurel Nagy
Shervin Tabrizi
Eran Hodis
Douglas A Melton
Pardis C Sabeti
spellingShingle Dylan Kotliar
Adrian Veres
M Aurel Nagy
Shervin Tabrizi
Eran Hodis
Douglas A Melton
Pardis C Sabeti
Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq
eLife
gene expression programs
single-cell Rna-Seq
matrix factorization
visual cortex
brain organoids
synaptogenesis
author_facet Dylan Kotliar
Adrian Veres
M Aurel Nagy
Shervin Tabrizi
Eran Hodis
Douglas A Melton
Pardis C Sabeti
author_sort Dylan Kotliar
title Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq
title_short Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq
title_full Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq
title_fullStr Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq
title_full_unstemmed Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq
title_sort identifying gene expression programs of cell-type identity and cellular activity with single-cell rna-seq
publisher eLife Sciences Publications Ltd
series eLife
issn 2050-084X
publishDate 2019-07-01
description Identifying gene expression programs underlying both cell-type identity and cellular activities (e.g. life-cycle processes, responses to environmental cues) is crucial for understanding the organization of cells and tissues. Although single-cell RNA-Seq (scRNA-Seq) can quantify transcripts in individual cells, each cell’s expression profile may be a mixture of both types of programs, making them difficult to disentangle. Here, we benchmark and enhance the use of matrix factorization to solve this problem. We show with simulations that a method we call consensus non-negative matrix factorization (cNMF) accurately infers identity and activity programs, including their relative contributions in each cell. To illustrate the insights this approach enables, we apply it to published brain organoid and visual cortex scRNA-Seq datasets; cNMF refines cell types and identifies both expected (e.g. cell cycle and hypoxia) and novel activity programs, including programs that may underlie a neurosecretory phenotype and synaptogenesis.
topic gene expression programs
single-cell Rna-Seq
matrix factorization
visual cortex
brain organoids
synaptogenesis
url https://elifesciences.org/articles/43803
work_keys_str_mv AT dylankotliar identifyinggeneexpressionprogramsofcelltypeidentityandcellularactivitywithsinglecellrnaseq
AT adrianveres identifyinggeneexpressionprogramsofcelltypeidentityandcellularactivitywithsinglecellrnaseq
AT maurelnagy identifyinggeneexpressionprogramsofcelltypeidentityandcellularactivitywithsinglecellrnaseq
AT shervintabrizi identifyinggeneexpressionprogramsofcelltypeidentityandcellularactivitywithsinglecellrnaseq
AT eranhodis identifyinggeneexpressionprogramsofcelltypeidentityandcellularactivitywithsinglecellrnaseq
AT douglasamelton identifyinggeneexpressionprogramsofcelltypeidentityandcellularactivitywithsinglecellrnaseq
AT pardiscsabeti identifyinggeneexpressionprogramsofcelltypeidentityandcellularactivitywithsinglecellrnaseq
_version_ 1721459077815468032