psupertime: supervised pseudotime analysis for time-series single-cell RNA-seq data

MOTIVATION: Improvements in single-cell RNA-seq technologies mean that studies measuring multiple experimental conditions, such as time series, have become more common. At present, few computational methods exist to infer time series-specific transcriptome changes, and such studies have therefore ty...

Full description

Bibliographic Details
Main Authors:	Claassen, M. (Author), Gupta, R. (Author), Macnair, W. (Author)
Format:	Article
Language:	English
Published:	NLM (Medline) 2022
Online Access:	View Fulltext in Publisher


LEADER	02444nam a2200157Ia 4500
001	10.1093-bioinformatics-btac227
008	220706s2022 CNT 000 0 und d
020			\|a 13674811 (ISSN)
245	1	0	\|a psupertime: supervised pseudotime analysis for time-series single-cell RNA-seq data
260		0	\|b NLM (Medline) \|c 2022
856			\|z View Fulltext in Publisher \|u https://doi.org/10.1093/bioinformatics/btac227
520	3		\|a MOTIVATION: Improvements in single-cell RNA-seq technologies mean that studies measuring multiple experimental conditions, such as time series, have become more common. At present, few computational methods exist to infer time series-specific transcriptome changes, and such studies have therefore typically used unsupervised pseudotime methods. While these methods identify cell subpopulations and the transitions between them, they are not appropriate for identifying the genes that vary coherently along the time series. In addition, the orderings they estimate are based only on the major sources of variation in the data, which may not correspond to the processes related to the time labels. RESULTS: We introduce psupertime, a supervised pseudotime approach based on a regression model, which explicitly uses time-series labels as input. It identifies genes that vary coherently along a time series, in addition to pseudotime values for individual cells, and a classifier that can be used to estimate labels for new data with unknown or differing labels. We show that psupertime outperforms benchmark classifiers in terms of identifying time-varying genes and provides better individual cell orderings than popular unsupervised pseudotime techniques. psupertime is applicable to any single-cell RNA-seq dataset with sequential labels (e.g. principally time series but also drug dosage and disease progression), derived from either experimental design and provides a fast, interpretable tool for targeted identification of genes varying along with specific biological processes. AVAILABILITY AND IMPLEMENTATION: R package available at github.com/wmacnair/psupertime and code for results reproduction at github.com/wmacnair/psupplementary. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. © The Author(s) 2022. Published by Oxford University Press.
700	1		\|a Claassen, M. \|e author
700	1		\|a Gupta, R. \|e author
700	1		\|a Macnair, W. \|e author
773			\|t Bioinformatics (Oxford, England)

psupertime: supervised pseudotime analysis for time-series single-cell RNA-seq data

Similar Items