M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing data

Abstract Background Various statistical models have been developed to model the single cell RNA-seq expression profiles, capture its multimodality, and conduct differential gene expression test. However, for expression data generated by different experimental design and platforms, there is currently...

Full description

Bibliographic Details
Main Authors: Yu Zhang, Changlin Wan, Pengcheng Wang, Wennan Chang, Yan Huo, Jian Chen, Qin Ma, Sha Cao, Chi Zhang
Format: Article
Language:English
Published: BMC 2019-12-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-019-3243-1
id doaj-fa5c57a58ff64adaa20df927ac574689
record_format Article
spelling doaj-fa5c57a58ff64adaa20df927ac5746892020-12-20T12:42:26ZengBMCBMC Bioinformatics1471-21052019-12-0120S241510.1186/s12859-019-3243-1M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing dataYu Zhang0Changlin Wan1Pengcheng Wang2Wennan Chang3Yan Huo4Jian Chen5Qin Ma6Sha Cao7Chi Zhang8MOE Key Laboratory of Symbolic Computation and Knowledge Engineering, Colleges of Computer Science and Technology, Jilin UniversityCenter for Computational Biology and Bioinformatics, Indiana University, School of MedicineDepartment of Computer Science, Indiana University-Purdue University IndianapolisCenter for Computational Biology and Bioinformatics, Indiana University, School of MedicineCenter for Computational Biology and Bioinformatics, Indiana University, School of MedicineShanghai Pulmonary Hospital, Tongji University School of MedicineDepartment of Biomedical Informatics, The Ohio State UniversityCenter for Computational Biology and Bioinformatics, Indiana University, School of MedicineCenter for Computational Biology and Bioinformatics, Indiana University, School of MedicineAbstract Background Various statistical models have been developed to model the single cell RNA-seq expression profiles, capture its multimodality, and conduct differential gene expression test. However, for expression data generated by different experimental design and platforms, there is currently lack of capability to determine the most proper statistical model. Results We developed an R package, namely Multi-Modal Model Selection (M3S), for gene-wise selection of the most proper multi-modality statistical model and downstream analysis, useful in a single-cell or large scale bulk tissue transcriptomic data. M3S is featured with (1) gene-wise selection of the most parsimonious model among 11 most commonly utilized ones, that can best fit the expression distribution of the gene, (2) parameter estimation of a selected model, and (3) differential gene expression test based on the selected model. Conclusion A comprehensive evaluation suggested that M3S can accurately capture the multimodality on simulated and real single cell data. An open source package and is available through GitHub at https://github.com/zy26/M3S.https://doi.org/10.1186/s12859-019-3243-1Single cell RNA-seqMultimodalityDifferential gene expression analysisDrop-seqLeft truncated mixture Gaussian
collection DOAJ
language English
format Article
sources DOAJ
author Yu Zhang
Changlin Wan
Pengcheng Wang
Wennan Chang
Yan Huo
Jian Chen
Qin Ma
Sha Cao
Chi Zhang
spellingShingle Yu Zhang
Changlin Wan
Pengcheng Wang
Wennan Chang
Yan Huo
Jian Chen
Qin Ma
Sha Cao
Chi Zhang
M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing data
BMC Bioinformatics
Single cell RNA-seq
Multimodality
Differential gene expression analysis
Drop-seq
Left truncated mixture Gaussian
author_facet Yu Zhang
Changlin Wan
Pengcheng Wang
Wennan Chang
Yan Huo
Jian Chen
Qin Ma
Sha Cao
Chi Zhang
author_sort Yu Zhang
title M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing data
title_short M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing data
title_full M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing data
title_fullStr M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing data
title_full_unstemmed M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing data
title_sort m3s: a comprehensive model selection for multi-modal single-cell rna sequencing data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2019-12-01
description Abstract Background Various statistical models have been developed to model the single cell RNA-seq expression profiles, capture its multimodality, and conduct differential gene expression test. However, for expression data generated by different experimental design and platforms, there is currently lack of capability to determine the most proper statistical model. Results We developed an R package, namely Multi-Modal Model Selection (M3S), for gene-wise selection of the most proper multi-modality statistical model and downstream analysis, useful in a single-cell or large scale bulk tissue transcriptomic data. M3S is featured with (1) gene-wise selection of the most parsimonious model among 11 most commonly utilized ones, that can best fit the expression distribution of the gene, (2) parameter estimation of a selected model, and (3) differential gene expression test based on the selected model. Conclusion A comprehensive evaluation suggested that M3S can accurately capture the multimodality on simulated and real single cell data. An open source package and is available through GitHub at https://github.com/zy26/M3S.
topic Single cell RNA-seq
Multimodality
Differential gene expression analysis
Drop-seq
Left truncated mixture Gaussian
url https://doi.org/10.1186/s12859-019-3243-1
work_keys_str_mv AT yuzhang m3sacomprehensivemodelselectionformultimodalsinglecellrnasequencingdata
AT changlinwan m3sacomprehensivemodelselectionformultimodalsinglecellrnasequencingdata
AT pengchengwang m3sacomprehensivemodelselectionformultimodalsinglecellrnasequencingdata
AT wennanchang m3sacomprehensivemodelselectionformultimodalsinglecellrnasequencingdata
AT yanhuo m3sacomprehensivemodelselectionformultimodalsinglecellrnasequencingdata
AT jianchen m3sacomprehensivemodelselectionformultimodalsinglecellrnasequencingdata
AT qinma m3sacomprehensivemodelselectionformultimodalsinglecellrnasequencingdata
AT shacao m3sacomprehensivemodelselectionformultimodalsinglecellrnasequencingdata
AT chizhang m3sacomprehensivemodelselectionformultimodalsinglecellrnasequencingdata
_version_ 1724376211147194368