An Adaptive Sparse Subspace Clustering for Cell Type Identification

The rapid development of single-cell transcriptome sequencing technology has provided us with a cell-level perspective to study biological problems. Identification of cell types is one of the fundamental issues in computational analysis of single-cell data. Due to the large amount of noise from sing...

Full description

Bibliographic Details
Main Authors: Ruiqing Zheng, Zhenlan Liang, Xiang Chen, Yu Tian, Chen Cao, Min Li
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-04-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fgene.2020.00407/full
id doaj-b5c72016f4f246afaeb0ff822c9b933e
record_format Article
spelling doaj-b5c72016f4f246afaeb0ff822c9b933e2020-11-25T02:10:01ZengFrontiers Media S.A.Frontiers in Genetics1664-80212020-04-011110.3389/fgene.2020.00407506830An Adaptive Sparse Subspace Clustering for Cell Type IdentificationRuiqing Zheng0Zhenlan Liang1Xiang Chen2Yu Tian3Chen Cao4Min Li5School of Computer Science and Engineering, Central South University, Changsha, ChinaSchool of Computer Science and Engineering, Central South University, Changsha, ChinaSchool of Computer Science and Engineering, Central South University, Changsha, ChinaSchool of Computer Science and Engineering, Central South University, Changsha, ChinaDepartments of Biochemistry & Molecular Biology and Medical Genetics, Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, CanadaSchool of Computer Science and Engineering, Central South University, Changsha, ChinaThe rapid development of single-cell transcriptome sequencing technology has provided us with a cell-level perspective to study biological problems. Identification of cell types is one of the fundamental issues in computational analysis of single-cell data. Due to the large amount of noise from single-cell technologies and high dimension of expression profiles, traditional clustering methods are not so applicable to solve it. To address the problem, we have designed an adaptive sparse subspace clustering method, called AdaptiveSSC, to identify cell types. AdaptiveSSC is based on the assumption that the expression of cells with the same type lies in the same subspace; one cell can be expressed as a linear combination of the other cells. Moreover, it uses a data-driven adaptive sparse constraint to construct the similarity matrix. The comparison results of 10 scRNA-seq datasets show that AdaptiveSSC outperforms original subspace clustering and other state-of-art methods in most cases. Moreover, the learned similarity matrix can also be integrated with a modified t-SNE to obtain an improved visualization result.https://www.frontiersin.org/article/10.3389/fgene.2020.00407/fullsingle cell RNA-seqsubspace clusteringadaptive sparse strategysimilarity learningvisualization
collection DOAJ
language English
format Article
sources DOAJ
author Ruiqing Zheng
Zhenlan Liang
Xiang Chen
Yu Tian
Chen Cao
Min Li
spellingShingle Ruiqing Zheng
Zhenlan Liang
Xiang Chen
Yu Tian
Chen Cao
Min Li
An Adaptive Sparse Subspace Clustering for Cell Type Identification
Frontiers in Genetics
single cell RNA-seq
subspace clustering
adaptive sparse strategy
similarity learning
visualization
author_facet Ruiqing Zheng
Zhenlan Liang
Xiang Chen
Yu Tian
Chen Cao
Min Li
author_sort Ruiqing Zheng
title An Adaptive Sparse Subspace Clustering for Cell Type Identification
title_short An Adaptive Sparse Subspace Clustering for Cell Type Identification
title_full An Adaptive Sparse Subspace Clustering for Cell Type Identification
title_fullStr An Adaptive Sparse Subspace Clustering for Cell Type Identification
title_full_unstemmed An Adaptive Sparse Subspace Clustering for Cell Type Identification
title_sort adaptive sparse subspace clustering for cell type identification
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2020-04-01
description The rapid development of single-cell transcriptome sequencing technology has provided us with a cell-level perspective to study biological problems. Identification of cell types is one of the fundamental issues in computational analysis of single-cell data. Due to the large amount of noise from single-cell technologies and high dimension of expression profiles, traditional clustering methods are not so applicable to solve it. To address the problem, we have designed an adaptive sparse subspace clustering method, called AdaptiveSSC, to identify cell types. AdaptiveSSC is based on the assumption that the expression of cells with the same type lies in the same subspace; one cell can be expressed as a linear combination of the other cells. Moreover, it uses a data-driven adaptive sparse constraint to construct the similarity matrix. The comparison results of 10 scRNA-seq datasets show that AdaptiveSSC outperforms original subspace clustering and other state-of-art methods in most cases. Moreover, the learned similarity matrix can also be integrated with a modified t-SNE to obtain an improved visualization result.
topic single cell RNA-seq
subspace clustering
adaptive sparse strategy
similarity learning
visualization
url https://www.frontiersin.org/article/10.3389/fgene.2020.00407/full
work_keys_str_mv AT ruiqingzheng anadaptivesparsesubspaceclusteringforcelltypeidentification
AT zhenlanliang anadaptivesparsesubspaceclusteringforcelltypeidentification
AT xiangchen anadaptivesparsesubspaceclusteringforcelltypeidentification
AT yutian anadaptivesparsesubspaceclusteringforcelltypeidentification
AT chencao anadaptivesparsesubspaceclusteringforcelltypeidentification
AT minli anadaptivesparsesubspaceclusteringforcelltypeidentification
AT ruiqingzheng adaptivesparsesubspaceclusteringforcelltypeidentification
AT zhenlanliang adaptivesparsesubspaceclusteringforcelltypeidentification
AT xiangchen adaptivesparsesubspaceclusteringforcelltypeidentification
AT yutian adaptivesparsesubspaceclusteringforcelltypeidentification
AT chencao adaptivesparsesubspaceclusteringforcelltypeidentification
AT minli adaptivesparsesubspaceclusteringforcelltypeidentification
_version_ 1724921212442771456