Joint Nonnegative Matrix Factorization Based on Sparse and Graph Laplacian Regularization for Clustering and Co-Differential Expression Genes Analysis
The explosion of multiomics data poses new challenges to existing data mining methods. Joint analysis of multiomics data can make the best of the complementary information that is provided by different types of data. Therefore, they can more accurately explore the biological mechanism of diseases. I...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi-Wiley
2020-01-01
|
Series: | Complexity |
Online Access: | http://dx.doi.org/10.1155/2020/3917812 |
id |
doaj-619501b1c0b14f678c9dbdee77100b2b |
---|---|
record_format |
Article |
spelling |
doaj-619501b1c0b14f678c9dbdee77100b2b2020-11-30T09:11:23ZengHindawi-WileyComplexity1076-27871099-05262020-01-01202010.1155/2020/39178123917812Joint Nonnegative Matrix Factorization Based on Sparse and Graph Laplacian Regularization for Clustering and Co-Differential Expression Genes AnalysisLing-Yun Dai0Rong Zhu1Juan Wang2School of Computer Science, Qufu Normal University, Rizhao 276826, ChinaSchool of Computer Science, Qufu Normal University, Rizhao 276826, ChinaSchool of Computer Science, Qufu Normal University, Rizhao 276826, ChinaThe explosion of multiomics data poses new challenges to existing data mining methods. Joint analysis of multiomics data can make the best of the complementary information that is provided by different types of data. Therefore, they can more accurately explore the biological mechanism of diseases. In this article, two forms of joint nonnegative matrix factorization based on the sparse and graph Laplacian regularization (SG-jNMF) method are proposed. In the method, the graph regularization constraint can preserve the local geometric structure of data. L2,1-norm regularization can enhance the sparsity among the rows and remove redundant features in the data. First, SG-jNMF1 projects multiomics data into a common subspace and applies the multiomics fusion characteristic matrix to mine the important information closely related to diseases. Second, multiomics data of the same disease are mapped into the common sample space by SG-jNMF2, and the cluster structures are detected clearly. Experimental results show that SG-jNMF can achieve significant improvement in sample clustering compared with existing joint analysis frameworks. SG-jNMF also effectively integrates multiomics data to identify co-differentially expressed genes (Co-DEGs). SG-jNMF provides an efficient integrative analysis method for mining the biological information hidden in heterogeneous multiomics data.http://dx.doi.org/10.1155/2020/3917812 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ling-Yun Dai Rong Zhu Juan Wang |
spellingShingle |
Ling-Yun Dai Rong Zhu Juan Wang Joint Nonnegative Matrix Factorization Based on Sparse and Graph Laplacian Regularization for Clustering and Co-Differential Expression Genes Analysis Complexity |
author_facet |
Ling-Yun Dai Rong Zhu Juan Wang |
author_sort |
Ling-Yun Dai |
title |
Joint Nonnegative Matrix Factorization Based on Sparse and Graph Laplacian Regularization for Clustering and Co-Differential Expression Genes Analysis |
title_short |
Joint Nonnegative Matrix Factorization Based on Sparse and Graph Laplacian Regularization for Clustering and Co-Differential Expression Genes Analysis |
title_full |
Joint Nonnegative Matrix Factorization Based on Sparse and Graph Laplacian Regularization for Clustering and Co-Differential Expression Genes Analysis |
title_fullStr |
Joint Nonnegative Matrix Factorization Based on Sparse and Graph Laplacian Regularization for Clustering and Co-Differential Expression Genes Analysis |
title_full_unstemmed |
Joint Nonnegative Matrix Factorization Based on Sparse and Graph Laplacian Regularization for Clustering and Co-Differential Expression Genes Analysis |
title_sort |
joint nonnegative matrix factorization based on sparse and graph laplacian regularization for clustering and co-differential expression genes analysis |
publisher |
Hindawi-Wiley |
series |
Complexity |
issn |
1076-2787 1099-0526 |
publishDate |
2020-01-01 |
description |
The explosion of multiomics data poses new challenges to existing data mining methods. Joint analysis of multiomics data can make the best of the complementary information that is provided by different types of data. Therefore, they can more accurately explore the biological mechanism of diseases. In this article, two forms of joint nonnegative matrix factorization based on the sparse and graph Laplacian regularization (SG-jNMF) method are proposed. In the method, the graph regularization constraint can preserve the local geometric structure of data. L2,1-norm regularization can enhance the sparsity among the rows and remove redundant features in the data. First, SG-jNMF1 projects multiomics data into a common subspace and applies the multiomics fusion characteristic matrix to mine the important information closely related to diseases. Second, multiomics data of the same disease are mapped into the common sample space by SG-jNMF2, and the cluster structures are detected clearly. Experimental results show that SG-jNMF can achieve significant improvement in sample clustering compared with existing joint analysis frameworks. SG-jNMF also effectively integrates multiomics data to identify co-differentially expressed genes (Co-DEGs). SG-jNMF provides an efficient integrative analysis method for mining the biological information hidden in heterogeneous multiomics data. |
url |
http://dx.doi.org/10.1155/2020/3917812 |
work_keys_str_mv |
AT lingyundai jointnonnegativematrixfactorizationbasedonsparseandgraphlaplacianregularizationforclusteringandcodifferentialexpressiongenesanalysis AT rongzhu jointnonnegativematrixfactorizationbasedonsparseandgraphlaplacianregularizationforclusteringandcodifferentialexpressiongenesanalysis AT juanwang jointnonnegativematrixfactorizationbasedonsparseandgraphlaplacianregularizationforclusteringandcodifferentialexpressiongenesanalysis |
_version_ |
1715028045004275712 |