Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research s...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2020-02-01
|
Series: | Frontiers in Genetics |
Subjects: | |
Online Access: | https://www.frontiersin.org/article/10.3389/fgene.2020.00083/full |
id |
doaj-6e2d7ba08e8b454e922e8b1d39aca4e1 |
---|---|
record_format |
Article |
spelling |
doaj-6e2d7ba08e8b454e922e8b1d39aca4e12020-11-25T02:17:57ZengFrontiers Media S.A.Frontiers in Genetics1664-80212020-02-011110.3389/fgene.2020.00083492974Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic InformationYuanyuan Ma0Guoying Liu1Yingjun Ma2Qianjun Chen3Qianjun Chen4School of Computer and Information Engineering, Anyang Normal University, Anyang, ChinaSchool of Computer and Information Engineering, Anyang Normal University, Anyang, ChinaSchool of Computer, Central China Normal University, Wuhan, ChinaSchool of Computer, Central China Normal University, Wuhan, ChinaSchool of Life Science, Hubei University, Wuhan, ChinaMicrobe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research studies mainly focused on the paradigm of “one disease, one microbe,” rarely investigated the cooperation and associations between microbes, diseases or microbe-disease co-modules from system level. In this study, we propose a novel two-level module identifying algorithm (MDNMF) based on nonnegative matrix tri-factorization which integrates two similarity matrices (disease and microbe similarity matrices) and one microbe-disease association matrix into the objective of MDNMF. MDNMF can identify the modules from different levels and reveal the connections between these modules. In order to improve the efficiency and effectiveness of MDNMF, we also introduce human symptoms-disease network and microbial phylogenetic distance into this model. Furthermore, we applied it to HMDAD dataset and compared it with two NMF-based methods to demonstrate its effectiveness. The experimental results show that MDNMF can obtain better performance in terms of enrichment index (EI) and the number of significantly enriched taxon sets. This demonstrates the potential of MDNMF in capturing microbial modules that have significantly biological function implications.https://www.frontiersin.org/article/10.3389/fgene.2020.00083/fullmicrobe-disease associationmatrix factorizationphylogenetic distancehuman microbiomeco-modules |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yuanyuan Ma Guoying Liu Yingjun Ma Qianjun Chen Qianjun Chen |
spellingShingle |
Yuanyuan Ma Guoying Liu Yingjun Ma Qianjun Chen Qianjun Chen Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information Frontiers in Genetics microbe-disease association matrix factorization phylogenetic distance human microbiome co-modules |
author_facet |
Yuanyuan Ma Guoying Liu Yingjun Ma Qianjun Chen Qianjun Chen |
author_sort |
Yuanyuan Ma |
title |
Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title_short |
Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title_full |
Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title_fullStr |
Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title_full_unstemmed |
Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information |
title_sort |
integrative analysis for identifying co-modules of microbe-disease data by matrix tri-factorization with phylogenetic information |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Genetics |
issn |
1664-8021 |
publishDate |
2020-02-01 |
description |
Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research studies mainly focused on the paradigm of “one disease, one microbe,” rarely investigated the cooperation and associations between microbes, diseases or microbe-disease co-modules from system level. In this study, we propose a novel two-level module identifying algorithm (MDNMF) based on nonnegative matrix tri-factorization which integrates two similarity matrices (disease and microbe similarity matrices) and one microbe-disease association matrix into the objective of MDNMF. MDNMF can identify the modules from different levels and reveal the connections between these modules. In order to improve the efficiency and effectiveness of MDNMF, we also introduce human symptoms-disease network and microbial phylogenetic distance into this model. Furthermore, we applied it to HMDAD dataset and compared it with two NMF-based methods to demonstrate its effectiveness. The experimental results show that MDNMF can obtain better performance in terms of enrichment index (EI) and the number of significantly enriched taxon sets. This demonstrates the potential of MDNMF in capturing microbial modules that have significantly biological function implications. |
topic |
microbe-disease association matrix factorization phylogenetic distance human microbiome co-modules |
url |
https://www.frontiersin.org/article/10.3389/fgene.2020.00083/full |
work_keys_str_mv |
AT yuanyuanma integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation AT guoyingliu integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation AT yingjunma integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation AT qianjunchen integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation AT qianjunchen integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation |
_version_ |
1724884014836219904 |