Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information

Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research s...

Full description

Bibliographic Details
Main Authors: Yuanyuan Ma, Guoying Liu, Yingjun Ma, Qianjun Chen
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-02-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fgene.2020.00083/full
id doaj-6e2d7ba08e8b454e922e8b1d39aca4e1
record_format Article
spelling doaj-6e2d7ba08e8b454e922e8b1d39aca4e12020-11-25T02:17:57ZengFrontiers Media S.A.Frontiers in Genetics1664-80212020-02-011110.3389/fgene.2020.00083492974Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic InformationYuanyuan Ma0Guoying Liu1Yingjun Ma2Qianjun Chen3Qianjun Chen4School of Computer and Information Engineering, Anyang Normal University, Anyang, ChinaSchool of Computer and Information Engineering, Anyang Normal University, Anyang, ChinaSchool of Computer, Central China Normal University, Wuhan, ChinaSchool of Computer, Central China Normal University, Wuhan, ChinaSchool of Life Science, Hubei University, Wuhan, ChinaMicrobe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research studies mainly focused on the paradigm of “one disease, one microbe,” rarely investigated the cooperation and associations between microbes, diseases or microbe-disease co-modules from system level. In this study, we propose a novel two-level module identifying algorithm (MDNMF) based on nonnegative matrix tri-factorization which integrates two similarity matrices (disease and microbe similarity matrices) and one microbe-disease association matrix into the objective of MDNMF. MDNMF can identify the modules from different levels and reveal the connections between these modules. In order to improve the efficiency and effectiveness of MDNMF, we also introduce human symptoms-disease network and microbial phylogenetic distance into this model. Furthermore, we applied it to HMDAD dataset and compared it with two NMF-based methods to demonstrate its effectiveness. The experimental results show that MDNMF can obtain better performance in terms of enrichment index (EI) and the number of significantly enriched taxon sets. This demonstrates the potential of MDNMF in capturing microbial modules that have significantly biological function implications.https://www.frontiersin.org/article/10.3389/fgene.2020.00083/fullmicrobe-disease associationmatrix factorizationphylogenetic distancehuman microbiomeco-modules
collection DOAJ
language English
format Article
sources DOAJ
author Yuanyuan Ma
Guoying Liu
Yingjun Ma
Qianjun Chen
Qianjun Chen
spellingShingle Yuanyuan Ma
Guoying Liu
Yingjun Ma
Qianjun Chen
Qianjun Chen
Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
Frontiers in Genetics
microbe-disease association
matrix factorization
phylogenetic distance
human microbiome
co-modules
author_facet Yuanyuan Ma
Guoying Liu
Yingjun Ma
Qianjun Chen
Qianjun Chen
author_sort Yuanyuan Ma
title Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title_short Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title_full Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title_fullStr Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title_full_unstemmed Integrative Analysis for Identifying Co-Modules of Microbe-Disease Data by Matrix Tri-Factorization With Phylogenetic Information
title_sort integrative analysis for identifying co-modules of microbe-disease data by matrix tri-factorization with phylogenetic information
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2020-02-01
description Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research studies mainly focused on the paradigm of “one disease, one microbe,” rarely investigated the cooperation and associations between microbes, diseases or microbe-disease co-modules from system level. In this study, we propose a novel two-level module identifying algorithm (MDNMF) based on nonnegative matrix tri-factorization which integrates two similarity matrices (disease and microbe similarity matrices) and one microbe-disease association matrix into the objective of MDNMF. MDNMF can identify the modules from different levels and reveal the connections between these modules. In order to improve the efficiency and effectiveness of MDNMF, we also introduce human symptoms-disease network and microbial phylogenetic distance into this model. Furthermore, we applied it to HMDAD dataset and compared it with two NMF-based methods to demonstrate its effectiveness. The experimental results show that MDNMF can obtain better performance in terms of enrichment index (EI) and the number of significantly enriched taxon sets. This demonstrates the potential of MDNMF in capturing microbial modules that have significantly biological function implications.
topic microbe-disease association
matrix factorization
phylogenetic distance
human microbiome
co-modules
url https://www.frontiersin.org/article/10.3389/fgene.2020.00083/full
work_keys_str_mv AT yuanyuanma integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation
AT guoyingliu integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation
AT yingjunma integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation
AT qianjunchen integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation
AT qianjunchen integrativeanalysisforidentifyingcomodulesofmicrobediseasedatabymatrixtrifactorizationwithphylogeneticinformation
_version_ 1724884014836219904