BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion

Abstract Background Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of m...

Full description

Bibliographic Details
Main Authors: Jian-Yu Shi, Hua Huang, Yan-Ning Zhang, Jiang-Bo Cao, Siu-Ming Yiu
Format: Article
Language:English
Published: BMC 2018-08-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-018-2274-3
id doaj-78afb2168c2248cba6bc4d24907e4add
record_format Article
spelling doaj-78afb2168c2248cba6bc4d24907e4add2020-11-24T21:41:40ZengBMCBMC Bioinformatics1471-21052018-08-0119S9859210.1186/s12859-018-2274-3BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completionJian-Yu Shi0Hua Huang1Yan-Ning Zhang2Jiang-Bo Cao3Siu-Ming Yiu4School of Life Sciences, Northwestern Polytechnical UniversitySchool of Software and Microelectronics, Northwestern Polytechnical UniversitySchool of Computer Science, Northwestern Polytechnical UniversitySchool of Life Sciences, Northwestern Polytechnical UniversityDepartment of Computer Science, The University of Hong KongAbstract Background Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of microbe-noninfectious disease associations (MDAs) is still a challenging task, because of both the high cost and the limitation of microbe cultivation. Thus, there is a need to develop fast approaches to screen potential MDAs. The growing number of validated MDAs enables us to meet the demand in a new insight. Computational approaches, especially machine learning, are promising to predict MDA candidates rapidly among a large number of microbe-disease pairs with the advantage of no limitation on microbe cultivation. Nevertheless, a few computational efforts at predicting MDAs are made so far. Results In this paper, grouping a set of MDAs into a binary MDA matrix, we propose a novel predictive approach (BMCMDA) based on Binary Matrix Completion to predict potential MDAs. The proposed BMCMDA assumes that the incomplete observed MDA matrix is the summation of a latent parameterizing matrix and a noising matrix. It also assumes that the independently occurring subscripts of observed entries in the MDA matrix follows a binomial model. Adopting a standard mean-zero Gaussian distribution for the nosing matrix, we model the relationship between the parameterizing matrix and the MDA matrix under the observed microbe-disease pairs as a probit regression. With the recovered parameterizing matrix, BMCMDA deduces how likely a microbe would be associated with a particular disease. In the experiment under leave-one-out cross-validation, it exhibits the inspiring performance (AUC = 0.906, AUPR =0.526) and demonstrates its superiority by ~ 7% and ~ 5% improvements in terms of AUC and AUPR respectively in the comparison with the pioneering approach KATZHMDA. Conclusions Our BMCMDA provides an effective approach for predicting MDAs and can be also extended to other similar predicting tasks of binary relationship (e.g. protein-protein interaction, drug-target interaction).http://link.springer.com/article/10.1186/s12859-018-2274-3Microbe-disease associationMatrix completionPredictionMachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Jian-Yu Shi
Hua Huang
Yan-Ning Zhang
Jiang-Bo Cao
Siu-Ming Yiu
spellingShingle Jian-Yu Shi
Hua Huang
Yan-Ning Zhang
Jiang-Bo Cao
Siu-Ming Yiu
BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
BMC Bioinformatics
Microbe-disease association
Matrix completion
Prediction
Machine learning
author_facet Jian-Yu Shi
Hua Huang
Yan-Ning Zhang
Jiang-Bo Cao
Siu-Ming Yiu
author_sort Jian-Yu Shi
title BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title_short BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title_full BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title_fullStr BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title_full_unstemmed BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
title_sort bmcmda: a novel model for predicting human microbe-disease associations via binary matrix completion
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2018-08-01
description Abstract Background Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of microbe-noninfectious disease associations (MDAs) is still a challenging task, because of both the high cost and the limitation of microbe cultivation. Thus, there is a need to develop fast approaches to screen potential MDAs. The growing number of validated MDAs enables us to meet the demand in a new insight. Computational approaches, especially machine learning, are promising to predict MDA candidates rapidly among a large number of microbe-disease pairs with the advantage of no limitation on microbe cultivation. Nevertheless, a few computational efforts at predicting MDAs are made so far. Results In this paper, grouping a set of MDAs into a binary MDA matrix, we propose a novel predictive approach (BMCMDA) based on Binary Matrix Completion to predict potential MDAs. The proposed BMCMDA assumes that the incomplete observed MDA matrix is the summation of a latent parameterizing matrix and a noising matrix. It also assumes that the independently occurring subscripts of observed entries in the MDA matrix follows a binomial model. Adopting a standard mean-zero Gaussian distribution for the nosing matrix, we model the relationship between the parameterizing matrix and the MDA matrix under the observed microbe-disease pairs as a probit regression. With the recovered parameterizing matrix, BMCMDA deduces how likely a microbe would be associated with a particular disease. In the experiment under leave-one-out cross-validation, it exhibits the inspiring performance (AUC = 0.906, AUPR =0.526) and demonstrates its superiority by ~ 7% and ~ 5% improvements in terms of AUC and AUPR respectively in the comparison with the pioneering approach KATZHMDA. Conclusions Our BMCMDA provides an effective approach for predicting MDAs and can be also extended to other similar predicting tasks of binary relationship (e.g. protein-protein interaction, drug-target interaction).
topic Microbe-disease association
Matrix completion
Prediction
Machine learning
url http://link.springer.com/article/10.1186/s12859-018-2274-3
work_keys_str_mv AT jianyushi bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion
AT huahuang bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion
AT yanningzhang bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion
AT jiangbocao bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion
AT siumingyiu bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion
_version_ 1725920571364999168