BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion
Abstract Background Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of m...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2018-08-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-018-2274-3 |
id |
doaj-78afb2168c2248cba6bc4d24907e4add |
---|---|
record_format |
Article |
spelling |
doaj-78afb2168c2248cba6bc4d24907e4add2020-11-24T21:41:40ZengBMCBMC Bioinformatics1471-21052018-08-0119S9859210.1186/s12859-018-2274-3BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completionJian-Yu Shi0Hua Huang1Yan-Ning Zhang2Jiang-Bo Cao3Siu-Ming Yiu4School of Life Sciences, Northwestern Polytechnical UniversitySchool of Software and Microelectronics, Northwestern Polytechnical UniversitySchool of Computer Science, Northwestern Polytechnical UniversitySchool of Life Sciences, Northwestern Polytechnical UniversityDepartment of Computer Science, The University of Hong KongAbstract Background Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of microbe-noninfectious disease associations (MDAs) is still a challenging task, because of both the high cost and the limitation of microbe cultivation. Thus, there is a need to develop fast approaches to screen potential MDAs. The growing number of validated MDAs enables us to meet the demand in a new insight. Computational approaches, especially machine learning, are promising to predict MDA candidates rapidly among a large number of microbe-disease pairs with the advantage of no limitation on microbe cultivation. Nevertheless, a few computational efforts at predicting MDAs are made so far. Results In this paper, grouping a set of MDAs into a binary MDA matrix, we propose a novel predictive approach (BMCMDA) based on Binary Matrix Completion to predict potential MDAs. The proposed BMCMDA assumes that the incomplete observed MDA matrix is the summation of a latent parameterizing matrix and a noising matrix. It also assumes that the independently occurring subscripts of observed entries in the MDA matrix follows a binomial model. Adopting a standard mean-zero Gaussian distribution for the nosing matrix, we model the relationship between the parameterizing matrix and the MDA matrix under the observed microbe-disease pairs as a probit regression. With the recovered parameterizing matrix, BMCMDA deduces how likely a microbe would be associated with a particular disease. In the experiment under leave-one-out cross-validation, it exhibits the inspiring performance (AUC = 0.906, AUPR =0.526) and demonstrates its superiority by ~ 7% and ~ 5% improvements in terms of AUC and AUPR respectively in the comparison with the pioneering approach KATZHMDA. Conclusions Our BMCMDA provides an effective approach for predicting MDAs and can be also extended to other similar predicting tasks of binary relationship (e.g. protein-protein interaction, drug-target interaction).http://link.springer.com/article/10.1186/s12859-018-2274-3Microbe-disease associationMatrix completionPredictionMachine learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jian-Yu Shi Hua Huang Yan-Ning Zhang Jiang-Bo Cao Siu-Ming Yiu |
spellingShingle |
Jian-Yu Shi Hua Huang Yan-Ning Zhang Jiang-Bo Cao Siu-Ming Yiu BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion BMC Bioinformatics Microbe-disease association Matrix completion Prediction Machine learning |
author_facet |
Jian-Yu Shi Hua Huang Yan-Ning Zhang Jiang-Bo Cao Siu-Ming Yiu |
author_sort |
Jian-Yu Shi |
title |
BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title_short |
BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title_full |
BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title_fullStr |
BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title_full_unstemmed |
BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion |
title_sort |
bmcmda: a novel model for predicting human microbe-disease associations via binary matrix completion |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2018-08-01 |
description |
Abstract Background Human Microbiome Project reveals the significant mutualistic influence between human body and microbes living in it. Such an influence lead to an interesting phenomenon that many noninfectious diseases are closely associated with diverse microbes. However, the identification of microbe-noninfectious disease associations (MDAs) is still a challenging task, because of both the high cost and the limitation of microbe cultivation. Thus, there is a need to develop fast approaches to screen potential MDAs. The growing number of validated MDAs enables us to meet the demand in a new insight. Computational approaches, especially machine learning, are promising to predict MDA candidates rapidly among a large number of microbe-disease pairs with the advantage of no limitation on microbe cultivation. Nevertheless, a few computational efforts at predicting MDAs are made so far. Results In this paper, grouping a set of MDAs into a binary MDA matrix, we propose a novel predictive approach (BMCMDA) based on Binary Matrix Completion to predict potential MDAs. The proposed BMCMDA assumes that the incomplete observed MDA matrix is the summation of a latent parameterizing matrix and a noising matrix. It also assumes that the independently occurring subscripts of observed entries in the MDA matrix follows a binomial model. Adopting a standard mean-zero Gaussian distribution for the nosing matrix, we model the relationship between the parameterizing matrix and the MDA matrix under the observed microbe-disease pairs as a probit regression. With the recovered parameterizing matrix, BMCMDA deduces how likely a microbe would be associated with a particular disease. In the experiment under leave-one-out cross-validation, it exhibits the inspiring performance (AUC = 0.906, AUPR =0.526) and demonstrates its superiority by ~ 7% and ~ 5% improvements in terms of AUC and AUPR respectively in the comparison with the pioneering approach KATZHMDA. Conclusions Our BMCMDA provides an effective approach for predicting MDAs and can be also extended to other similar predicting tasks of binary relationship (e.g. protein-protein interaction, drug-target interaction). |
topic |
Microbe-disease association Matrix completion Prediction Machine learning |
url |
http://link.springer.com/article/10.1186/s12859-018-2274-3 |
work_keys_str_mv |
AT jianyushi bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion AT huahuang bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion AT yanningzhang bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion AT jiangbocao bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion AT siumingyiu bmcmdaanovelmodelforpredictinghumanmicrobediseaseassociationsviabinarymatrixcompletion |
_version_ |
1725920571364999168 |