New statistical method identifies cytokines that distinguish stool microbiomes
Abstract Regressing an outcome or dependent variable onto a set of input or independent variables allows the analyst to measure associations between the two so that changes in the outcome can be described by and predicted by changes in the inputs. While there are many ways of doing this in classical...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2019-12-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-019-56397-9 |
id |
doaj-563746d07b9c472e85efc44c58ae80d8 |
---|---|
record_format |
Article |
spelling |
doaj-563746d07b9c472e85efc44c58ae80d82020-12-27T12:15:49ZengNature Publishing GroupScientific Reports2045-23222019-12-019111110.1038/s41598-019-56397-9New statistical method identifies cytokines that distinguish stool microbiomesDake Yang0Jethro Johnson1Xin Zhou2Elena Deych3Berkley Shands4Blake Hanson5Erica Sodergren6George Weinstock7William D. Shannon8BioRankingsJackson Laboratory for Genomic MedicineJackson Laboratory for Genomic MedicineBioRankingsBioRankingsUniversity of Texas Health Sciences CenterJackson Laboratory for Genomic MedicineJackson Laboratory for Genomic MedicineBioRankingsAbstract Regressing an outcome or dependent variable onto a set of input or independent variables allows the analyst to measure associations between the two so that changes in the outcome can be described by and predicted by changes in the inputs. While there are many ways of doing this in classical statistics, where the dependent variable has certain properties (e.g., a scalar, survival time, count), little progress on regression where the dependent variable are microbiome taxa counts has been made that do not impose extremely strict conditions on the data. In this paper, we propose and apply a new regression model combining the Dirichlet-multinomial distribution with recursive partitioning providing a fully non-parametric regression model. This model, called DM-RPart, is applied to cytokine data and microbiome taxa count data and is applicable to any microbiome taxa count/metadata, is automatically fit, and intuitively interpretable. This is a model which can be applied to any microbiome or other compositional data and software (R package HMP) available through the R CRAN website.https://doi.org/10.1038/s41598-019-56397-9 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Dake Yang Jethro Johnson Xin Zhou Elena Deych Berkley Shands Blake Hanson Erica Sodergren George Weinstock William D. Shannon |
spellingShingle |
Dake Yang Jethro Johnson Xin Zhou Elena Deych Berkley Shands Blake Hanson Erica Sodergren George Weinstock William D. Shannon New statistical method identifies cytokines that distinguish stool microbiomes Scientific Reports |
author_facet |
Dake Yang Jethro Johnson Xin Zhou Elena Deych Berkley Shands Blake Hanson Erica Sodergren George Weinstock William D. Shannon |
author_sort |
Dake Yang |
title |
New statistical method identifies cytokines that distinguish stool microbiomes |
title_short |
New statistical method identifies cytokines that distinguish stool microbiomes |
title_full |
New statistical method identifies cytokines that distinguish stool microbiomes |
title_fullStr |
New statistical method identifies cytokines that distinguish stool microbiomes |
title_full_unstemmed |
New statistical method identifies cytokines that distinguish stool microbiomes |
title_sort |
new statistical method identifies cytokines that distinguish stool microbiomes |
publisher |
Nature Publishing Group |
series |
Scientific Reports |
issn |
2045-2322 |
publishDate |
2019-12-01 |
description |
Abstract Regressing an outcome or dependent variable onto a set of input or independent variables allows the analyst to measure associations between the two so that changes in the outcome can be described by and predicted by changes in the inputs. While there are many ways of doing this in classical statistics, where the dependent variable has certain properties (e.g., a scalar, survival time, count), little progress on regression where the dependent variable are microbiome taxa counts has been made that do not impose extremely strict conditions on the data. In this paper, we propose and apply a new regression model combining the Dirichlet-multinomial distribution with recursive partitioning providing a fully non-parametric regression model. This model, called DM-RPart, is applied to cytokine data and microbiome taxa count data and is applicable to any microbiome taxa count/metadata, is automatically fit, and intuitively interpretable. This is a model which can be applied to any microbiome or other compositional data and software (R package HMP) available through the R CRAN website. |
url |
https://doi.org/10.1038/s41598-019-56397-9 |
work_keys_str_mv |
AT dakeyang newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes AT jethrojohnson newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes AT xinzhou newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes AT elenadeych newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes AT berkleyshands newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes AT blakehanson newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes AT ericasodergren newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes AT georgeweinstock newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes AT williamdshannon newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes |
_version_ |
1724369156874174464 |