New statistical method identifies cytokines that distinguish stool microbiomes

Abstract Regressing an outcome or dependent variable onto a set of input or independent variables allows the analyst to measure associations between the two so that changes in the outcome can be described by and predicted by changes in the inputs. While there are many ways of doing this in classical...

Full description

Bibliographic Details
Main Authors: Dake Yang, Jethro Johnson, Xin Zhou, Elena Deych, Berkley Shands, Blake Hanson, Erica Sodergren, George Weinstock, William D. Shannon
Format: Article
Language:English
Published: Nature Publishing Group 2019-12-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-019-56397-9
id doaj-563746d07b9c472e85efc44c58ae80d8
record_format Article
spelling doaj-563746d07b9c472e85efc44c58ae80d82020-12-27T12:15:49ZengNature Publishing GroupScientific Reports2045-23222019-12-019111110.1038/s41598-019-56397-9New statistical method identifies cytokines that distinguish stool microbiomesDake Yang0Jethro Johnson1Xin Zhou2Elena Deych3Berkley Shands4Blake Hanson5Erica Sodergren6George Weinstock7William D. Shannon8BioRankingsJackson Laboratory for Genomic MedicineJackson Laboratory for Genomic MedicineBioRankingsBioRankingsUniversity of Texas Health Sciences CenterJackson Laboratory for Genomic MedicineJackson Laboratory for Genomic MedicineBioRankingsAbstract Regressing an outcome or dependent variable onto a set of input or independent variables allows the analyst to measure associations between the two so that changes in the outcome can be described by and predicted by changes in the inputs. While there are many ways of doing this in classical statistics, where the dependent variable has certain properties (e.g., a scalar, survival time, count), little progress on regression where the dependent variable are microbiome taxa counts has been made that do not impose extremely strict conditions on the data. In this paper, we propose and apply a new regression model combining the Dirichlet-multinomial distribution with recursive partitioning providing a fully non-parametric regression model. This model, called DM-RPart, is applied to cytokine data and microbiome taxa count data and is applicable to any microbiome taxa count/metadata, is automatically fit, and intuitively interpretable. This is a model which can be applied to any microbiome or other compositional data and software (R package HMP) available through the R CRAN website.https://doi.org/10.1038/s41598-019-56397-9
collection DOAJ
language English
format Article
sources DOAJ
author Dake Yang
Jethro Johnson
Xin Zhou
Elena Deych
Berkley Shands
Blake Hanson
Erica Sodergren
George Weinstock
William D. Shannon
spellingShingle Dake Yang
Jethro Johnson
Xin Zhou
Elena Deych
Berkley Shands
Blake Hanson
Erica Sodergren
George Weinstock
William D. Shannon
New statistical method identifies cytokines that distinguish stool microbiomes
Scientific Reports
author_facet Dake Yang
Jethro Johnson
Xin Zhou
Elena Deych
Berkley Shands
Blake Hanson
Erica Sodergren
George Weinstock
William D. Shannon
author_sort Dake Yang
title New statistical method identifies cytokines that distinguish stool microbiomes
title_short New statistical method identifies cytokines that distinguish stool microbiomes
title_full New statistical method identifies cytokines that distinguish stool microbiomes
title_fullStr New statistical method identifies cytokines that distinguish stool microbiomes
title_full_unstemmed New statistical method identifies cytokines that distinguish stool microbiomes
title_sort new statistical method identifies cytokines that distinguish stool microbiomes
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2019-12-01
description Abstract Regressing an outcome or dependent variable onto a set of input or independent variables allows the analyst to measure associations between the two so that changes in the outcome can be described by and predicted by changes in the inputs. While there are many ways of doing this in classical statistics, where the dependent variable has certain properties (e.g., a scalar, survival time, count), little progress on regression where the dependent variable are microbiome taxa counts has been made that do not impose extremely strict conditions on the data. In this paper, we propose and apply a new regression model combining the Dirichlet-multinomial distribution with recursive partitioning providing a fully non-parametric regression model. This model, called DM-RPart, is applied to cytokine data and microbiome taxa count data and is applicable to any microbiome taxa count/metadata, is automatically fit, and intuitively interpretable. This is a model which can be applied to any microbiome or other compositional data and software (R package HMP) available through the R CRAN website.
url https://doi.org/10.1038/s41598-019-56397-9
work_keys_str_mv AT dakeyang newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes
AT jethrojohnson newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes
AT xinzhou newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes
AT elenadeych newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes
AT berkleyshands newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes
AT blakehanson newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes
AT ericasodergren newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes
AT georgeweinstock newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes
AT williamdshannon newstatisticalmethodidentifiescytokinesthatdistinguishstoolmicrobiomes
_version_ 1724369156874174464