Aggregating Knockoffs for False Discovery Rate Control with an Application to Gut Microbiome Data

Recent discoveries suggest that our gut microbiome plays an important role in our health and wellbeing. However, the gut microbiome data are intricate; for example, the microbial diversity in the gut makes the data high-dimensional. While there are dedicated high-dimensional methods, such as the las...

Full description

Bibliographic Details
Main Authors: Fang Xie, Johannes Lederer
Format: Article
Language:English
Published: MDPI AG 2021-02-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/23/2/230
id doaj-75f937f7f3244c05a73424109e7ffa98
record_format Article
spelling doaj-75f937f7f3244c05a73424109e7ffa982021-02-17T00:05:05ZengMDPI AGEntropy1099-43002021-02-012323023010.3390/e23020230Aggregating Knockoffs for False Discovery Rate Control with an Application to Gut Microbiome DataFang Xie0Johannes Lederer1Department of Mathematics, Ruhr-University Bochum, Universitätsstraße 150, 44801 Bochum, GermanyDepartment of Mathematics, Ruhr-University Bochum, Universitätsstraße 150, 44801 Bochum, GermanyRecent discoveries suggest that our gut microbiome plays an important role in our health and wellbeing. However, the gut microbiome data are intricate; for example, the microbial diversity in the gut makes the data high-dimensional. While there are dedicated high-dimensional methods, such as the lasso estimator, they always come with the risk of false discoveries. Knockoffs are a recent approach to control the number of false discoveries. In this paper, we show that knockoffs can be aggregated to increase power while retaining sharp control over the false discoveries. We support our method both in theory and simulations, and we show that it can lead to new discoveries on microbiome data from the American Gut Project. In particular, our results indicate that several phyla that have been overlooked so far are associated with obesity.https://www.mdpi.com/1099-4300/23/2/230false discovery rate controlknockoffsvariable selectiongut microbiome
collection DOAJ
language English
format Article
sources DOAJ
author Fang Xie
Johannes Lederer
spellingShingle Fang Xie
Johannes Lederer
Aggregating Knockoffs for False Discovery Rate Control with an Application to Gut Microbiome Data
Entropy
false discovery rate control
knockoffs
variable selection
gut microbiome
author_facet Fang Xie
Johannes Lederer
author_sort Fang Xie
title Aggregating Knockoffs for False Discovery Rate Control with an Application to Gut Microbiome Data
title_short Aggregating Knockoffs for False Discovery Rate Control with an Application to Gut Microbiome Data
title_full Aggregating Knockoffs for False Discovery Rate Control with an Application to Gut Microbiome Data
title_fullStr Aggregating Knockoffs for False Discovery Rate Control with an Application to Gut Microbiome Data
title_full_unstemmed Aggregating Knockoffs for False Discovery Rate Control with an Application to Gut Microbiome Data
title_sort aggregating knockoffs for false discovery rate control with an application to gut microbiome data
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2021-02-01
description Recent discoveries suggest that our gut microbiome plays an important role in our health and wellbeing. However, the gut microbiome data are intricate; for example, the microbial diversity in the gut makes the data high-dimensional. While there are dedicated high-dimensional methods, such as the lasso estimator, they always come with the risk of false discoveries. Knockoffs are a recent approach to control the number of false discoveries. In this paper, we show that knockoffs can be aggregated to increase power while retaining sharp control over the false discoveries. We support our method both in theory and simulations, and we show that it can lead to new discoveries on microbiome data from the American Gut Project. In particular, our results indicate that several phyla that have been overlooked so far are associated with obesity.
topic false discovery rate control
knockoffs
variable selection
gut microbiome
url https://www.mdpi.com/1099-4300/23/2/230
work_keys_str_mv AT fangxie aggregatingknockoffsforfalsediscoveryratecontrolwithanapplicationtogutmicrobiomedata
AT johanneslederer aggregatingknockoffsforfalsediscoveryratecontrolwithanapplicationtogutmicrobiomedata
_version_ 1724265794212200448