An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data

The human microbiome is comprised of thousands of microbial species. These species will substantially influence the normal physiology of humans and cause numerous diseases. Microbiome data can be measured by sequencing, microarray, or other technologies. With the fast development of these technologi...

Full description

Bibliographic Details
Main Authors: Xiang Lin, Jie Zhang, Zhi Wei, Turki Turki
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9466845/
id doaj-763ce47076f145dbaca98e9351468610
record_format Article
spelling doaj-763ce47076f145dbaca98e93514686102021-07-20T23:00:20ZengIEEEIEEE Access2169-35362021-01-01910002910003910.1109/ACCESS.2021.30930459466845An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome DataXiang Lin0https://orcid.org/0000-0002-7634-5780Jie Zhang1https://orcid.org/0000-0003-0242-8812Zhi Wei2https://orcid.org/0000-0001-6059-4267Turki Turki3https://orcid.org/0000-0002-9491-2435Department of Computer Science, New Jersey Institute of Technology, Newark, NJ, USAAdobe Inc., San Jose, CA, USADepartment of Computer Science, New Jersey Institute of Technology, Newark, NJ, USADepartment of Computer Science, King Abdulaziz University, Jeddah, Saudi ArabiaThe human microbiome is comprised of thousands of microbial species. These species will substantially influence the normal physiology of humans and cause numerous diseases. Microbiome data can be measured by sequencing, microarray, or other technologies. With the fast development of these technologies, downstream analysis methods should also be designed to effectively and accurately discover the valuable information that is hidden in the data. Many methods have been designed for the count data of microbiome. However, to our knowledge, there are only a few methods developed for the continuous microbiome data. Many microbiome data have an over-dispersed and zero-inflated data structure. Traditional methods rarely characterize this data structure and only focus on the differences in the abundance between different samples. In this study, we introduce a novel method, the zero-inflated gamma (ZIG) omnibus test, to specifically test the continuous and zero-inflated microbiome data. In this test, abundance will be tested along with zero prevalence and dispersion. We compared this method with five other popular methods. We found that ZIG omnibus test has significantly higher power and a similar or lower false-positive rate than the competing methods in the tests of simulated data. It also found more proved microbiomes in the real data with tonsil cancer. So, we conclude that ZIG omnibus test is a robust method across various biological conditions in the differential expression test of microbiome data.https://ieeexplore.ieee.org/document/9466845/Microbiomezero-inflationomnibus teststatistical modelingmicroarray
collection DOAJ
language English
format Article
sources DOAJ
author Xiang Lin
Jie Zhang
Zhi Wei
Turki Turki
spellingShingle Xiang Lin
Jie Zhang
Zhi Wei
Turki Turki
An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data
IEEE Access
Microbiome
zero-inflation
omnibus test
statistical modeling
microarray
author_facet Xiang Lin
Jie Zhang
Zhi Wei
Turki Turki
author_sort Xiang Lin
title An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data
title_short An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data
title_full An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data
title_fullStr An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data
title_full_unstemmed An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data
title_sort omnibus test for differential distribution analysis of continuous microbiome data
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description The human microbiome is comprised of thousands of microbial species. These species will substantially influence the normal physiology of humans and cause numerous diseases. Microbiome data can be measured by sequencing, microarray, or other technologies. With the fast development of these technologies, downstream analysis methods should also be designed to effectively and accurately discover the valuable information that is hidden in the data. Many methods have been designed for the count data of microbiome. However, to our knowledge, there are only a few methods developed for the continuous microbiome data. Many microbiome data have an over-dispersed and zero-inflated data structure. Traditional methods rarely characterize this data structure and only focus on the differences in the abundance between different samples. In this study, we introduce a novel method, the zero-inflated gamma (ZIG) omnibus test, to specifically test the continuous and zero-inflated microbiome data. In this test, abundance will be tested along with zero prevalence and dispersion. We compared this method with five other popular methods. We found that ZIG omnibus test has significantly higher power and a similar or lower false-positive rate than the competing methods in the tests of simulated data. It also found more proved microbiomes in the real data with tonsil cancer. So, we conclude that ZIG omnibus test is a robust method across various biological conditions in the differential expression test of microbiome data.
topic Microbiome
zero-inflation
omnibus test
statistical modeling
microarray
url https://ieeexplore.ieee.org/document/9466845/
work_keys_str_mv AT xianglin anomnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata
AT jiezhang anomnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata
AT zhiwei anomnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata
AT turkiturki anomnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata
AT xianglin omnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata
AT jiezhang omnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata
AT zhiwei omnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata
AT turkiturki omnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata
_version_ 1721293268489076736