An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data
The human microbiome is comprised of thousands of microbial species. These species will substantially influence the normal physiology of humans and cause numerous diseases. Microbiome data can be measured by sequencing, microarray, or other technologies. With the fast development of these technologi...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9466845/ |
id |
doaj-763ce47076f145dbaca98e9351468610 |
---|---|
record_format |
Article |
spelling |
doaj-763ce47076f145dbaca98e93514686102021-07-20T23:00:20ZengIEEEIEEE Access2169-35362021-01-01910002910003910.1109/ACCESS.2021.30930459466845An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome DataXiang Lin0https://orcid.org/0000-0002-7634-5780Jie Zhang1https://orcid.org/0000-0003-0242-8812Zhi Wei2https://orcid.org/0000-0001-6059-4267Turki Turki3https://orcid.org/0000-0002-9491-2435Department of Computer Science, New Jersey Institute of Technology, Newark, NJ, USAAdobe Inc., San Jose, CA, USADepartment of Computer Science, New Jersey Institute of Technology, Newark, NJ, USADepartment of Computer Science, King Abdulaziz University, Jeddah, Saudi ArabiaThe human microbiome is comprised of thousands of microbial species. These species will substantially influence the normal physiology of humans and cause numerous diseases. Microbiome data can be measured by sequencing, microarray, or other technologies. With the fast development of these technologies, downstream analysis methods should also be designed to effectively and accurately discover the valuable information that is hidden in the data. Many methods have been designed for the count data of microbiome. However, to our knowledge, there are only a few methods developed for the continuous microbiome data. Many microbiome data have an over-dispersed and zero-inflated data structure. Traditional methods rarely characterize this data structure and only focus on the differences in the abundance between different samples. In this study, we introduce a novel method, the zero-inflated gamma (ZIG) omnibus test, to specifically test the continuous and zero-inflated microbiome data. In this test, abundance will be tested along with zero prevalence and dispersion. We compared this method with five other popular methods. We found that ZIG omnibus test has significantly higher power and a similar or lower false-positive rate than the competing methods in the tests of simulated data. It also found more proved microbiomes in the real data with tonsil cancer. So, we conclude that ZIG omnibus test is a robust method across various biological conditions in the differential expression test of microbiome data.https://ieeexplore.ieee.org/document/9466845/Microbiomezero-inflationomnibus teststatistical modelingmicroarray |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiang Lin Jie Zhang Zhi Wei Turki Turki |
spellingShingle |
Xiang Lin Jie Zhang Zhi Wei Turki Turki An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data IEEE Access Microbiome zero-inflation omnibus test statistical modeling microarray |
author_facet |
Xiang Lin Jie Zhang Zhi Wei Turki Turki |
author_sort |
Xiang Lin |
title |
An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data |
title_short |
An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data |
title_full |
An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data |
title_fullStr |
An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data |
title_full_unstemmed |
An Omnibus Test for Differential Distribution Analysis of Continuous Microbiome Data |
title_sort |
omnibus test for differential distribution analysis of continuous microbiome data |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
The human microbiome is comprised of thousands of microbial species. These species will substantially influence the normal physiology of humans and cause numerous diseases. Microbiome data can be measured by sequencing, microarray, or other technologies. With the fast development of these technologies, downstream analysis methods should also be designed to effectively and accurately discover the valuable information that is hidden in the data. Many methods have been designed for the count data of microbiome. However, to our knowledge, there are only a few methods developed for the continuous microbiome data. Many microbiome data have an over-dispersed and zero-inflated data structure. Traditional methods rarely characterize this data structure and only focus on the differences in the abundance between different samples. In this study, we introduce a novel method, the zero-inflated gamma (ZIG) omnibus test, to specifically test the continuous and zero-inflated microbiome data. In this test, abundance will be tested along with zero prevalence and dispersion. We compared this method with five other popular methods. We found that ZIG omnibus test has significantly higher power and a similar or lower false-positive rate than the competing methods in the tests of simulated data. It also found more proved microbiomes in the real data with tonsil cancer. So, we conclude that ZIG omnibus test is a robust method across various biological conditions in the differential expression test of microbiome data. |
topic |
Microbiome zero-inflation omnibus test statistical modeling microarray |
url |
https://ieeexplore.ieee.org/document/9466845/ |
work_keys_str_mv |
AT xianglin anomnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata AT jiezhang anomnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata AT zhiwei anomnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata AT turkiturki anomnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata AT xianglin omnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata AT jiezhang omnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata AT zhiwei omnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata AT turkiturki omnibustestfordifferentialdistributionanalysisofcontinuousmicrobiomedata |
_version_ |
1721293268489076736 |