A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.

Identifying genes that are differentially expressed (DE) between two or more conditions with multiple patterns of expression is one of the primary objectives of gene expression data analysis. Several statistical approaches, including one-way analysis of variance (ANOVA), are used to identify DE gene...

Full description

Bibliographic Details
Main Authors: Mohammad Manir Hossain Mollah, Rahman Jamal, Norfilza Mohd Mokhtar, Roslan Harun, Md Nurul Haque Mollah
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4587675?pdf=render
id doaj-0ed9fb9f92174bdd88ed254dae0d237c
record_format Article
spelling doaj-0ed9fb9f92174bdd88ed254dae0d237c2020-11-24T21:26:35ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-01109e013881010.1371/journal.pone.0138810A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.Mohammad Manir Hossain MollahRahman JamalNorfilza Mohd MokhtarRoslan HarunMd Nurul Haque MollahIdentifying genes that are differentially expressed (DE) between two or more conditions with multiple patterns of expression is one of the primary objectives of gene expression data analysis. Several statistical approaches, including one-way analysis of variance (ANOVA), are used to identify DE genes. However, most of these methods provide misleading results for two or more conditions with multiple patterns of expression in the presence of outlying genes. In this paper, an attempt is made to develop a hybrid one-way ANOVA approach that unifies the robustness and efficiency of estimation using the minimum β-divergence method to overcome some problems that arise in the existing robust methods for both small- and large-sample cases with multiple patterns of expression.The proposed method relies on a β-weight function, which produces values between 0 and 1. The β-weight function with β = 0.2 is used as a measure of outlier detection. It assigns smaller weights (≥ 0) to outlying expressions and larger weights (≤ 1) to typical expressions. The distribution of the β-weights is used to calculate the cut-off point, which is compared to the observed β-weight of an expression to determine whether that gene expression is an outlier. This weight function plays a key role in unifying the robustness and efficiency of estimation in one-way ANOVA.Analyses of simulated gene expression profiles revealed that all eight methods (ANOVA, SAM, LIMMA, EBarrays, eLNN, KW, robust BetaEB and proposed) perform almost identically for m = 2 conditions in the absence of outliers. However, the robust BetaEB method and the proposed method exhibited considerably better performance than the other six methods in the presence of outliers. In this case, the BetaEB method exhibited slightly better performance than the proposed method for the small-sample cases, but the the proposed method exhibited much better performance than the BetaEB method for both the small- and large-sample cases in the presence of more than 50% outlying genes. The proposed method also exhibited better performance than the other methods for m > 2 conditions with multiple patterns of expression, where the BetaEB was not extended for this condition. Therefore, the proposed approach would be more suitable and reliable on average for the identification of DE genes between two or more conditions with multiple patterns of expression.http://europepmc.org/articles/PMC4587675?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Mohammad Manir Hossain Mollah
Rahman Jamal
Norfilza Mohd Mokhtar
Roslan Harun
Md Nurul Haque Mollah
spellingShingle Mohammad Manir Hossain Mollah
Rahman Jamal
Norfilza Mohd Mokhtar
Roslan Harun
Md Nurul Haque Mollah
A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.
PLoS ONE
author_facet Mohammad Manir Hossain Mollah
Rahman Jamal
Norfilza Mohd Mokhtar
Roslan Harun
Md Nurul Haque Mollah
author_sort Mohammad Manir Hossain Mollah
title A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.
title_short A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.
title_full A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.
title_fullStr A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.
title_full_unstemmed A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.
title_sort hybrid one-way anova approach for the robust and efficient estimation of differential gene expression with multiple patterns.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2015-01-01
description Identifying genes that are differentially expressed (DE) between two or more conditions with multiple patterns of expression is one of the primary objectives of gene expression data analysis. Several statistical approaches, including one-way analysis of variance (ANOVA), are used to identify DE genes. However, most of these methods provide misleading results for two or more conditions with multiple patterns of expression in the presence of outlying genes. In this paper, an attempt is made to develop a hybrid one-way ANOVA approach that unifies the robustness and efficiency of estimation using the minimum β-divergence method to overcome some problems that arise in the existing robust methods for both small- and large-sample cases with multiple patterns of expression.The proposed method relies on a β-weight function, which produces values between 0 and 1. The β-weight function with β = 0.2 is used as a measure of outlier detection. It assigns smaller weights (≥ 0) to outlying expressions and larger weights (≤ 1) to typical expressions. The distribution of the β-weights is used to calculate the cut-off point, which is compared to the observed β-weight of an expression to determine whether that gene expression is an outlier. This weight function plays a key role in unifying the robustness and efficiency of estimation in one-way ANOVA.Analyses of simulated gene expression profiles revealed that all eight methods (ANOVA, SAM, LIMMA, EBarrays, eLNN, KW, robust BetaEB and proposed) perform almost identically for m = 2 conditions in the absence of outliers. However, the robust BetaEB method and the proposed method exhibited considerably better performance than the other six methods in the presence of outliers. In this case, the BetaEB method exhibited slightly better performance than the proposed method for the small-sample cases, but the the proposed method exhibited much better performance than the BetaEB method for both the small- and large-sample cases in the presence of more than 50% outlying genes. The proposed method also exhibited better performance than the other methods for m > 2 conditions with multiple patterns of expression, where the BetaEB was not extended for this condition. Therefore, the proposed approach would be more suitable and reliable on average for the identification of DE genes between two or more conditions with multiple patterns of expression.
url http://europepmc.org/articles/PMC4587675?pdf=render
work_keys_str_mv AT mohammadmanirhossainmollah ahybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns
AT rahmanjamal ahybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns
AT norfilzamohdmokhtar ahybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns
AT roslanharun ahybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns
AT mdnurulhaquemollah ahybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns
AT mohammadmanirhossainmollah hybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns
AT rahmanjamal hybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns
AT norfilzamohdmokhtar hybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns
AT roslanharun hybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns
AT mdnurulhaquemollah hybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns
_version_ 1725978780432859136