A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.
Identifying genes that are differentially expressed (DE) between two or more conditions with multiple patterns of expression is one of the primary objectives of gene expression data analysis. Several statistical approaches, including one-way analysis of variance (ANOVA), are used to identify DE gene...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2015-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC4587675?pdf=render |
id |
doaj-0ed9fb9f92174bdd88ed254dae0d237c |
---|---|
record_format |
Article |
spelling |
doaj-0ed9fb9f92174bdd88ed254dae0d237c2020-11-24T21:26:35ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-01109e013881010.1371/journal.pone.0138810A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.Mohammad Manir Hossain MollahRahman JamalNorfilza Mohd MokhtarRoslan HarunMd Nurul Haque MollahIdentifying genes that are differentially expressed (DE) between two or more conditions with multiple patterns of expression is one of the primary objectives of gene expression data analysis. Several statistical approaches, including one-way analysis of variance (ANOVA), are used to identify DE genes. However, most of these methods provide misleading results for two or more conditions with multiple patterns of expression in the presence of outlying genes. In this paper, an attempt is made to develop a hybrid one-way ANOVA approach that unifies the robustness and efficiency of estimation using the minimum β-divergence method to overcome some problems that arise in the existing robust methods for both small- and large-sample cases with multiple patterns of expression.The proposed method relies on a β-weight function, which produces values between 0 and 1. The β-weight function with β = 0.2 is used as a measure of outlier detection. It assigns smaller weights (≥ 0) to outlying expressions and larger weights (≤ 1) to typical expressions. The distribution of the β-weights is used to calculate the cut-off point, which is compared to the observed β-weight of an expression to determine whether that gene expression is an outlier. This weight function plays a key role in unifying the robustness and efficiency of estimation in one-way ANOVA.Analyses of simulated gene expression profiles revealed that all eight methods (ANOVA, SAM, LIMMA, EBarrays, eLNN, KW, robust BetaEB and proposed) perform almost identically for m = 2 conditions in the absence of outliers. However, the robust BetaEB method and the proposed method exhibited considerably better performance than the other six methods in the presence of outliers. In this case, the BetaEB method exhibited slightly better performance than the proposed method for the small-sample cases, but the the proposed method exhibited much better performance than the BetaEB method for both the small- and large-sample cases in the presence of more than 50% outlying genes. The proposed method also exhibited better performance than the other methods for m > 2 conditions with multiple patterns of expression, where the BetaEB was not extended for this condition. Therefore, the proposed approach would be more suitable and reliable on average for the identification of DE genes between two or more conditions with multiple patterns of expression.http://europepmc.org/articles/PMC4587675?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mohammad Manir Hossain Mollah Rahman Jamal Norfilza Mohd Mokhtar Roslan Harun Md Nurul Haque Mollah |
spellingShingle |
Mohammad Manir Hossain Mollah Rahman Jamal Norfilza Mohd Mokhtar Roslan Harun Md Nurul Haque Mollah A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns. PLoS ONE |
author_facet |
Mohammad Manir Hossain Mollah Rahman Jamal Norfilza Mohd Mokhtar Roslan Harun Md Nurul Haque Mollah |
author_sort |
Mohammad Manir Hossain Mollah |
title |
A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns. |
title_short |
A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns. |
title_full |
A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns. |
title_fullStr |
A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns. |
title_full_unstemmed |
A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns. |
title_sort |
hybrid one-way anova approach for the robust and efficient estimation of differential gene expression with multiple patterns. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2015-01-01 |
description |
Identifying genes that are differentially expressed (DE) between two or more conditions with multiple patterns of expression is one of the primary objectives of gene expression data analysis. Several statistical approaches, including one-way analysis of variance (ANOVA), are used to identify DE genes. However, most of these methods provide misleading results for two or more conditions with multiple patterns of expression in the presence of outlying genes. In this paper, an attempt is made to develop a hybrid one-way ANOVA approach that unifies the robustness and efficiency of estimation using the minimum β-divergence method to overcome some problems that arise in the existing robust methods for both small- and large-sample cases with multiple patterns of expression.The proposed method relies on a β-weight function, which produces values between 0 and 1. The β-weight function with β = 0.2 is used as a measure of outlier detection. It assigns smaller weights (≥ 0) to outlying expressions and larger weights (≤ 1) to typical expressions. The distribution of the β-weights is used to calculate the cut-off point, which is compared to the observed β-weight of an expression to determine whether that gene expression is an outlier. This weight function plays a key role in unifying the robustness and efficiency of estimation in one-way ANOVA.Analyses of simulated gene expression profiles revealed that all eight methods (ANOVA, SAM, LIMMA, EBarrays, eLNN, KW, robust BetaEB and proposed) perform almost identically for m = 2 conditions in the absence of outliers. However, the robust BetaEB method and the proposed method exhibited considerably better performance than the other six methods in the presence of outliers. In this case, the BetaEB method exhibited slightly better performance than the proposed method for the small-sample cases, but the the proposed method exhibited much better performance than the BetaEB method for both the small- and large-sample cases in the presence of more than 50% outlying genes. The proposed method also exhibited better performance than the other methods for m > 2 conditions with multiple patterns of expression, where the BetaEB was not extended for this condition. Therefore, the proposed approach would be more suitable and reliable on average for the identification of DE genes between two or more conditions with multiple patterns of expression. |
url |
http://europepmc.org/articles/PMC4587675?pdf=render |
work_keys_str_mv |
AT mohammadmanirhossainmollah ahybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns AT rahmanjamal ahybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns AT norfilzamohdmokhtar ahybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns AT roslanharun ahybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns AT mdnurulhaquemollah ahybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns AT mohammadmanirhossainmollah hybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns AT rahmanjamal hybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns AT norfilzamohdmokhtar hybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns AT roslanharun hybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns AT mdnurulhaquemollah hybridonewayanovaapproachfortherobustandefficientestimationofdifferentialgeneexpressionwithmultiplepatterns |
_version_ |
1725978780432859136 |