A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability
<p>Abstract</p> <p>Background</p> <p>Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biolo...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2009-11-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/10/389 |
id |
doaj-b11d247f455a4cac9ab4688890e29c12 |
---|---|
record_format |
Article |
spelling |
doaj-b11d247f455a4cac9ab4688890e29c122020-11-24T20:51:48ZengBMCBMC Bioinformatics1471-21052009-11-0110138910.1186/1471-2105-10-389A comprehensive sensitivity analysis of microarray breast cancer classification under feature variabilityReinders Marcel JTvan den Ham RenéMoerland Perry DSontrop Herman MJVerhaegh Wim FJ<p>Abstract</p> <p>Background</p> <p>Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear.</p> <p>Results</p> <p>We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical.</p> <p>Conclusion</p> <p>Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments.</p> http://www.biomedcentral.com/1471-2105/10/389 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Reinders Marcel JT van den Ham René Moerland Perry D Sontrop Herman MJ Verhaegh Wim FJ |
spellingShingle |
Reinders Marcel JT van den Ham René Moerland Perry D Sontrop Herman MJ Verhaegh Wim FJ A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability BMC Bioinformatics |
author_facet |
Reinders Marcel JT van den Ham René Moerland Perry D Sontrop Herman MJ Verhaegh Wim FJ |
author_sort |
Reinders Marcel JT |
title |
A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability |
title_short |
A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability |
title_full |
A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability |
title_fullStr |
A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability |
title_full_unstemmed |
A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability |
title_sort |
comprehensive sensitivity analysis of microarray breast cancer classification under feature variability |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2009-11-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear.</p> <p>Results</p> <p>We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical.</p> <p>Conclusion</p> <p>Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments.</p> |
url |
http://www.biomedcentral.com/1471-2105/10/389 |
work_keys_str_mv |
AT reindersmarceljt acomprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability AT vandenhamrene acomprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability AT moerlandperryd acomprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability AT sontrophermanmj acomprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability AT verhaeghwimfj acomprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability AT reindersmarceljt comprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability AT vandenhamrene comprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability AT moerlandperryd comprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability AT sontrophermanmj comprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability AT verhaeghwimfj comprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability |
_version_ |
1716801134554775552 |