Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips

<p>Abstract</p> <p>Background</p> <p>Affymetrix GeneChips are a popular platform for performing whole-genome experiments on the transcriptome. There are a range of different calibration steps, and users are presented with choices of different background subtractions, no...

Full description

Bibliographic Details
Main Authors: Johnston Caroline E, Harrison Andrew P, Orengo Christine A
Format: Article
Language:English
Published: BMC 2007-06-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/8/195
id doaj-a77a43234d7d44b88554aaa591999936
record_format Article
spelling doaj-a77a43234d7d44b88554aaa5919999362020-11-24T21:39:30ZengBMCBMC Bioinformatics1471-21052007-06-018119510.1186/1471-2105-8-195Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChipsJohnston Caroline EHarrison Andrew POrengo Christine A<p>Abstract</p> <p>Background</p> <p>Affymetrix GeneChips are a popular platform for performing whole-genome experiments on the transcriptome. There are a range of different calibration steps, and users are presented with choices of different background subtractions, normalisations and expression measures. We wished to establish which of the calibration steps resulted in the biggest uncertainty in the sets of genes reported to be differentially expressed.</p> <p>Results</p> <p>Our results indicate that the sets of genes identified as being most significantly differentially expressed, as estimated by the z-score of fold change, is relatively insensitive to the choice of background subtraction and normalisation. However, the contents of the gene list are most sensitive to the choice of expression measure. This is irrespective of whether the experiment uses a rat, mouse or human chip and whether the chip definition is made using probe mappings from Unigene, RefSeq, Entrez Gene or the original Affymetrix definitions. It is also irrespective of whether both Present and Absent, or just Present, Calls from the MAS5 algorithm are used to filter genelists, and this conclusion holds for genes of differing intensities. We also reach the same conclusion after assigning genes to be differentially expressed using t-statistics, although this approach results in a large amount of false positives in the sets of genes identified due to the small numbers of replicates typically used in microarray experiments.</p> <p>Conclusion</p> <p>The major calibration uncertainty that biologists need to consider when analysing Affymetrix data is how their multiple probe values are condensed into one expression measure.</p> http://www.biomedcentral.com/1471-2105/8/195
collection DOAJ
language English
format Article
sources DOAJ
author Johnston Caroline E
Harrison Andrew P
Orengo Christine A
spellingShingle Johnston Caroline E
Harrison Andrew P
Orengo Christine A
Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips
BMC Bioinformatics
author_facet Johnston Caroline E
Harrison Andrew P
Orengo Christine A
author_sort Johnston Caroline E
title Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips
title_short Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips
title_full Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips
title_fullStr Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips
title_full_unstemmed Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips
title_sort establishing a major cause of discrepancy in the calibration of affymetrix genechips
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2007-06-01
description <p>Abstract</p> <p>Background</p> <p>Affymetrix GeneChips are a popular platform for performing whole-genome experiments on the transcriptome. There are a range of different calibration steps, and users are presented with choices of different background subtractions, normalisations and expression measures. We wished to establish which of the calibration steps resulted in the biggest uncertainty in the sets of genes reported to be differentially expressed.</p> <p>Results</p> <p>Our results indicate that the sets of genes identified as being most significantly differentially expressed, as estimated by the z-score of fold change, is relatively insensitive to the choice of background subtraction and normalisation. However, the contents of the gene list are most sensitive to the choice of expression measure. This is irrespective of whether the experiment uses a rat, mouse or human chip and whether the chip definition is made using probe mappings from Unigene, RefSeq, Entrez Gene or the original Affymetrix definitions. It is also irrespective of whether both Present and Absent, or just Present, Calls from the MAS5 algorithm are used to filter genelists, and this conclusion holds for genes of differing intensities. We also reach the same conclusion after assigning genes to be differentially expressed using t-statistics, although this approach results in a large amount of false positives in the sets of genes identified due to the small numbers of replicates typically used in microarray experiments.</p> <p>Conclusion</p> <p>The major calibration uncertainty that biologists need to consider when analysing Affymetrix data is how their multiple probe values are condensed into one expression measure.</p>
url http://www.biomedcentral.com/1471-2105/8/195
work_keys_str_mv AT johnstoncarolinee establishingamajorcauseofdiscrepancyinthecalibrationofaffymetrixgenechips
AT harrisonandrewp establishingamajorcauseofdiscrepancyinthecalibrationofaffymetrixgenechips
AT orengochristinea establishingamajorcauseofdiscrepancyinthecalibrationofaffymetrixgenechips
_version_ 1725930963743014912