A simple method for statistical analysis of intensity differences in microarray-derived gene expression data

BACKGROUND:Microarray experiments offer a potent solution to the problem of making and comparing large numbers of gene expression measurements either in different cell types or in the same cell type under different conditions. Inferences about the biological relevance of observed changes in expressi...

Full description

Bibliographic Details
Main Authors: Kamb, Alexander, Ramaswami, Mani
Other Authors: Arcaris, Inc. (Currently Deltagen Proteomics, Inc.) Salt Lake City, UT USA
Language:en
Published: BioMed Central 2001
Online Access:BMC Biotechnology 2001, 1:8 http://www.biomedcentral.com/1472-6750/1/8
http://hdl.handle.net/10150/610339
http://arizona.openrepository.com/arizona/handle/10150/610339
Description
Summary:BACKGROUND:Microarray experiments offer a potent solution to the problem of making and comparing large numbers of gene expression measurements either in different cell types or in the same cell type under different conditions. Inferences about the biological relevance of observed changes in expression depend on the statistical significance of the changes. In lieu of many replicates with which to determine accurate intensity means and variances, reliable estimates of statistical significance remain problematic. Without such estimates, overly conservative choices for significance must be enforced.RESULTS:A simple statistical method for estimating variances from microarray control data which does not require multiple replicates is presented. Comparison of datasets from two commercial entities using this difference-averaging method demonstrates that the standard deviation of the signal scales at a level intermediate between the signal intensity and its square root. Application of the method to a dataset related to the beta-catenin pathway yields a larger number of biologically reasonable genes whose expression is altered than the ratio method.CONCLUSIONS:The difference-averaging method enables determination of variances as a function of signal intensities by averaging over the entire dataset. The method also provides a platform-independent view of important statistical properties of microarray data.