Simple and flexible classification of gene expression microarrays via Swirls and Ripples

<p>Abstract</p> <p>Background</p> <p>A simple classification rule with few genes and parameters is desirable when applying a classification rule to new data. One popular simple classification rule, diagonal discriminant analysis, yields linear or curved classification b...

Full description

Bibliographic Details
Main Author: Baker Stuart G
Format: Article
Language:English
Published: BMC 2010-09-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/11/452
id doaj-431705e4e0f245049241c4f07c310d56
record_format Article
spelling doaj-431705e4e0f245049241c4f07c310d562020-11-25T01:03:12ZengBMCBMC Bioinformatics1471-21052010-09-0111145210.1186/1471-2105-11-452Simple and flexible classification of gene expression microarrays via Swirls and RipplesBaker Stuart G<p>Abstract</p> <p>Background</p> <p>A simple classification rule with few genes and parameters is desirable when applying a classification rule to new data. One popular simple classification rule, diagonal discriminant analysis, yields linear or curved classification boundaries, called Ripples, that are optimal when gene expression levels are normally distributed with the appropriate variance, but may yield poor classification in other situations.</p> <p>Results</p> <p>A simple modification of diagonal discriminant analysis yields smooth highly nonlinear classification boundaries, called Swirls, that sometimes outperforms Ripples. In particular, if the data are normally distributed with different variances in each class, Swirls substantially outperforms Ripples when using a pooled variance to reduce the number of parameters. The proposed classification rule for two classes selects either Swirls or Ripples after parsimoniously selecting the number of genes and distance measures. Applications to five cancer microarray data sets identified predictive genes related to the tissue organization theory of carcinogenesis.</p> <p>Conclusion</p> <p>The parsimonious selection of classifiers coupled with the selection of either Swirls or Ripples provides a good basis for formulating a simple, yet flexible, classification rule. Open source software is available for download.</p> http://www.biomedcentral.com/1471-2105/11/452
collection DOAJ
language English
format Article
sources DOAJ
author Baker Stuart G
spellingShingle Baker Stuart G
Simple and flexible classification of gene expression microarrays via Swirls and Ripples
BMC Bioinformatics
author_facet Baker Stuart G
author_sort Baker Stuart G
title Simple and flexible classification of gene expression microarrays via Swirls and Ripples
title_short Simple and flexible classification of gene expression microarrays via Swirls and Ripples
title_full Simple and flexible classification of gene expression microarrays via Swirls and Ripples
title_fullStr Simple and flexible classification of gene expression microarrays via Swirls and Ripples
title_full_unstemmed Simple and flexible classification of gene expression microarrays via Swirls and Ripples
title_sort simple and flexible classification of gene expression microarrays via swirls and ripples
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2010-09-01
description <p>Abstract</p> <p>Background</p> <p>A simple classification rule with few genes and parameters is desirable when applying a classification rule to new data. One popular simple classification rule, diagonal discriminant analysis, yields linear or curved classification boundaries, called Ripples, that are optimal when gene expression levels are normally distributed with the appropriate variance, but may yield poor classification in other situations.</p> <p>Results</p> <p>A simple modification of diagonal discriminant analysis yields smooth highly nonlinear classification boundaries, called Swirls, that sometimes outperforms Ripples. In particular, if the data are normally distributed with different variances in each class, Swirls substantially outperforms Ripples when using a pooled variance to reduce the number of parameters. The proposed classification rule for two classes selects either Swirls or Ripples after parsimoniously selecting the number of genes and distance measures. Applications to five cancer microarray data sets identified predictive genes related to the tissue organization theory of carcinogenesis.</p> <p>Conclusion</p> <p>The parsimonious selection of classifiers coupled with the selection of either Swirls or Ripples provides a good basis for formulating a simple, yet flexible, classification rule. Open source software is available for download.</p>
url http://www.biomedcentral.com/1471-2105/11/452
work_keys_str_mv AT bakerstuartg simpleandflexibleclassificationofgeneexpressionmicroarraysviaswirlsandripples
_version_ 1725201818765492224