Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data

<p>Abstract</p> <p>Background</p> <p>More studies based on gene expression data have been reported in great detail, however, one major challenge for the methodologists is the choice of classification methods. The main purpose of this research was to compare the performa...

Full description

Bibliographic Details
Main Authors: He Miao, Quan Yu, Huang Desheng, Zhou Baosen
Format: Article
Language:English
Published: BMC 2009-12-01
Series:Journal of Experimental & Clinical Cancer Research
Online Access:http://www.jeccr.com/content/28/1/149
id doaj-a7157418c4eb4d78b04a57bf57931d27
record_format Article
spelling doaj-a7157418c4eb4d78b04a57bf57931d272020-11-25T00:04:48ZengBMCJournal of Experimental & Clinical Cancer Research1756-99662009-12-0128114910.1186/1756-9966-28-149Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression dataHe MiaoQuan YuHuang DeshengZhou Baosen<p>Abstract</p> <p>Background</p> <p>More studies based on gene expression data have been reported in great detail, however, one major challenge for the methodologists is the choice of classification methods. The main purpose of this research was to compare the performance of linear discriminant analysis (LDA) and its modification methods for the classification of cancer based on gene expression data.</p> <p>Methods</p> <p>The classification performance of linear discriminant analysis (LDA) and its modification methods was evaluated by applying these methods to six public cancer gene expression datasets. These methods included linear discriminant analysis (LDA), prediction analysis for microarrays (PAM), shrinkage centroid regularized discriminant analysis (SCRDA), shrinkage linear discriminant analysis (SLDA) and shrinkage diagonal discriminant analysis (SDDA). The procedures were performed by software R 2.80.</p> <p>Results</p> <p>PAM picked out fewer feature genes than other methods from most datasets except from Brain dataset. For the two methods of shrinkage discriminant analysis, SLDA selected more genes than SDDA from most datasets except from 2-class lung cancer dataset. When comparing SLDA with SCRDA, SLDA selected more genes than SCRDA from 2-class lung cancer, SRBCT and Brain dataset, the result was opposite for the rest datasets. The average test error of LDA modification methods was lower than LDA method.</p> <p>Conclusions</p> <p>The classification performance of LDA modification methods was superior to that of traditional LDA with respect to the average error and there was no significant difference between theses modification methods.</p> http://www.jeccr.com/content/28/1/149
collection DOAJ
language English
format Article
sources DOAJ
author He Miao
Quan Yu
Huang Desheng
Zhou Baosen
spellingShingle He Miao
Quan Yu
Huang Desheng
Zhou Baosen
Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data
Journal of Experimental & Clinical Cancer Research
author_facet He Miao
Quan Yu
Huang Desheng
Zhou Baosen
author_sort He Miao
title Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data
title_short Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data
title_full Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data
title_fullStr Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data
title_full_unstemmed Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data
title_sort comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data
publisher BMC
series Journal of Experimental & Clinical Cancer Research
issn 1756-9966
publishDate 2009-12-01
description <p>Abstract</p> <p>Background</p> <p>More studies based on gene expression data have been reported in great detail, however, one major challenge for the methodologists is the choice of classification methods. The main purpose of this research was to compare the performance of linear discriminant analysis (LDA) and its modification methods for the classification of cancer based on gene expression data.</p> <p>Methods</p> <p>The classification performance of linear discriminant analysis (LDA) and its modification methods was evaluated by applying these methods to six public cancer gene expression datasets. These methods included linear discriminant analysis (LDA), prediction analysis for microarrays (PAM), shrinkage centroid regularized discriminant analysis (SCRDA), shrinkage linear discriminant analysis (SLDA) and shrinkage diagonal discriminant analysis (SDDA). The procedures were performed by software R 2.80.</p> <p>Results</p> <p>PAM picked out fewer feature genes than other methods from most datasets except from Brain dataset. For the two methods of shrinkage discriminant analysis, SLDA selected more genes than SDDA from most datasets except from 2-class lung cancer dataset. When comparing SLDA with SCRDA, SLDA selected more genes than SCRDA from 2-class lung cancer, SRBCT and Brain dataset, the result was opposite for the rest datasets. The average test error of LDA modification methods was lower than LDA method.</p> <p>Conclusions</p> <p>The classification performance of LDA modification methods was superior to that of traditional LDA with respect to the average error and there was no significant difference between theses modification methods.</p>
url http://www.jeccr.com/content/28/1/149
work_keys_str_mv AT hemiao comparisonoflineardiscriminantanalysismethodsfortheclassificationofcancerbasedongeneexpressiondata
AT quanyu comparisonoflineardiscriminantanalysismethodsfortheclassificationofcancerbasedongeneexpressiondata
AT huangdesheng comparisonoflineardiscriminantanalysismethodsfortheclassificationofcancerbasedongeneexpressiondata
AT zhoubaosen comparisonoflineardiscriminantanalysismethodsfortheclassificationofcancerbasedongeneexpressiondata
_version_ 1725427921511776256