Neighborhood Rough Set Reduction-Based Gene Selection and Prioritization for Gene Expression Profile Analysis and Molecular Cancer Classification

Selection of reliable cancer biomarkers is crucial for gene expression profile-based precise diagnosis of cancer type and successful treatment. However, current studies are confronted with overfitting and dimensionality curse in tumor classification and false positives in the identification of cance...

Full description

Bibliographic Details
Main Authors: Mei-Ling Hou, Shu-Lin Wang, Xue-Ling Li, Ying-Ke Lei
Format: Article
Language:English
Published: Hindawi Limited 2010-01-01
Series:Journal of Biomedicine and Biotechnology
Online Access:http://dx.doi.org/10.1155/2010/726413
id doaj-dfb90d4d6476434aafdf2399c80120ee
record_format Article
spelling doaj-dfb90d4d6476434aafdf2399c80120ee2020-11-25T01:39:57ZengHindawi LimitedJournal of Biomedicine and Biotechnology1110-72431110-72512010-01-01201010.1155/2010/726413726413Neighborhood Rough Set Reduction-Based Gene Selection and Prioritization for Gene Expression Profile Analysis and Molecular Cancer ClassificationMei-Ling Hou0Shu-Lin Wang1Xue-Ling Li2Ying-Ke Lei3Intelligent Computing Laboratory, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, Hefei, Anhui 230031, ChinaIntelligent Computing Laboratory, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, Hefei, Anhui 230031, ChinaIntelligent Computing Laboratory, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, Hefei, Anhui 230031, ChinaIntelligent Computing Laboratory, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, Hefei, Anhui 230031, ChinaSelection of reliable cancer biomarkers is crucial for gene expression profile-based precise diagnosis of cancer type and successful treatment. However, current studies are confronted with overfitting and dimensionality curse in tumor classification and false positives in the identification of cancer biomarkers. Here, we developed a novel gene-ranking method based on neighborhood rough set reduction for molecular cancer classification based on gene expression profile. Comparison with other methods such as PAM, ClaNC, Kruskal-Wallis rank sum test, and Relief-F, our method shows that only few top-ranked genes could achieve higher tumor classification accuracy. Moreover, although the selected genes are not typical of known oncogenes, they are found to play a crucial role in the occurrence of tumor through searching the scientific literature and analyzing protein interaction partners, which may be used as candidate cancer biomarkers.http://dx.doi.org/10.1155/2010/726413
collection DOAJ
language English
format Article
sources DOAJ
author Mei-Ling Hou
Shu-Lin Wang
Xue-Ling Li
Ying-Ke Lei
spellingShingle Mei-Ling Hou
Shu-Lin Wang
Xue-Ling Li
Ying-Ke Lei
Neighborhood Rough Set Reduction-Based Gene Selection and Prioritization for Gene Expression Profile Analysis and Molecular Cancer Classification
Journal of Biomedicine and Biotechnology
author_facet Mei-Ling Hou
Shu-Lin Wang
Xue-Ling Li
Ying-Ke Lei
author_sort Mei-Ling Hou
title Neighborhood Rough Set Reduction-Based Gene Selection and Prioritization for Gene Expression Profile Analysis and Molecular Cancer Classification
title_short Neighborhood Rough Set Reduction-Based Gene Selection and Prioritization for Gene Expression Profile Analysis and Molecular Cancer Classification
title_full Neighborhood Rough Set Reduction-Based Gene Selection and Prioritization for Gene Expression Profile Analysis and Molecular Cancer Classification
title_fullStr Neighborhood Rough Set Reduction-Based Gene Selection and Prioritization for Gene Expression Profile Analysis and Molecular Cancer Classification
title_full_unstemmed Neighborhood Rough Set Reduction-Based Gene Selection and Prioritization for Gene Expression Profile Analysis and Molecular Cancer Classification
title_sort neighborhood rough set reduction-based gene selection and prioritization for gene expression profile analysis and molecular cancer classification
publisher Hindawi Limited
series Journal of Biomedicine and Biotechnology
issn 1110-7243
1110-7251
publishDate 2010-01-01
description Selection of reliable cancer biomarkers is crucial for gene expression profile-based precise diagnosis of cancer type and successful treatment. However, current studies are confronted with overfitting and dimensionality curse in tumor classification and false positives in the identification of cancer biomarkers. Here, we developed a novel gene-ranking method based on neighborhood rough set reduction for molecular cancer classification based on gene expression profile. Comparison with other methods such as PAM, ClaNC, Kruskal-Wallis rank sum test, and Relief-F, our method shows that only few top-ranked genes could achieve higher tumor classification accuracy. Moreover, although the selected genes are not typical of known oncogenes, they are found to play a crucial role in the occurrence of tumor through searching the scientific literature and analyzing protein interaction partners, which may be used as candidate cancer biomarkers.
url http://dx.doi.org/10.1155/2010/726413
work_keys_str_mv AT meilinghou neighborhoodroughsetreductionbasedgeneselectionandprioritizationforgeneexpressionprofileanalysisandmolecularcancerclassification
AT shulinwang neighborhoodroughsetreductionbasedgeneselectionandprioritizationforgeneexpressionprofileanalysisandmolecularcancerclassification
AT xuelingli neighborhoodroughsetreductionbasedgeneselectionandprioritizationforgeneexpressionprofileanalysisandmolecularcancerclassification
AT yingkelei neighborhoodroughsetreductionbasedgeneselectionandprioritizationforgeneexpressionprofileanalysisandmolecularcancerclassification
_version_ 1725048173585498112