Information gain and wheel based simplified swarm optimization for gene selection from gene expression data
碩士 === 國立清華大學 === 工業工程與工程管理學系 === 103 === Recently, feature selection has been an important issue in data mining problems. The object of feature selection is to find the most distinguished features among datasets which have enormous number of features and then improve the classification accuracy. Fe...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/81564909715943106056 |
id |
ndltd-TW-103NTHU5031057 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103NTHU50310572017-02-26T04:27:51Z http://ndltd.ncl.edu.tw/handle/81564909715943106056 Information gain and wheel based simplified swarm optimization for gene selection from gene expression data 應用資訊增益、簡化群體演算法及輪式搜尋策略於基因選取之研究 Chang, Chung Yi 張中議 碩士 國立清華大學 工業工程與工程管理學系 103 Recently, feature selection has been an important issue in data mining problems. The object of feature selection is to find the most distinguished features among datasets which have enormous number of features and then improve the classification accuracy. Feature selection can reduce the noise and save lots of time and costs for researchers, especially when the volume of data is huge. Feature selection has wide applications for high dimensional real world situations such as cancer research in medical field. When feature selection is being used in cancer research to find cancerous genes, it is called “gene selection”. With gene selection, doctors can find the symptoms or signs of cancer at early stage and enhance the survival rate. In this paper, we try to develop an effective gene selection model for ten benchmark gene expression datasets. We proposed an information gain and wheel-based simplified swarm optimization (IG-WSSO) to solve the problem. Initially, we used information gain (IG) to remove irrelevant genes. Then, we conducted simplified swarm optimization with the wheel based search strategy for gene selection (WSSO). Support vector machine (SVM) with leave one out cross validation (LOOCV) was adopted to evaluate the accuracy. We compared our algorithm, IG-WSSO, with previous research by running ten benchmark datasets of gene expression data, which can be downloaded on: http://www.gems-system.org/. The results show IG-WSSO can achieve higher classification accuracy by selecting less number of genes. Yeh, Wei Chang 葉維彰 2015 學位論文 ; thesis 42 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 工業工程與工程管理學系 === 103 === Recently, feature selection has been an important issue in data mining problems. The object of feature selection is to find the most distinguished features among datasets which have enormous number of features and then improve the classification accuracy. Feature selection can reduce the noise and save lots of time and costs for researchers, especially when the volume of data is huge. Feature selection has wide applications for high dimensional real world situations such as cancer research in medical field. When feature selection is being used in cancer research to find cancerous genes, it is called “gene selection”. With gene selection, doctors can find the symptoms or signs of cancer at early stage and enhance the survival rate. In this paper, we try to develop an effective gene selection model for ten benchmark gene expression datasets. We proposed an information gain and wheel-based simplified swarm optimization (IG-WSSO) to solve the problem. Initially, we used information gain (IG) to remove irrelevant genes. Then, we conducted simplified swarm optimization with the wheel based search strategy for gene selection (WSSO). Support vector machine (SVM) with leave one out cross validation (LOOCV) was adopted to evaluate the accuracy. We compared our algorithm, IG-WSSO, with previous research by running ten benchmark datasets of gene expression data, which can be downloaded on: http://www.gems-system.org/. The results show IG-WSSO can achieve higher classification accuracy by selecting less number of genes.
|
author2 |
Yeh, Wei Chang |
author_facet |
Yeh, Wei Chang Chang, Chung Yi 張中議 |
author |
Chang, Chung Yi 張中議 |
spellingShingle |
Chang, Chung Yi 張中議 Information gain and wheel based simplified swarm optimization for gene selection from gene expression data |
author_sort |
Chang, Chung Yi |
title |
Information gain and wheel based simplified swarm optimization for gene selection from gene expression data |
title_short |
Information gain and wheel based simplified swarm optimization for gene selection from gene expression data |
title_full |
Information gain and wheel based simplified swarm optimization for gene selection from gene expression data |
title_fullStr |
Information gain and wheel based simplified swarm optimization for gene selection from gene expression data |
title_full_unstemmed |
Information gain and wheel based simplified swarm optimization for gene selection from gene expression data |
title_sort |
information gain and wheel based simplified swarm optimization for gene selection from gene expression data |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/81564909715943106056 |
work_keys_str_mv |
AT changchungyi informationgainandwheelbasedsimplifiedswarmoptimizationforgeneselectionfromgeneexpressiondata AT zhāngzhōngyì informationgainandwheelbasedsimplifiedswarmoptimizationforgeneselectionfromgeneexpressiondata AT changchungyi yīngyòngzīxùnzēngyìjiǎnhuàqúntǐyǎnsuànfǎjílúnshìsōuxúncèlüèyújīyīnxuǎnqǔzhīyánjiū AT zhāngzhōngyì yīngyòngzīxùnzēngyìjiǎnhuàqúntǐyǎnsuànfǎjílúnshìsōuxúncèlüèyújīyīnxuǎnqǔzhīyánjiū |
_version_ |
1718416815850258432 |