Redundancy-Reducing Feature Selection from Microarray Data Based on Gene-Grouping
碩士 === 國立交通大學 === 統計學研究所 === 92 === A microarray dataset contains thousands of genes but only tens of subjects in general. This so-called “large (gene), small (subject)” feature brings about some difficulties to statistical analysis. Gene selection is a typical approach to deal with this problem...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2004
|
Online Access: | http://ndltd.ncl.edu.tw/handle/ka3d5b |
id |
ndltd-TW-092NCTU5337005 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-092NCTU53370052019-05-15T19:38:00Z http://ndltd.ncl.edu.tw/handle/ka3d5b Redundancy-Reducing Feature Selection from Microarray Data Based on Gene-Grouping 在微陣列資料上利用基因分群以減少冗贅之基因選取方法 Bao-wen Chang 張寶文 碩士 國立交通大學 統計學研究所 92 A microarray dataset contains thousands of genes but only tens of subjects in general. This so-called “large (gene), small (subject)” feature brings about some difficulties to statistical analysis. Gene selection is a typical approach to deal with this problem. There are two conventional gene selection methods, filters and wrappers. Filters judge whether a gene should be selected based on a ranking criterion; therefore, they are very fast in computation but might select highly correlated genes that give rise to redundancy. On the other hand, wrappers usually select a small set of non-redundant genes but require extensive computation. A combination of these two methods is adopted in this study. We first filter out irrelevant genes according a ranking criterion and then group the rest to avoid redundancy via K-means clustering algorithm. Then, the SVM-RFE gene selection method proposed by Guyon et al. (2002) is applied to a list of candidate genes selected from each cluster. Three popular cancer data sets are analyzed by means of the proposed method. The results show that the proposed method performs better than three filter methods under study when the number of selected genes is small. Jyh-Jen Horng Shiau Hui-Nien Hung 洪志真 洪慧念 2004 學位論文 ; thesis 42 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 統計學研究所 === 92 === A microarray dataset contains thousands of genes but only tens of subjects in general. This so-called “large (gene), small (subject)” feature brings about some difficulties to statistical analysis. Gene selection is a typical approach to deal with this problem. There are two conventional gene selection methods, filters and wrappers. Filters judge whether a gene should be selected based on a ranking criterion; therefore, they are very fast in computation but might select highly correlated genes that give rise to redundancy. On the other hand, wrappers usually select a small set of non-redundant genes but require extensive computation. A combination of these two methods is adopted in this study. We first filter out irrelevant genes according a ranking criterion and then group the rest to avoid redundancy via K-means clustering algorithm. Then, the SVM-RFE gene selection method proposed by Guyon et al. (2002) is applied to a list of candidate genes selected from each cluster. Three popular cancer data sets are analyzed by means of the proposed method. The results show that the proposed method performs better than three filter methods under study when the number of selected genes is small.
|
author2 |
Jyh-Jen Horng Shiau |
author_facet |
Jyh-Jen Horng Shiau Bao-wen Chang 張寶文 |
author |
Bao-wen Chang 張寶文 |
spellingShingle |
Bao-wen Chang 張寶文 Redundancy-Reducing Feature Selection from Microarray Data Based on Gene-Grouping |
author_sort |
Bao-wen Chang |
title |
Redundancy-Reducing Feature Selection from Microarray Data Based on Gene-Grouping |
title_short |
Redundancy-Reducing Feature Selection from Microarray Data Based on Gene-Grouping |
title_full |
Redundancy-Reducing Feature Selection from Microarray Data Based on Gene-Grouping |
title_fullStr |
Redundancy-Reducing Feature Selection from Microarray Data Based on Gene-Grouping |
title_full_unstemmed |
Redundancy-Reducing Feature Selection from Microarray Data Based on Gene-Grouping |
title_sort |
redundancy-reducing feature selection from microarray data based on gene-grouping |
publishDate |
2004 |
url |
http://ndltd.ncl.edu.tw/handle/ka3d5b |
work_keys_str_mv |
AT baowenchang redundancyreducingfeatureselectionfrommicroarraydatabasedongenegrouping AT zhāngbǎowén redundancyreducingfeatureselectionfrommicroarraydatabasedongenegrouping AT baowenchang zàiwēizhènlièzīliàoshànglìyòngjīyīnfēnqúnyǐjiǎnshǎorǒngzhuìzhījīyīnxuǎnqǔfāngfǎ AT zhāngbǎowén zàiwēizhènlièzīliàoshànglìyòngjīyīnfēnqúnyǐjiǎnshǎorǒngzhuìzhījīyīnxuǎnqǔfāngfǎ |
_version_ |
1719091673418432512 |