Memetic Computation Based SVM for Cancer Classification
碩士 === 大葉大學 === 電機工程學系 === 100 === Cancer is one of the dreadful diseases found in most of the living being, which is one of the challenging studies for scientist towards 21th century. In cancer diagnosis and treatment, cancer classification plays a very important role. With the advent of DNA microa...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2012
|
Online Access: | http://ndltd.ncl.edu.tw/handle/9u3urc |
id |
ndltd-TW-100DYU00442008 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-100DYU004420082019-05-15T20:43:08Z http://ndltd.ncl.edu.tw/handle/9u3urc Memetic Computation Based SVM for Cancer Classification 基於文化計算的SVM技術於癌症分類 Nguyen Thi Nga Nguyen Thi Nga(阮氏娥) 碩士 大葉大學 電機工程學系 100 Cancer is one of the dreadful diseases found in most of the living being, which is one of the challenging studies for scientist towards 21th century. In cancer diagnosis and treatment, cancer classification plays a very important role. With the advent of DNA microarrays technology, constructing gene expression profiles for different cancer types has already become a promising means for cancer classification. However, it offers a challenge for current machine learning research. Microarray datasets are characterized by high dimension and small sample size. Over-fitting is a major problem due to the high dimension, while the small data size makes it worse. Support vector machine (SVM) is statistical classification algorithm that classifies data by separating two classes with the help of a functional hyper plane. SVM is known for good performance on noisy and high dimensional data such as microarray. One main disadvantage of using SVMs is that the performance of classifier depends on setting of parameters. In this thesis, we do classify cancer using gene expression data with a SVM classifier. A hybrid approach of particle swarm optimization (PSO) and simulated annealing (SA) is proposed to determine proper setting of SVM parameters which can improve the quality of SVM model. Our approach is a combination of methods. The motivation is to bring out an effective classification method for cancer by utilizing the strength of various techniques and compensating for their weaknesses. The proposed approach is tested on six benchmark cancer gene expression data sets, namely, colon, leukemia, lung, ovarian, prostate and breast. The experimental results show that the classification accuracy rates of the proposed method are competitive to that of other existing methods. It can be used as an efficient computational tool for microarray data analysis. Keywords: cancer classification, support vector machine, parameter optimization. Shinq-Jen Wu 吳幸珍 2012 學位論文 ; thesis 30 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 大葉大學 === 電機工程學系 === 100 === Cancer is one of the dreadful diseases found in most of the living being, which is one of the challenging studies for scientist towards 21th century. In cancer diagnosis and treatment, cancer classification plays a very important role. With the advent of DNA microarrays technology, constructing gene expression profiles for different cancer types has already become a promising means for cancer classification. However, it offers a challenge for current machine learning research. Microarray datasets are characterized by high dimension and small sample size. Over-fitting is a major problem due to the high dimension, while the small data size makes it worse. Support vector machine (SVM) is statistical classification algorithm that classifies data by separating two classes with the help of a functional hyper plane. SVM is known for good performance on noisy and high dimensional data such as microarray. One main disadvantage of using SVMs is that the performance of classifier depends on setting of parameters.
In this thesis, we do classify cancer using gene expression data with a SVM classifier. A hybrid approach of particle swarm optimization (PSO) and simulated annealing (SA) is proposed to determine proper setting of SVM parameters which can improve the quality of SVM model. Our approach is a combination of methods. The motivation is to bring out an effective classification method for cancer by utilizing the strength of various techniques and compensating for their weaknesses. The proposed approach is tested on six benchmark cancer gene expression data sets, namely, colon, leukemia, lung, ovarian, prostate and breast. The experimental results show that the classification accuracy rates of the proposed method are competitive to that of other existing methods. It can be used as an efficient computational tool for microarray data analysis.
Keywords: cancer classification, support vector machine, parameter optimization.
|
author2 |
Shinq-Jen Wu |
author_facet |
Shinq-Jen Wu Nguyen Thi Nga Nguyen Thi Nga(阮氏娥) |
author |
Nguyen Thi Nga Nguyen Thi Nga(阮氏娥) |
spellingShingle |
Nguyen Thi Nga Nguyen Thi Nga(阮氏娥) Memetic Computation Based SVM for Cancer Classification |
author_sort |
Nguyen Thi Nga |
title |
Memetic Computation Based SVM for Cancer Classification |
title_short |
Memetic Computation Based SVM for Cancer Classification |
title_full |
Memetic Computation Based SVM for Cancer Classification |
title_fullStr |
Memetic Computation Based SVM for Cancer Classification |
title_full_unstemmed |
Memetic Computation Based SVM for Cancer Classification |
title_sort |
memetic computation based svm for cancer classification |
publishDate |
2012 |
url |
http://ndltd.ncl.edu.tw/handle/9u3urc |
work_keys_str_mv |
AT nguyenthinga memeticcomputationbasedsvmforcancerclassification AT nguyenthingaruǎnshìé memeticcomputationbasedsvmforcancerclassification AT nguyenthinga jīyúwénhuàjìsuàndesvmjìshùyúáizhèngfēnlèi AT nguyenthingaruǎnshìé jīyúwénhuàjìsuàndesvmjìshùyúáizhèngfēnlèi |
_version_ |
1719104192733249536 |