Memetic Computation Based SVM for Cancer Classification

碩士 === 大葉大學 === 電機工程學系 === 100 === Cancer is one of the dreadful diseases found in most of the living being, which is one of the challenging studies for scientist towards 21th century. In cancer diagnosis and treatment, cancer classification plays a very important role. With the advent of DNA microa...

Full description

Bibliographic Details
Main Authors:	Nguyen Thi Nga, Nguyen Thi Nga(阮氏娥)
Other Authors:	Shinq-Jen Wu
Format:	Others
Language:	en_US
Published:	2012
Online Access:	http://ndltd.ncl.edu.tw/handle/9u3urc

id	ndltd-TW-100DYU00442008
record_format	oai_dc
spelling	ndltd-TW-100DYU004420082019-05-15T20:43:08Z http://ndltd.ncl.edu.tw/handle/9u3urc Memetic Computation Based SVM for Cancer Classification 基於文化計算的SVM技術於癌症分類 Nguyen Thi Nga Nguyen Thi Nga(阮氏娥) 碩士大葉大學電機工程學系 100 Cancer is one of the dreadful diseases found in most of the living being, which is one of the challenging studies for scientist towards 21th century. In cancer diagnosis and treatment, cancer classification plays a very important role. With the advent of DNA microarrays technology, constructing gene expression profiles for different cancer types has already become a promising means for cancer classification. However, it offers a challenge for current machine learning research. Microarray datasets are characterized by high dimension and small sample size. Over-fitting is a major problem due to the high dimension, while the small data size makes it worse. Support vector machine (SVM) is statistical classification algorithm that classifies data by separating two classes with the help of a functional hyper plane. SVM is known for good performance on noisy and high dimensional data such as microarray. One main disadvantage of using SVMs is that the performance of classifier depends on setting of parameters. In this thesis, we do classify cancer using gene expression data with a SVM classifier. A hybrid approach of particle swarm optimization (PSO) and simulated annealing (SA) is proposed to determine proper setting of SVM parameters which can improve the quality of SVM model. Our approach is a combination of methods. The motivation is to bring out an effective classification method for cancer by utilizing the strength of various techniques and compensating for their weaknesses. The proposed approach is tested on six benchmark cancer gene expression data sets, namely, colon, leukemia, lung, ovarian, prostate and breast. The experimental results show that the classification accuracy rates of the proposed method are competitive to that of other existing methods. It can be used as an efficient computational tool for microarray data analysis. Keywords: cancer classification, support vector machine, parameter optimization. Shinq-Jen Wu 吳幸珍 2012 學位論文 ; thesis 30 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 大葉大學 === 電機工程學系 === 100 === Cancer is one of the dreadful diseases found in most of the living being, which is one of the challenging studies for scientist towards 21th century. In cancer diagnosis and treatment, cancer classification plays a very important role. With the advent of DNA microarrays technology, constructing gene expression profiles for different cancer types has already become a promising means for cancer classification. However, it offers a challenge for current machine learning research. Microarray datasets are characterized by high dimension and small sample size. Over-fitting is a major problem due to the high dimension, while the small data size makes it worse. Support vector machine (SVM) is statistical classification algorithm that classifies data by separating two classes with the help of a functional hyper plane. SVM is known for good performance on noisy and high dimensional data such as microarray. One main disadvantage of using SVMs is that the performance of classifier depends on setting of parameters. In this thesis, we do classify cancer using gene expression data with a SVM classifier. A hybrid approach of particle swarm optimization (PSO) and simulated annealing (SA) is proposed to determine proper setting of SVM parameters which can improve the quality of SVM model. Our approach is a combination of methods. The motivation is to bring out an effective classification method for cancer by utilizing the strength of various techniques and compensating for their weaknesses. The proposed approach is tested on six benchmark cancer gene expression data sets, namely, colon, leukemia, lung, ovarian, prostate and breast. The experimental results show that the classification accuracy rates of the proposed method are competitive to that of other existing methods. It can be used as an efficient computational tool for microarray data analysis. Keywords: cancer classification, support vector machine, parameter optimization.
author2	Shinq-Jen Wu
author_facet	Shinq-Jen Wu Nguyen Thi Nga Nguyen Thi Nga(阮氏娥)
author	Nguyen Thi Nga Nguyen Thi Nga(阮氏娥)
spellingShingle	Nguyen Thi Nga Nguyen Thi Nga(阮氏娥) Memetic Computation Based SVM for Cancer Classification
author_sort	Nguyen Thi Nga
title	Memetic Computation Based SVM for Cancer Classification
title_short	Memetic Computation Based SVM for Cancer Classification
title_full	Memetic Computation Based SVM for Cancer Classification
title_fullStr	Memetic Computation Based SVM for Cancer Classification
title_full_unstemmed	Memetic Computation Based SVM for Cancer Classification
title_sort	memetic computation based svm for cancer classification
publishDate	2012
url	http://ndltd.ncl.edu.tw/handle/9u3urc
work_keys_str_mv	AT nguyenthinga memeticcomputationbasedsvmforcancerclassification AT nguyenthingaruǎnshìé memeticcomputationbasedsvmforcancerclassification AT nguyenthinga jīyúwénhuàjìsuàndesvmjìshùyúáizhèngfēnlèi AT nguyenthingaruǎnshìé jīyúwénhuàjìsuàndesvmjìshùyúáizhèngfēnlèi
_version_	1719104192733249536

Memetic Computation Based SVM for Cancer Classification

Similar Items