A Proposed Frequency-Based Feature Selection Method for Cancer Classification
Feature selection method is becoming an essential procedure in data preprocessing step. The feature selection problem can affect the efficiency and accuracy of classification models. Therefore, it also relates to whether a classification model can have a reliable performance. In this study, we compa...
Main Author: | |
---|---|
Format: | Others |
Published: |
TopSCHOLAR®
2017
|
Subjects: | |
Online Access: | http://digitalcommons.wku.edu/theses/1954 http://digitalcommons.wku.edu/cgi/viewcontent.cgi?article=2956&context=theses |
id |
ndltd-WKU-oai-digitalcommons.wku.edu-theses-2956 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-WKU-oai-digitalcommons.wku.edu-theses-29562017-04-28T05:37:49Z A Proposed Frequency-Based Feature Selection Method for Cancer Classification Pan, Yi Feature selection method is becoming an essential procedure in data preprocessing step. The feature selection problem can affect the efficiency and accuracy of classification models. Therefore, it also relates to whether a classification model can have a reliable performance. In this study, we compared an original feature selection method and a proposed frequency-based feature selection method with four classification models and three filter-based ranking techniques using a cancer dataset. The proposed method was implemented in WEKA which is an open source software. The performance is evaluated by two evaluation methods: Recall and Receiver Operating Characteristic (ROC). Finally, we found the frequency-based feature selection method performed better than the original ranking method. 2017-04-01T07:00:00Z text application/pdf http://digitalcommons.wku.edu/theses/1954 http://digitalcommons.wku.edu/cgi/viewcontent.cgi?article=2956&context=theses Masters Theses & Specialist Projects TopSCHOLAR® frequency-based feature selection ranking method WEKA ROC Bioinformatics Computer Sciences Databases and Information Systems |
collection |
NDLTD |
format |
Others
|
sources |
NDLTD |
topic |
frequency-based feature selection ranking method WEKA ROC Bioinformatics Computer Sciences Databases and Information Systems |
spellingShingle |
frequency-based feature selection ranking method WEKA ROC Bioinformatics Computer Sciences Databases and Information Systems Pan, Yi A Proposed Frequency-Based Feature Selection Method for Cancer Classification |
description |
Feature selection method is becoming an essential procedure in data preprocessing step. The feature selection problem can affect the efficiency and accuracy of classification models. Therefore, it also relates to whether a classification model can have a reliable performance. In this study, we compared an original feature selection method and a proposed frequency-based feature selection method with four classification models and three filter-based ranking techniques using a cancer dataset. The proposed method was implemented in WEKA which is an open source software. The performance is evaluated by two evaluation methods: Recall and Receiver Operating Characteristic (ROC). Finally, we found the frequency-based feature selection method performed better than the original ranking method. |
author |
Pan, Yi |
author_facet |
Pan, Yi |
author_sort |
Pan, Yi |
title |
A Proposed Frequency-Based Feature Selection Method for Cancer Classification |
title_short |
A Proposed Frequency-Based Feature Selection Method for Cancer Classification |
title_full |
A Proposed Frequency-Based Feature Selection Method for Cancer Classification |
title_fullStr |
A Proposed Frequency-Based Feature Selection Method for Cancer Classification |
title_full_unstemmed |
A Proposed Frequency-Based Feature Selection Method for Cancer Classification |
title_sort |
proposed frequency-based feature selection method for cancer classification |
publisher |
TopSCHOLAR® |
publishDate |
2017 |
url |
http://digitalcommons.wku.edu/theses/1954 http://digitalcommons.wku.edu/cgi/viewcontent.cgi?article=2956&context=theses |
work_keys_str_mv |
AT panyi aproposedfrequencybasedfeatureselectionmethodforcancerclassification AT panyi proposedfrequencybasedfeatureselectionmethodforcancerclassification |
_version_ |
1718445435289337856 |