A Proposed Frequency-Based Feature Selection Method for Cancer Classification

Feature selection method is becoming an essential procedure in data preprocessing step. The feature selection problem can affect the efficiency and accuracy of classification models. Therefore, it also relates to whether a classification model can have a reliable performance. In this study, we compa...

Full description

Bibliographic Details
Main Author: Pan, Yi
Format: Others
Published: TopSCHOLAR® 2017
Subjects:
ROC
Online Access:http://digitalcommons.wku.edu/theses/1954
http://digitalcommons.wku.edu/cgi/viewcontent.cgi?article=2956&context=theses
id ndltd-WKU-oai-digitalcommons.wku.edu-theses-2956
record_format oai_dc
spelling ndltd-WKU-oai-digitalcommons.wku.edu-theses-29562017-04-28T05:37:49Z A Proposed Frequency-Based Feature Selection Method for Cancer Classification Pan, Yi Feature selection method is becoming an essential procedure in data preprocessing step. The feature selection problem can affect the efficiency and accuracy of classification models. Therefore, it also relates to whether a classification model can have a reliable performance. In this study, we compared an original feature selection method and a proposed frequency-based feature selection method with four classification models and three filter-based ranking techniques using a cancer dataset. The proposed method was implemented in WEKA which is an open source software. The performance is evaluated by two evaluation methods: Recall and Receiver Operating Characteristic (ROC). Finally, we found the frequency-based feature selection method performed better than the original ranking method. 2017-04-01T07:00:00Z text application/pdf http://digitalcommons.wku.edu/theses/1954 http://digitalcommons.wku.edu/cgi/viewcontent.cgi?article=2956&context=theses Masters Theses & Specialist Projects TopSCHOLAR® frequency-based feature selection ranking method WEKA ROC Bioinformatics Computer Sciences Databases and Information Systems
collection NDLTD
format Others
sources NDLTD
topic frequency-based
feature selection
ranking method
WEKA
ROC
Bioinformatics
Computer Sciences
Databases and Information Systems
spellingShingle frequency-based
feature selection
ranking method
WEKA
ROC
Bioinformatics
Computer Sciences
Databases and Information Systems
Pan, Yi
A Proposed Frequency-Based Feature Selection Method for Cancer Classification
description Feature selection method is becoming an essential procedure in data preprocessing step. The feature selection problem can affect the efficiency and accuracy of classification models. Therefore, it also relates to whether a classification model can have a reliable performance. In this study, we compared an original feature selection method and a proposed frequency-based feature selection method with four classification models and three filter-based ranking techniques using a cancer dataset. The proposed method was implemented in WEKA which is an open source software. The performance is evaluated by two evaluation methods: Recall and Receiver Operating Characteristic (ROC). Finally, we found the frequency-based feature selection method performed better than the original ranking method.
author Pan, Yi
author_facet Pan, Yi
author_sort Pan, Yi
title A Proposed Frequency-Based Feature Selection Method for Cancer Classification
title_short A Proposed Frequency-Based Feature Selection Method for Cancer Classification
title_full A Proposed Frequency-Based Feature Selection Method for Cancer Classification
title_fullStr A Proposed Frequency-Based Feature Selection Method for Cancer Classification
title_full_unstemmed A Proposed Frequency-Based Feature Selection Method for Cancer Classification
title_sort proposed frequency-based feature selection method for cancer classification
publisher TopSCHOLAR®
publishDate 2017
url http://digitalcommons.wku.edu/theses/1954
http://digitalcommons.wku.edu/cgi/viewcontent.cgi?article=2956&context=theses
work_keys_str_mv AT panyi aproposedfrequencybasedfeatureselectionmethodforcancerclassification
AT panyi proposedfrequencybasedfeatureselectionmethodforcancerclassification
_version_ 1718445435289337856