Sensitivity analysis of predictive data analytic models to attributes

Classification algorithms represent a rich set of tools, which train a classification model from a given training and test set, to classify previously unseen test instances. Although existing methods have studied classification algorithm performance with respect to feature selection, noise condition...

Full description

Bibliographic Details
Other Authors: Chiou, James (author)
Format: Others
Language:English
Published: Florida Atlantic University
Subjects:
Online Access:http://purl.flvc.org/fau/fd/FA00004274
http://purl.flvc.org/fau/fd/FA00004274
id ndltd-fau.edu-oai-fau.digital.flvc.org-fau_30763
record_format oai_dc
spelling ndltd-fau.edu-oai-fau.digital.flvc.org-fau_307632019-07-04T03:53:01Z Sensitivity analysis of predictive data analytic models to attributes FA00004274 Chiou, James (author) Zhu, Xingquan (Thesis advisor) Florida Atlantic University (Degree grantor) College of Engineering and Computer Science Department of Computer and Electrical Engineering and Computer Science 115 p. application/pdf Electronic Thesis or Dissertation Text English Classification algorithms represent a rich set of tools, which train a classification model from a given training and test set, to classify previously unseen test instances. Although existing methods have studied classification algorithm performance with respect to feature selection, noise condition, and sample distributions, our existing studies have not addressed an important issue on the classification algorithm performance relating to feature deletion and addition. In this thesis, we carry out sensitive study of classification algorithms by using feature deletion and addition. Three types of classifiers: (1) weak classifiers; (2) generic and strong classifiers; and (3) ensemble classifiers are validated on three types of data (1) feature dimension data, (2) gene expression data and (3) biomedical document data. In the experiments, we continuously add redundant features to the training and test set in order to observe the classification algorithm performance, and also continuously remove features to find the performance of the underlying classifiers. Our studies draw a number of important findings, which will help data mining and machine learning community under the genuine performance of common classification algorithms on real-world data. Florida Atlantic University Data mining Forecasting -- Mathematical models Social sciences -- Statistical methods Ubiquitous computing Includes bibliography. Thesis (M.S.)--Florida Atlantic University, 2014. FAU Electronic Theses and Dissertations Collection Copyright © is held by the author, with permission granted to Florida Atlantic University to digitize, archive and distribute this item for non-profit research and educational purposes. Any reuse of this item in excess of fair use or other copyright exemptions requires permission of the copyright holder. http://purl.flvc.org/fau/fd/FA00004274 http://purl.flvc.org/fau/fd/FA00004274 http://rightsstatements.org/vocab/InC/1.0/ https://fau.digital.flvc.org/islandora/object/fau%3A30763/datastream/TN/view/Sensitivity%20analysis%20of%20predictive%20data%20analytic%20models%20to%20attributes.jpg
collection NDLTD
language English
format Others
sources NDLTD
topic Data mining
Forecasting -- Mathematical models
Social sciences -- Statistical methods
Ubiquitous computing
spellingShingle Data mining
Forecasting -- Mathematical models
Social sciences -- Statistical methods
Ubiquitous computing
Sensitivity analysis of predictive data analytic models to attributes
description Classification algorithms represent a rich set of tools, which train a classification model from a given training and test set, to classify previously unseen test instances. Although existing methods have studied classification algorithm performance with respect to feature selection, noise condition, and sample distributions, our existing studies have not addressed an important issue on the classification algorithm performance relating to feature deletion and addition. In this thesis, we carry out sensitive study of classification algorithms by using feature deletion and addition. Three types of classifiers: (1) weak classifiers; (2) generic and strong classifiers; and (3) ensemble classifiers are validated on three types of data (1) feature dimension data, (2) gene expression data and (3) biomedical document data. In the experiments, we continuously add redundant features to the training and test set in order to observe the classification algorithm performance, and also continuously remove features to find the performance of the underlying classifiers. Our studies draw a number of important findings, which will help data mining and machine learning community under the genuine performance of common classification algorithms on real-world data. === Includes bibliography. === Thesis (M.S.)--Florida Atlantic University, 2014. === FAU Electronic Theses and Dissertations Collection
author2 Chiou, James (author)
author_facet Chiou, James (author)
title Sensitivity analysis of predictive data analytic models to attributes
title_short Sensitivity analysis of predictive data analytic models to attributes
title_full Sensitivity analysis of predictive data analytic models to attributes
title_fullStr Sensitivity analysis of predictive data analytic models to attributes
title_full_unstemmed Sensitivity analysis of predictive data analytic models to attributes
title_sort sensitivity analysis of predictive data analytic models to attributes
publisher Florida Atlantic University
url http://purl.flvc.org/fau/fd/FA00004274
http://purl.flvc.org/fau/fd/FA00004274
_version_ 1719219039297863680