Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm

Data splitting is an important step in artificial neural network (ANN) models, which is found in the form of training and testing subsets. In general, a random data splitting method is favored to divide a pool of samples into subsets, without considering the quality of data for the training step of...

Full description

Bibliographic Details
Main Authors: Wirote Apinantanakon, Khamron Sunat, Joel Alan Kinmond
Format: Article
Language:English
Published: Khon Kaen University 2019-12-01
Series:Engineering and Applied Science Research
Subjects:
Online Access:https://www.tci-thaijo.org/index.php/easr/article/download/184408/155804/
id doaj-4119f0c8a3ba4d34bfbfb032057a2148
record_format Article
spelling doaj-4119f0c8a3ba4d34bfbfb032057a21482020-11-25T01:54:16ZengKhon Kaen UniversityEngineering and Applied Science Research2539-61612539-62182019-12-0146427628410.14456/easr.2019.31Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithmWirote ApinantanakonKhamron SunatJoel Alan KinmondData splitting is an important step in artificial neural network (ANN) models, which is found in the form of training and testing subsets. In general, a random data splitting method is favored to divide a pool of samples into subsets, without considering the quality of data for the training step of a neural network. The drawback of poor data splitting methods is that they poses ill effects to the performance of the neural network when the data involves complex matrices or multivariate modeling. In order to overcome this drawback, the current paper presents our proposed M-SPXY method. It is based on a modified version of Sample Set Partitioning, which relies on a joint X-y distances (SPXY) method. The proposed method has resulted in better performance, compared to the modified Kennard-Stone (KS) method, using Mahalanobis distances (MDKS). In our experiments, the proposed approach was employed to compare various data splitting methods using data sets from the repository of the University of California in Irvine (UCI), processed through an Extreme Learning Machine (ELM) neural network. Performance was measured in terms of classification accuracy. The results indicate that the classification accuracy of the proposed M-SPXY process is superior to that of the MDKS data splitting method.https://www.tci-thaijo.org/index.php/easr/article/download/184408/155804/extreme learning machinespxyneural networksubset selectionmahalanobis distanceclassification
collection DOAJ
language English
format Article
sources DOAJ
author Wirote Apinantanakon
Khamron Sunat
Joel Alan Kinmond
spellingShingle Wirote Apinantanakon
Khamron Sunat
Joel Alan Kinmond
Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm
Engineering and Applied Science Research
extreme learning machine
spxy
neural network
subset selection
mahalanobis distance
classification
author_facet Wirote Apinantanakon
Khamron Sunat
Joel Alan Kinmond
author_sort Wirote Apinantanakon
title Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm
title_short Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm
title_full Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm
title_fullStr Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm
title_full_unstemmed Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm
title_sort optimal data division for empowering artificial neural network models employing a modified m-spxy algorithm
publisher Khon Kaen University
series Engineering and Applied Science Research
issn 2539-6161
2539-6218
publishDate 2019-12-01
description Data splitting is an important step in artificial neural network (ANN) models, which is found in the form of training and testing subsets. In general, a random data splitting method is favored to divide a pool of samples into subsets, without considering the quality of data for the training step of a neural network. The drawback of poor data splitting methods is that they poses ill effects to the performance of the neural network when the data involves complex matrices or multivariate modeling. In order to overcome this drawback, the current paper presents our proposed M-SPXY method. It is based on a modified version of Sample Set Partitioning, which relies on a joint X-y distances (SPXY) method. The proposed method has resulted in better performance, compared to the modified Kennard-Stone (KS) method, using Mahalanobis distances (MDKS). In our experiments, the proposed approach was employed to compare various data splitting methods using data sets from the repository of the University of California in Irvine (UCI), processed through an Extreme Learning Machine (ELM) neural network. Performance was measured in terms of classification accuracy. The results indicate that the classification accuracy of the proposed M-SPXY process is superior to that of the MDKS data splitting method.
topic extreme learning machine
spxy
neural network
subset selection
mahalanobis distance
classification
url https://www.tci-thaijo.org/index.php/easr/article/download/184408/155804/
work_keys_str_mv AT wiroteapinantanakon optimaldatadivisionforempoweringartificialneuralnetworkmodelsemployingamodifiedmspxyalgorithm
AT khamronsunat optimaldatadivisionforempoweringartificialneuralnetworkmodelsemployingamodifiedmspxyalgorithm
AT joelalankinmond optimaldatadivisionforempoweringartificialneuralnetworkmodelsemployingamodifiedmspxyalgorithm
_version_ 1724988206859943936