Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm
Data splitting is an important step in artificial neural network (ANN) models, which is found in the form of training and testing subsets. In general, a random data splitting method is favored to divide a pool of samples into subsets, without considering the quality of data for the training step of...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Khon Kaen University
2019-12-01
|
Series: | Engineering and Applied Science Research |
Subjects: | |
Online Access: | https://www.tci-thaijo.org/index.php/easr/article/download/184408/155804/ |
id |
doaj-4119f0c8a3ba4d34bfbfb032057a2148 |
---|---|
record_format |
Article |
spelling |
doaj-4119f0c8a3ba4d34bfbfb032057a21482020-11-25T01:54:16ZengKhon Kaen UniversityEngineering and Applied Science Research2539-61612539-62182019-12-0146427628410.14456/easr.2019.31Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithmWirote ApinantanakonKhamron SunatJoel Alan KinmondData splitting is an important step in artificial neural network (ANN) models, which is found in the form of training and testing subsets. In general, a random data splitting method is favored to divide a pool of samples into subsets, without considering the quality of data for the training step of a neural network. The drawback of poor data splitting methods is that they poses ill effects to the performance of the neural network when the data involves complex matrices or multivariate modeling. In order to overcome this drawback, the current paper presents our proposed M-SPXY method. It is based on a modified version of Sample Set Partitioning, which relies on a joint X-y distances (SPXY) method. The proposed method has resulted in better performance, compared to the modified Kennard-Stone (KS) method, using Mahalanobis distances (MDKS). In our experiments, the proposed approach was employed to compare various data splitting methods using data sets from the repository of the University of California in Irvine (UCI), processed through an Extreme Learning Machine (ELM) neural network. Performance was measured in terms of classification accuracy. The results indicate that the classification accuracy of the proposed M-SPXY process is superior to that of the MDKS data splitting method.https://www.tci-thaijo.org/index.php/easr/article/download/184408/155804/extreme learning machinespxyneural networksubset selectionmahalanobis distanceclassification |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wirote Apinantanakon Khamron Sunat Joel Alan Kinmond |
spellingShingle |
Wirote Apinantanakon Khamron Sunat Joel Alan Kinmond Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm Engineering and Applied Science Research extreme learning machine spxy neural network subset selection mahalanobis distance classification |
author_facet |
Wirote Apinantanakon Khamron Sunat Joel Alan Kinmond |
author_sort |
Wirote Apinantanakon |
title |
Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm |
title_short |
Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm |
title_full |
Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm |
title_fullStr |
Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm |
title_full_unstemmed |
Optimal data division for empowering artificial neural network models employing a modified M-SPXY algorithm |
title_sort |
optimal data division for empowering artificial neural network models employing a modified m-spxy algorithm |
publisher |
Khon Kaen University |
series |
Engineering and Applied Science Research |
issn |
2539-6161 2539-6218 |
publishDate |
2019-12-01 |
description |
Data splitting is an important step in artificial neural network (ANN) models, which is found in the form of training and testing subsets. In general, a random data splitting method is favored to divide a pool of samples into subsets, without considering the quality of data for the training step of a neural network. The drawback of poor data splitting methods is that they poses ill effects to the performance of the neural network when the data involves complex matrices or multivariate modeling. In order to overcome this drawback, the current paper presents our proposed M-SPXY method. It is based on a modified version of Sample Set Partitioning, which relies on a joint X-y distances (SPXY) method. The proposed method has resulted in better performance, compared to the modified Kennard-Stone (KS) method, using Mahalanobis distances (MDKS). In our experiments, the proposed approach was employed to compare various data splitting methods using data sets from the repository of the University of California in Irvine (UCI), processed through an Extreme Learning Machine (ELM) neural network. Performance was measured in terms of classification accuracy. The results indicate that the classification accuracy of the proposed M-SPXY process is superior to that of the MDKS data splitting method. |
topic |
extreme learning machine spxy neural network subset selection mahalanobis distance classification |
url |
https://www.tci-thaijo.org/index.php/easr/article/download/184408/155804/ |
work_keys_str_mv |
AT wiroteapinantanakon optimaldatadivisionforempoweringartificialneuralnetworkmodelsemployingamodifiedmspxyalgorithm AT khamronsunat optimaldatadivisionforempoweringartificialneuralnetworkmodelsemployingamodifiedmspxyalgorithm AT joelalankinmond optimaldatadivisionforempoweringartificialneuralnetworkmodelsemployingamodifiedmspxyalgorithm |
_version_ |
1724988206859943936 |