The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
Training and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally t...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2020-01-01
|
Series: | Mathematical Problems in Engineering |
Online Access: | http://dx.doi.org/10.1155/2020/2836236 |
id |
doaj-7c6ca7a3141343baaf659688564d8d55 |
---|---|
record_format |
Article |
spelling |
doaj-7c6ca7a3141343baaf659688564d8d552020-11-25T03:09:20ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472020-01-01202010.1155/2020/28362362836236The Effect of Training and Testing Process on Machine Learning in Biomedical DatasetsMuhammed Kürşad Uçar0Majid Nour1Hatem Sindi2Kemal Polat3Electrical and Electronics Engineering, Faculty of Engineering, Sakarya University, Sakarya 54187, TurkeyDepartment of Electrical and Computer Engineering, Faculty of Engineering, King Abdulaziz University, Jeddah 21589, Saudi ArabiaKing Abdulaziz University, Knowledge-Economy, and Technology Transfer Center, Jeddah 21589, Saudi ArabiaDepartment of Electrical and Electronics Engineering, Faculty of Engineering, Bolu Abant Izzet Baysal University, Bolu 14280, TurkeyTraining and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally theoretical. Besides, there is no useful model for how to select samples in the training and testing process. Therefore, there is a need for resources in machine learning that discuss the training and testing process in detail and offer new recommendations. This article provides a detailed analysis of the training and testing process in machine learning. The article has the following sections. The third section describes how to prepare the datasets. Four balanced datasets were used for the application. The fourth section describes the rate and how to select samples at the training and testing stage. The fundamental sampling theorem is the subject of statistics. It shows how to select samples. In this article, it has been proposed to use sampling methods in machine learning training and testing process. The fourth section covers the theoretic expression of four different sampling theorems. Besides, the results section has the results of the performance of sampling theorems. The fifth section describes the methods by which training and pretest features can be selected. In the study, three different classifiers control the performance. The results section describes how the results should be analyzed. Additionally, this article proposes performance evaluation methods to evaluate its results. This article examines the effect of the training and testing process on performance in machine learning in detail and proposes the use of sampling theorems for the training and testing process. According to the results, datasets, feature selection algorithms, classifiers, training, and test ratio are the criteria that directly affect performance. However, the methods of selecting samples at the training and testing stages are vital for the system to work correctly. In order to design a stable system, it is recommended that samples should be selected with a stratified systematic sampling theorem.http://dx.doi.org/10.1155/2020/2836236 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Muhammed Kürşad Uçar Majid Nour Hatem Sindi Kemal Polat |
spellingShingle |
Muhammed Kürşad Uçar Majid Nour Hatem Sindi Kemal Polat The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets Mathematical Problems in Engineering |
author_facet |
Muhammed Kürşad Uçar Majid Nour Hatem Sindi Kemal Polat |
author_sort |
Muhammed Kürşad Uçar |
title |
The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets |
title_short |
The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets |
title_full |
The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets |
title_fullStr |
The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets |
title_full_unstemmed |
The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets |
title_sort |
effect of training and testing process on machine learning in biomedical datasets |
publisher |
Hindawi Limited |
series |
Mathematical Problems in Engineering |
issn |
1024-123X 1563-5147 |
publishDate |
2020-01-01 |
description |
Training and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally theoretical. Besides, there is no useful model for how to select samples in the training and testing process. Therefore, there is a need for resources in machine learning that discuss the training and testing process in detail and offer new recommendations. This article provides a detailed analysis of the training and testing process in machine learning. The article has the following sections. The third section describes how to prepare the datasets. Four balanced datasets were used for the application. The fourth section describes the rate and how to select samples at the training and testing stage. The fundamental sampling theorem is the subject of statistics. It shows how to select samples. In this article, it has been proposed to use sampling methods in machine learning training and testing process. The fourth section covers the theoretic expression of four different sampling theorems. Besides, the results section has the results of the performance of sampling theorems. The fifth section describes the methods by which training and pretest features can be selected. In the study, three different classifiers control the performance. The results section describes how the results should be analyzed. Additionally, this article proposes performance evaluation methods to evaluate its results. This article examines the effect of the training and testing process on performance in machine learning in detail and proposes the use of sampling theorems for the training and testing process. According to the results, datasets, feature selection algorithms, classifiers, training, and test ratio are the criteria that directly affect performance. However, the methods of selecting samples at the training and testing stages are vital for the system to work correctly. In order to design a stable system, it is recommended that samples should be selected with a stratified systematic sampling theorem. |
url |
http://dx.doi.org/10.1155/2020/2836236 |
work_keys_str_mv |
AT muhammedkursaducar theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT majidnour theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT hatemsindi theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT kemalpolat theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT muhammedkursaducar effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT majidnour effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT hatemsindi effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT kemalpolat effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets |
_version_ |
1715292128569982976 |