The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets

Training and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally t...

Full description

Bibliographic Details
Main Authors: Muhammed Kürşad Uçar, Majid Nour, Hatem Sindi, Kemal Polat
Format: Article
Language:English
Published: Hindawi Limited 2020-01-01
Series:Mathematical Problems in Engineering
Online Access:http://dx.doi.org/10.1155/2020/2836236
id doaj-7c6ca7a3141343baaf659688564d8d55
record_format Article
spelling doaj-7c6ca7a3141343baaf659688564d8d552020-11-25T03:09:20ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472020-01-01202010.1155/2020/28362362836236The Effect of Training and Testing Process on Machine Learning in Biomedical DatasetsMuhammed Kürşad Uçar0Majid Nour1Hatem Sindi2Kemal Polat3Electrical and Electronics Engineering, Faculty of Engineering, Sakarya University, Sakarya 54187, TurkeyDepartment of Electrical and Computer Engineering, Faculty of Engineering, King Abdulaziz University, Jeddah 21589, Saudi ArabiaKing Abdulaziz University, Knowledge-Economy, and Technology Transfer Center, Jeddah 21589, Saudi ArabiaDepartment of Electrical and Electronics Engineering, Faculty of Engineering, Bolu Abant Izzet Baysal University, Bolu 14280, TurkeyTraining and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally theoretical. Besides, there is no useful model for how to select samples in the training and testing process. Therefore, there is a need for resources in machine learning that discuss the training and testing process in detail and offer new recommendations. This article provides a detailed analysis of the training and testing process in machine learning. The article has the following sections. The third section describes how to prepare the datasets. Four balanced datasets were used for the application. The fourth section describes the rate and how to select samples at the training and testing stage. The fundamental sampling theorem is the subject of statistics. It shows how to select samples. In this article, it has been proposed to use sampling methods in machine learning training and testing process. The fourth section covers the theoretic expression of four different sampling theorems. Besides, the results section has the results of the performance of sampling theorems. The fifth section describes the methods by which training and pretest features can be selected. In the study, three different classifiers control the performance. The results section describes how the results should be analyzed. Additionally, this article proposes performance evaluation methods to evaluate its results. This article examines the effect of the training and testing process on performance in machine learning in detail and proposes the use of sampling theorems for the training and testing process. According to the results, datasets, feature selection algorithms, classifiers, training, and test ratio are the criteria that directly affect performance. However, the methods of selecting samples at the training and testing stages are vital for the system to work correctly. In order to design a stable system, it is recommended that samples should be selected with a stratified systematic sampling theorem.http://dx.doi.org/10.1155/2020/2836236
collection DOAJ
language English
format Article
sources DOAJ
author Muhammed Kürşad Uçar
Majid Nour
Hatem Sindi
Kemal Polat
spellingShingle Muhammed Kürşad Uçar
Majid Nour
Hatem Sindi
Kemal Polat
The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
Mathematical Problems in Engineering
author_facet Muhammed Kürşad Uçar
Majid Nour
Hatem Sindi
Kemal Polat
author_sort Muhammed Kürşad Uçar
title The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
title_short The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
title_full The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
title_fullStr The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
title_full_unstemmed The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
title_sort effect of training and testing process on machine learning in biomedical datasets
publisher Hindawi Limited
series Mathematical Problems in Engineering
issn 1024-123X
1563-5147
publishDate 2020-01-01
description Training and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally theoretical. Besides, there is no useful model for how to select samples in the training and testing process. Therefore, there is a need for resources in machine learning that discuss the training and testing process in detail and offer new recommendations. This article provides a detailed analysis of the training and testing process in machine learning. The article has the following sections. The third section describes how to prepare the datasets. Four balanced datasets were used for the application. The fourth section describes the rate and how to select samples at the training and testing stage. The fundamental sampling theorem is the subject of statistics. It shows how to select samples. In this article, it has been proposed to use sampling methods in machine learning training and testing process. The fourth section covers the theoretic expression of four different sampling theorems. Besides, the results section has the results of the performance of sampling theorems. The fifth section describes the methods by which training and pretest features can be selected. In the study, three different classifiers control the performance. The results section describes how the results should be analyzed. Additionally, this article proposes performance evaluation methods to evaluate its results. This article examines the effect of the training and testing process on performance in machine learning in detail and proposes the use of sampling theorems for the training and testing process. According to the results, datasets, feature selection algorithms, classifiers, training, and test ratio are the criteria that directly affect performance. However, the methods of selecting samples at the training and testing stages are vital for the system to work correctly. In order to design a stable system, it is recommended that samples should be selected with a stratified systematic sampling theorem.
url http://dx.doi.org/10.1155/2020/2836236
work_keys_str_mv AT muhammedkursaducar theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets
AT majidnour theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets
AT hatemsindi theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets
AT kemalpolat theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets
AT muhammedkursaducar effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets
AT majidnour effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets
AT hatemsindi effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets
AT kemalpolat effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets
_version_ 1715292128569982976