The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets

Training and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally t...

Full description

Bibliographic Details
Main Authors:	Muhammed Kürşad Uçar, Majid Nour, Hatem Sindi, Kemal Polat
Format:	Article
Language:	English
Published:	Hindawi Limited 2020-01-01
Series:	Mathematical Problems in Engineering
Online Access:	http://dx.doi.org/10.1155/2020/2836236

id	doaj-7c6ca7a3141343baaf659688564d8d55
record_format	Article
spelling	doaj-7c6ca7a3141343baaf659688564d8d552020-11-25T03:09:20ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472020-01-01202010.1155/2020/28362362836236The Effect of Training and Testing Process on Machine Learning in Biomedical DatasetsMuhammed Kürşad Uçar0Majid Nour1Hatem Sindi2Kemal Polat3Electrical and Electronics Engineering, Faculty of Engineering, Sakarya University, Sakarya 54187, TurkeyDepartment of Electrical and Computer Engineering, Faculty of Engineering, King Abdulaziz University, Jeddah 21589, Saudi ArabiaKing Abdulaziz University, Knowledge-Economy, and Technology Transfer Center, Jeddah 21589, Saudi ArabiaDepartment of Electrical and Electronics Engineering, Faculty of Engineering, Bolu Abant Izzet Baysal University, Bolu 14280, TurkeyTraining and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally theoretical. Besides, there is no useful model for how to select samples in the training and testing process. Therefore, there is a need for resources in machine learning that discuss the training and testing process in detail and offer new recommendations. This article provides a detailed analysis of the training and testing process in machine learning. The article has the following sections. The third section describes how to prepare the datasets. Four balanced datasets were used for the application. The fourth section describes the rate and how to select samples at the training and testing stage. The fundamental sampling theorem is the subject of statistics. It shows how to select samples. In this article, it has been proposed to use sampling methods in machine learning training and testing process. The fourth section covers the theoretic expression of four different sampling theorems. Besides, the results section has the results of the performance of sampling theorems. The fifth section describes the methods by which training and pretest features can be selected. In the study, three different classifiers control the performance. The results section describes how the results should be analyzed. Additionally, this article proposes performance evaluation methods to evaluate its results. This article examines the effect of the training and testing process on performance in machine learning in detail and proposes the use of sampling theorems for the training and testing process. According to the results, datasets, feature selection algorithms, classifiers, training, and test ratio are the criteria that directly affect performance. However, the methods of selecting samples at the training and testing stages are vital for the system to work correctly. In order to design a stable system, it is recommended that samples should be selected with a stratified systematic sampling theorem.http://dx.doi.org/10.1155/2020/2836236
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Muhammed Kürşad Uçar Majid Nour Hatem Sindi Kemal Polat
spellingShingle	Muhammed Kürşad Uçar Majid Nour Hatem Sindi Kemal Polat The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets Mathematical Problems in Engineering
author_facet	Muhammed Kürşad Uçar Majid Nour Hatem Sindi Kemal Polat
author_sort	Muhammed Kürşad Uçar
title	The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
title_short	The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
title_full	The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
title_fullStr	The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
title_full_unstemmed	The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets
title_sort	effect of training and testing process on machine learning in biomedical datasets
publisher	Hindawi Limited
series	Mathematical Problems in Engineering
issn	1024-123X 1563-5147
publishDate	2020-01-01
description	Training and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally theoretical. Besides, there is no useful model for how to select samples in the training and testing process. Therefore, there is a need for resources in machine learning that discuss the training and testing process in detail and offer new recommendations. This article provides a detailed analysis of the training and testing process in machine learning. The article has the following sections. The third section describes how to prepare the datasets. Four balanced datasets were used for the application. The fourth section describes the rate and how to select samples at the training and testing stage. The fundamental sampling theorem is the subject of statistics. It shows how to select samples. In this article, it has been proposed to use sampling methods in machine learning training and testing process. The fourth section covers the theoretic expression of four different sampling theorems. Besides, the results section has the results of the performance of sampling theorems. The fifth section describes the methods by which training and pretest features can be selected. In the study, three different classifiers control the performance. The results section describes how the results should be analyzed. Additionally, this article proposes performance evaluation methods to evaluate its results. This article examines the effect of the training and testing process on performance in machine learning in detail and proposes the use of sampling theorems for the training and testing process. According to the results, datasets, feature selection algorithms, classifiers, training, and test ratio are the criteria that directly affect performance. However, the methods of selecting samples at the training and testing stages are vital for the system to work correctly. In order to design a stable system, it is recommended that samples should be selected with a stratified systematic sampling theorem.
url	http://dx.doi.org/10.1155/2020/2836236
work_keys_str_mv	AT muhammedkursaducar theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT majidnour theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT hatemsindi theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT kemalpolat theeffectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT muhammedkursaducar effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT majidnour effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT hatemsindi effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets AT kemalpolat effectoftrainingandtestingprocessonmachinelearninginbiomedicaldatasets
_version_	1715292128569982976

The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets

Similar Items