The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation

Regression analysis makes up a large part of supervised machine learning, and consists of the prediction of a continuous independent target from a set of other predictor variables. The difference between binary classification and regression is in the target range: in binary classification, the targe...

Full description

Bibliographic Details
Main Authors: Davide Chicco, Matthijs J. Warrens, Giuseppe Jurman
Format: Article
Language:English
Published: PeerJ Inc. 2021-07-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-623.pdf
id doaj-1ba8d51ba4ae44c89e5b4495000d4eec
record_format Article
spelling doaj-1ba8d51ba4ae44c89e5b4495000d4eec2021-07-07T15:05:13ZengPeerJ Inc.PeerJ Computer Science2376-59922021-07-017e62310.7717/peerj-cs.623The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluationDavide Chicco0Matthijs J. Warrens1Giuseppe Jurman2Institute of Health Policy Management and Evaluation, University of Toronto, Toronto, CanadaGroningen Institute for Educational Research, University of Groningen, Groningen, NetherlandsData Science for Health Unit, Fondazione Bruno Kessler, Trento, ItalyRegression analysis makes up a large part of supervised machine learning, and consists of the prediction of a continuous independent target from a set of other predictor variables. The difference between binary classification and regression is in the target range: in binary classification, the target can have only two values (usually encoded as 0 and 1), while in regression the target can have multiple values. Even if regression analysis has been employed in a huge number of machine learning studies, no consensus has been reached on a single, unified, standard metric to assess the results of the regression itself. Many studies employ the mean square error (MSE) and its rooted variant (RMSE), or the mean absolute error (MAE) and its percentage variant (MAPE). Although useful, these rates share a common drawback: since their values can range between zero and +infinity, a single value of them does not say much about the performance of the regression with respect to the distribution of the ground truth elements. In this study, we focus on two rates that actually generate a high score only if the majority of the elements of a ground truth group has been correctly predicted: the coefficient of determination (also known as R-squared or R2) and the symmetric mean absolute percentage error (SMAPE). After showing their mathematical properties, we report a comparison between R2 and SMAPE in several use cases and in two real medical scenarios. Our results demonstrate that the coefficient of determination (R-squared) is more informative and truthful than SMAPE, and does not have the interpretability limitations of MSE, RMSE, MAE and MAPE. We therefore suggest the usage of R-squared as standard metric to evaluate regression analyses in any scientific domain.https://peerj.com/articles/cs-623.pdfRegressionRegression evaluationRegression evaluation ratesCoefficient of determinationMean square errorMean absolute error
collection DOAJ
language English
format Article
sources DOAJ
author Davide Chicco
Matthijs J. Warrens
Giuseppe Jurman
spellingShingle Davide Chicco
Matthijs J. Warrens
Giuseppe Jurman
The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation
PeerJ Computer Science
Regression
Regression evaluation
Regression evaluation rates
Coefficient of determination
Mean square error
Mean absolute error
author_facet Davide Chicco
Matthijs J. Warrens
Giuseppe Jurman
author_sort Davide Chicco
title The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation
title_short The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation
title_full The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation
title_fullStr The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation
title_full_unstemmed The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation
title_sort coefficient of determination r-squared is more informative than smape, mae, mape, mse and rmse in regression analysis evaluation
publisher PeerJ Inc.
series PeerJ Computer Science
issn 2376-5992
publishDate 2021-07-01
description Regression analysis makes up a large part of supervised machine learning, and consists of the prediction of a continuous independent target from a set of other predictor variables. The difference between binary classification and regression is in the target range: in binary classification, the target can have only two values (usually encoded as 0 and 1), while in regression the target can have multiple values. Even if regression analysis has been employed in a huge number of machine learning studies, no consensus has been reached on a single, unified, standard metric to assess the results of the regression itself. Many studies employ the mean square error (MSE) and its rooted variant (RMSE), or the mean absolute error (MAE) and its percentage variant (MAPE). Although useful, these rates share a common drawback: since their values can range between zero and +infinity, a single value of them does not say much about the performance of the regression with respect to the distribution of the ground truth elements. In this study, we focus on two rates that actually generate a high score only if the majority of the elements of a ground truth group has been correctly predicted: the coefficient of determination (also known as R-squared or R2) and the symmetric mean absolute percentage error (SMAPE). After showing their mathematical properties, we report a comparison between R2 and SMAPE in several use cases and in two real medical scenarios. Our results demonstrate that the coefficient of determination (R-squared) is more informative and truthful than SMAPE, and does not have the interpretability limitations of MSE, RMSE, MAE and MAPE. We therefore suggest the usage of R-squared as standard metric to evaluate regression analyses in any scientific domain.
topic Regression
Regression evaluation
Regression evaluation rates
Coefficient of determination
Mean square error
Mean absolute error
url https://peerj.com/articles/cs-623.pdf
work_keys_str_mv AT davidechicco thecoefficientofdeterminationrsquaredismoreinformativethansmapemaemapemseandrmseinregressionanalysisevaluation
AT matthijsjwarrens thecoefficientofdeterminationrsquaredismoreinformativethansmapemaemapemseandrmseinregressionanalysisevaluation
AT giuseppejurman thecoefficientofdeterminationrsquaredismoreinformativethansmapemaemapemseandrmseinregressionanalysisevaluation
AT davidechicco coefficientofdeterminationrsquaredismoreinformativethansmapemaemapemseandrmseinregressionanalysisevaluation
AT matthijsjwarrens coefficientofdeterminationrsquaredismoreinformativethansmapemaemapemseandrmseinregressionanalysisevaluation
AT giuseppejurman coefficientofdeterminationrsquaredismoreinformativethansmapemaemapemseandrmseinregressionanalysisevaluation
_version_ 1721315106997927936