The evaluation of binary classification tasks in economical prediction

In the area of economical classification tasks, the accuracy maximization is often used to evaluate classifier performance. Accuracy maximization (or error rate minimization) suffers from the assumption of equal false positive and false negative error costs. Furthermore, accuracy is not able to expr...

Full description

Bibliographic Details
Main Author:	Martin Pokorný
Format:	Article
Language:	English
Published:	Mendel University Press 2010-01-01
Series:	Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis
Subjects:	binary classification bankruptcy prediction classifier performance evaluation accuracy maximization receiver operating characteristic (ROC)
Online Access:	https://acta.mendelu.cz/58/6/0369/

id	doaj-94d1a3f5db1b4a67a32850f088ec126b
record_format	Article
spelling	doaj-94d1a3f5db1b4a67a32850f088ec126b2020-11-24T22:50:47ZengMendel University PressActa Universitatis Agriculturae et Silviculturae Mendelianae Brunensis1211-85162464-83102010-01-0158636937810.11118/actaun201058060369The evaluation of binary classification tasks in economical predictionMartin Pokorný0Ústav informatiky, Mendelova univerzita v Brně, Zemědělská 1, 613 00, Brno, Česká republikaIn the area of economical classification tasks, the accuracy maximization is often used to evaluate classifier performance. Accuracy maximization (or error rate minimization) suffers from the assumption of equal false positive and false negative error costs. Furthermore, accuracy is not able to express true classifier performance under skewed class distribution. Due to these limitations, the use of accuracy on real tasks is questionable. In a real binary classification task, the difference between the costs of false positive and false negative error is usually critical. To overcome this issue, the Receiver Operating Characteristic (ROC) method in relation to decision-analytic principles can be used. One essential advantage of this method is the possibility of classifier performance visualization by means of a ROC graph. This paper presents concrete examples of binary classification, where the inadequacy of accuracy as the evaluation metric is shown, and on the same examples the ROC method is applied. From the set of possible classification models, the probabilistic classifier with continuous output is under consideration. Mainly two questions are solved. Firstly, the selection of the best classifier from a set of possible classifiers. For example, accuracy metric rates two classifiers almost equivalently (87.7 % and 89.3 %), whereas decision analysis (via costs minimization) or ROC analysis reveal different performance according to target conditions of unequal error costs of false positives and false negatives. Secondly, the setting of an optimal decision threshold at classifier’s output. For example, accuracy maximization finds the optimal threshold at classifier’s output in value of 0.597, but the optimal threshold respecting higher costs of false negatives is discovered by costs minimization or ROC analysis in a value substantially lower (0.477).https://acta.mendelu.cz/58/6/0369/binary classificationbankruptcy predictionclassifier performance evaluationaccuracy maximizationreceiver operating characteristic (ROC)
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Martin Pokorný
spellingShingle	Martin Pokorný The evaluation of binary classification tasks in economical prediction Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis binary classification bankruptcy prediction classifier performance evaluation accuracy maximization receiver operating characteristic (ROC)
author_facet	Martin Pokorný
author_sort	Martin Pokorný
title	The evaluation of binary classification tasks in economical prediction
title_short	The evaluation of binary classification tasks in economical prediction
title_full	The evaluation of binary classification tasks in economical prediction
title_fullStr	The evaluation of binary classification tasks in economical prediction
title_full_unstemmed	The evaluation of binary classification tasks in economical prediction
title_sort	evaluation of binary classification tasks in economical prediction
publisher	Mendel University Press
series	Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis
issn	1211-8516 2464-8310
publishDate	2010-01-01
description	In the area of economical classification tasks, the accuracy maximization is often used to evaluate classifier performance. Accuracy maximization (or error rate minimization) suffers from the assumption of equal false positive and false negative error costs. Furthermore, accuracy is not able to express true classifier performance under skewed class distribution. Due to these limitations, the use of accuracy on real tasks is questionable. In a real binary classification task, the difference between the costs of false positive and false negative error is usually critical. To overcome this issue, the Receiver Operating Characteristic (ROC) method in relation to decision-analytic principles can be used. One essential advantage of this method is the possibility of classifier performance visualization by means of a ROC graph. This paper presents concrete examples of binary classification, where the inadequacy of accuracy as the evaluation metric is shown, and on the same examples the ROC method is applied. From the set of possible classification models, the probabilistic classifier with continuous output is under consideration. Mainly two questions are solved. Firstly, the selection of the best classifier from a set of possible classifiers. For example, accuracy metric rates two classifiers almost equivalently (87.7 % and 89.3 %), whereas decision analysis (via costs minimization) or ROC analysis reveal different performance according to target conditions of unequal error costs of false positives and false negatives. Secondly, the setting of an optimal decision threshold at classifier’s output. For example, accuracy maximization finds the optimal threshold at classifier’s output in value of 0.597, but the optimal threshold respecting higher costs of false negatives is discovered by costs minimization or ROC analysis in a value substantially lower (0.477).
topic	binary classification bankruptcy prediction classifier performance evaluation accuracy maximization receiver operating characteristic (ROC)
url	https://acta.mendelu.cz/58/6/0369/
work_keys_str_mv	AT martinpokorny theevaluationofbinaryclassificationtasksineconomicalprediction AT martinpokorny evaluationofbinaryclassificationtasksineconomicalprediction
_version_	1725671432792309760

The evaluation of binary classification tasks in economical prediction

Similar Items