The data representativeness criterion: Predicting the performance of supervised classification based on data set similarity.

In a broad range of fields it may be desirable to reuse a supervised classification algorithm and apply it to a new data set. However, generalization of such an algorithm and thus achieving a similar classification performance is only possible when the training data used to build the algorithm is si...

Full description

Bibliographic Details
Main Authors: Evelien Schat, Rens van de Schoot, Wouter M Kouw, Duco Veen, Adriënne M Mendrik
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0237009