Predicting the Unobserved : A statistical analysis of missing data techniques for binary classification

The aim of the thesis is to investigate how the classification performance of random forest and logistic regression differ, given an imbalanced data set with MCAR missing data. The performance is measured in terms of accuracy and sensitivity. Two analyses are performed: one with a simulated data set...

Full description

Bibliographic Details
Main Author: Säfström, Stella
Format: Others
Language:English
Published: Uppsala universitet, Statistiska institutionen 2019
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-388581