Predicting the Unobserved : A statistical analysis of missing data techniques for binary classification
The aim of the thesis is to investigate how the classification performance of random forest and logistic regression differ, given an imbalanced data set with MCAR missing data. The performance is measured in terms of accuracy and sensitivity. Two analyses are performed: one with a simulated data set...
Main Author: | Säfström, Stella |
---|---|
Format: | Others |
Language: | English |
Published: |
Uppsala universitet, Statistiska institutionen
2019
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-388581 |
Similar Items
-
Missing Data - A Gentle Introduction
by: Österlund, Vilgot
Published: (2020) -
Missing data and multiple imputation in clinical epidemiological research
by: Pedersen AB, et al.
Published: (2017-03-01) -
A Benchmark for Data Imputation Methods
by: Sebastian Jäger, et al.
Published: (2021-07-01) -
Planned Missing Data in Mediation Analysis
Published: (2015) -
CAN STATISTICAL MODELS BEAT BENCHMARK PREDICTIONS BASED ON RANKINGS IN TENNIS?
by: Svensson, William
Published: (2021)