A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness

Imputation of missing data in datasets with high seasonality plays an important role in data analysis and prediction. Failure to appropriately account for missing data may lead to erroneous findings, false conclusions, and inaccurate predictions. The essence of a good imputation method is its missin...

Full description

Bibliographic Details
Main Authors: Aizaz Chaudhry, Wei Li, Amir Basri, François Patenaude
Format: Article
Language:English
Published: Hindawi-Wiley 2019-01-01
Series:Wireless Communications and Mobile Computing
Online Access:http://dx.doi.org/10.1155/2019/4039758
id doaj-ce4663c3425a4be59c525d8f50bbb696
record_format Article
spelling doaj-ce4663c3425a4be59c525d8f50bbb6962020-11-24T22:21:18ZengHindawi-WileyWireless Communications and Mobile Computing1530-86691530-86772019-01-01201910.1155/2019/40397584039758A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of MissingnessAizaz Chaudhry0Wei Li1Amir Basri2François Patenaude3Communications Research Centre Canada, Ottawa, ON K2H 8S2, CanadaCommunications Research Centre Canada, Ottawa, ON K2H 8S2, CanadaCommunications Research Centre Canada, Ottawa, ON K2H 8S2, CanadaCommunications Research Centre Canada, Ottawa, ON K2H 8S2, CanadaImputation of missing data in datasets with high seasonality plays an important role in data analysis and prediction. Failure to appropriately account for missing data may lead to erroneous findings, false conclusions, and inaccurate predictions. The essence of a good imputation method is its missingness-recovery-ability, i.e., the ability to deal with large periods of missing data in the dataset and the ability to extract the right characteristics (e.g., seasonality pattern) buried under the dataset to be analyzed. Univariate imputation is usually incapable of providing a reasonable imputation for a variable when periods of missing values are large. On the other hand, the default multivariate imputation approach cannot provide an accurate imputation for a variable when missing values of other correlated variables used for imputation occur at exactly the same time intervals. To deal with these drawbacks and to provide feasible imputations in such scenarios, we propose a novel method that converts a single variable into a multivariate form by exploiting the high seasonality and random missingness of this variable. After this conversion, multivariate imputation can then be applied. We then test the proposed method on an LTE spectrum dataset for imputing a single variable, such as the average cell throughput. We compare the performance of our proposed method with Kalman filtering and default method for multivariate imputation. The performance evaluation results clearly show that the proposed method significantly outperforms Kalman filtering and default method in terms of imputation and prediction accuracy.http://dx.doi.org/10.1155/2019/4039758
collection DOAJ
language English
format Article
sources DOAJ
author Aizaz Chaudhry
Wei Li
Amir Basri
François Patenaude
spellingShingle Aizaz Chaudhry
Wei Li
Amir Basri
François Patenaude
A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness
Wireless Communications and Mobile Computing
author_facet Aizaz Chaudhry
Wei Li
Amir Basri
François Patenaude
author_sort Aizaz Chaudhry
title A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness
title_short A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness
title_full A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness
title_fullStr A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness
title_full_unstemmed A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness
title_sort method for improving imputation and prediction accuracy of highly seasonal univariate data with large periods of missingness
publisher Hindawi-Wiley
series Wireless Communications and Mobile Computing
issn 1530-8669
1530-8677
publishDate 2019-01-01
description Imputation of missing data in datasets with high seasonality plays an important role in data analysis and prediction. Failure to appropriately account for missing data may lead to erroneous findings, false conclusions, and inaccurate predictions. The essence of a good imputation method is its missingness-recovery-ability, i.e., the ability to deal with large periods of missing data in the dataset and the ability to extract the right characteristics (e.g., seasonality pattern) buried under the dataset to be analyzed. Univariate imputation is usually incapable of providing a reasonable imputation for a variable when periods of missing values are large. On the other hand, the default multivariate imputation approach cannot provide an accurate imputation for a variable when missing values of other correlated variables used for imputation occur at exactly the same time intervals. To deal with these drawbacks and to provide feasible imputations in such scenarios, we propose a novel method that converts a single variable into a multivariate form by exploiting the high seasonality and random missingness of this variable. After this conversion, multivariate imputation can then be applied. We then test the proposed method on an LTE spectrum dataset for imputing a single variable, such as the average cell throughput. We compare the performance of our proposed method with Kalman filtering and default method for multivariate imputation. The performance evaluation results clearly show that the proposed method significantly outperforms Kalman filtering and default method in terms of imputation and prediction accuracy.
url http://dx.doi.org/10.1155/2019/4039758
work_keys_str_mv AT aizazchaudhry amethodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness
AT weili amethodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness
AT amirbasri amethodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness
AT francoispatenaude amethodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness
AT aizazchaudhry methodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness
AT weili methodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness
AT amirbasri methodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness
AT francoispatenaude methodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness
_version_ 1725771786899947520