A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness
Imputation of missing data in datasets with high seasonality plays an important role in data analysis and prediction. Failure to appropriately account for missing data may lead to erroneous findings, false conclusions, and inaccurate predictions. The essence of a good imputation method is its missin...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi-Wiley
2019-01-01
|
Series: | Wireless Communications and Mobile Computing |
Online Access: | http://dx.doi.org/10.1155/2019/4039758 |
id |
doaj-ce4663c3425a4be59c525d8f50bbb696 |
---|---|
record_format |
Article |
spelling |
doaj-ce4663c3425a4be59c525d8f50bbb6962020-11-24T22:21:18ZengHindawi-WileyWireless Communications and Mobile Computing1530-86691530-86772019-01-01201910.1155/2019/40397584039758A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of MissingnessAizaz Chaudhry0Wei Li1Amir Basri2François Patenaude3Communications Research Centre Canada, Ottawa, ON K2H 8S2, CanadaCommunications Research Centre Canada, Ottawa, ON K2H 8S2, CanadaCommunications Research Centre Canada, Ottawa, ON K2H 8S2, CanadaCommunications Research Centre Canada, Ottawa, ON K2H 8S2, CanadaImputation of missing data in datasets with high seasonality plays an important role in data analysis and prediction. Failure to appropriately account for missing data may lead to erroneous findings, false conclusions, and inaccurate predictions. The essence of a good imputation method is its missingness-recovery-ability, i.e., the ability to deal with large periods of missing data in the dataset and the ability to extract the right characteristics (e.g., seasonality pattern) buried under the dataset to be analyzed. Univariate imputation is usually incapable of providing a reasonable imputation for a variable when periods of missing values are large. On the other hand, the default multivariate imputation approach cannot provide an accurate imputation for a variable when missing values of other correlated variables used for imputation occur at exactly the same time intervals. To deal with these drawbacks and to provide feasible imputations in such scenarios, we propose a novel method that converts a single variable into a multivariate form by exploiting the high seasonality and random missingness of this variable. After this conversion, multivariate imputation can then be applied. We then test the proposed method on an LTE spectrum dataset for imputing a single variable, such as the average cell throughput. We compare the performance of our proposed method with Kalman filtering and default method for multivariate imputation. The performance evaluation results clearly show that the proposed method significantly outperforms Kalman filtering and default method in terms of imputation and prediction accuracy.http://dx.doi.org/10.1155/2019/4039758 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Aizaz Chaudhry Wei Li Amir Basri François Patenaude |
spellingShingle |
Aizaz Chaudhry Wei Li Amir Basri François Patenaude A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness Wireless Communications and Mobile Computing |
author_facet |
Aizaz Chaudhry Wei Li Amir Basri François Patenaude |
author_sort |
Aizaz Chaudhry |
title |
A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness |
title_short |
A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness |
title_full |
A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness |
title_fullStr |
A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness |
title_full_unstemmed |
A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness |
title_sort |
method for improving imputation and prediction accuracy of highly seasonal univariate data with large periods of missingness |
publisher |
Hindawi-Wiley |
series |
Wireless Communications and Mobile Computing |
issn |
1530-8669 1530-8677 |
publishDate |
2019-01-01 |
description |
Imputation of missing data in datasets with high seasonality plays an important role in data analysis and prediction. Failure to appropriately account for missing data may lead to erroneous findings, false conclusions, and inaccurate predictions. The essence of a good imputation method is its missingness-recovery-ability, i.e., the ability to deal with large periods of missing data in the dataset and the ability to extract the right characteristics (e.g., seasonality pattern) buried under the dataset to be analyzed. Univariate imputation is usually incapable of providing a reasonable imputation for a variable when periods of missing values are large. On the other hand, the default multivariate imputation approach cannot provide an accurate imputation for a variable when missing values of other correlated variables used for imputation occur at exactly the same time intervals. To deal with these drawbacks and to provide feasible imputations in such scenarios, we propose a novel method that converts a single variable into a multivariate form by exploiting the high seasonality and random missingness of this variable. After this conversion, multivariate imputation can then be applied. We then test the proposed method on an LTE spectrum dataset for imputing a single variable, such as the average cell throughput. We compare the performance of our proposed method with Kalman filtering and default method for multivariate imputation. The performance evaluation results clearly show that the proposed method significantly outperforms Kalman filtering and default method in terms of imputation and prediction accuracy. |
url |
http://dx.doi.org/10.1155/2019/4039758 |
work_keys_str_mv |
AT aizazchaudhry amethodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness AT weili amethodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness AT amirbasri amethodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness AT francoispatenaude amethodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness AT aizazchaudhry methodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness AT weili methodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness AT amirbasri methodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness AT francoispatenaude methodforimprovingimputationandpredictionaccuracyofhighlyseasonalunivariatedatawithlargeperiodsofmissingness |
_version_ |
1725771786899947520 |