A novel model for malaria prediction based on ensemble algorithms.

BACKGROUND AND OBJECTIVE:Most previous studies adopted single traditional time series models to predict incidences of malaria. A single model cannot effectively capture all the properties of the data structure. However, a stacking architecture can solve this problem by combining distinct algorithms...

Full description

Bibliographic Details
Main Authors: Mengyang Wang, Hui Wang, Jiao Wang, Hongwei Liu, Rui Lu, Tongqing Duan, Xiaowen Gong, Siyuan Feng, Yuanyuan Liu, Zhuang Cui, Changping Li, Jun Ma
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0226910
id doaj-4768565100c44f23aedfb15fdd993df4
record_format Article
spelling doaj-4768565100c44f23aedfb15fdd993df42021-03-03T21:20:56ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-011412e022691010.1371/journal.pone.0226910A novel model for malaria prediction based on ensemble algorithms.Mengyang WangHui WangJiao WangHongwei LiuRui LuTongqing DuanXiaowen GongSiyuan FengYuanyuan LiuZhuang CuiChangping LiJun MaBACKGROUND AND OBJECTIVE:Most previous studies adopted single traditional time series models to predict incidences of malaria. A single model cannot effectively capture all the properties of the data structure. However, a stacking architecture can solve this problem by combining distinct algorithms and models. This study compares the performance of traditional time series models and deep learning algorithms in malaria case prediction and explores the application value of stacking methods in the field of infectious disease prediction. METHODS:The ARIMA, STL+ARIMA, BP-ANN and LSTM network models were separately applied in simulations using malaria data and meteorological data in Yunnan Province from 2011 to 2017. We compared the predictive performance of each model through evaluation measures: RMSE, MASE, MAD. In addition, gradient-boosting regression trees (GBRTs) were used to combine the above four models. We also determined whether stacking structure improved the model prediction performance. RESULTS:The root mean square errors (RMSEs) of the four sub-models were 13.176, 14.543, 9.571 and 7.208; the mean absolute scaled errors (MASEs) were 0.469, 0.472, 0.296 and 0.266 and the mean absolute deviation (MAD) were 6.403, 7.658, 5.871 and 5.691. After using the stacking architecture combined with the above four models, the RMSE, MASE and MAD values of the ensemble model decreased to 6.810, 0.224 and 4.625, respectively. CONCLUSIONS:A novel ensemble model based on the robustness of structured prediction and model combination through stacking was developed. The findings suggest that the predictive performance of the final model is superior to that of the other four sub-models, indicating that stacking architecture may have significant implications in infectious disease prediction.https://doi.org/10.1371/journal.pone.0226910
collection DOAJ
language English
format Article
sources DOAJ
author Mengyang Wang
Hui Wang
Jiao Wang
Hongwei Liu
Rui Lu
Tongqing Duan
Xiaowen Gong
Siyuan Feng
Yuanyuan Liu
Zhuang Cui
Changping Li
Jun Ma
spellingShingle Mengyang Wang
Hui Wang
Jiao Wang
Hongwei Liu
Rui Lu
Tongqing Duan
Xiaowen Gong
Siyuan Feng
Yuanyuan Liu
Zhuang Cui
Changping Li
Jun Ma
A novel model for malaria prediction based on ensemble algorithms.
PLoS ONE
author_facet Mengyang Wang
Hui Wang
Jiao Wang
Hongwei Liu
Rui Lu
Tongqing Duan
Xiaowen Gong
Siyuan Feng
Yuanyuan Liu
Zhuang Cui
Changping Li
Jun Ma
author_sort Mengyang Wang
title A novel model for malaria prediction based on ensemble algorithms.
title_short A novel model for malaria prediction based on ensemble algorithms.
title_full A novel model for malaria prediction based on ensemble algorithms.
title_fullStr A novel model for malaria prediction based on ensemble algorithms.
title_full_unstemmed A novel model for malaria prediction based on ensemble algorithms.
title_sort novel model for malaria prediction based on ensemble algorithms.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2019-01-01
description BACKGROUND AND OBJECTIVE:Most previous studies adopted single traditional time series models to predict incidences of malaria. A single model cannot effectively capture all the properties of the data structure. However, a stacking architecture can solve this problem by combining distinct algorithms and models. This study compares the performance of traditional time series models and deep learning algorithms in malaria case prediction and explores the application value of stacking methods in the field of infectious disease prediction. METHODS:The ARIMA, STL+ARIMA, BP-ANN and LSTM network models were separately applied in simulations using malaria data and meteorological data in Yunnan Province from 2011 to 2017. We compared the predictive performance of each model through evaluation measures: RMSE, MASE, MAD. In addition, gradient-boosting regression trees (GBRTs) were used to combine the above four models. We also determined whether stacking structure improved the model prediction performance. RESULTS:The root mean square errors (RMSEs) of the four sub-models were 13.176, 14.543, 9.571 and 7.208; the mean absolute scaled errors (MASEs) were 0.469, 0.472, 0.296 and 0.266 and the mean absolute deviation (MAD) were 6.403, 7.658, 5.871 and 5.691. After using the stacking architecture combined with the above four models, the RMSE, MASE and MAD values of the ensemble model decreased to 6.810, 0.224 and 4.625, respectively. CONCLUSIONS:A novel ensemble model based on the robustness of structured prediction and model combination through stacking was developed. The findings suggest that the predictive performance of the final model is superior to that of the other four sub-models, indicating that stacking architecture may have significant implications in infectious disease prediction.
url https://doi.org/10.1371/journal.pone.0226910
work_keys_str_mv AT mengyangwang anovelmodelformalariapredictionbasedonensemblealgorithms
AT huiwang anovelmodelformalariapredictionbasedonensemblealgorithms
AT jiaowang anovelmodelformalariapredictionbasedonensemblealgorithms
AT hongweiliu anovelmodelformalariapredictionbasedonensemblealgorithms
AT ruilu anovelmodelformalariapredictionbasedonensemblealgorithms
AT tongqingduan anovelmodelformalariapredictionbasedonensemblealgorithms
AT xiaowengong anovelmodelformalariapredictionbasedonensemblealgorithms
AT siyuanfeng anovelmodelformalariapredictionbasedonensemblealgorithms
AT yuanyuanliu anovelmodelformalariapredictionbasedonensemblealgorithms
AT zhuangcui anovelmodelformalariapredictionbasedonensemblealgorithms
AT changpingli anovelmodelformalariapredictionbasedonensemblealgorithms
AT junma anovelmodelformalariapredictionbasedonensemblealgorithms
AT mengyangwang novelmodelformalariapredictionbasedonensemblealgorithms
AT huiwang novelmodelformalariapredictionbasedonensemblealgorithms
AT jiaowang novelmodelformalariapredictionbasedonensemblealgorithms
AT hongweiliu novelmodelformalariapredictionbasedonensemblealgorithms
AT ruilu novelmodelformalariapredictionbasedonensemblealgorithms
AT tongqingduan novelmodelformalariapredictionbasedonensemblealgorithms
AT xiaowengong novelmodelformalariapredictionbasedonensemblealgorithms
AT siyuanfeng novelmodelformalariapredictionbasedonensemblealgorithms
AT yuanyuanliu novelmodelformalariapredictionbasedonensemblealgorithms
AT zhuangcui novelmodelformalariapredictionbasedonensemblealgorithms
AT changpingli novelmodelformalariapredictionbasedonensemblealgorithms
AT junma novelmodelformalariapredictionbasedonensemblealgorithms
_version_ 1714817328406855680