A novel model for malaria prediction based on ensemble algorithms.
BACKGROUND AND OBJECTIVE:Most previous studies adopted single traditional time series models to predict incidences of malaria. A single model cannot effectively capture all the properties of the data structure. However, a stacking architecture can solve this problem by combining distinct algorithms...
Main Authors: | , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2019-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0226910 |
id |
doaj-4768565100c44f23aedfb15fdd993df4 |
---|---|
record_format |
Article |
spelling |
doaj-4768565100c44f23aedfb15fdd993df42021-03-03T21:20:56ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-011412e022691010.1371/journal.pone.0226910A novel model for malaria prediction based on ensemble algorithms.Mengyang WangHui WangJiao WangHongwei LiuRui LuTongqing DuanXiaowen GongSiyuan FengYuanyuan LiuZhuang CuiChangping LiJun MaBACKGROUND AND OBJECTIVE:Most previous studies adopted single traditional time series models to predict incidences of malaria. A single model cannot effectively capture all the properties of the data structure. However, a stacking architecture can solve this problem by combining distinct algorithms and models. This study compares the performance of traditional time series models and deep learning algorithms in malaria case prediction and explores the application value of stacking methods in the field of infectious disease prediction. METHODS:The ARIMA, STL+ARIMA, BP-ANN and LSTM network models were separately applied in simulations using malaria data and meteorological data in Yunnan Province from 2011 to 2017. We compared the predictive performance of each model through evaluation measures: RMSE, MASE, MAD. In addition, gradient-boosting regression trees (GBRTs) were used to combine the above four models. We also determined whether stacking structure improved the model prediction performance. RESULTS:The root mean square errors (RMSEs) of the four sub-models were 13.176, 14.543, 9.571 and 7.208; the mean absolute scaled errors (MASEs) were 0.469, 0.472, 0.296 and 0.266 and the mean absolute deviation (MAD) were 6.403, 7.658, 5.871 and 5.691. After using the stacking architecture combined with the above four models, the RMSE, MASE and MAD values of the ensemble model decreased to 6.810, 0.224 and 4.625, respectively. CONCLUSIONS:A novel ensemble model based on the robustness of structured prediction and model combination through stacking was developed. The findings suggest that the predictive performance of the final model is superior to that of the other four sub-models, indicating that stacking architecture may have significant implications in infectious disease prediction.https://doi.org/10.1371/journal.pone.0226910 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mengyang Wang Hui Wang Jiao Wang Hongwei Liu Rui Lu Tongqing Duan Xiaowen Gong Siyuan Feng Yuanyuan Liu Zhuang Cui Changping Li Jun Ma |
spellingShingle |
Mengyang Wang Hui Wang Jiao Wang Hongwei Liu Rui Lu Tongqing Duan Xiaowen Gong Siyuan Feng Yuanyuan Liu Zhuang Cui Changping Li Jun Ma A novel model for malaria prediction based on ensemble algorithms. PLoS ONE |
author_facet |
Mengyang Wang Hui Wang Jiao Wang Hongwei Liu Rui Lu Tongqing Duan Xiaowen Gong Siyuan Feng Yuanyuan Liu Zhuang Cui Changping Li Jun Ma |
author_sort |
Mengyang Wang |
title |
A novel model for malaria prediction based on ensemble algorithms. |
title_short |
A novel model for malaria prediction based on ensemble algorithms. |
title_full |
A novel model for malaria prediction based on ensemble algorithms. |
title_fullStr |
A novel model for malaria prediction based on ensemble algorithms. |
title_full_unstemmed |
A novel model for malaria prediction based on ensemble algorithms. |
title_sort |
novel model for malaria prediction based on ensemble algorithms. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2019-01-01 |
description |
BACKGROUND AND OBJECTIVE:Most previous studies adopted single traditional time series models to predict incidences of malaria. A single model cannot effectively capture all the properties of the data structure. However, a stacking architecture can solve this problem by combining distinct algorithms and models. This study compares the performance of traditional time series models and deep learning algorithms in malaria case prediction and explores the application value of stacking methods in the field of infectious disease prediction. METHODS:The ARIMA, STL+ARIMA, BP-ANN and LSTM network models were separately applied in simulations using malaria data and meteorological data in Yunnan Province from 2011 to 2017. We compared the predictive performance of each model through evaluation measures: RMSE, MASE, MAD. In addition, gradient-boosting regression trees (GBRTs) were used to combine the above four models. We also determined whether stacking structure improved the model prediction performance. RESULTS:The root mean square errors (RMSEs) of the four sub-models were 13.176, 14.543, 9.571 and 7.208; the mean absolute scaled errors (MASEs) were 0.469, 0.472, 0.296 and 0.266 and the mean absolute deviation (MAD) were 6.403, 7.658, 5.871 and 5.691. After using the stacking architecture combined with the above four models, the RMSE, MASE and MAD values of the ensemble model decreased to 6.810, 0.224 and 4.625, respectively. CONCLUSIONS:A novel ensemble model based on the robustness of structured prediction and model combination through stacking was developed. The findings suggest that the predictive performance of the final model is superior to that of the other four sub-models, indicating that stacking architecture may have significant implications in infectious disease prediction. |
url |
https://doi.org/10.1371/journal.pone.0226910 |
work_keys_str_mv |
AT mengyangwang anovelmodelformalariapredictionbasedonensemblealgorithms AT huiwang anovelmodelformalariapredictionbasedonensemblealgorithms AT jiaowang anovelmodelformalariapredictionbasedonensemblealgorithms AT hongweiliu anovelmodelformalariapredictionbasedonensemblealgorithms AT ruilu anovelmodelformalariapredictionbasedonensemblealgorithms AT tongqingduan anovelmodelformalariapredictionbasedonensemblealgorithms AT xiaowengong anovelmodelformalariapredictionbasedonensemblealgorithms AT siyuanfeng anovelmodelformalariapredictionbasedonensemblealgorithms AT yuanyuanliu anovelmodelformalariapredictionbasedonensemblealgorithms AT zhuangcui anovelmodelformalariapredictionbasedonensemblealgorithms AT changpingli anovelmodelformalariapredictionbasedonensemblealgorithms AT junma anovelmodelformalariapredictionbasedonensemblealgorithms AT mengyangwang novelmodelformalariapredictionbasedonensemblealgorithms AT huiwang novelmodelformalariapredictionbasedonensemblealgorithms AT jiaowang novelmodelformalariapredictionbasedonensemblealgorithms AT hongweiliu novelmodelformalariapredictionbasedonensemblealgorithms AT ruilu novelmodelformalariapredictionbasedonensemblealgorithms AT tongqingduan novelmodelformalariapredictionbasedonensemblealgorithms AT xiaowengong novelmodelformalariapredictionbasedonensemblealgorithms AT siyuanfeng novelmodelformalariapredictionbasedonensemblealgorithms AT yuanyuanliu novelmodelformalariapredictionbasedonensemblealgorithms AT zhuangcui novelmodelformalariapredictionbasedonensemblealgorithms AT changpingli novelmodelformalariapredictionbasedonensemblealgorithms AT junma novelmodelformalariapredictionbasedonensemblealgorithms |
_version_ |
1714817328406855680 |