Predicting diarrhoea outbreak with climate change

Climate change is expected to exacerbate diarrhoea outbreak in South Africa, a leading cause of morbidity and mortality in the region. In this study, we modelled the impacts of climate change on diarrhoea with machine learning methods. We applied two deep learning techniques, convolutional neural ne...

Full description

Bibliographic Details
Main Author: Abdullahi, Tassallah Amina
Other Authors: Nitschke, Geoff
Format: Dissertation
Language:English
Published: Faculty of Science 2021
Subjects:
Online Access:http://hdl.handle.net/11427/33615
Description
Summary:Climate change is expected to exacerbate diarrhoea outbreak in South Africa, a leading cause of morbidity and mortality in the region. In this study, we modelled the impacts of climate change on diarrhoea with machine learning methods. We applied two deep learning techniques, convolutional neural networks (CNNs) and long-short term memory networks (LSTMs); and a support vector machine to predict daily diarrhoea cases over the different South African provinces by incorporating climate information. Generative Adversarial Networks (GANs) was used to generate synthetic data which was used to augment the available dataset. Furthermore, relevance estimation and value calibration (REVAC) was used to tune the parameters of the machine learning algorithms to optimize the accuracy of their predictions. Sensitivity analysis was also performed to investigate the contribution of the different climate factors to the diarrhoea prediction model. The results of the study showed that all three ML methods were appropriate for predicting daily diarrhoea cases with respect to the selected climate variables in each South African province. The ML methods were all able to yield low and similar RMSE. However, the level of accuracy for each model varied across different experiments, with the deep learning models outperforming the SVM model. Among the deep learning techniques, the CNN model performed best when only real-world dataset was used, while the LSTM model outperformed the other models when the real dataset was augmented with synthetic data. Across the provinces, the accuracy of all three ML algorithms improved by at least 30% when data augmentation was implemented. In addition, REVAC improved the accuracy of the CNN model by more than 12% in KwaZulu Natal province. However, the percentage increase in accuracy of the LSTM model was less than 4% in Western Cape province when REVAC was used. Our sensitivity analysis revealed that the most influential climate variables to be considered when predicting outbreak of diarrhoea in South Africa are precipitation, humidity, evaporation and temperature conditions. The result of this study is important for the development of an early warning system for diarrhoea outbreak over South Africa.