Applying Deep Neural Networks and Ensemble Machine Learning Methods to Forecast Airborne <i>Ambrosia</i> Pollen

Allergies to airborne pollen are a significant issue affecting millions of Americans. Consequently, accurately predicting the daily concentration of airborne pollen is of significant public benefit in providing timely alerts. This study presents a method for the robust estimation of the concentratio...

Full description

Bibliographic Details
Main Authors: Gebreab K. Zewdie, David J. Lary, Estelle Levetin, Gemechu F. Garuma
Format: Article
Language:English
Published: MDPI AG 2019-06-01
Series:International Journal of Environmental Research and Public Health
Subjects:
Online Access:https://www.mdpi.com/1660-4601/16/11/1992
id doaj-7f3a3fc6058948d680c7ce42c7c85802
record_format Article
spelling doaj-7f3a3fc6058948d680c7ce42c7c858022020-11-25T01:14:03ZengMDPI AGInternational Journal of Environmental Research and Public Health1660-46012019-06-011611199210.3390/ijerph16111992ijerph16111992Applying Deep Neural Networks and Ensemble Machine Learning Methods to Forecast Airborne <i>Ambrosia</i> PollenGebreab K. Zewdie0David J. Lary1Estelle Levetin2Gemechu F. Garuma3William B. Hanson Center for Space Sciences, The University of Texas at Dallas, Richardson, TX 75080, USAWilliam B. Hanson Center for Space Sciences, The University of Texas at Dallas, Richardson, TX 75080, USADepartment of Biological Science, The University of Tulsa, Tulsa, OK 74104, USAInstitute of Earth and Environmental Sciences, University of Quebec at Montreal, Montreal, QC H2L 2C4, CanadaAllergies to airborne pollen are a significant issue affecting millions of Americans. Consequently, accurately predicting the daily concentration of airborne pollen is of significant public benefit in providing timely alerts. This study presents a method for the robust estimation of the concentration of airborne <i>Ambrosia</i> pollen using a suite of machine learning approaches including deep learning and ensemble learners. Each of these machine learning approaches utilize data from the European Centre for Medium-Range Weather Forecasts (ECMWF) atmospheric weather and land surface reanalysis. The machine learning approaches used for developing a suite of empirical models are deep neural networks, extreme gradient boosting, random forests and Bayesian ridge regression methods for developing our predictive model. The training data included twenty-four years of daily pollen concentration measurements together with ECMWF weather and land surface reanalysis data from 1987 to 2011 is used to develop the machine learning predictive models. The last six years of the dataset from 2012 to 2017 is used to independently test the performance of the machine learning models. The correlation coefficients between the estimated and actual pollen abundance for the independent validation datasets for the deep neural networks, random forest, extreme gradient boosting and Bayesian ridge were 0.82, 0.81, 0.81 and 0.75 respectively, showing that machine learning can be used to effectively forecast the concentrations of airborne pollen.https://www.mdpi.com/1660-4601/16/11/1992<i>Ambrosia</i> pollenrandom forestextreme gradient boostingdeep neural networksmachine learningECMWFpollen allergy
collection DOAJ
language English
format Article
sources DOAJ
author Gebreab K. Zewdie
David J. Lary
Estelle Levetin
Gemechu F. Garuma
spellingShingle Gebreab K. Zewdie
David J. Lary
Estelle Levetin
Gemechu F. Garuma
Applying Deep Neural Networks and Ensemble Machine Learning Methods to Forecast Airborne <i>Ambrosia</i> Pollen
International Journal of Environmental Research and Public Health
<i>Ambrosia</i> pollen
random forest
extreme gradient boosting
deep neural networks
machine learning
ECMWF
pollen allergy
author_facet Gebreab K. Zewdie
David J. Lary
Estelle Levetin
Gemechu F. Garuma
author_sort Gebreab K. Zewdie
title Applying Deep Neural Networks and Ensemble Machine Learning Methods to Forecast Airborne <i>Ambrosia</i> Pollen
title_short Applying Deep Neural Networks and Ensemble Machine Learning Methods to Forecast Airborne <i>Ambrosia</i> Pollen
title_full Applying Deep Neural Networks and Ensemble Machine Learning Methods to Forecast Airborne <i>Ambrosia</i> Pollen
title_fullStr Applying Deep Neural Networks and Ensemble Machine Learning Methods to Forecast Airborne <i>Ambrosia</i> Pollen
title_full_unstemmed Applying Deep Neural Networks and Ensemble Machine Learning Methods to Forecast Airborne <i>Ambrosia</i> Pollen
title_sort applying deep neural networks and ensemble machine learning methods to forecast airborne <i>ambrosia</i> pollen
publisher MDPI AG
series International Journal of Environmental Research and Public Health
issn 1660-4601
publishDate 2019-06-01
description Allergies to airborne pollen are a significant issue affecting millions of Americans. Consequently, accurately predicting the daily concentration of airborne pollen is of significant public benefit in providing timely alerts. This study presents a method for the robust estimation of the concentration of airborne <i>Ambrosia</i> pollen using a suite of machine learning approaches including deep learning and ensemble learners. Each of these machine learning approaches utilize data from the European Centre for Medium-Range Weather Forecasts (ECMWF) atmospheric weather and land surface reanalysis. The machine learning approaches used for developing a suite of empirical models are deep neural networks, extreme gradient boosting, random forests and Bayesian ridge regression methods for developing our predictive model. The training data included twenty-four years of daily pollen concentration measurements together with ECMWF weather and land surface reanalysis data from 1987 to 2011 is used to develop the machine learning predictive models. The last six years of the dataset from 2012 to 2017 is used to independently test the performance of the machine learning models. The correlation coefficients between the estimated and actual pollen abundance for the independent validation datasets for the deep neural networks, random forest, extreme gradient boosting and Bayesian ridge were 0.82, 0.81, 0.81 and 0.75 respectively, showing that machine learning can be used to effectively forecast the concentrations of airborne pollen.
topic <i>Ambrosia</i> pollen
random forest
extreme gradient boosting
deep neural networks
machine learning
ECMWF
pollen allergy
url https://www.mdpi.com/1660-4601/16/11/1992
work_keys_str_mv AT gebreabkzewdie applyingdeepneuralnetworksandensemblemachinelearningmethodstoforecastairborneiambrosiaipollen
AT davidjlary applyingdeepneuralnetworksandensemblemachinelearningmethodstoforecastairborneiambrosiaipollen
AT estellelevetin applyingdeepneuralnetworksandensemblemachinelearningmethodstoforecastairborneiambrosiaipollen
AT gemechufgaruma applyingdeepneuralnetworksandensemblemachinelearningmethodstoforecastairborneiambrosiaipollen
_version_ 1725159215415164928