Adaptive nowcasting of influenza outbreaks using Google searches

Seasonal influenza outbreaks and pandemics of new strains of the influenza virus affect humans around the globe. However, traditional systems for measuring the spread of flu infections deliver results with one or two weeks delay. Recent research suggests that data on queries made to the search engin...

Full description

Bibliographic Details
Main Authors: Tobias Preis, Helen Susannah Moat
Format: Article
Language:English
Published: The Royal Society 2014-01-01
Series:Royal Society Open Science
Subjects:
Online Access:https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.140095
id doaj-483c691e54ac437297829f809f9747a9
record_format Article
spelling doaj-483c691e54ac437297829f809f9747a92020-11-25T04:04:32ZengThe Royal SocietyRoyal Society Open Science2054-57032014-01-011210.1098/rsos.140095140095Adaptive nowcasting of influenza outbreaks using Google searchesTobias PreisHelen Susannah MoatSeasonal influenza outbreaks and pandemics of new strains of the influenza virus affect humans around the globe. However, traditional systems for measuring the spread of flu infections deliver results with one or two weeks delay. Recent research suggests that data on queries made to the search engine Google can be used to address this problem, providing real-time estimates of levels of influenza-like illness in a population. Others have however argued that equally good estimates of current flu levels can be forecast using historic flu measurements. Here, we build dynamic ‘nowcasting’ models; in other words, forecasting models that estimate current levels of influenza, before the release of official data one week later. We find that when using Google Flu Trends data in combination with historic flu levels, the mean absolute error (MAE) of in-sample ‘nowcasts’ can be significantly reduced by 14.4%, compared with a baseline model that uses historic data on flu levels only. We further demonstrate that the MAE of out-of-sample nowcasts can also be significantly reduced by between 16.0% and 52.7%, depending on the length of the sliding training interval. We conclude that, using adaptive models, Google Flu Trends data can indeed be used to improve real-time influenza monitoring, even when official reports of flu infections are available with only one week's delay.https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.140095data sciencecomputational social sciencecomplex systems
collection DOAJ
language English
format Article
sources DOAJ
author Tobias Preis
Helen Susannah Moat
spellingShingle Tobias Preis
Helen Susannah Moat
Adaptive nowcasting of influenza outbreaks using Google searches
Royal Society Open Science
data science
computational social science
complex systems
author_facet Tobias Preis
Helen Susannah Moat
author_sort Tobias Preis
title Adaptive nowcasting of influenza outbreaks using Google searches
title_short Adaptive nowcasting of influenza outbreaks using Google searches
title_full Adaptive nowcasting of influenza outbreaks using Google searches
title_fullStr Adaptive nowcasting of influenza outbreaks using Google searches
title_full_unstemmed Adaptive nowcasting of influenza outbreaks using Google searches
title_sort adaptive nowcasting of influenza outbreaks using google searches
publisher The Royal Society
series Royal Society Open Science
issn 2054-5703
publishDate 2014-01-01
description Seasonal influenza outbreaks and pandemics of new strains of the influenza virus affect humans around the globe. However, traditional systems for measuring the spread of flu infections deliver results with one or two weeks delay. Recent research suggests that data on queries made to the search engine Google can be used to address this problem, providing real-time estimates of levels of influenza-like illness in a population. Others have however argued that equally good estimates of current flu levels can be forecast using historic flu measurements. Here, we build dynamic ‘nowcasting’ models; in other words, forecasting models that estimate current levels of influenza, before the release of official data one week later. We find that when using Google Flu Trends data in combination with historic flu levels, the mean absolute error (MAE) of in-sample ‘nowcasts’ can be significantly reduced by 14.4%, compared with a baseline model that uses historic data on flu levels only. We further demonstrate that the MAE of out-of-sample nowcasts can also be significantly reduced by between 16.0% and 52.7%, depending on the length of the sliding training interval. We conclude that, using adaptive models, Google Flu Trends data can indeed be used to improve real-time influenza monitoring, even when official reports of flu infections are available with only one week's delay.
topic data science
computational social science
complex systems
url https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.140095
work_keys_str_mv AT tobiaspreis adaptivenowcastingofinfluenzaoutbreaksusinggooglesearches
AT helensusannahmoat adaptivenowcastingofinfluenzaoutbreaksusinggooglesearches
_version_ 1724436293078745088