Large-scale forecasting of information spreading

Abstract This research proposes a system based on a combination of various components for parallel modelling and forecasting the processes in networks with data assimilation from the real network. The main novelty of this work consists of the assimilation of data for forecasting the processes in soc...

Full description

Bibliographic Details
Main Authors: Oksana Severiukhina, Sergey Kesarev, Klavdiya Bochenina, Alexander Boukhanovsky, Michael H. Lees, Peter M. A. Sloot
Format: Article
Language:English
Published: SpringerOpen 2020-09-01
Series:Journal of Big Data
Subjects:
Online Access:http://link.springer.com/article/10.1186/s40537-020-00350-5
id doaj-95c13b7d0e3444799fa0c2396c7ee535
record_format Article
spelling doaj-95c13b7d0e3444799fa0c2396c7ee5352020-11-25T03:02:40ZengSpringerOpenJournal of Big Data2196-11152020-09-017111710.1186/s40537-020-00350-5Large-scale forecasting of information spreadingOksana Severiukhina0Sergey Kesarev1Klavdiya Bochenina2Alexander Boukhanovsky3Michael H. Lees4Peter M. A. Sloot5ITMO UniversityITMO UniversityITMO UniversityITMO UniversityITMO UniversityITMO UniversityAbstract This research proposes a system based on a combination of various components for parallel modelling and forecasting the processes in networks with data assimilation from the real network. The main novelty of this work consists of the assimilation of data for forecasting the processes in social networks which allows improving the quality of the forecast. The social network VK was considered as a source of information for determining types of entities and the parameters of the model. The main component is the model based on a combination of internal sub-models for more realistic reproduction of processes on micro (for single information message) and meso (for series of messages) levels. Moreover, the results of the forecast must not lose their relevance during the calculations. In order to get the result of the forecast for networks with millions of nodes in reasonable time, the process of simulation has been parallelized. The accuracy of the forecast is estimated by MAPE, MAE metrics for micro-scale, the Kolmogorov–Smirnov criterion for aggregated dynamics. The quality in the operational regime is also estimated by the number of batches with assimilated data to achieve the required accuracy and the ratio of calculation time in the frames of the forecasting period. In addition, the results include experimental studies of functional characteristics, scalability, as well as the performance of the system.http://link.springer.com/article/10.1186/s40537-020-00350-5Agent-based modellingModel of information spreadParallel simulationForecasting modelData-driven model
collection DOAJ
language English
format Article
sources DOAJ
author Oksana Severiukhina
Sergey Kesarev
Klavdiya Bochenina
Alexander Boukhanovsky
Michael H. Lees
Peter M. A. Sloot
spellingShingle Oksana Severiukhina
Sergey Kesarev
Klavdiya Bochenina
Alexander Boukhanovsky
Michael H. Lees
Peter M. A. Sloot
Large-scale forecasting of information spreading
Journal of Big Data
Agent-based modelling
Model of information spread
Parallel simulation
Forecasting model
Data-driven model
author_facet Oksana Severiukhina
Sergey Kesarev
Klavdiya Bochenina
Alexander Boukhanovsky
Michael H. Lees
Peter M. A. Sloot
author_sort Oksana Severiukhina
title Large-scale forecasting of information spreading
title_short Large-scale forecasting of information spreading
title_full Large-scale forecasting of information spreading
title_fullStr Large-scale forecasting of information spreading
title_full_unstemmed Large-scale forecasting of information spreading
title_sort large-scale forecasting of information spreading
publisher SpringerOpen
series Journal of Big Data
issn 2196-1115
publishDate 2020-09-01
description Abstract This research proposes a system based on a combination of various components for parallel modelling and forecasting the processes in networks with data assimilation from the real network. The main novelty of this work consists of the assimilation of data for forecasting the processes in social networks which allows improving the quality of the forecast. The social network VK was considered as a source of information for determining types of entities and the parameters of the model. The main component is the model based on a combination of internal sub-models for more realistic reproduction of processes on micro (for single information message) and meso (for series of messages) levels. Moreover, the results of the forecast must not lose their relevance during the calculations. In order to get the result of the forecast for networks with millions of nodes in reasonable time, the process of simulation has been parallelized. The accuracy of the forecast is estimated by MAPE, MAE metrics for micro-scale, the Kolmogorov–Smirnov criterion for aggregated dynamics. The quality in the operational regime is also estimated by the number of batches with assimilated data to achieve the required accuracy and the ratio of calculation time in the frames of the forecasting period. In addition, the results include experimental studies of functional characteristics, scalability, as well as the performance of the system.
topic Agent-based modelling
Model of information spread
Parallel simulation
Forecasting model
Data-driven model
url http://link.springer.com/article/10.1186/s40537-020-00350-5
work_keys_str_mv AT oksanaseveriukhina largescaleforecastingofinformationspreading
AT sergeykesarev largescaleforecastingofinformationspreading
AT klavdiyabochenina largescaleforecastingofinformationspreading
AT alexanderboukhanovsky largescaleforecastingofinformationspreading
AT michaelhlees largescaleforecastingofinformationspreading
AT petermasloot largescaleforecastingofinformationspreading
_version_ 1724688988704342016