Large-scale forecasting of information spreading
Abstract This research proposes a system based on a combination of various components for parallel modelling and forecasting the processes in networks with data assimilation from the real network. The main novelty of this work consists of the assimilation of data for forecasting the processes in soc...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2020-09-01
|
Series: | Journal of Big Data |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s40537-020-00350-5 |
id |
doaj-95c13b7d0e3444799fa0c2396c7ee535 |
---|---|
record_format |
Article |
spelling |
doaj-95c13b7d0e3444799fa0c2396c7ee5352020-11-25T03:02:40ZengSpringerOpenJournal of Big Data2196-11152020-09-017111710.1186/s40537-020-00350-5Large-scale forecasting of information spreadingOksana Severiukhina0Sergey Kesarev1Klavdiya Bochenina2Alexander Boukhanovsky3Michael H. Lees4Peter M. A. Sloot5ITMO UniversityITMO UniversityITMO UniversityITMO UniversityITMO UniversityITMO UniversityAbstract This research proposes a system based on a combination of various components for parallel modelling and forecasting the processes in networks with data assimilation from the real network. The main novelty of this work consists of the assimilation of data for forecasting the processes in social networks which allows improving the quality of the forecast. The social network VK was considered as a source of information for determining types of entities and the parameters of the model. The main component is the model based on a combination of internal sub-models for more realistic reproduction of processes on micro (for single information message) and meso (for series of messages) levels. Moreover, the results of the forecast must not lose their relevance during the calculations. In order to get the result of the forecast for networks with millions of nodes in reasonable time, the process of simulation has been parallelized. The accuracy of the forecast is estimated by MAPE, MAE metrics for micro-scale, the Kolmogorov–Smirnov criterion for aggregated dynamics. The quality in the operational regime is also estimated by the number of batches with assimilated data to achieve the required accuracy and the ratio of calculation time in the frames of the forecasting period. In addition, the results include experimental studies of functional characteristics, scalability, as well as the performance of the system.http://link.springer.com/article/10.1186/s40537-020-00350-5Agent-based modellingModel of information spreadParallel simulationForecasting modelData-driven model |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Oksana Severiukhina Sergey Kesarev Klavdiya Bochenina Alexander Boukhanovsky Michael H. Lees Peter M. A. Sloot |
spellingShingle |
Oksana Severiukhina Sergey Kesarev Klavdiya Bochenina Alexander Boukhanovsky Michael H. Lees Peter M. A. Sloot Large-scale forecasting of information spreading Journal of Big Data Agent-based modelling Model of information spread Parallel simulation Forecasting model Data-driven model |
author_facet |
Oksana Severiukhina Sergey Kesarev Klavdiya Bochenina Alexander Boukhanovsky Michael H. Lees Peter M. A. Sloot |
author_sort |
Oksana Severiukhina |
title |
Large-scale forecasting of information spreading |
title_short |
Large-scale forecasting of information spreading |
title_full |
Large-scale forecasting of information spreading |
title_fullStr |
Large-scale forecasting of information spreading |
title_full_unstemmed |
Large-scale forecasting of information spreading |
title_sort |
large-scale forecasting of information spreading |
publisher |
SpringerOpen |
series |
Journal of Big Data |
issn |
2196-1115 |
publishDate |
2020-09-01 |
description |
Abstract This research proposes a system based on a combination of various components for parallel modelling and forecasting the processes in networks with data assimilation from the real network. The main novelty of this work consists of the assimilation of data for forecasting the processes in social networks which allows improving the quality of the forecast. The social network VK was considered as a source of information for determining types of entities and the parameters of the model. The main component is the model based on a combination of internal sub-models for more realistic reproduction of processes on micro (for single information message) and meso (for series of messages) levels. Moreover, the results of the forecast must not lose their relevance during the calculations. In order to get the result of the forecast for networks with millions of nodes in reasonable time, the process of simulation has been parallelized. The accuracy of the forecast is estimated by MAPE, MAE metrics for micro-scale, the Kolmogorov–Smirnov criterion for aggregated dynamics. The quality in the operational regime is also estimated by the number of batches with assimilated data to achieve the required accuracy and the ratio of calculation time in the frames of the forecasting period. In addition, the results include experimental studies of functional characteristics, scalability, as well as the performance of the system. |
topic |
Agent-based modelling Model of information spread Parallel simulation Forecasting model Data-driven model |
url |
http://link.springer.com/article/10.1186/s40537-020-00350-5 |
work_keys_str_mv |
AT oksanaseveriukhina largescaleforecastingofinformationspreading AT sergeykesarev largescaleforecastingofinformationspreading AT klavdiyabochenina largescaleforecastingofinformationspreading AT alexanderboukhanovsky largescaleforecastingofinformationspreading AT michaelhlees largescaleforecastingofinformationspreading AT petermasloot largescaleforecastingofinformationspreading |
_version_ |
1724688988704342016 |