A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data

In the current paper, we propose a machine learning forecasting model for the accurate prediction of qualitative weather information on winter precipitation types, utilized in Apache Spark Streaming distributed framework. The proposed model receives storage and processes data in real-time, in order...

Full description

Bibliographic Details
Main Authors: Andreas Kanavos, Maria Trigka, Elias Dritsas, Gerasimos Vonitsanos, Phivos Mylonas
Format: Article
Language:English
Published: MDPI AG 2021-08-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/10/16/1872
id doaj-10fb5b3695814dcfa6bcec20c21fec4e
record_format Article
spelling doaj-10fb5b3695814dcfa6bcec20c21fec4e2021-08-26T13:41:21ZengMDPI AGElectronics2079-92922021-08-01101872187210.3390/electronics10161872A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming DataAndreas Kanavos0Maria Trigka1Elias Dritsas2Gerasimos Vonitsanos3Phivos Mylonas4Department of Digital Media and Communication, Ionian University, 28100 Corfu, GreeceComputer Engineering and Informatics Department, University of Patras, 26504 Patras, GreeceComputer Engineering and Informatics Department, University of Patras, 26504 Patras, GreeceComputer Engineering and Informatics Department, University of Patras, 26504 Patras, GreeceDepartment of Informatics, Ionian University, 49100 Corfu, GreeceIn the current paper, we propose a machine learning forecasting model for the accurate prediction of qualitative weather information on winter precipitation types, utilized in Apache Spark Streaming distributed framework. The proposed model receives storage and processes data in real-time, in order to extract useful knowledge from different sensors related to weather data. In following, the numerical weather prediction model aims at forecasting the weather type given three precipitation classes namely rain, freezing rain, and snow as recorded in the Automated Surface Observing System (ASOS) network. For depicting the effectiveness of our proposed schema, a regularization technique for feature selection so as to avoid overfitting is implemented. Several classification models covering three different categorization methods namely the Bayesian, decision trees, and meta/ensemble methods, have been investigated in a real dataset. The experimental analysis illustrates that the utilization of the regularization technique could offer a significant boost in forecasting performance.https://www.mdpi.com/2079-9292/10/16/1872Apache CassandraApache KafkaApache Spark Streamingbig dataclassificationknowledge discovery
collection DOAJ
language English
format Article
sources DOAJ
author Andreas Kanavos
Maria Trigka
Elias Dritsas
Gerasimos Vonitsanos
Phivos Mylonas
spellingShingle Andreas Kanavos
Maria Trigka
Elias Dritsas
Gerasimos Vonitsanos
Phivos Mylonas
A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data
Electronics
Apache Cassandra
Apache Kafka
Apache Spark Streaming
big data
classification
knowledge discovery
author_facet Andreas Kanavos
Maria Trigka
Elias Dritsas
Gerasimos Vonitsanos
Phivos Mylonas
author_sort Andreas Kanavos
title A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data
title_short A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data
title_full A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data
title_fullStr A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data
title_full_unstemmed A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data
title_sort regularization-based big data framework for winter precipitation forecasting on streaming data
publisher MDPI AG
series Electronics
issn 2079-9292
publishDate 2021-08-01
description In the current paper, we propose a machine learning forecasting model for the accurate prediction of qualitative weather information on winter precipitation types, utilized in Apache Spark Streaming distributed framework. The proposed model receives storage and processes data in real-time, in order to extract useful knowledge from different sensors related to weather data. In following, the numerical weather prediction model aims at forecasting the weather type given three precipitation classes namely rain, freezing rain, and snow as recorded in the Automated Surface Observing System (ASOS) network. For depicting the effectiveness of our proposed schema, a regularization technique for feature selection so as to avoid overfitting is implemented. Several classification models covering three different categorization methods namely the Bayesian, decision trees, and meta/ensemble methods, have been investigated in a real dataset. The experimental analysis illustrates that the utilization of the regularization technique could offer a significant boost in forecasting performance.
topic Apache Cassandra
Apache Kafka
Apache Spark Streaming
big data
classification
knowledge discovery
url https://www.mdpi.com/2079-9292/10/16/1872
work_keys_str_mv AT andreaskanavos aregularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata
AT mariatrigka aregularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata
AT eliasdritsas aregularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata
AT gerasimosvonitsanos aregularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata
AT phivosmylonas aregularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata
AT andreaskanavos regularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata
AT mariatrigka regularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata
AT eliasdritsas regularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata
AT gerasimosvonitsanos regularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata
AT phivosmylonas regularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata
_version_ 1721193876464599040