A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data
In the current paper, we propose a machine learning forecasting model for the accurate prediction of qualitative weather information on winter precipitation types, utilized in Apache Spark Streaming distributed framework. The proposed model receives storage and processes data in real-time, in order...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-08-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/10/16/1872 |
id |
doaj-10fb5b3695814dcfa6bcec20c21fec4e |
---|---|
record_format |
Article |
spelling |
doaj-10fb5b3695814dcfa6bcec20c21fec4e2021-08-26T13:41:21ZengMDPI AGElectronics2079-92922021-08-01101872187210.3390/electronics10161872A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming DataAndreas Kanavos0Maria Trigka1Elias Dritsas2Gerasimos Vonitsanos3Phivos Mylonas4Department of Digital Media and Communication, Ionian University, 28100 Corfu, GreeceComputer Engineering and Informatics Department, University of Patras, 26504 Patras, GreeceComputer Engineering and Informatics Department, University of Patras, 26504 Patras, GreeceComputer Engineering and Informatics Department, University of Patras, 26504 Patras, GreeceDepartment of Informatics, Ionian University, 49100 Corfu, GreeceIn the current paper, we propose a machine learning forecasting model for the accurate prediction of qualitative weather information on winter precipitation types, utilized in Apache Spark Streaming distributed framework. The proposed model receives storage and processes data in real-time, in order to extract useful knowledge from different sensors related to weather data. In following, the numerical weather prediction model aims at forecasting the weather type given three precipitation classes namely rain, freezing rain, and snow as recorded in the Automated Surface Observing System (ASOS) network. For depicting the effectiveness of our proposed schema, a regularization technique for feature selection so as to avoid overfitting is implemented. Several classification models covering three different categorization methods namely the Bayesian, decision trees, and meta/ensemble methods, have been investigated in a real dataset. The experimental analysis illustrates that the utilization of the regularization technique could offer a significant boost in forecasting performance.https://www.mdpi.com/2079-9292/10/16/1872Apache CassandraApache KafkaApache Spark Streamingbig dataclassificationknowledge discovery |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Andreas Kanavos Maria Trigka Elias Dritsas Gerasimos Vonitsanos Phivos Mylonas |
spellingShingle |
Andreas Kanavos Maria Trigka Elias Dritsas Gerasimos Vonitsanos Phivos Mylonas A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data Electronics Apache Cassandra Apache Kafka Apache Spark Streaming big data classification knowledge discovery |
author_facet |
Andreas Kanavos Maria Trigka Elias Dritsas Gerasimos Vonitsanos Phivos Mylonas |
author_sort |
Andreas Kanavos |
title |
A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data |
title_short |
A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data |
title_full |
A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data |
title_fullStr |
A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data |
title_full_unstemmed |
A Regularization-Based Big Data Framework for Winter Precipitation Forecasting on Streaming Data |
title_sort |
regularization-based big data framework for winter precipitation forecasting on streaming data |
publisher |
MDPI AG |
series |
Electronics |
issn |
2079-9292 |
publishDate |
2021-08-01 |
description |
In the current paper, we propose a machine learning forecasting model for the accurate prediction of qualitative weather information on winter precipitation types, utilized in Apache Spark Streaming distributed framework. The proposed model receives storage and processes data in real-time, in order to extract useful knowledge from different sensors related to weather data. In following, the numerical weather prediction model aims at forecasting the weather type given three precipitation classes namely rain, freezing rain, and snow as recorded in the Automated Surface Observing System (ASOS) network. For depicting the effectiveness of our proposed schema, a regularization technique for feature selection so as to avoid overfitting is implemented. Several classification models covering three different categorization methods namely the Bayesian, decision trees, and meta/ensemble methods, have been investigated in a real dataset. The experimental analysis illustrates that the utilization of the regularization technique could offer a significant boost in forecasting performance. |
topic |
Apache Cassandra Apache Kafka Apache Spark Streaming big data classification knowledge discovery |
url |
https://www.mdpi.com/2079-9292/10/16/1872 |
work_keys_str_mv |
AT andreaskanavos aregularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata AT mariatrigka aregularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata AT eliasdritsas aregularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata AT gerasimosvonitsanos aregularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata AT phivosmylonas aregularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata AT andreaskanavos regularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata AT mariatrigka regularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata AT eliasdritsas regularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata AT gerasimosvonitsanos regularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata AT phivosmylonas regularizationbasedbigdataframeworkforwinterprecipitationforecastingonstreamingdata |
_version_ |
1721193876464599040 |