On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning

Machine learning techniques generally require or assume balanced datasets. Skewed data can make machine learning systems never function properly, no matter how carefully the parameter tuning is conducted. Thus, a common solution to the problem of high skewness is to pre-process data (e.g., log trans...

Full description

Bibliographic Details
Main Authors: Zheng Li, Jhon Galdames-Retamal
Format: Article
Language:English
Published: MDPI AG 2021-08-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/16/7461
id doaj-288ed2496f994906bd6fddb5d163b7e5
record_format Article
spelling doaj-288ed2496f994906bd6fddb5d163b7e52021-08-26T13:30:06ZengMDPI AGApplied Sciences2076-34172021-08-01117461746110.3390/app11167461On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge LearningZheng Li0Jhon Galdames-Retamal1Department of Computer Science, University of Concepción, Concepción 4030000, ChileDepartment of Computer Science, University of Concepción, Concepción 4030000, ChileMachine learning techniques generally require or assume balanced datasets. Skewed data can make machine learning systems never function properly, no matter how carefully the parameter tuning is conducted. Thus, a common solution to the problem of high skewness is to pre-process data (e.g., log transformation) before applying machine learning to deal with real-world problems. Nevertheless, this pre-processing strategy cannot be employed for online machine learning, especially in the context of edge computing, because it is barely possible to foresee and store the continuous data flow on IoT devices on the edge. Thus, it will be crucial and valuable to enable skewness monitoring in real time. Unfortunately, there exists a surprising gap between practitioners’ needs and scientific research in running statistics for monitoring real-time skewness, not to mention the lack of suitable remedies for skewed data at runtime. Inspired by Welford’s algorithm, which is the most efficient approach to calculating running variance, this research developed efficient calculation methods for three versions of running skewness. These methods can conveniently be implemented as skewness monitoring modules that are affordable for IoT devices in different edge learning scenarios. Such an IoT-friendly skewness monitoring eventually acts a cornerstone for developing the research field of skewness-aware online edge learning. By initially validating the usefulness and significance of skewness awareness in edge learning implementations, we also argue that conjoint research efforts from relevant communities are needed to boost this promising research field.https://www.mdpi.com/2076-3417/11/16/7461edge computingonline edge learningopen methodsrunning skewnessWelford’s algorithm
collection DOAJ
language English
format Article
sources DOAJ
author Zheng Li
Jhon Galdames-Retamal
spellingShingle Zheng Li
Jhon Galdames-Retamal
On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
Applied Sciences
edge computing
online edge learning
open methods
running skewness
Welford’s algorithm
author_facet Zheng Li
Jhon Galdames-Retamal
author_sort Zheng Li
title On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
title_short On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
title_full On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
title_fullStr On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
title_full_unstemmed On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
title_sort on iot-friendly skewness monitoring for skewness-aware online edge learning
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2021-08-01
description Machine learning techniques generally require or assume balanced datasets. Skewed data can make machine learning systems never function properly, no matter how carefully the parameter tuning is conducted. Thus, a common solution to the problem of high skewness is to pre-process data (e.g., log transformation) before applying machine learning to deal with real-world problems. Nevertheless, this pre-processing strategy cannot be employed for online machine learning, especially in the context of edge computing, because it is barely possible to foresee and store the continuous data flow on IoT devices on the edge. Thus, it will be crucial and valuable to enable skewness monitoring in real time. Unfortunately, there exists a surprising gap between practitioners’ needs and scientific research in running statistics for monitoring real-time skewness, not to mention the lack of suitable remedies for skewed data at runtime. Inspired by Welford’s algorithm, which is the most efficient approach to calculating running variance, this research developed efficient calculation methods for three versions of running skewness. These methods can conveniently be implemented as skewness monitoring modules that are affordable for IoT devices in different edge learning scenarios. Such an IoT-friendly skewness monitoring eventually acts a cornerstone for developing the research field of skewness-aware online edge learning. By initially validating the usefulness and significance of skewness awareness in edge learning implementations, we also argue that conjoint research efforts from relevant communities are needed to boost this promising research field.
topic edge computing
online edge learning
open methods
running skewness
Welford’s algorithm
url https://www.mdpi.com/2076-3417/11/16/7461
work_keys_str_mv AT zhengli oniotfriendlyskewnessmonitoringforskewnessawareonlineedgelearning
AT jhongaldamesretamal oniotfriendlyskewnessmonitoringforskewnessawareonlineedgelearning
_version_ 1721195002350010368