On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
Machine learning techniques generally require or assume balanced datasets. Skewed data can make machine learning systems never function properly, no matter how carefully the parameter tuning is conducted. Thus, a common solution to the problem of high skewness is to pre-process data (e.g., log trans...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-08-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/11/16/7461 |
id |
doaj-288ed2496f994906bd6fddb5d163b7e5 |
---|---|
record_format |
Article |
spelling |
doaj-288ed2496f994906bd6fddb5d163b7e52021-08-26T13:30:06ZengMDPI AGApplied Sciences2076-34172021-08-01117461746110.3390/app11167461On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge LearningZheng Li0Jhon Galdames-Retamal1Department of Computer Science, University of Concepción, Concepción 4030000, ChileDepartment of Computer Science, University of Concepción, Concepción 4030000, ChileMachine learning techniques generally require or assume balanced datasets. Skewed data can make machine learning systems never function properly, no matter how carefully the parameter tuning is conducted. Thus, a common solution to the problem of high skewness is to pre-process data (e.g., log transformation) before applying machine learning to deal with real-world problems. Nevertheless, this pre-processing strategy cannot be employed for online machine learning, especially in the context of edge computing, because it is barely possible to foresee and store the continuous data flow on IoT devices on the edge. Thus, it will be crucial and valuable to enable skewness monitoring in real time. Unfortunately, there exists a surprising gap between practitioners’ needs and scientific research in running statistics for monitoring real-time skewness, not to mention the lack of suitable remedies for skewed data at runtime. Inspired by Welford’s algorithm, which is the most efficient approach to calculating running variance, this research developed efficient calculation methods for three versions of running skewness. These methods can conveniently be implemented as skewness monitoring modules that are affordable for IoT devices in different edge learning scenarios. Such an IoT-friendly skewness monitoring eventually acts a cornerstone for developing the research field of skewness-aware online edge learning. By initially validating the usefulness and significance of skewness awareness in edge learning implementations, we also argue that conjoint research efforts from relevant communities are needed to boost this promising research field.https://www.mdpi.com/2076-3417/11/16/7461edge computingonline edge learningopen methodsrunning skewnessWelford’s algorithm |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zheng Li Jhon Galdames-Retamal |
spellingShingle |
Zheng Li Jhon Galdames-Retamal On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning Applied Sciences edge computing online edge learning open methods running skewness Welford’s algorithm |
author_facet |
Zheng Li Jhon Galdames-Retamal |
author_sort |
Zheng Li |
title |
On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning |
title_short |
On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning |
title_full |
On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning |
title_fullStr |
On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning |
title_full_unstemmed |
On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning |
title_sort |
on iot-friendly skewness monitoring for skewness-aware online edge learning |
publisher |
MDPI AG |
series |
Applied Sciences |
issn |
2076-3417 |
publishDate |
2021-08-01 |
description |
Machine learning techniques generally require or assume balanced datasets. Skewed data can make machine learning systems never function properly, no matter how carefully the parameter tuning is conducted. Thus, a common solution to the problem of high skewness is to pre-process data (e.g., log transformation) before applying machine learning to deal with real-world problems. Nevertheless, this pre-processing strategy cannot be employed for online machine learning, especially in the context of edge computing, because it is barely possible to foresee and store the continuous data flow on IoT devices on the edge. Thus, it will be crucial and valuable to enable skewness monitoring in real time. Unfortunately, there exists a surprising gap between practitioners’ needs and scientific research in running statistics for monitoring real-time skewness, not to mention the lack of suitable remedies for skewed data at runtime. Inspired by Welford’s algorithm, which is the most efficient approach to calculating running variance, this research developed efficient calculation methods for three versions of running skewness. These methods can conveniently be implemented as skewness monitoring modules that are affordable for IoT devices in different edge learning scenarios. Such an IoT-friendly skewness monitoring eventually acts a cornerstone for developing the research field of skewness-aware online edge learning. By initially validating the usefulness and significance of skewness awareness in edge learning implementations, we also argue that conjoint research efforts from relevant communities are needed to boost this promising research field. |
topic |
edge computing online edge learning open methods running skewness Welford’s algorithm |
url |
https://www.mdpi.com/2076-3417/11/16/7461 |
work_keys_str_mv |
AT zhengli oniotfriendlyskewnessmonitoringforskewnessawareonlineedgelearning AT jhongaldamesretamal oniotfriendlyskewnessmonitoringforskewnessawareonlineedgelearning |
_version_ |
1721195002350010368 |