On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning

Machine learning techniques generally require or assume balanced datasets. Skewed data can make machine learning systems never function properly, no matter how carefully the parameter tuning is conducted. Thus, a common solution to the problem of high skewness is to pre-process data (e.g., log trans...

Full description

Bibliographic Details
Main Authors:	Zheng Li, Jhon Galdames-Retamal
Format:	Article
Language:	English
Published:	MDPI AG 2021-08-01
Series:	Applied Sciences
Subjects:	edge computing online edge learning open methods running skewness Welford’s algorithm
Online Access:	https://www.mdpi.com/2076-3417/11/16/7461

id	doaj-288ed2496f994906bd6fddb5d163b7e5
record_format	Article
spelling	doaj-288ed2496f994906bd6fddb5d163b7e52021-08-26T13:30:06ZengMDPI AGApplied Sciences2076-34172021-08-01117461746110.3390/app11167461On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge LearningZheng Li0Jhon Galdames-Retamal1Department of Computer Science, University of Concepción, Concepción 4030000, ChileDepartment of Computer Science, University of Concepción, Concepción 4030000, ChileMachine learning techniques generally require or assume balanced datasets. Skewed data can make machine learning systems never function properly, no matter how carefully the parameter tuning is conducted. Thus, a common solution to the problem of high skewness is to pre-process data (e.g., log transformation) before applying machine learning to deal with real-world problems. Nevertheless, this pre-processing strategy cannot be employed for online machine learning, especially in the context of edge computing, because it is barely possible to foresee and store the continuous data flow on IoT devices on the edge. Thus, it will be crucial and valuable to enable skewness monitoring in real time. Unfortunately, there exists a surprising gap between practitioners’ needs and scientific research in running statistics for monitoring real-time skewness, not to mention the lack of suitable remedies for skewed data at runtime. Inspired by Welford’s algorithm, which is the most efficient approach to calculating running variance, this research developed efficient calculation methods for three versions of running skewness. These methods can conveniently be implemented as skewness monitoring modules that are affordable for IoT devices in different edge learning scenarios. Such an IoT-friendly skewness monitoring eventually acts a cornerstone for developing the research field of skewness-aware online edge learning. By initially validating the usefulness and significance of skewness awareness in edge learning implementations, we also argue that conjoint research efforts from relevant communities are needed to boost this promising research field.https://www.mdpi.com/2076-3417/11/16/7461edge computingonline edge learningopen methodsrunning skewnessWelford’s algorithm
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Zheng Li Jhon Galdames-Retamal
spellingShingle	Zheng Li Jhon Galdames-Retamal On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning Applied Sciences edge computing online edge learning open methods running skewness Welford’s algorithm
author_facet	Zheng Li Jhon Galdames-Retamal
author_sort	Zheng Li
title	On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
title_short	On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
title_full	On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
title_fullStr	On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
title_full_unstemmed	On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning
title_sort	on iot-friendly skewness monitoring for skewness-aware online edge learning
publisher	MDPI AG
series	Applied Sciences
issn	2076-3417
publishDate	2021-08-01
description	Machine learning techniques generally require or assume balanced datasets. Skewed data can make machine learning systems never function properly, no matter how carefully the parameter tuning is conducted. Thus, a common solution to the problem of high skewness is to pre-process data (e.g., log transformation) before applying machine learning to deal with real-world problems. Nevertheless, this pre-processing strategy cannot be employed for online machine learning, especially in the context of edge computing, because it is barely possible to foresee and store the continuous data flow on IoT devices on the edge. Thus, it will be crucial and valuable to enable skewness monitoring in real time. Unfortunately, there exists a surprising gap between practitioners’ needs and scientific research in running statistics for monitoring real-time skewness, not to mention the lack of suitable remedies for skewed data at runtime. Inspired by Welford’s algorithm, which is the most efficient approach to calculating running variance, this research developed efficient calculation methods for three versions of running skewness. These methods can conveniently be implemented as skewness monitoring modules that are affordable for IoT devices in different edge learning scenarios. Such an IoT-friendly skewness monitoring eventually acts a cornerstone for developing the research field of skewness-aware online edge learning. By initially validating the usefulness and significance of skewness awareness in edge learning implementations, we also argue that conjoint research efforts from relevant communities are needed to boost this promising research field.
topic	edge computing online edge learning open methods running skewness Welford’s algorithm
url	https://www.mdpi.com/2076-3417/11/16/7461
work_keys_str_mv	AT zhengli oniotfriendlyskewnessmonitoringforskewnessawareonlineedgelearning AT jhongaldamesretamal oniotfriendlyskewnessmonitoringforskewnessawareonlineedgelearning
_version_	1721195002350010368

On IoT-Friendly Skewness Monitoring for Skewness-Aware Online Edge Learning

Similar Items