An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data
The intelligent environment monitoring network, as the foundation of ecosystem research, has rapidly developed with the ever-growing Internet of Things (IoT). IoT-networked sensors deployed to monitor ecosystems generate copious sensor data characterized by nonstationarity and nonlinearity such that...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8922712/ |
id |
doaj-d293b9a2250f4d299e4489e7d59446ca |
---|---|
record_format |
Article |
spelling |
doaj-d293b9a2250f4d299e4489e7d59446ca2021-03-30T00:25:36ZengIEEEIEEE Access2169-35362019-01-01717519217521210.1109/ACCESS.2019.29576028922712An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor DataMinghu Zhang0https://orcid.org/0000-0002-5178-586XXin Li1https://orcid.org/0000-0003-2999-9818Lili Wang2https://orcid.org/0000-0001-6401-264XKey Laboratory of Remote Sensing of Gansu Province, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou, ChinaNational Tibetan Plateau Data Center, Institute of Tibetan Plateau Research, Chinese Academy of Sciences, Beijing, ChinaCollege of Physics and Electrical Engineering, Northwest Normal University, Lanzhou, ChinaThe intelligent environment monitoring network, as the foundation of ecosystem research, has rapidly developed with the ever-growing Internet of Things (IoT). IoT-networked sensors deployed to monitor ecosystems generate copious sensor data characterized by nonstationarity and nonlinearity such that outlier detection remains a source of concern. Most outlier detection models involve hypothesis tests based on setting outlier threshold values. However, signal decomposition describes stationary and nonstationary relationships sensor data. Therefore, this paper proposes a three-level hybrid model based on the median filter (MF), empirical mode decomposition (EMD), classification and regression tree (CART), autoregression (AR) and exponential weighted moving average (EWMA) methods called MF-EMD-CART-AR-EWMA to detect outliers in sensor data. The first-level performance is compared to that of the Butterworth filter, FIR filter, moving average filter, wavelet filter and Wiener filter. The second-level prediction performance is compared to support vector regression (SVR), K-nearest neighbor (KNN), CART, complementary ensemble EEMD with CART and AR (EEMD-CART-AR) and ensemble CEEMD with CART and AR (CEEMD-CART-AR) methods. Finally, EWMA is compared to Cumulative Sum Control Chart (CUSUM) and Shewhart control charts. The proposed hybrid model was evaluated with a real dataset from the hydrometeorological observation network in the Heihe River Basin, yielding experimental results with better generalization ability and higher accuracy than the compared models, and providing extremely effective detection of minor outliers in predicted values. This paper provides valuable insight and a promising reference for outlier detection involving sensor data and presents a new perspective for detecting outliers.https://ieeexplore.ieee.org/document/8922712/Environmental monitoringsensor dataoutlier detectionintegrated modelstatistical analysis |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Minghu Zhang Xin Li Lili Wang |
spellingShingle |
Minghu Zhang Xin Li Lili Wang An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data IEEE Access Environmental monitoring sensor data outlier detection integrated model statistical analysis |
author_facet |
Minghu Zhang Xin Li Lili Wang |
author_sort |
Minghu Zhang |
title |
An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data |
title_short |
An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data |
title_full |
An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data |
title_fullStr |
An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data |
title_full_unstemmed |
An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data |
title_sort |
adaptive outlier detection and processing approach towards time series sensor data |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
The intelligent environment monitoring network, as the foundation of ecosystem research, has rapidly developed with the ever-growing Internet of Things (IoT). IoT-networked sensors deployed to monitor ecosystems generate copious sensor data characterized by nonstationarity and nonlinearity such that outlier detection remains a source of concern. Most outlier detection models involve hypothesis tests based on setting outlier threshold values. However, signal decomposition describes stationary and nonstationary relationships sensor data. Therefore, this paper proposes a three-level hybrid model based on the median filter (MF), empirical mode decomposition (EMD), classification and regression tree (CART), autoregression (AR) and exponential weighted moving average (EWMA) methods called MF-EMD-CART-AR-EWMA to detect outliers in sensor data. The first-level performance is compared to that of the Butterworth filter, FIR filter, moving average filter, wavelet filter and Wiener filter. The second-level prediction performance is compared to support vector regression (SVR), K-nearest neighbor (KNN), CART, complementary ensemble EEMD with CART and AR (EEMD-CART-AR) and ensemble CEEMD with CART and AR (CEEMD-CART-AR) methods. Finally, EWMA is compared to Cumulative Sum Control Chart (CUSUM) and Shewhart control charts. The proposed hybrid model was evaluated with a real dataset from the hydrometeorological observation network in the Heihe River Basin, yielding experimental results with better generalization ability and higher accuracy than the compared models, and providing extremely effective detection of minor outliers in predicted values. This paper provides valuable insight and a promising reference for outlier detection involving sensor data and presents a new perspective for detecting outliers. |
topic |
Environmental monitoring sensor data outlier detection integrated model statistical analysis |
url |
https://ieeexplore.ieee.org/document/8922712/ |
work_keys_str_mv |
AT minghuzhang anadaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata AT xinli anadaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata AT liliwang anadaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata AT minghuzhang adaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata AT xinli adaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata AT liliwang adaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata |
_version_ |
1724188390178422784 |