An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data

The intelligent environment monitoring network, as the foundation of ecosystem research, has rapidly developed with the ever-growing Internet of Things (IoT). IoT-networked sensors deployed to monitor ecosystems generate copious sensor data characterized by nonstationarity and nonlinearity such that...

Full description

Bibliographic Details
Main Authors: Minghu Zhang, Xin Li, Lili Wang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8922712/
id doaj-d293b9a2250f4d299e4489e7d59446ca
record_format Article
spelling doaj-d293b9a2250f4d299e4489e7d59446ca2021-03-30T00:25:36ZengIEEEIEEE Access2169-35362019-01-01717519217521210.1109/ACCESS.2019.29576028922712An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor DataMinghu Zhang0https://orcid.org/0000-0002-5178-586XXin Li1https://orcid.org/0000-0003-2999-9818Lili Wang2https://orcid.org/0000-0001-6401-264XKey Laboratory of Remote Sensing of Gansu Province, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou, ChinaNational Tibetan Plateau Data Center, Institute of Tibetan Plateau Research, Chinese Academy of Sciences, Beijing, ChinaCollege of Physics and Electrical Engineering, Northwest Normal University, Lanzhou, ChinaThe intelligent environment monitoring network, as the foundation of ecosystem research, has rapidly developed with the ever-growing Internet of Things (IoT). IoT-networked sensors deployed to monitor ecosystems generate copious sensor data characterized by nonstationarity and nonlinearity such that outlier detection remains a source of concern. Most outlier detection models involve hypothesis tests based on setting outlier threshold values. However, signal decomposition describes stationary and nonstationary relationships sensor data. Therefore, this paper proposes a three-level hybrid model based on the median filter (MF), empirical mode decomposition (EMD), classification and regression tree (CART), autoregression (AR) and exponential weighted moving average (EWMA) methods called MF-EMD-CART-AR-EWMA to detect outliers in sensor data. The first-level performance is compared to that of the Butterworth filter, FIR filter, moving average filter, wavelet filter and Wiener filter. The second-level prediction performance is compared to support vector regression (SVR), K-nearest neighbor (KNN), CART, complementary ensemble EEMD with CART and AR (EEMD-CART-AR) and ensemble CEEMD with CART and AR (CEEMD-CART-AR) methods. Finally, EWMA is compared to Cumulative Sum Control Chart (CUSUM) and Shewhart control charts. The proposed hybrid model was evaluated with a real dataset from the hydrometeorological observation network in the Heihe River Basin, yielding experimental results with better generalization ability and higher accuracy than the compared models, and providing extremely effective detection of minor outliers in predicted values. This paper provides valuable insight and a promising reference for outlier detection involving sensor data and presents a new perspective for detecting outliers.https://ieeexplore.ieee.org/document/8922712/Environmental monitoringsensor dataoutlier detectionintegrated modelstatistical analysis
collection DOAJ
language English
format Article
sources DOAJ
author Minghu Zhang
Xin Li
Lili Wang
spellingShingle Minghu Zhang
Xin Li
Lili Wang
An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data
IEEE Access
Environmental monitoring
sensor data
outlier detection
integrated model
statistical analysis
author_facet Minghu Zhang
Xin Li
Lili Wang
author_sort Minghu Zhang
title An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data
title_short An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data
title_full An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data
title_fullStr An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data
title_full_unstemmed An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data
title_sort adaptive outlier detection and processing approach towards time series sensor data
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description The intelligent environment monitoring network, as the foundation of ecosystem research, has rapidly developed with the ever-growing Internet of Things (IoT). IoT-networked sensors deployed to monitor ecosystems generate copious sensor data characterized by nonstationarity and nonlinearity such that outlier detection remains a source of concern. Most outlier detection models involve hypothesis tests based on setting outlier threshold values. However, signal decomposition describes stationary and nonstationary relationships sensor data. Therefore, this paper proposes a three-level hybrid model based on the median filter (MF), empirical mode decomposition (EMD), classification and regression tree (CART), autoregression (AR) and exponential weighted moving average (EWMA) methods called MF-EMD-CART-AR-EWMA to detect outliers in sensor data. The first-level performance is compared to that of the Butterworth filter, FIR filter, moving average filter, wavelet filter and Wiener filter. The second-level prediction performance is compared to support vector regression (SVR), K-nearest neighbor (KNN), CART, complementary ensemble EEMD with CART and AR (EEMD-CART-AR) and ensemble CEEMD with CART and AR (CEEMD-CART-AR) methods. Finally, EWMA is compared to Cumulative Sum Control Chart (CUSUM) and Shewhart control charts. The proposed hybrid model was evaluated with a real dataset from the hydrometeorological observation network in the Heihe River Basin, yielding experimental results with better generalization ability and higher accuracy than the compared models, and providing extremely effective detection of minor outliers in predicted values. This paper provides valuable insight and a promising reference for outlier detection involving sensor data and presents a new perspective for detecting outliers.
topic Environmental monitoring
sensor data
outlier detection
integrated model
statistical analysis
url https://ieeexplore.ieee.org/document/8922712/
work_keys_str_mv AT minghuzhang anadaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata
AT xinli anadaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata
AT liliwang anadaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata
AT minghuzhang adaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata
AT xinli adaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata
AT liliwang adaptiveoutlierdetectionandprocessingapproachtowardstimeseriessensordata
_version_ 1724188390178422784