Outlier Detection for Multivariate Time Series Using Dynamic Bayesian Networks

Outliers are observations suspected of not having been generated by the underlying process of the remaining data. Many applications require a way of identifying interesting or unusual patterns in multivariate time series (MTS), now ubiquitous in many applications; however, most outlier detection met...

Full description

Bibliographic Details
Main Authors: Jorge L. Serras, Susana Vinga, Alexandra M. Carvalho
Format: Article
Language:English
Published: MDPI AG 2021-02-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/4/1955
id doaj-135355869a4b4c3ea9129e4de3017fee
record_format Article
spelling doaj-135355869a4b4c3ea9129e4de3017fee2021-02-24T00:01:40ZengMDPI AGApplied Sciences2076-34172021-02-01111955195510.3390/app11041955Outlier Detection for Multivariate Time Series Using Dynamic Bayesian NetworksJorge L. Serras0Susana Vinga1Alexandra M. Carvalho2Instituto de Telecomunicações, Instituto Superior Técnico, Universidade de Lisboa, 1049-001 Lisboa, PortugalINESC-ID, Instituto Superior Técnico, Universidade de Lisboa, 1049-001 Lisboa, PortugalInstituto de Telecomunicações, Instituto Superior Técnico, Universidade de Lisboa, 1049-001 Lisboa, PortugalOutliers are observations suspected of not having been generated by the underlying process of the remaining data. Many applications require a way of identifying interesting or unusual patterns in multivariate time series (MTS), now ubiquitous in many applications; however, most outlier detection methods focus solely on univariate series. We propose a complete and automatic outlier detection system covering the pre-processing of MTS data that adopts a dynamic Bayesian network (DBN) modeling algorithm. The latter encodes optimal inter and intra-time slice connectivity of transition networks capable of capturing conditional dependencies in MTS datasets. A sliding window mechanism is employed to score each MTS transition gradually, given the DBN model. Two score-analysis strategies are studied to assure an automatic classification of anomalous data. The proposed approach is first validated in simulated data, demonstrating the performance of the system. Further experiments are made on real data, by uncovering anomalies in distinct scenarios such as electrocardiogram series, mortality rate data, and written pen digits. The developed system proved beneficial in capturing unusual data resulting from temporal contexts, being suitable for any MTS scenario. A widely accessible web application employing the complete system is publicly available jointly with a tutorial.https://www.mdpi.com/2076-3417/11/4/1955multivariate time seriesoutlier detectiondynamic bayesian networkssliding window algorithmscore analysisweb application
collection DOAJ
language English
format Article
sources DOAJ
author Jorge L. Serras
Susana Vinga
Alexandra M. Carvalho
spellingShingle Jorge L. Serras
Susana Vinga
Alexandra M. Carvalho
Outlier Detection for Multivariate Time Series Using Dynamic Bayesian Networks
Applied Sciences
multivariate time series
outlier detection
dynamic bayesian networks
sliding window algorithm
score analysis
web application
author_facet Jorge L. Serras
Susana Vinga
Alexandra M. Carvalho
author_sort Jorge L. Serras
title Outlier Detection for Multivariate Time Series Using Dynamic Bayesian Networks
title_short Outlier Detection for Multivariate Time Series Using Dynamic Bayesian Networks
title_full Outlier Detection for Multivariate Time Series Using Dynamic Bayesian Networks
title_fullStr Outlier Detection for Multivariate Time Series Using Dynamic Bayesian Networks
title_full_unstemmed Outlier Detection for Multivariate Time Series Using Dynamic Bayesian Networks
title_sort outlier detection for multivariate time series using dynamic bayesian networks
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2021-02-01
description Outliers are observations suspected of not having been generated by the underlying process of the remaining data. Many applications require a way of identifying interesting or unusual patterns in multivariate time series (MTS), now ubiquitous in many applications; however, most outlier detection methods focus solely on univariate series. We propose a complete and automatic outlier detection system covering the pre-processing of MTS data that adopts a dynamic Bayesian network (DBN) modeling algorithm. The latter encodes optimal inter and intra-time slice connectivity of transition networks capable of capturing conditional dependencies in MTS datasets. A sliding window mechanism is employed to score each MTS transition gradually, given the DBN model. Two score-analysis strategies are studied to assure an automatic classification of anomalous data. The proposed approach is first validated in simulated data, demonstrating the performance of the system. Further experiments are made on real data, by uncovering anomalies in distinct scenarios such as electrocardiogram series, mortality rate data, and written pen digits. The developed system proved beneficial in capturing unusual data resulting from temporal contexts, being suitable for any MTS scenario. A widely accessible web application employing the complete system is publicly available jointly with a tutorial.
topic multivariate time series
outlier detection
dynamic bayesian networks
sliding window algorithm
score analysis
web application
url https://www.mdpi.com/2076-3417/11/4/1955
work_keys_str_mv AT jorgelserras outlierdetectionformultivariatetimeseriesusingdynamicbayesiannetworks
AT susanavinga outlierdetectionformultivariatetimeseriesusingdynamicbayesiannetworks
AT alexandramcarvalho outlierdetectionformultivariatetimeseriesusingdynamicbayesiannetworks
_version_ 1724253685220900864