An Efficient Anomaly Detection System for Crowded Scenes Using Variational Autoencoders

Anomaly detection in crowded scenes is an important and challenging part of the intelligent video surveillance system. As the deep neural networks make success in feature representation, the features extracted by a deep neural network represent the appearance and motion patterns in different scenes...

Full description

Bibliographic Details
Main Authors: Ming Xu, Xiaosheng Yu, Dongyue Chen, Chengdong Wu, Yang Jiang
Format: Article
Language:English
Published: MDPI AG 2019-08-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/9/16/3337
id doaj-1c0acc4208d149db9d02d141c8d68ba0
record_format Article
spelling doaj-1c0acc4208d149db9d02d141c8d68ba02020-11-24T21:22:11ZengMDPI AGApplied Sciences2076-34172019-08-01916333710.3390/app9163337app9163337An Efficient Anomaly Detection System for Crowded Scenes Using Variational AutoencodersMing Xu0Xiaosheng Yu1Dongyue Chen2Chengdong Wu3Yang Jiang4College of Information Science and Engineering, Northeastern University, Shenyang 110819, Liaoning, ChinaFaculty of Robot Science and Engineering, Northeastern University, Shenyang 110819, Liaoning, ChinaCollege of Information Science and Engineering, Northeastern University, Shenyang 110819, Liaoning, ChinaFaculty of Robot Science and Engineering, Northeastern University, Shenyang 110819, Liaoning, ChinaFaculty of Robot Science and Engineering, Northeastern University, Shenyang 110819, Liaoning, ChinaAnomaly detection in crowded scenes is an important and challenging part of the intelligent video surveillance system. As the deep neural networks make success in feature representation, the features extracted by a deep neural network represent the appearance and motion patterns in different scenes more specifically, comparing with the hand-crafted features typically used in the traditional anomaly detection approaches. In this paper, we propose a new baseline framework of anomaly detection for complex surveillance scenes based on a variational auto-encoder with convolution kernels to learn feature representations. Firstly, the raw frames series are provided as input to our variational auto-encoder without any preprocessing to learn the appearance and motion features of the receptive fields. Then, multiple Gaussian models are used to predict the anomaly scores of the corresponding receptive fields. Our proposed two-stage anomaly detection system is evaluated on the video surveillance dataset for a large scene, UCSD pedestrian datasets, and yields competitive performance compared with state-of-the-art methods.https://www.mdpi.com/2076-3417/9/16/3337video surveillance systemanomaly detectionunsupervised learningconvolutional auto-encodervariational auto-encoder
collection DOAJ
language English
format Article
sources DOAJ
author Ming Xu
Xiaosheng Yu
Dongyue Chen
Chengdong Wu
Yang Jiang
spellingShingle Ming Xu
Xiaosheng Yu
Dongyue Chen
Chengdong Wu
Yang Jiang
An Efficient Anomaly Detection System for Crowded Scenes Using Variational Autoencoders
Applied Sciences
video surveillance system
anomaly detection
unsupervised learning
convolutional auto-encoder
variational auto-encoder
author_facet Ming Xu
Xiaosheng Yu
Dongyue Chen
Chengdong Wu
Yang Jiang
author_sort Ming Xu
title An Efficient Anomaly Detection System for Crowded Scenes Using Variational Autoencoders
title_short An Efficient Anomaly Detection System for Crowded Scenes Using Variational Autoencoders
title_full An Efficient Anomaly Detection System for Crowded Scenes Using Variational Autoencoders
title_fullStr An Efficient Anomaly Detection System for Crowded Scenes Using Variational Autoencoders
title_full_unstemmed An Efficient Anomaly Detection System for Crowded Scenes Using Variational Autoencoders
title_sort efficient anomaly detection system for crowded scenes using variational autoencoders
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2019-08-01
description Anomaly detection in crowded scenes is an important and challenging part of the intelligent video surveillance system. As the deep neural networks make success in feature representation, the features extracted by a deep neural network represent the appearance and motion patterns in different scenes more specifically, comparing with the hand-crafted features typically used in the traditional anomaly detection approaches. In this paper, we propose a new baseline framework of anomaly detection for complex surveillance scenes based on a variational auto-encoder with convolution kernels to learn feature representations. Firstly, the raw frames series are provided as input to our variational auto-encoder without any preprocessing to learn the appearance and motion features of the receptive fields. Then, multiple Gaussian models are used to predict the anomaly scores of the corresponding receptive fields. Our proposed two-stage anomaly detection system is evaluated on the video surveillance dataset for a large scene, UCSD pedestrian datasets, and yields competitive performance compared with state-of-the-art methods.
topic video surveillance system
anomaly detection
unsupervised learning
convolutional auto-encoder
variational auto-encoder
url https://www.mdpi.com/2076-3417/9/16/3337
work_keys_str_mv AT mingxu anefficientanomalydetectionsystemforcrowdedscenesusingvariationalautoencoders
AT xiaoshengyu anefficientanomalydetectionsystemforcrowdedscenesusingvariationalautoencoders
AT dongyuechen anefficientanomalydetectionsystemforcrowdedscenesusingvariationalautoencoders
AT chengdongwu anefficientanomalydetectionsystemforcrowdedscenesusingvariationalautoencoders
AT yangjiang anefficientanomalydetectionsystemforcrowdedscenesusingvariationalautoencoders
AT mingxu efficientanomalydetectionsystemforcrowdedscenesusingvariationalautoencoders
AT xiaoshengyu efficientanomalydetectionsystemforcrowdedscenesusingvariationalautoencoders
AT dongyuechen efficientanomalydetectionsystemforcrowdedscenesusingvariationalautoencoders
AT chengdongwu efficientanomalydetectionsystemforcrowdedscenesusingvariationalautoencoders
AT yangjiang efficientanomalydetectionsystemforcrowdedscenesusingvariationalautoencoders
_version_ 1725997075273875456