Training Data Extraction and Object Detection in Surveillance Scenario

Police and various security services use video analysis for securing public space, mass events, and when investigating criminal activity. Due to a huge amount of data supplied to surveillance systems, some automatic data processing is a necessity. In one typical scenario, an operator marks an object...

Full description

Bibliographic Details
Main Authors: Artur Wilkowski, Maciej Stefańczyk, Włodzimierz Kasprzak
Format: Article
Language:English
Published: MDPI AG 2020-05-01
Series:Sensors
Subjects:
SVM
CNN
Online Access:https://www.mdpi.com/1424-8220/20/9/2689
id doaj-d770e8dbc375495290eeea1ddace58c7
record_format Article
spelling doaj-d770e8dbc375495290eeea1ddace58c72020-11-25T02:20:13ZengMDPI AGSensors1424-82202020-05-01202689268910.3390/s20092689Training Data Extraction and Object Detection in Surveillance ScenarioArtur Wilkowski0Maciej Stefańczyk1Włodzimierz Kasprzak2Institute of Control and Computation Engineering, Warsaw University of Technology, Nowowiejska 15/19, 00-665 Warszawa, PolandInstitute of Control and Computation Engineering, Warsaw University of Technology, Nowowiejska 15/19, 00-665 Warszawa, PolandInstitute of Control and Computation Engineering, Warsaw University of Technology, Nowowiejska 15/19, 00-665 Warszawa, PolandPolice and various security services use video analysis for securing public space, mass events, and when investigating criminal activity. Due to a huge amount of data supplied to surveillance systems, some automatic data processing is a necessity. In one typical scenario, an operator marks an object in an image frame and searches for all occurrences of the object in other frames or even image sequences. This problem is hard in general. Algorithms supporting this scenario must reconcile several seemingly contradicting factors: training and detection speed, detection reliability, and learning from small data sets. In the system proposed here, we use a two-stage detector. The first region proposal stage is based on a Cascade Classifier while the second classification stage is based either on a Support Vector Machines (SVMs) or Convolutional Neural Networks (CNNs). The proposed configuration ensures both speed and detection reliability. In addition to this, an object tracking and background-foreground separation algorithm is used, supported by the GrabCut algorithm and a sample synthesis procedure, in order to collect rich training data for the detector. Experiments show that the system is effective, useful, and applicable to practical surveillance tasks.https://www.mdpi.com/1424-8220/20/9/2689object detectionfew shot learningSVMCNNcascade classifiervideo surveillance
collection DOAJ
language English
format Article
sources DOAJ
author Artur Wilkowski
Maciej Stefańczyk
Włodzimierz Kasprzak
spellingShingle Artur Wilkowski
Maciej Stefańczyk
Włodzimierz Kasprzak
Training Data Extraction and Object Detection in Surveillance Scenario
Sensors
object detection
few shot learning
SVM
CNN
cascade classifier
video surveillance
author_facet Artur Wilkowski
Maciej Stefańczyk
Włodzimierz Kasprzak
author_sort Artur Wilkowski
title Training Data Extraction and Object Detection in Surveillance Scenario
title_short Training Data Extraction and Object Detection in Surveillance Scenario
title_full Training Data Extraction and Object Detection in Surveillance Scenario
title_fullStr Training Data Extraction and Object Detection in Surveillance Scenario
title_full_unstemmed Training Data Extraction and Object Detection in Surveillance Scenario
title_sort training data extraction and object detection in surveillance scenario
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2020-05-01
description Police and various security services use video analysis for securing public space, mass events, and when investigating criminal activity. Due to a huge amount of data supplied to surveillance systems, some automatic data processing is a necessity. In one typical scenario, an operator marks an object in an image frame and searches for all occurrences of the object in other frames or even image sequences. This problem is hard in general. Algorithms supporting this scenario must reconcile several seemingly contradicting factors: training and detection speed, detection reliability, and learning from small data sets. In the system proposed here, we use a two-stage detector. The first region proposal stage is based on a Cascade Classifier while the second classification stage is based either on a Support Vector Machines (SVMs) or Convolutional Neural Networks (CNNs). The proposed configuration ensures both speed and detection reliability. In addition to this, an object tracking and background-foreground separation algorithm is used, supported by the GrabCut algorithm and a sample synthesis procedure, in order to collect rich training data for the detector. Experiments show that the system is effective, useful, and applicable to practical surveillance tasks.
topic object detection
few shot learning
SVM
CNN
cascade classifier
video surveillance
url https://www.mdpi.com/1424-8220/20/9/2689
work_keys_str_mv AT arturwilkowski trainingdataextractionandobjectdetectioninsurveillancescenario
AT maciejstefanczyk trainingdataextractionandobjectdetectioninsurveillancescenario
AT włodzimierzkasprzak trainingdataextractionandobjectdetectioninsurveillancescenario
_version_ 1724872909798768640