Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes
Single sensor systems and standard optical—usually RGB CCTV video cameras—fail to provide adequate observations, or the amount of spectral information required to build rich, expressive, discriminative features for object detection and tracking tasks in challenging outdoor and in...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2019-02-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/11/4/446 |
id |
doaj-7342d7102397493885a9bc0b0a64a180 |
---|---|
record_format |
Article |
spelling |
doaj-7342d7102397493885a9bc0b0a64a1802020-11-25T00:03:31ZengMDPI AGRemote Sensing2072-42922019-02-0111444610.3390/rs11040446rs11040446Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor ScenesZacharias Kandylakis0Konstantinos Vasili1Konstantinos Karantzalos2Remote Sensing Laboratory, National Technical University of Athens, 15780 Zographos, GreeceRemote Sensing Laboratory, National Technical University of Athens, 15780 Zographos, GreeceRemote Sensing Laboratory, National Technical University of Athens, 15780 Zographos, GreeceSingle sensor systems and standard optical—usually RGB CCTV video cameras—fail to provide adequate observations, or the amount of spectral information required to build rich, expressive, discriminative features for object detection and tracking tasks in challenging outdoor and indoor scenes under various environmental/illumination conditions. Towards this direction, we have designed a multisensor system based on thermal, shortwave infrared, and hyperspectral video sensors and propose a processing pipeline able to perform in real-time object detection tasks despite the huge amount of the concurrently acquired video streams. In particular, in order to avoid the computationally intensive coregistration of the hyperspectral data with other imaging modalities, the initially detected targets are projected through a local coordinate system on the hypercube image plane. Regarding the object detection, a detector-agnostic procedure has been developed, integrating both unsupervised (background subtraction) and supervised (deep learning convolutional neural networks) techniques for validation purposes. The detected and verified targets are extracted through the fusion and data association steps based on temporal spectral signatures of both target and background. The quite promising experimental results in challenging indoor and outdoor scenes indicated the robust and efficient performance of the developed methodology under different conditions like fog, smoke, and illumination changes.https://www.mdpi.com/2072-4292/11/4/446hyperspectralSWIRthermalvideomultisensordetectiontrackingmoving object |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zacharias Kandylakis Konstantinos Vasili Konstantinos Karantzalos |
spellingShingle |
Zacharias Kandylakis Konstantinos Vasili Konstantinos Karantzalos Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes Remote Sensing hyperspectral SWIR thermal video multisensor detection tracking moving object |
author_facet |
Zacharias Kandylakis Konstantinos Vasili Konstantinos Karantzalos |
author_sort |
Zacharias Kandylakis |
title |
Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes |
title_short |
Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes |
title_full |
Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes |
title_fullStr |
Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes |
title_full_unstemmed |
Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes |
title_sort |
fusing multimodal video data for detecting moving objects/targets in challenging indoor and outdoor scenes |
publisher |
MDPI AG |
series |
Remote Sensing |
issn |
2072-4292 |
publishDate |
2019-02-01 |
description |
Single sensor systems and standard optical—usually RGB CCTV video cameras—fail to provide adequate observations, or the amount of spectral information required to build rich, expressive, discriminative features for object detection and tracking tasks in challenging outdoor and indoor scenes under various environmental/illumination conditions. Towards this direction, we have designed a multisensor system based on thermal, shortwave infrared, and hyperspectral video sensors and propose a processing pipeline able to perform in real-time object detection tasks despite the huge amount of the concurrently acquired video streams. In particular, in order to avoid the computationally intensive coregistration of the hyperspectral data with other imaging modalities, the initially detected targets are projected through a local coordinate system on the hypercube image plane. Regarding the object detection, a detector-agnostic procedure has been developed, integrating both unsupervised (background subtraction) and supervised (deep learning convolutional neural networks) techniques for validation purposes. The detected and verified targets are extracted through the fusion and data association steps based on temporal spectral signatures of both target and background. The quite promising experimental results in challenging indoor and outdoor scenes indicated the robust and efficient performance of the developed methodology under different conditions like fog, smoke, and illumination changes. |
topic |
hyperspectral SWIR thermal video multisensor detection tracking moving object |
url |
https://www.mdpi.com/2072-4292/11/4/446 |
work_keys_str_mv |
AT zachariaskandylakis fusingmultimodalvideodatafordetectingmovingobjectstargetsinchallengingindoorandoutdoorscenes AT konstantinosvasili fusingmultimodalvideodatafordetectingmovingobjectstargetsinchallengingindoorandoutdoorscenes AT konstantinoskarantzalos fusingmultimodalvideodatafordetectingmovingobjectstargetsinchallengingindoorandoutdoorscenes |
_version_ |
1725433461768978432 |