Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes

Single sensor systems and standard optical—usually RGB CCTV video cameras—fail to provide adequate observations, or the amount of spectral information required to build rich, expressive, discriminative features for object detection and tracking tasks in challenging outdoor and in...

Full description

Bibliographic Details
Main Authors: Zacharias Kandylakis, Konstantinos Vasili, Konstantinos Karantzalos
Format: Article
Language:English
Published: MDPI AG 2019-02-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/11/4/446
id doaj-7342d7102397493885a9bc0b0a64a180
record_format Article
spelling doaj-7342d7102397493885a9bc0b0a64a1802020-11-25T00:03:31ZengMDPI AGRemote Sensing2072-42922019-02-0111444610.3390/rs11040446rs11040446Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor ScenesZacharias Kandylakis0Konstantinos Vasili1Konstantinos Karantzalos2Remote Sensing Laboratory, National Technical University of Athens, 15780 Zographos, GreeceRemote Sensing Laboratory, National Technical University of Athens, 15780 Zographos, GreeceRemote Sensing Laboratory, National Technical University of Athens, 15780 Zographos, GreeceSingle sensor systems and standard optical—usually RGB CCTV video cameras—fail to provide adequate observations, or the amount of spectral information required to build rich, expressive, discriminative features for object detection and tracking tasks in challenging outdoor and indoor scenes under various environmental/illumination conditions. Towards this direction, we have designed a multisensor system based on thermal, shortwave infrared, and hyperspectral video sensors and propose a processing pipeline able to perform in real-time object detection tasks despite the huge amount of the concurrently acquired video streams. In particular, in order to avoid the computationally intensive coregistration of the hyperspectral data with other imaging modalities, the initially detected targets are projected through a local coordinate system on the hypercube image plane. Regarding the object detection, a detector-agnostic procedure has been developed, integrating both unsupervised (background subtraction) and supervised (deep learning convolutional neural networks) techniques for validation purposes. The detected and verified targets are extracted through the fusion and data association steps based on temporal spectral signatures of both target and background. The quite promising experimental results in challenging indoor and outdoor scenes indicated the robust and efficient performance of the developed methodology under different conditions like fog, smoke, and illumination changes.https://www.mdpi.com/2072-4292/11/4/446hyperspectralSWIRthermalvideomultisensordetectiontrackingmoving object
collection DOAJ
language English
format Article
sources DOAJ
author Zacharias Kandylakis
Konstantinos Vasili
Konstantinos Karantzalos
spellingShingle Zacharias Kandylakis
Konstantinos Vasili
Konstantinos Karantzalos
Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes
Remote Sensing
hyperspectral
SWIR
thermal
video
multisensor
detection
tracking
moving object
author_facet Zacharias Kandylakis
Konstantinos Vasili
Konstantinos Karantzalos
author_sort Zacharias Kandylakis
title Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes
title_short Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes
title_full Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes
title_fullStr Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes
title_full_unstemmed Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes
title_sort fusing multimodal video data for detecting moving objects/targets in challenging indoor and outdoor scenes
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2019-02-01
description Single sensor systems and standard optical—usually RGB CCTV video cameras—fail to provide adequate observations, or the amount of spectral information required to build rich, expressive, discriminative features for object detection and tracking tasks in challenging outdoor and indoor scenes under various environmental/illumination conditions. Towards this direction, we have designed a multisensor system based on thermal, shortwave infrared, and hyperspectral video sensors and propose a processing pipeline able to perform in real-time object detection tasks despite the huge amount of the concurrently acquired video streams. In particular, in order to avoid the computationally intensive coregistration of the hyperspectral data with other imaging modalities, the initially detected targets are projected through a local coordinate system on the hypercube image plane. Regarding the object detection, a detector-agnostic procedure has been developed, integrating both unsupervised (background subtraction) and supervised (deep learning convolutional neural networks) techniques for validation purposes. The detected and verified targets are extracted through the fusion and data association steps based on temporal spectral signatures of both target and background. The quite promising experimental results in challenging indoor and outdoor scenes indicated the robust and efficient performance of the developed methodology under different conditions like fog, smoke, and illumination changes.
topic hyperspectral
SWIR
thermal
video
multisensor
detection
tracking
moving object
url https://www.mdpi.com/2072-4292/11/4/446
work_keys_str_mv AT zachariaskandylakis fusingmultimodalvideodatafordetectingmovingobjectstargetsinchallengingindoorandoutdoorscenes
AT konstantinosvasili fusingmultimodalvideodatafordetectingmovingobjectstargetsinchallengingindoorandoutdoorscenes
AT konstantinoskarantzalos fusingmultimodalvideodatafordetectingmovingobjectstargetsinchallengingindoorandoutdoorscenes
_version_ 1725433461768978432