Temporal Learning in Video Data Using Deep Learning and Gaussian Processes

This paper presents an approach for data-driven modeling of hidden, stationary temporal dynamics in sequential images or videos using deep learning and Bayesian non-parametric techniques. In particular, a deep Convolutional Neural Network (CNN) is used to extract spatial features in an unsupervised...

Full description

Bibliographic Details
Main Authors: Devesh K. Jha, Abhishek Srivastav, Asok Ray
Format: Article
Language:English
Published: The Prognostics and Health Management Society 2016-12-01
Series:International Journal of Prognostics and Health Management
Subjects:
Online Access:https://papers.phmsociety.org/index.php/ijphm/article/view/2460
id doaj-884f834b048548259a3bd06c619f0cec
record_format Article
spelling doaj-884f834b048548259a3bd06c619f0cec2021-07-02T20:43:04ZengThe Prognostics and Health Management SocietyInternational Journal of Prognostics and Health Management2153-26482153-26482016-12-0174doi:10.36001/ijphm.2016.v7i4.2460Temporal Learning in Video Data Using Deep Learning and Gaussian ProcessesDevesh K. Jha0Abhishek Srivastav1Asok Ray2Mechanical and Nuclear Engineering Department, Pennsylvania State University, PA, USAMachine Learning Lab, GE Global Research, San Ramon, CA, USAMechanical and Nuclear Engineering Department, Pennsylvania State University, PA, USAThis paper presents an approach for data-driven modeling of hidden, stationary temporal dynamics in sequential images or videos using deep learning and Bayesian non-parametric techniques. In particular, a deep Convolutional Neural Network (CNN) is used to extract spatial features in an unsupervised fashion from individual images and then, a Gaussian process is used to model the temporal dynamics of the spatial features extracted by the deep CNN. By decomposing the spatial and temporal components and utilizing the strengths of deep learning and Gaussian processes for the respective sub-problems, we are able to construct a model that is able to capture complex spatio-temporal phenomena while using relatively small number of free parameters. The proposed approach is tested on high-speed grey-scale video data obtained of combustion flames in a swirl-stabilized combustor, where certain protocols are used to induce instability in combustion process. The proposed approach is then used to detect and predict the transition of the combustion process from stable to unstable regime. It is demonstrated that the proposed approach is able to detect unstable flame conditions using very few frames from high-speed video. This is useful as early detection of unstable combustion can lead to better control strategies to mitigate instability. Results from the proposed approach are compared and contrasted with several baselines and recent work in this area. The performance of the proposed approach is found to be significantly better in terms of detection accuracy, model complexity and lead-time to detection.https://papers.phmsociety.org/index.php/ijphm/article/view/2460gaussian processesdeep learningcombustion instability
collection DOAJ
language English
format Article
sources DOAJ
author Devesh K. Jha
Abhishek Srivastav
Asok Ray
spellingShingle Devesh K. Jha
Abhishek Srivastav
Asok Ray
Temporal Learning in Video Data Using Deep Learning and Gaussian Processes
International Journal of Prognostics and Health Management
gaussian processes
deep learning
combustion instability
author_facet Devesh K. Jha
Abhishek Srivastav
Asok Ray
author_sort Devesh K. Jha
title Temporal Learning in Video Data Using Deep Learning and Gaussian Processes
title_short Temporal Learning in Video Data Using Deep Learning and Gaussian Processes
title_full Temporal Learning in Video Data Using Deep Learning and Gaussian Processes
title_fullStr Temporal Learning in Video Data Using Deep Learning and Gaussian Processes
title_full_unstemmed Temporal Learning in Video Data Using Deep Learning and Gaussian Processes
title_sort temporal learning in video data using deep learning and gaussian processes
publisher The Prognostics and Health Management Society
series International Journal of Prognostics and Health Management
issn 2153-2648
2153-2648
publishDate 2016-12-01
description This paper presents an approach for data-driven modeling of hidden, stationary temporal dynamics in sequential images or videos using deep learning and Bayesian non-parametric techniques. In particular, a deep Convolutional Neural Network (CNN) is used to extract spatial features in an unsupervised fashion from individual images and then, a Gaussian process is used to model the temporal dynamics of the spatial features extracted by the deep CNN. By decomposing the spatial and temporal components and utilizing the strengths of deep learning and Gaussian processes for the respective sub-problems, we are able to construct a model that is able to capture complex spatio-temporal phenomena while using relatively small number of free parameters. The proposed approach is tested on high-speed grey-scale video data obtained of combustion flames in a swirl-stabilized combustor, where certain protocols are used to induce instability in combustion process. The proposed approach is then used to detect and predict the transition of the combustion process from stable to unstable regime. It is demonstrated that the proposed approach is able to detect unstable flame conditions using very few frames from high-speed video. This is useful as early detection of unstable combustion can lead to better control strategies to mitigate instability. Results from the proposed approach are compared and contrasted with several baselines and recent work in this area. The performance of the proposed approach is found to be significantly better in terms of detection accuracy, model complexity and lead-time to detection.
topic gaussian processes
deep learning
combustion instability
url https://papers.phmsociety.org/index.php/ijphm/article/view/2460
work_keys_str_mv AT deveshkjha temporallearninginvideodatausingdeeplearningandgaussianprocesses
AT abhisheksrivastav temporallearninginvideodatausingdeeplearningandgaussianprocesses
AT asokray temporallearninginvideodatausingdeeplearningandgaussianprocesses
_version_ 1721322731291541504