Deep neural network model for group activity recognition using contextual relationship
In this paper, we present contextual relationship-based learning model using deep neural network for recognizing the activities performed by a group of people in a video sequence. The proposed model comprises of the context learning using a bottom-up approach, learning from individual human actions...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2019-02-01
|
Series: | Engineering Science and Technology, an International Journal |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2215098618306232 |
id |
doaj-96653f61d8d142959bd4e90479e6edf7 |
---|---|
record_format |
Article |
spelling |
doaj-96653f61d8d142959bd4e90479e6edf72020-11-25T00:05:31ZengElsevierEngineering Science and Technology, an International Journal2215-09862019-02-012214754Deep neural network model for group activity recognition using contextual relationshipS.A. Vahora0N.C. Chauhan1Charusat University, Changa, Gujarat, India; Corresponding author.A.D. Patel Institute of Technology, New V.V. Nagar, Gujarat, IndiaIn this paper, we present contextual relationship-based learning model using deep neural network for recognizing the activities performed by a group of people in a video sequence. The proposed model comprises of the context learning using a bottom-up approach, learning from individual human actions to group level activity as well as learning from the scene information. We build deep convolutional neural network model to capture human action-pose feature for a given input video sequence. To capture group level temporal flow changes, aggregated action-pose feature of persons within the context area are fed to deep recurrent neural network, which provides spatio-temporal group descriptor. Together with this, we build a scene level convolutional neural network, to extract scene level feature which improves the performance of group activity recognition. The probabilistic inference model, as an additional layer in deep neural network, added to ensemble the models and provide a unified deep learning framework. Experimental results show the efficiency of the proposed model on standard benchmark collective activity dataset in group activity recognition. We also present the evaluated results by varying different learning parameters, optimizers, especially recurrent neural network models long short-term memory and gated recurrent unit on the benchmark collective activity dataset. Keywords: Group activity recognition, Convolutional neural network, Long short-term memory, Gated recurrent unit, Context learninghttp://www.sciencedirect.com/science/article/pii/S2215098618306232 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
S.A. Vahora N.C. Chauhan |
spellingShingle |
S.A. Vahora N.C. Chauhan Deep neural network model for group activity recognition using contextual relationship Engineering Science and Technology, an International Journal |
author_facet |
S.A. Vahora N.C. Chauhan |
author_sort |
S.A. Vahora |
title |
Deep neural network model for group activity recognition using contextual relationship |
title_short |
Deep neural network model for group activity recognition using contextual relationship |
title_full |
Deep neural network model for group activity recognition using contextual relationship |
title_fullStr |
Deep neural network model for group activity recognition using contextual relationship |
title_full_unstemmed |
Deep neural network model for group activity recognition using contextual relationship |
title_sort |
deep neural network model for group activity recognition using contextual relationship |
publisher |
Elsevier |
series |
Engineering Science and Technology, an International Journal |
issn |
2215-0986 |
publishDate |
2019-02-01 |
description |
In this paper, we present contextual relationship-based learning model using deep neural network for recognizing the activities performed by a group of people in a video sequence. The proposed model comprises of the context learning using a bottom-up approach, learning from individual human actions to group level activity as well as learning from the scene information. We build deep convolutional neural network model to capture human action-pose feature for a given input video sequence. To capture group level temporal flow changes, aggregated action-pose feature of persons within the context area are fed to deep recurrent neural network, which provides spatio-temporal group descriptor. Together with this, we build a scene level convolutional neural network, to extract scene level feature which improves the performance of group activity recognition. The probabilistic inference model, as an additional layer in deep neural network, added to ensemble the models and provide a unified deep learning framework. Experimental results show the efficiency of the proposed model on standard benchmark collective activity dataset in group activity recognition. We also present the evaluated results by varying different learning parameters, optimizers, especially recurrent neural network models long short-term memory and gated recurrent unit on the benchmark collective activity dataset. Keywords: Group activity recognition, Convolutional neural network, Long short-term memory, Gated recurrent unit, Context learning |
url |
http://www.sciencedirect.com/science/article/pii/S2215098618306232 |
work_keys_str_mv |
AT savahora deepneuralnetworkmodelforgroupactivityrecognitionusingcontextualrelationship AT ncchauhan deepneuralnetworkmodelforgroupactivityrecognitionusingcontextualrelationship |
_version_ |
1725424948814544896 |