Deep neural network model for group activity recognition using contextual relationship

In this paper, we present contextual relationship-based learning model using deep neural network for recognizing the activities performed by a group of people in a video sequence. The proposed model comprises of the context learning using a bottom-up approach, learning from individual human actions...

Full description

Bibliographic Details
Main Authors: S.A. Vahora, N.C. Chauhan
Format: Article
Language:English
Published: Elsevier 2019-02-01
Series:Engineering Science and Technology, an International Journal
Online Access:http://www.sciencedirect.com/science/article/pii/S2215098618306232
id doaj-96653f61d8d142959bd4e90479e6edf7
record_format Article
spelling doaj-96653f61d8d142959bd4e90479e6edf72020-11-25T00:05:31ZengElsevierEngineering Science and Technology, an International Journal2215-09862019-02-012214754Deep neural network model for group activity recognition using contextual relationshipS.A. Vahora0N.C. Chauhan1Charusat University, Changa, Gujarat, India; Corresponding author.A.D. Patel Institute of Technology, New V.V. Nagar, Gujarat, IndiaIn this paper, we present contextual relationship-based learning model using deep neural network for recognizing the activities performed by a group of people in a video sequence. The proposed model comprises of the context learning using a bottom-up approach, learning from individual human actions to group level activity as well as learning from the scene information. We build deep convolutional neural network model to capture human action-pose feature for a given input video sequence. To capture group level temporal flow changes, aggregated action-pose feature of persons within the context area are fed to deep recurrent neural network, which provides spatio-temporal group descriptor. Together with this, we build a scene level convolutional neural network, to extract scene level feature which improves the performance of group activity recognition. The probabilistic inference model, as an additional layer in deep neural network, added to ensemble the models and provide a unified deep learning framework. Experimental results show the efficiency of the proposed model on standard benchmark collective activity dataset in group activity recognition. We also present the evaluated results by varying different learning parameters, optimizers, especially recurrent neural network models long short-term memory and gated recurrent unit on the benchmark collective activity dataset. Keywords: Group activity recognition, Convolutional neural network, Long short-term memory, Gated recurrent unit, Context learninghttp://www.sciencedirect.com/science/article/pii/S2215098618306232
collection DOAJ
language English
format Article
sources DOAJ
author S.A. Vahora
N.C. Chauhan
spellingShingle S.A. Vahora
N.C. Chauhan
Deep neural network model for group activity recognition using contextual relationship
Engineering Science and Technology, an International Journal
author_facet S.A. Vahora
N.C. Chauhan
author_sort S.A. Vahora
title Deep neural network model for group activity recognition using contextual relationship
title_short Deep neural network model for group activity recognition using contextual relationship
title_full Deep neural network model for group activity recognition using contextual relationship
title_fullStr Deep neural network model for group activity recognition using contextual relationship
title_full_unstemmed Deep neural network model for group activity recognition using contextual relationship
title_sort deep neural network model for group activity recognition using contextual relationship
publisher Elsevier
series Engineering Science and Technology, an International Journal
issn 2215-0986
publishDate 2019-02-01
description In this paper, we present contextual relationship-based learning model using deep neural network for recognizing the activities performed by a group of people in a video sequence. The proposed model comprises of the context learning using a bottom-up approach, learning from individual human actions to group level activity as well as learning from the scene information. We build deep convolutional neural network model to capture human action-pose feature for a given input video sequence. To capture group level temporal flow changes, aggregated action-pose feature of persons within the context area are fed to deep recurrent neural network, which provides spatio-temporal group descriptor. Together with this, we build a scene level convolutional neural network, to extract scene level feature which improves the performance of group activity recognition. The probabilistic inference model, as an additional layer in deep neural network, added to ensemble the models and provide a unified deep learning framework. Experimental results show the efficiency of the proposed model on standard benchmark collective activity dataset in group activity recognition. We also present the evaluated results by varying different learning parameters, optimizers, especially recurrent neural network models long short-term memory and gated recurrent unit on the benchmark collective activity dataset. Keywords: Group activity recognition, Convolutional neural network, Long short-term memory, Gated recurrent unit, Context learning
url http://www.sciencedirect.com/science/article/pii/S2215098618306232
work_keys_str_mv AT savahora deepneuralnetworkmodelforgroupactivityrecognitionusingcontextualrelationship
AT ncchauhan deepneuralnetworkmodelforgroupactivityrecognitionusingcontextualrelationship
_version_ 1725424948814544896