Robust Suspicious Action Recognition Approach Using Pose Descriptor
In the current era of technological development, human actions can be recorded in public places like airports, shopping malls, and educational institutes, etc., to monitor suspicious activities like terrorism, fighting, theft, and vandalism. Surveillance videos contain adequate visual and motion inf...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2021-01-01
|
Series: | Mathematical Problems in Engineering |
Online Access: | http://dx.doi.org/10.1155/2021/2449603 |
id |
doaj-632bf2b5fec14d8badf10e1cfa98aaa7 |
---|---|
record_format |
Article |
spelling |
doaj-632bf2b5fec14d8badf10e1cfa98aaa72021-08-23T01:33:19ZengHindawi LimitedMathematical Problems in Engineering1563-51472021-01-01202110.1155/2021/2449603Robust Suspicious Action Recognition Approach Using Pose DescriptorWaqas Ahmed0Muhammad Haroon Yousaf1Amanullah Yasin2Department of Telecommunication EngineeringDepartment of Computer EngineeringSwarm Robotic Lab-National Centre for Robotics and Automation (NCRA)In the current era of technological development, human actions can be recorded in public places like airports, shopping malls, and educational institutes, etc., to monitor suspicious activities like terrorism, fighting, theft, and vandalism. Surveillance videos contain adequate visual and motion information for events that occur within a camera’s view. Our study focuses on the concept that actions are a sequence of moving body parts. In this paper, a new descriptor is proposed that formulates human poses and tracks the relative motion of human body parts along with the video frames, and extracts the position and orientation of body parts. We used Part Affinity Fields (PAFs) to acquire the associated body parts of the people present in the frame. The architecture jointly learns the body parts and their associations with other body parts in a sequential process, such that a pose can be formulated step by step. We can obtain the complete pose with a limited number of points as it moves along the video and we can conclude with a defined action. Later, these feature points are classified with a Support Vector Machine (SVM). The proposed work was evaluated on the benchmark datasets, namely, UT-interaction, UCF11, CASIA, and HCA datasets. Our proposed scheme was evaluated on the aforementioned datasets, which contained criminal/suspicious actions, such as kick, punch, push, gun shooting, and sword-fighting, and achieved an accuracy of 96.4% on UT-interaction, 99% on UCF11, 98% on CASIA and 88.72% on HCA.http://dx.doi.org/10.1155/2021/2449603 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Waqas Ahmed Muhammad Haroon Yousaf Amanullah Yasin |
spellingShingle |
Waqas Ahmed Muhammad Haroon Yousaf Amanullah Yasin Robust Suspicious Action Recognition Approach Using Pose Descriptor Mathematical Problems in Engineering |
author_facet |
Waqas Ahmed Muhammad Haroon Yousaf Amanullah Yasin |
author_sort |
Waqas Ahmed |
title |
Robust Suspicious Action Recognition Approach Using Pose Descriptor |
title_short |
Robust Suspicious Action Recognition Approach Using Pose Descriptor |
title_full |
Robust Suspicious Action Recognition Approach Using Pose Descriptor |
title_fullStr |
Robust Suspicious Action Recognition Approach Using Pose Descriptor |
title_full_unstemmed |
Robust Suspicious Action Recognition Approach Using Pose Descriptor |
title_sort |
robust suspicious action recognition approach using pose descriptor |
publisher |
Hindawi Limited |
series |
Mathematical Problems in Engineering |
issn |
1563-5147 |
publishDate |
2021-01-01 |
description |
In the current era of technological development, human actions can be recorded in public places like airports, shopping malls, and educational institutes, etc., to monitor suspicious activities like terrorism, fighting, theft, and vandalism. Surveillance videos contain adequate visual and motion information for events that occur within a camera’s view. Our study focuses on the concept that actions are a sequence of moving body parts. In this paper, a new descriptor is proposed that formulates human poses and tracks the relative motion of human body parts along with the video frames, and extracts the position and orientation of body parts. We used Part Affinity Fields (PAFs) to acquire the associated body parts of the people present in the frame. The architecture jointly learns the body parts and their associations with other body parts in a sequential process, such that a pose can be formulated step by step. We can obtain the complete pose with a limited number of points as it moves along the video and we can conclude with a defined action. Later, these feature points are classified with a Support Vector Machine (SVM). The proposed work was evaluated on the benchmark datasets, namely, UT-interaction, UCF11, CASIA, and HCA datasets. Our proposed scheme was evaluated on the aforementioned datasets, which contained criminal/suspicious actions, such as kick, punch, push, gun shooting, and sword-fighting, and achieved an accuracy of 96.4% on UT-interaction, 99% on UCF11, 98% on CASIA and 88.72% on HCA. |
url |
http://dx.doi.org/10.1155/2021/2449603 |
work_keys_str_mv |
AT waqasahmed robustsuspiciousactionrecognitionapproachusingposedescriptor AT muhammadharoonyousaf robustsuspiciousactionrecognitionapproachusingposedescriptor AT amanullahyasin robustsuspiciousactionrecognitionapproachusingposedescriptor |
_version_ |
1721198867586744320 |