Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNet

For the purpose of improving the service quality, video surveillance systems are widely used to standardize the service process in power supply business halls. If the employers check surveillance video to ensure predefined process of staff behaviours, it will be characterized as time-consuming. In r...

Full description

Bibliographic Details
Main Authors: Tongyao Lin, Li Ouyang, He Wen, Dezhi Xiong, Janusz Smulko
Format: Article
Language:English
Published: VSB-Technical University of Ostrava 2021-01-01
Series:Advances in Electrical and Electronic Engineering
Subjects:
Online Access:http://advances.utc.sk/index.php/AEEE/article/view/3950
id doaj-484871a05cb84021aa3ba6ee4e33221c
record_format Article
spelling doaj-484871a05cb84021aa3ba6ee4e33221c2021-10-11T08:03:08ZengVSB-Technical University of OstravaAdvances in Electrical and Electronic Engineering1336-13761804-31192021-01-01191909910.15598/aeee.v19i1.39501111Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNetTongyao Lin0Li Ouyang1He Wen2Dezhi Xiong3Janusz Smulko4College of Electrical and Information Engineering, Hunan University, Lushan S Road 2, 410007 Changsha, ChinaHunan Province Key Laboratory of Intelligent Electrical Measurement and Application Technology, State Grid Hunan Electric Power Company Power Supply Service Center (Metrology Center), Lushan North Road 388, 410007 Changsha, ChinaCollege of Electrical and Information Engineering, Hunan University, Lushan S Road 2, 410007 Changsha, ChinaHunan Province Key Laboratory of Intelligent Electrical Measurement and Application Technology, State Grid Hunan Electric Power Company Power Supply Service Center (Metrology Center), Lushan North Road 388, 410007 Changsha, ChinaDepartment of Optoelectronics and Electronics Systems, Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, Gabriela Narutowicza 11/12, 80-233 Gdansk, PolandFor the purpose of improving the service quality, video surveillance systems are widely used to standardize the service process in power supply business halls. If the employers check surveillance video to ensure predefined process of staff behaviours, it will be characterized as time-consuming. In recent years, great progress has been made in intelligent action recognition using Convolution Neural Networks (CNNs). However, due to the small range of staffs' motion and similar scene information of power supply business halls, the performance of using traditional CNNs to recognize service actions, e.g. bowing, standing and sitting, is general. For improving the recognition rate, this paper proposes a 3D-fused Convolutional Network (ConvNet) for service actions recognition, which focuses on detecting the actions in the typical scene of one staff person and one customer with a well-segmented video clip. The well-segmented video clips are sent as input to the 3D-fused ConvNet for action recognition. The 3D-fused ConvNet consists of two base learners, optical flow base learner and RGB base learner. Both learners use the Convolutional 3D (C3D) architecture. Specifically, the RGB learner can be used to capture the features of small staffs' motion while the optical flow base learner can be viewed as the key part to eliminate the influence of the background, especially in a similar scene. Furthermore, prediction scores of two base learners can be weighted by the softmax function according to the performance of each base learner. Finally, the prediction scores of the two base learners are fused to obtain the prediction result, namely the specific actions of the staffs in the videos. The experiment result shows that the proposed method achieves 92.41% accuracy on the service action dataset of the power supply business hall.http://advances.utc.sk/index.php/AEEE/article/view/39503d convolutionaction recognitionensemblepower supply business hall.
collection DOAJ
language English
format Article
sources DOAJ
author Tongyao Lin
Li Ouyang
He Wen
Dezhi Xiong
Janusz Smulko
spellingShingle Tongyao Lin
Li Ouyang
He Wen
Dezhi Xiong
Janusz Smulko
Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNet
Advances in Electrical and Electronic Engineering
3d convolution
action recognition
ensemble
power supply business hall.
author_facet Tongyao Lin
Li Ouyang
He Wen
Dezhi Xiong
Janusz Smulko
author_sort Tongyao Lin
title Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNet
title_short Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNet
title_full Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNet
title_fullStr Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNet
title_full_unstemmed Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNet
title_sort service action recognition in power supply business hall with 3d-fused convnet
publisher VSB-Technical University of Ostrava
series Advances in Electrical and Electronic Engineering
issn 1336-1376
1804-3119
publishDate 2021-01-01
description For the purpose of improving the service quality, video surveillance systems are widely used to standardize the service process in power supply business halls. If the employers check surveillance video to ensure predefined process of staff behaviours, it will be characterized as time-consuming. In recent years, great progress has been made in intelligent action recognition using Convolution Neural Networks (CNNs). However, due to the small range of staffs' motion and similar scene information of power supply business halls, the performance of using traditional CNNs to recognize service actions, e.g. bowing, standing and sitting, is general. For improving the recognition rate, this paper proposes a 3D-fused Convolutional Network (ConvNet) for service actions recognition, which focuses on detecting the actions in the typical scene of one staff person and one customer with a well-segmented video clip. The well-segmented video clips are sent as input to the 3D-fused ConvNet for action recognition. The 3D-fused ConvNet consists of two base learners, optical flow base learner and RGB base learner. Both learners use the Convolutional 3D (C3D) architecture. Specifically, the RGB learner can be used to capture the features of small staffs' motion while the optical flow base learner can be viewed as the key part to eliminate the influence of the background, especially in a similar scene. Furthermore, prediction scores of two base learners can be weighted by the softmax function according to the performance of each base learner. Finally, the prediction scores of the two base learners are fused to obtain the prediction result, namely the specific actions of the staffs in the videos. The experiment result shows that the proposed method achieves 92.41% accuracy on the service action dataset of the power supply business hall.
topic 3d convolution
action recognition
ensemble
power supply business hall.
url http://advances.utc.sk/index.php/AEEE/article/view/3950
work_keys_str_mv AT tongyaolin serviceactionrecognitioninpowersupplybusinesshallwith3dfusedconvnet
AT liouyang serviceactionrecognitioninpowersupplybusinesshallwith3dfusedconvnet
AT hewen serviceactionrecognitioninpowersupplybusinesshallwith3dfusedconvnet
AT dezhixiong serviceactionrecognitioninpowersupplybusinesshallwith3dfusedconvnet
AT januszsmulko serviceactionrecognitioninpowersupplybusinesshallwith3dfusedconvnet
_version_ 1716828016565288960