Complete Video-Level Representations for Action Recognition

Complete Video-Level Representations for Action Recognition

In most of the existing work for activity recognition, 3D ConvNets show promising performance for learning spatiotemporal features of videos. However, most methods sample fixed-length frames from the original video, which are cropped to a fixed size and fed into the model for training. In this manne...

Full description

Bibliographic Details
Main Authors:	Min Li, Ruwen Bai, Bo Meng, Junxing Ren, Miao Jiang, Yang Yang, Linghan Li, Hong Du
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	3D ConvNets activity recognition video-level feature representation
Online Access:	https://ieeexplore.ieee.org/document/9353486/

Similar Items

Dynamic Sign Language Recognition Based on Video Sequence With BLSTM-3D Residual Networks
by: Yanqiu Liao, et al.
Published: (2019-01-01)

Spatially and Temporally Structured Global to Local Aggregation of Dynamic Depth Information for Action Recognition
by: Yonghong Hou, et al.
Published: (2018-01-01)

A Novel Violent Video Detection Scheme Based on Modified 3D Convolutional Neural Networks
by: Wei Song, et al.
Published: (2019-01-01)

ALBERTC-CNN Based Aspect Level Sentiment Analysis
by: Xingxin Ye, et al.
Published: (2021-01-01)

Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation
by: Le Wang, et al.
Published: (2018-05-01)

Deep Full-Body HPE for Activity Recognition from RGB Frames Only
by: Sameh Neili Boualia, et al.
Published: (2021-01-01)

Robust ConvNet Landmark-Based Visual Place Recognition by Optimizing Landmark Matching
by: Yaguang Kong, et al.
Published: (2019-01-01)

The Image Game: Exploit Kit Detection Based on Recursive Convolutional Neural Networks
by: Suyeon Yoo, et al.
Published: (2020-01-01)

Unsupervised Multi-Scale-Stage Content-Aware Homography Estimation
by: Hou, B., et al.
Published: (2023)

Self-Supervised Learning to Detect Key Frames in Videos
by: Xiang Yan, et al.
Published: (2020-12-01)

Fault Diagnosis Based on Space Mapping and Deformable Convolution Networks
by: Yunji Zhao, et al.
Published: (2020-01-01)

Fall Detection in Videos With Trajectory-Weighted Deep-Convolutional Rank-Pooling Descriptor
by: Zhimeng Zhang, et al.
Published: (2019-01-01)

Face Recognition Based on CSGF(2D)<sup>2</sup>PCANet
by: Jun Kong, et al.
Published: (2018-01-01)

Vehicle Speed Estimation Based on 3D ConvNets and Non-Local Blocks
by: Huanan Dong, et al.
Published: (2019-05-01)

Feature Fusion-Based Multi-Task ConvNet for Simultaneous Optical Performance Monitoring and Bit-Rate/Modulation Format Identification
by: Xiaojie Fan, et al.
Published: (2019-01-01)

Spatio-Temporal Unity Networking for Video Anomaly Detection
by: Yuanyuan Li, et al.
Published: (2019-01-01)

Comparison Between Frame-Constrained Fix-Pixel-Value and Frame-Free Spiking-Dynamic-Pixel ConvNets for Visual Processing
by: Clément eFarabet, et al.
Published: (2012-04-01)

Semisupervised Center Loss for Remote Sensing Image Scene Classification
by: Jun Zhang, et al.
Published: (2020-01-01)

Deformable ConvNet with Aspect Ratio Constrained NMS for Object Detection in Remote Sensing Imagery
by: Zhaozhuo Xu, et al.
Published: (2017-12-01)

Blended Multi-Modal Deep ConvNet Features for Diabetic Retinopathy Severity Prediction
by: Jyostna Devi Bodapati, et al.
Published: (2020-05-01)

AIKYATAN: mapping distal regulatory elements using convolutional learning on GPU
by: Chih-Hao Fang, et al.
Published: (2019-10-01)

Local Attention Sequence Model for Video Object Detection
by: Zhenhui Li, et al.
Published: (2021-05-01)

Localisation Absolue par Mono-caméra d'un Véhicule en Milieu Urbain via l'utilisation de Street View
by: Yu, Li
Published: (2018)

Video Object Detection With Two-Path Convolutional LSTM Pyramid
by: Chen Zhang, et al.
Published: (2020-01-01)

Temporal and Fine-Grained Pedestrian Action Recognition on Driving Recorder Database
by: Hirokatsu Kataoka, et al.
Published: (2018-02-01)

Dynamic gesture recognition based on feature fusion network and variant ConvLSTM
by: Yuqing Peng, et al.
Published: (2020-09-01)

High-Level Video Event Modeling, Recognition, and Reasoning via Petri Net
by: Zhijiao Xiao, et al.
Published: (2019-01-01)

Exploiting textures for better action recognition in low-quality videos
by: Saimunur Rahman, et al.
Published: (2017-11-01)

Automated Video Behavior Recognition of Pigs Using Two-Stream Convolutional Networks
by: Kaifeng Zhang, et al.
Published: (2020-02-01)

PyConvU-Net: a lightweight and multiscale network for biomedical image segmentation
by: Changyong Li, et al.
Published: (2021-01-01)

Video Object Detection Using Event-Aware Convolutional Lstm and Object Relation Networks
by: Chen Zhang, et al.
Published: (2021-08-01)

The study of security application of LOGO recognition technology in sports video
by: Zhi Li
Published: (2019-02-01)

Murine Motion Behavior Recognition Based on DeepLabCut and Convolutional Long Short-Term Memory Network
by: Liu, R., et al.
Published: (2022)

Bidirectional ConvLSTMXNet for Brain Tumor Segmentation of MR Images
by: M. Ravikumar*, et al.
Published: (2021-01-01)

Fusion of Video and Inertial Sensing for Deep Learning–Based Human Action Recognition
by: Haoran Wei, et al.
Published: (2019-08-01)

Face Recognition Attendance System Based on Real-Time Video Processing
by: Hao Yang, et al.
Published: (2020-01-01)

A Deep Learning Approach To Coarse Robot Localization
by: Bettaieb, Luc Alexandre
Published: (2017)

Learning Attention-Enhanced Spatiotemporal Representation for Action Recognition
by: Zhensheng Shi, et al.
Published: (2020-01-01)

Im2Vid: Future Video Prediction for Static Image Action Recognition
by: AlBahar, Badour A Sh A.
Published: (2018)

Temporal Memory Network Towards Real-Time Video Understanding
by: Ziming Liu, et al.
Published: (2020-01-01)