Deep learning-based action detection in untrimmed videos: A survey

E Vahdani, Y Tian - IEEE Transactions on Pattern Analysis and …, 2022 - ieeexplore.ieee.org
Understanding human behavior and activity facilitates advancement of numerous real-world
applications, and is critical for video analysis. Despite the progress of action recognition …

Every pixel counts++: Joint learning of geometry and motion with 3d holistic understanding

C Luo, Z Yang, P Wang, Y Wang, W Xu… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Learning to estimate 3D geometry in a single frame and optical flow from consecutive frames
by watching unlabeled videos via deep convolutional network has made significant progress …

Remote photoplethysmograph signal measurement from facial videos using spatio-temporal networks

Z Yu, X Li, G Zhao - arxiv preprint arxiv:1905.02419, 2019 - arxiv.org
Recent studies demonstrated that the average heart rate (HR) can be measured from facial
videos based on non-contact remote photoplethysmography (rPPG). However for many …

Learning spatio-temporal representation with local and global diffusion

Z Qiu, T Yao, CW Ngo, X Tian… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Abstract Convolutional Neural Networks (CNN) have been regarded as a powerful class of
models for visual recognition problems. Nevertheless, the convolutional filters in these …

Video action understanding

MS Hutchinson, VN Gadepally - IEEE Access, 2021 - ieeexplore.ieee.org
Many believe that the successes of deep learning on image understanding problems can be
replicated in the realm of video understanding. However, due to the scale and temporal …

Videocapsulenet: A simplified network for action detection

K Duarte, Y Rawat, M Shah - Advances in neural …, 2018 - proceedings.neurips.cc
The recent advances in Deep Convolutional Neural Networks (DCNNs) have shown
extremely good results for video human action classification, however, action detection is …

Step: Spatio-temporal progressive learning for video action detection

X Yang, X Yang, MY Liu, F **ao… - Proceedings of the …, 2019 - openaccess.thecvf.com
In this paper, we propose Spatio-TEmporal Progressive (STEP) action detector--a
progressive learning framework for spatio-temporal action detection in videos. Starting from …

Recurrent tubelet proposal and recognition networks for action detection

D Li, Z Qiu, Q Dai, T Yao, T Mei - Proceedings of the …, 2018 - openaccess.thecvf.com
Detecting actions in videos is a challenging task as video is an information intensive media
with complex variations. Existing approaches predominantly generate action proposals for …

Dance with flow: Two-in-one stream action detection

J Zhao, CGM Snoek - … of the ieee/cvf conference on …, 2019 - openaccess.thecvf.com
The goal of this paper is to detect the spatio-temporal extent of an action. The two-stream
detection network based on RGB and flow provides state-of-the-art accuracy at the expense …

Every pixel counts: Unsupervised geometry learning with holistic 3d motion understanding

Z Yang, P Wang, Y Wang, W Xu… - Proceedings of the …, 2018 - openaccess.thecvf.com
Learning to estimate 3D geometry in a single image by watching unlabeled videos via deep
convolutional network has made significant process recently. Current state-of-the-art (SOTA) …