Temporal action segmentation: An analysis of modern techniques

G Ding, F Sener, A Yao - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …

Diffusion action segmentation

D Liu, Q Li, AD Dinh, T Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Temporal action segmentation is crucial for understanding long-form videos. Previous works
on this task commonly adopt an iterative refinement paradigm by using multi-stage models …

Continuous human action recognition for human-machine interaction: a review

H Gammulle, D Ahmedt-Aristizabal, S Denman… - ACM Computing …, 2023 - dl.acm.org
With advances in data-driven machine learning research, a wide variety of prediction
models have been proposed to capture spatio-temporal features for the analysis of video …

Boundary-aware cascade networks for temporal action segmentation

Z Wang, Z Gao, L Wang, Z Li, G Wu - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
Identifying human action segments in an untrimmed video is still challenging due to
boundary ambiguity and over-segmentation issues. To address these problems, we present …

APNet: Adversarial learning assistance and perceived importance fusion network for all-day RGB-T salient object detection

W Zhou, Y Zhu, J Lei, J Wan… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
To improve the performance of salient object detection (SOD) in scenes with low-light
conditions (eg, nighttime) and cluttered backgrounds, infrared thermal images are used to …

Global2local: Efficient structure search for video action segmentation

SH Gao, Q Han, ZY Li, P Peng… - Proceedings of the …, 2021 - openaccess.thecvf.com
Temporal receptive fields of models play an important role in action segmentation. Large
receptive fields facilitate the long-term relations among video clips while small receptive …

Rf-next: Efficient receptive field search for convolutional neural networks

S Gao, ZY Li, Q Han, MM Cheng… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Temporal/spatial receptive fields of models play an important role in sequential/spatial tasks.
Large receptive fields facilitate long-term relations, while small receptive fields help to …

Predicting the future: A jointly learnt model for action anticipation

H Gammulle, S Denman… - Proceedings of the …, 2019 - openaccess.thecvf.com
Inspired by human neurological structures for action anticipation, we present an action
anticipation model that enables the prediction of plausible future actions by forecasting both …

CmSalGAN: RGB-D salient object detection with cross-view generative adversarial networks

B Jiang, Z Zhou, X Wang, J Tang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Image salient object detection (SOD) is an active research topic in computer vision and
multimedia area. Fusing complementary information of RGB and depth has been …

TMMF: Temporal multi-modal fusion for single-stage continuous gesture recognition

H Gammulle, S Denman, S Sridharan… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Gesture recognition is a much studied research area which has myriad real-world
applications including robotics and human-machine interaction. Current gesture recognition …