Temporal action segmentation: An analysis of modern techniques

G Ding, F Sener, A Yao - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …

Vision-based human activity recognition: a survey

DR Beddiar, B Nini, M Sabokrou, A Hadid - Multimedia Tools and …, 2020 - Springer
Human activity recognition (HAR) systems attempt to automatically identify and analyze
human activities using acquired information from various types of sensors. Although several …

Asformer: Transformer for action segmentation

F Yi, H Wen, T Jiang - arxiv preprint arxiv:2110.08568, 2021 - arxiv.org
Algorithms for the action segmentation task typically use temporal models to predict what
action is occurring at each frame for a minute-long daily activity. Recent studies have shown …

Ms-tcn: Multi-stage temporal convolutional network for action segmentation

YA Farha, J Gall - Proceedings of the IEEE/CVF conference …, 2019 - openaccess.thecvf.com
Temporally locating and classifying action segments in long untrimmed videos is of
particular interest to many applications like surveillance and robotics. While traditional …

Unified fully and timestamp supervised temporal action segmentation via sequence to sequence translation

N Behrmann, SA Golestaneh, Z Kolter, J Gall… - European conference on …, 2022 - Springer
This paper introduces a unified framework for video action segmentation via sequence to
sequence (seq2seq) translation in a fully and timestamp supervised setup. In contrast to …

Temporal convolutional networks for action segmentation and detection

C Lea, MD Flynn, R Vidal, A Reiter… - proceedings of the …, 2017 - openaccess.thecvf.com
The ability to identify and temporally segment fine-grained human actions throughout a
video is crucial for robotics, surveillance, education, and beyond. Typical approaches …

Towards automatic learning of procedures from web instructional videos

L Zhou, C Xu, J Corso - Proceedings of the AAAI conference on artificial …, 2018 - ojs.aaai.org
The potential for agents, whether embodied or software, to learn by observing other agents
performing procedures involving objects and actions is rich. Current research on automatic …

Fact: Frame-action cross-attention temporal modeling for efficient action segmentation

Z Lu, E Elhamifar - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We study supervised action segmentation whose goal is to predict framewise action labels
of a video. To capture temporal dependencies over long horizons prior works either improve …

Alleviating over-segmentation errors by detecting action boundaries

Y Ishikawa, S Kasai, Y Aoki… - Proceedings of the …, 2021 - openaccess.thecvf.com
We propose an effective framework for the temporal action segmentation task, namely an
Action Segment Refinement Framework (ASRF). Our model architecture consists of a long …

How much temporal long-term context is needed for action segmentation?

E Bahrami, G Francesca, J Gall - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Modeling long-term context in videos is crucial for many fine-grained tasks including
temporal action segmentation. An interesting question that is still open is how much long …