Stmixer: A one-stage sparse action detector

T Wu, M Cao, Z Gao, G Wu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Traditional video action detectors typically adopt the two-stage pipeline, where a person
detector is first employed to yield actor boxes and then 3D RoIAlign is used to extract actor …

A survey on deep learning-based spatio-temporal action detection

P Wang, F Zeng, Y Qian - International Journal of Wavelets …, 2024 - World Scientific
Spatio-temporal action detection (STAD) aims to classify the actions present in a video and
localize them in space and time. It has become a particularly active area of research in …

A semantic and motion-aware spatiotemporal transformer network for action detection

M Korban, P Youngs, ST Acton - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
This paper presents a novel spatiotemporal transformer network that introduces several
original components to detect actions in untrimmed videos. First, the multi-feature selective …

A review work: human action recognition in video surveillance using deep learning techniques

NS Gupta, KR Ramya, R Karnati - Информатика и автоматизация, 2024 - ia.spcras.ru
Despite being extensively used in numerous uses, precise and effective human activity
identification continues to be an interesting research issue in the area of vision for …

TQRFormer: Tubelet query recollection transformer for action detection

X Wang, K Yang, Q Ding, R Wang, J Sun - Image and Vision Computing, 2024 - Elsevier
Spatial and temporal action detection aims to precisely locate actions while predicting their
respective categories. The existing solution, TubeR (Zhao et al., 2022), is designed to …

IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic

C Parikh, R Saluja, CV Jawahar… - … on Robotics and …, 2024 - ieeexplore.ieee.org
Intelligent vehicle systems require a deep understanding of the interplay between road
conditions, surrounding entities, and the ego vehicle's driving behavior for safe and efficient …

Action Progression Networks for Temporal Action Detection in Videos

C Lu, M Mak, R Li, Z Chi, H Fu - IEEE Access, 2024 - ieeexplore.ieee.org
This study introduces an innovative Temporal Action Detection (TAD) model that is
distinguished by its lightweight structure and capability for end-to-end training, delivering …

Game State and Spatio-temporal Action Detection in Soccer using Graph Neural Networks and 3D Convolutional Networks

J Ochin, G Devineau, B Stanciulescu… - arxiv preprint arxiv …, 2025 - arxiv.org
Soccer analytics rely on two data sources: the player positions on the pitch and the
sequences of events they perform. With around 2000 ball events per game, their precise and …

Query matching for spatio-temporal action detection with query-based object detector

S Hori, K Omi, T Tamaki - arxiv preprint arxiv:2409.18408, 2024 - arxiv.org
In this paper, we propose a method that extends the query-based object detection model,
DETR, to spatio-temporal action detection, which requires maintaining temporal consistency …

A Tracking-Based Two-Stage Framework for Spatio-Temporal Action Detection

J Luo, Y Yang, R Liu, L Chen, H Fei, C Hu, R Shi, Y Zou - Electronics, 2024 - mdpi.com
Spatio-temporal action detection (STAD) is a task receiving widespread attention and has
numerous application scenarios, such as video surveillance and smart education. Current …