Road: The road event awareness dataset for autonomous driving

G Singh, S Akrigg, M Di Maio, V Fontana… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Humans drive in a holistic fashion which entails, in particular, understanding dynamic road
events and their evolution. Injecting these capabilities in autonomous vehicles can thus take …

Dance with flow: Two-in-one stream action detection

J Zhao, CGM Snoek - … of the ieee/cvf conference on …, 2019 - openaccess.thecvf.com
The goal of this paper is to detect the spatio-temporal extent of an action. The two-stream
detection network based on RGB and flow provides state-of-the-art accuracy at the expense …

A survey on deep learning-based spatio-temporal action detection

P Wang, F Zeng, Y Qian - International Journal of Wavelets …, 2024 - World Scientific
Spatio-temporal action detection (STAD) aims to classify the actions present in a video and
localize them in space and time. It has become a particularly active area of research in …

Spatio-temporal action detection under large motion

G Singh, V Choutas, S Saha, F Yu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Current methods for spatiotemporal action tube detection often extend a bounding box
proposal at a given key-frame into a 3D temporal cuboid and pool features from nearby …

Uncertainty-aware weakly supervised action detection from untrimmed videos

A Arnab, C Sun, A Nagrani, C Schmid - … Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
Despite the recent advances in video classification, progress in spatio-temporal action
recognition has lagged behind. A major contributing factor has been the prohibitive cost of …

TQRFormer: Tubelet query recollection transformer for action detection

X Wang, K Yang, Q Ding, R Wang, J Sun - Image and Vision Computing, 2024 - Elsevier
Spatial and temporal action detection aims to precisely locate actions while predicting their
respective categories. The existing solution, TubeR (Zhao et al., 2022), is designed to …

Exploiting instance-based mixed sampling via auxiliary source domain supervision for domain-adaptive action detection

Y Lu, G Singh, S Saha… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We propose a novel domain adaptive action detection approach and a new adaptation
protocol that leverages the recent advancements in image-level unsupervised domain …

Spatiotemporal Event Graphs for Dynamic Scene Understanding

S Khan - arxiv preprint arxiv:2312.07621, 2023 - arxiv.org
Dynamic scene understanding is the ability of a computer system to interpret and make
sense of the visual information present in a video of a real-world scene. In this thesis, we …

RADNet: A deep neural network model for robust perception in moving autonomous systems

BA Mudassar, S Ko, M Li, P Saha… - arxiv preprint arxiv …, 2022 - arxiv.org
Interactive autonomous applications require robustness of the perception engine to artifacts
in unconstrained videos. In this paper, we examine the effect of camera motion on the task of …

Spatio-temporal instance learning: Action tubes from class supervision

P Mettes, CGM Snoek - arxiv preprint arxiv:1807.02800, 2018 - arxiv.org
The goal of this work is spatio-temporal action localization in videos, using only the
supervision from video-level class labels. The state-of-the-art casts this weakly-supervised …