A review on deep learning techniques for video prediction

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org
The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

Anticipative video transformer

R Girdhar, K Grauman - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Abstract We propose Anticipative Video Transformer (AVT), an end-to-end attention-based
video modeling architecture that attends to the previously observed video in order to …

Overcoming limitations of mixture density networks: A sampling and fitting framework for multimodal future prediction

O Makansi, E Ilg, O Cicek… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Future prediction is a fundamental principle of intelligence that helps plan actions and avoid
possible dangers. As the future is uncertain to a large extent, modeling the uncertainty and …

A stochastic conditioning scheme for diverse human motion prediction

S Aliakbarian, FS Saleh, M Salzmann… - Proceedings of the …, 2020 - openaccess.thecvf.com
Human motion prediction, the task of predicting future 3D human poses given a sequence of
observed ones, has been mostly treated as a deterministic problem. However, human …

Human-Human Interaction Recognition using Mask R-CNN and Multi-class SVM

MH Azhar, A Jalal - … on Emerging Trends in Electrical, Control …, 2024 - ieeexplore.ieee.org
Computer vision systems face significant challenges in human-human interaction
recognition in dynamic environments, particularly with cluttered backgrounds. In this paper …

Diverse and admissible trajectory forecasting through multimodal context understanding

SH Park, G Lee, J Seo, M Bhat, M Kang… - Computer Vision–ECCV …, 2020 - Springer
Multi-agent trajectory forecasting in autonomous driving requires an agent to accurately
anticipate the behaviors of the surrounding vehicles and pedestrians, for safe and reliable …

Anticipating human actions by correlating past with the future with jaccard similarity measures

B Fernando, S Herath - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
We propose a framework for early action recognition and anticipation by correlating past
features with the future using three novel similarity measures called Jaccard vector similarity …

Learning to anticipate egocentric actions by imagination

Y Wu, L Zhu, X Wang, Y Yang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Anticipating actions before they are executed is crucial for a wide range of practical
applications, including autonomous driving and robotics. In this paper, we study the …

WHITE STAG model: Wise human interaction tracking and estimation (WHITE) using spatio-temporal and angular-geometric (STAG) descriptors

M Mahmood, A Jalal, K Kim - Multimedia tools and applications, 2020 - Springer
To understand human to human dealing accurately, human interaction recognition (HIR)
systems require robust feature extraction and selection methods based on vision sensors. In …

Latency matters: Real-time action forecasting transformer

H Girase, N Agarwal, C Choi… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present RAFTformer, a real-time action forecasting transformer for latency aware real-
world action forecasting applications. RAFTformer is a two-stage fully transformer based …