A review on deep learning techniques for video prediction

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org
The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

Diffusion models for video prediction and infilling

T Höppe, A Mehrjou, S Bauer, D Nielsen… - arxiv preprint arxiv …, 2022 - arxiv.org
Predicting and anticipating future outcomes or reasoning about missing information in a
sequence are critical skills for agents to be able to make intelligent decisions. This requires …

Eidetic 3D LSTM: A model for video prediction and beyond

Y Wang, L Jiang, MH Yang, LJ Li, M Long… - International …, 2018 - openreview.net
Spatiotemporal predictive learning, though long considered to be a promising self-
supervised feature learning method, seldom shows its effectiveness beyond future video …

Real-time online video detection with temporal smoothing transformers

Y Zhao, P Krähenbühl - European Conference on Computer Vision, 2022 - Springer
Streaming video recognition reasons about objects and their actions in every frame of a
video. A good streaming recognition model captures both long-term dynamics and short …

Rolling-unrolling lstms for action anticipation from first-person video

A Furnari, GM Farinella - IEEE transactions on pattern analysis …, 2020 - ieeexplore.ieee.org
In this paper, we tackle the problem of egocentric action anticipation, ie, predicting what
actions the camera wearer will perform in the near future and which objects they will interact …

What would you expect? anticipating egocentric actions with rolling-unrolling lstms and modality attention

A Furnari, GM Farinella - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Egocentric action anticipation consists in understanding which objects the camera wearer
will interact with in the near future and which actions they will perform. We tackle the …

Video action understanding

MS Hutchinson, VN Gadepally - IEEE Access, 2021 - ieeexplore.ieee.org
Many believe that the successes of deep learning on image understanding problems can be
replicated in the realm of video understanding. However, due to the scale and temporal …

When will you do what?-anticipating temporal occurrences of activities

Y Abu Farha, A Richard, J Gall - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Analyzing human actions in videos has gained increased attention recently. While most
works focus on classifying and labeling observed video frames or anticipating the very …

Imitation learning for human pose prediction

B Wang, E Adeli, H Chiu, DA Huang… - Proceedings of the …, 2019 - openaccess.thecvf.com
Modeling and prediction of human motion dynamics has long been a challenging problem in
computer vision, and most existing methods rely on the end-to-end supervised training of …

Anticipating human actions by correlating past with the future with jaccard similarity measures

B Fernando, S Herath - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
We propose a framework for early action recognition and anticipation by correlating past
features with the future using three novel similarity measures called Jaccard vector similarity …