A dynamic multi-scale voxel flow network for video prediction

X Hu, Z Huang, A Huang, J Xu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The performance of video prediction has been greatly boosted by advanced deep neural
networks. However, most of the current methods suffer from large model sizes and require …

Overcoming limitations of mixture density networks: A sampling and fitting framework for multimodal future prediction

O Makansi, E Ilg, O Cicek… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Future prediction is a fundamental principle of intelligence that helps plan actions and avoid
possible dangers. As the future is uncertain to a large extent, modeling the uncertainty and …

Exploring spatial-temporal multi-frequency analysis for high-fidelity and temporal-consistency video prediction

B **, Y Hu, Q Tang, J Niu, Z Shi… - Proceedings of the …, 2020 - openaccess.thecvf.com
Video prediction is a pixel-wise dense prediction task to infer future frames based on past
frames. Missing appearance details and motion blur are still two major problems for current …

Future video synthesis with object motion prediction

Y Wu, R Gao, J Park, Q Chen - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
We present an approach to predict future video frames given a sequence of continuous
video frames in the past. Instead of synthesizing images directly, our approach is designed …

Optimizing video prediction via video frame interpolation

Y Wu, Q Wen, Q Chen - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Video prediction is an extrapolation task that predicts future frames given past frames, and
video frame interpolation is an interpolation task that estimates intermediate frames between …

Iso-dream: Isolating and leveraging noncontrollable visual dynamics in world models

M Pan, X Zhu, Y Wang, X Yang - Advances in neural …, 2022 - proceedings.neurips.cc
World models learn the consequences of actions in vision-based interactive systems.
However, in practical scenarios such as autonomous driving, there commonly exists …

Dynamic motion representation for human action recognition

S Asghari-Esfeden, M Sznaier… - Proceedings of the …, 2020 - openaccess.thecvf.com
Despite the advances in Human Activity Recognition, the ability to exploit the dynamics of
human body motion in videos has yet to be achieved. In numerous recent works …

Deep learning for vision-based prediction: A survey

A Rasouli - arxiv preprint arxiv:2007.00095, 2020 - arxiv.org
Vision-based prediction algorithms have a wide range of applications including autonomous
driving, surveillance, human-robot interaction, weather prediction. The objective of this …

Enhanced surveillance video compression with dual reference frames generation

L Zhao, S Wang, S Wang, Y Ye, S Ma… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In this paper, we improve the inter coding performance of surveillance videos by
simultaneously investigating the distinct characteristics of background and foreground …

Intermediate fused network with multiple timescales for anomaly detection

W Wang, F Chang, H Mi - Neurocomputing, 2021 - Elsevier
This paper proposes an intermediate fused network with multiple timescales to predict future
video segments for video anomaly detection. Video prediction technique for anomaly …