Mcvd-masked conditional video diffusion for prediction, generation, and interpolation

V Voleti, A Jolicoeur-Martineau… - Advances in neural …, 2022 - proceedings.neurips.cc
Video prediction is a challenging task. The quality of video frames from current state-of-the-
art (SOTA) generative models tends to be poor and generalization beyond the training data …

Neural video compression with diverse contexts

J Li, B Li, Y Lu - Proceedings of the IEEE/CVF Conference …, 2023 - openaccess.thecvf.com
For any video codecs, the coding efficiency highly relies on whether the current signal to be
encoded can find the relevant contexts from the previous reconstructed signals. Traditional …

Diffusion probabilistic modeling for video generation

R Yang, P Srivastava, S Mandt - Entropy, 2023 - mdpi.com
Denoising diffusion probabilistic models are a promising new class of generative models
that mark a milestone in high-quality image generation. This paper showcases their ability to …

FVC: A new framework towards deep video compression in feature space

Z Hu, G Lu, D Xu - … of the IEEE/CVF Conference on …, 2021 - openaccess.thecvf.com
Learning based video compression attracts increasing attention in the past few years. The
previous hybrid coding approaches rely on pixel space operations to reduce spatial and …

Hybrid spatial-temporal entropy modelling for neural video compression

J Li, B Li, Y Lu - Proceedings of the 30th ACM International Conference …, 2022 - dl.acm.org
For neural video codec, it is critical, yet challenging, to design an efficient entropy model
which can accurately predict the probability distribution of the quantized latent …

An introduction to neural data compression

Y Yang, S Mandt, L Theis - Foundations and Trends® in …, 2023 - nowpublishers.com
Neural compression is the application of neural networks and other machine learning
methods to data compression. Recent advances in statistical machine learning have opened …

Vct: A video compression transformer

F Mentzer, G Toderici, D Minnen, SJ Hwang… - arxiv preprint arxiv …, 2022 - arxiv.org
We show how transformers can be used to vastly simplify neural video compression.
Previous methods have been relying on an increasing number of architectural biases and …

Coarse-to-fine deep video coding with hyperprior-guided mode prediction

Z Hu, G Lu, J Guo, S Liu, W Jiang… - Proceedings of the …, 2022 - openaccess.thecvf.com
The previous deep video compression approaches only use the single scale motion
compensation strategy and rarely adopt the mode prediction technique from the traditional …

Nonlinear transform coding

J Ballé, PA Chou, D Minnen, S Singh… - IEEE Journal of …, 2020 - ieeexplore.ieee.org
We review a class of methods that can be collected under the name nonlinear transform
coding (NTC), which over the past few years have become competitive with the best linear …

Temporal context mining for learned video compression

X Sheng, J Li, B Li, L Li, D Liu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Applying deep learning to video compression has attracted increasing attention in recent
few years. In this work, we address end-to-end learned video compression with a special …