Deep learning-based video coding: A review and a case study

D Liu, Y Li, J Lin, H Li, F Wu - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
The past decade has witnessed the great success of deep learning in many disciplines,
especially in computer vision and image processing. However, deep learning-based video …

Deep architectures for image compression: a critical review

D Mishra, SK Singh, RK Singh - Signal Processing, 2022 - Elsevier
Deep learning architectures are now pervasive and filled almost all applications under
image processing, computer vision, and biometrics. The attractive property of feature …

Palette: Image-to-image diffusion models

C Saharia, W Chan, H Chang, C Lee, J Ho… - ACM SIGGRAPH 2022 …, 2022 - dl.acm.org
This paper develops a unified framework for image-to-image translation based on
conditional diffusion models and evaluates this framework on four challenging image-to …

Towards real-world blind face restoration with generative facial prior

X Wang, Y Li, H Zhang, Y Shan - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Blind face restoration usually relies on facial priors, such as facial geometry prior or
reference prior, to restore realistic and faithful details. However, very low-quality inputs …

High-fidelity generative image compression

F Mentzer, GD Toderici… - Advances in neural …, 2020 - proceedings.neurips.cc
We extensively study how to combine Generative Adversarial Networks and learned
compression to obtain a state-of-the-art generative lossy compression system. In particular …

Generative adversarial networks for extreme learned image compression

E Agustsson, M Tschannen, F Mentzer… - Proceedings of the …, 2019 - openaccess.thecvf.com
We present a learned image compression system based on GANs, operating at extremely
low bitrates. Our proposed framework combines an encoder, decoder/generator and a multi …

Remote heart rate measurement from highly compressed facial videos: an end-to-end deep learning solution with video enhancement

Z Yu, W Peng, X Li, X Hong… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Remote photoplethysmography (rPPG), which aims at measuring heart activities without any
contact, has great potential in many applications (eg, remote healthcare). Existing rPPG …

Generative adversarial networks for image and video synthesis: Algorithms and applications

MY Liu, X Huang, J Yu, TC Wang… - Proceedings of the …, 2021 - ieeexplore.ieee.org
The generative adversarial network (GAN) framework has emerged as a powerful tool for
various image and video synthesis tasks, allowing the synthesis of visual content in an …

Rethinking lossy compression: The rate-distortion-perception tradeoff

Y Blau, T Michaeli - International Conference on Machine …, 2019 - proceedings.mlr.press
Lossy compression algorithms are typically designed and analyzed through the lens of
Shannon's rate-distortion theory, where the goal is to achieve the lowest possible distortion …

Video coding for machines: A paradigm of collaborative compression and intelligent analytics

L Duan, J Liu, W Yang, T Huang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Video coding, which targets to compress and reconstruct the whole frame, and feature
compression, which only preserves and transmits the most critical information, stand at two …