Misc: Ultra-low bitrate image semantic compression driven by large multimodal model

C Li, G Lu, D Feng, H Wu, Z Zhang, X Liu… - … on Image Processing, 2024 - ieeexplore.ieee.org
With the evolution of storage and communication protocols, ultra-low bitrate image
compression has become a highly demanding topic. However, all existing compression …

What makes an image realistic?

L Theis - arxiv preprint arxiv:2403.04493, 2024 - arxiv.org
The last decade has seen tremendous progress in our ability to generate realistic-looking
data, be it images, text, audio, or video. Here, we discuss the closely related problem of …

Generative Latent Coding for Ultra-Low Bitrate Image Compression

Z Jia, J Li, B Li, H Li, Y Lu - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Most existing image compression approaches perform transform coding in the pixel space to
reduce its spatial redundancy. However they encounter difficulties in achieving both high …

On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models

TB Ifriqi, P Astolfi, M Hall, R Askari-Hemmat… - arxiv preprint arxiv …, 2024 - arxiv.org
Large-scale training of latent diffusion models (LDMs) has enabled unprecedented quality in
image generation. However, the key components of the best performing LDM training …

Semantically-Guided Image Compression for Enhanced Perceptual Quality at Extremely Low Bitrates

S Iwai, T Miyazaki, S Omachi - IEEE Access, 2024 - ieeexplore.ieee.org
Image compression methods based on machine learning have achieved high rate-distortion
performance. However, the reconstructions they produce suffer from blurring at extremely …

Progressive compression with universally quantized diffusion models

Y Yang, JC Will, S Mandt - arxiv preprint arxiv:2412.10935, 2024 - arxiv.org
Diffusion probabilistic models have achieved mainstream success in many generative
modeling tasks, from image generation to inverse problem solving. A distinct feature of these …

Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task

J Liu, Y Wei, J Lin, S Zhao, H Sun, Z Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
While learned image compression methods have achieved impressive results in either
human visual perception or machine vision tasks, they are often specialized only for one …

Linearly transformed color guide for low-bitrate diffusion based image compression

T Bordin, T Maugey - IEEE Transactions on Image Processing, 2024 - ieeexplore.ieee.org
This study addresses the challenge of controlling the global color aspect of images
generated by a diffusion model without training or fine-tuning. We rewrite the guidance …

Consistency-diversity-realism Pareto fronts of conditional image generative models

P Astolfi, M Careil, M Hall, O Mañas, M Muckley… - arxiv preprint arxiv …, 2024 - arxiv.org
Building world models that accurately and comprehensively represent the real world is the
utmost aspiration for conditional image generative models as it would enable their use as …

Generative Refinement for Low Bitrate Image Coding Using Vector Quantized Residual

Y Kong, M Lu, Z Ma - IEEE Journal on Emerging and Selected …, 2024 - ieeexplore.ieee.org
Despite the significant progress in recent deep learning-based image compression, the
reconstructed visual quality still suffers at low bitrates due to the lack of high-frequency …