Misc: Ultra-low bitrate image semantic compression driven by large multimodal model
With the evolution of storage and communication protocols, ultra-low bitrate image
compression has become a highly demanding topic. However, all existing compression …
compression has become a highly demanding topic. However, all existing compression …
What makes an image realistic?
L Theis - arxiv preprint arxiv:2403.04493, 2024 - arxiv.org
The last decade has seen tremendous progress in our ability to generate realistic-looking
data, be it images, text, audio, or video. Here, we discuss the closely related problem of …
data, be it images, text, audio, or video. Here, we discuss the closely related problem of …
Generative Latent Coding for Ultra-Low Bitrate Image Compression
Most existing image compression approaches perform transform coding in the pixel space to
reduce its spatial redundancy. However they encounter difficulties in achieving both high …
reduce its spatial redundancy. However they encounter difficulties in achieving both high …
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
Large-scale training of latent diffusion models (LDMs) has enabled unprecedented quality in
image generation. However, the key components of the best performing LDM training …
image generation. However, the key components of the best performing LDM training …
Semantically-Guided Image Compression for Enhanced Perceptual Quality at Extremely Low Bitrates
Image compression methods based on machine learning have achieved high rate-distortion
performance. However, the reconstructions they produce suffer from blurring at extremely …
performance. However, the reconstructions they produce suffer from blurring at extremely …
Progressive compression with universally quantized diffusion models
Diffusion probabilistic models have achieved mainstream success in many generative
modeling tasks, from image generation to inverse problem solving. A distinct feature of these …
modeling tasks, from image generation to inverse problem solving. A distinct feature of these …
Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task
While learned image compression methods have achieved impressive results in either
human visual perception or machine vision tasks, they are often specialized only for one …
human visual perception or machine vision tasks, they are often specialized only for one …
Linearly transformed color guide for low-bitrate diffusion based image compression
T Bordin, T Maugey - IEEE Transactions on Image Processing, 2024 - ieeexplore.ieee.org
This study addresses the challenge of controlling the global color aspect of images
generated by a diffusion model without training or fine-tuning. We rewrite the guidance …
generated by a diffusion model without training or fine-tuning. We rewrite the guidance …
Consistency-diversity-realism Pareto fronts of conditional image generative models
Building world models that accurately and comprehensively represent the real world is the
utmost aspiration for conditional image generative models as it would enable their use as …
utmost aspiration for conditional image generative models as it would enable their use as …
Generative Refinement for Low Bitrate Image Coding Using Vector Quantized Residual
Despite the significant progress in recent deep learning-based image compression, the
reconstructed visual quality still suffers at low bitrates due to the lack of high-frequency …
reconstructed visual quality still suffers at low bitrates due to the lack of high-frequency …