Counterfactual explanations and how to find them: literature review and benchmarking
R Guidotti - Data Mining and Knowledge Discovery, 2024 - Springer
Interpretable machine learning aims at unveiling the reasons behind predictions returned by
uninterpretable classifiers. One of the most valuable types of explanation consists of …
uninterpretable classifiers. One of the most valuable types of explanation consists of …
A review on generative adversarial networks: Algorithms, theory, and applications
Generative adversarial networks (GANs) have recently become a hot research topic;
however, they have been studied since 2014, and a large number of algorithms have been …
however, they have been studied since 2014, and a large number of algorithms have been …
Adding conditional control to text-to-image diffusion models
We present ControlNet, a neural network architecture to add spatial conditioning controls to
large, pretrained text-to-image diffusion models. ControlNet locks the production-ready large …
large, pretrained text-to-image diffusion models. ControlNet locks the production-ready large …
Palette: Image-to-image diffusion models
This paper develops a unified framework for image-to-image translation based on
conditional diffusion models and evaluates this framework on four challenging image-to …
conditional diffusion models and evaluates this framework on four challenging image-to …
Zero-shot image-to-image translation
Large-scale text-to-image generative models have shown their remarkable ability to
synthesize diverse, high-quality images. However, directly applying these models for real …
synthesize diverse, high-quality images. However, directly applying these models for real …
Diffusionclip: Text-guided diffusion models for robust image manipulation
Recently, GAN inversion methods combined with Contrastive Language-Image Pretraining
(CLIP) enables zero-shot image manipulation guided by text prompts. However, their …
(CLIP) enables zero-shot image manipulation guided by text prompts. However, their …
Contrastive learning for unpaired image-to-image translation
In image-to-image translation, each patch in the output should reflect the content of the
corresponding patch in the input, independent of domain. We propose a straightforward …
corresponding patch in the input, independent of domain. We propose a straightforward …
Giraffe: Representing scenes as compositional generative neural feature fields
Deep generative models allow for photorealistic image synthesis at high resolutions. But for
many applications, this is not enough: content creation also needs to be controllable. While …
many applications, this is not enough: content creation also needs to be controllable. While …
Gligen: Open-set grounded text-to-image generation
Large-scale text-to-image diffusion models have made amazing advances. However, the
status quo is to use text input alone, which can impede controllability. In this work, we …
status quo is to use text input alone, which can impede controllability. In this work, we …
Image fusion in the loop of high-level vision tasks: A semantic-aware real-time infrared and visible image fusion network
Infrared and visible image fusion aims to synthesize a single fused image that not only
contains salient targets and abundant texture details but also facilitates high-level vision …
contains salient targets and abundant texture details but also facilitates high-level vision …