Advances in diffusion models for image data augmentation: A review of methods, models, evaluation metrics and future research directions

P Alimisis, I Mademlis, P Radoglou-Grammatikis… - Artificial Intelligence …, 2025 - Springer
Image data augmentation constitutes a critical methodology in modern computer vision
tasks, since it can facilitate towards enhancing the diversity and quality of training datasets; …

Omg: Occlusion-friendly personalized multi-concept generation in diffusion models

Z Kong, Y Zhang, T Yang, T Wang, K Zhang… - … on Computer Vision, 2024 - Springer
Personalization is an important topic in text-to-image generation, especially the challenging
multi-concept personalization. Current multi-concept methods are struggling with identity …

Implicit style-content separation using b-lora

Y Frenkel, Y Vinker, A Shamir, D Cohen-Or - European Conference on …, 2024 - Springer
Image stylization involves manipulating the visual appearance and texture (style) of an
image while preserving its underlying objects, structures, and concepts (content). The …

The chosen one: Consistent characters in text-to-image diffusion models

O Avrahami, A Hertz, Y Vinker, M Arar… - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org
Recent advances in text-to-image generation models have unlocked vast potential for visual
creativity. However, the users that use these models struggle with the generation of …

Magic clothing: Controllable garment-driven image synthesis

W Chen, T Gu, Y Xu, A Chen - … of the 32nd ACM International Conference …, 2024 - dl.acm.org
We propose Magic Clothing, a latent diffusion model (LDM)-based network architecture for
an unexplored garment-driven image synthesis task. Aiming at generating customized …

Designprompt: Using multimodal interaction for design exploration with generative ai

X Peng, J Koch, WE Mackay - Proceedings of the 2024 ACM Designing …, 2024 - dl.acm.org
Visually oriented designers often struggle to create effective generative AI (GenAI) prompts.
A preliminary study identified specific issues in composing and fine-tuning prompts, as well …

Conceptlab: Creative generation using diffusion prior constraints

E Richardson, K Goldberg, Y Alaluf… - arxiv preprint arxiv …, 2023 - arxiv.org
Recent text-to-image generative models have enabled us to transform our words into
vibrant, captivating imagery. The surge of personalization techniques that has followed has …

PALP: prompt aligned personalization of text-to-image models

M Arar, A Voynov, A Hertz, O Avrahami… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
Content creators often aim to create personalized images using personal subjects that go
beyond the capabilities of conventional text-to-image models. Additionally, they may want …

Colorpeel: Color prompt learning with diffusion models via color and shape disentanglement

MA Butt, K Wang, J Vazquez-Corral… - European Conference on …, 2024 - Springer
Abstract Text-to-Image (T2I) generation has made significant advancements with the advent
of diffusion models. These models exhibit remarkable abilities to produce images based on …

Can AI be as creative as humans?

H Wang, J Zou, M Mozer, A Goyal, A Lamb… - arxiv preprint arxiv …, 2024 - arxiv.org
Creativity serves as a cornerstone for societal progress and innovation. With the rise of
advanced generative AI models capable of tasks once reserved for human creativity, the …