Advances in diffusion models for image data augmentation: A review of methods, models, evaluation metrics and future research directions

P Alimisis, I Mademlis, P Radoglou-Grammatikis… - Artificial Intelligence …, 2025 - Springer
Image data augmentation constitutes a critical methodology in modern computer vision
tasks, since it can facilitate towards enhancing the diversity and quality of training datasets; …

Portraitbooth: A versatile portrait model for fast identity-preserved personalization

X Peng, J Zhu, B Jiang, Y Tai, D Luo… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advancements in personalized image generation using diffusion models have been
noteworthy. However existing methods suffer from inefficiencies due to the requirement for …

Deepfake generation and detection: A benchmark and survey

G Pei, J Zhang, M Hu, Z Zhang, C Wang, Y Wu… - arxiv preprint arxiv …, 2024 - arxiv.org
Deepfake is a technology dedicated to creating highly realistic facial images and videos
under specific conditions, which has significant application potential in fields such as …

Migc: Multi-instance generation controller for text-to-image synthesis

D Zhou, Y Li, F Ma, X Zhang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract We present a Multi-Instance Generation (MIG) task simultaneously generating
multiple instances with diverse controls in one image. Given a set of predefined coordinates …

Arc2face: A foundation model for id-consistent human faces

FP Papantoniou, A Lattas, S Moschoglou… - … on Computer Vision, 2024 - Springer
Abstract This paper presents Arc2Face, an identity-conditioned face foundation model,
which, given the ArcFace embedding of a person, can generate diverse photo-realistic …

Diffusionavatars: Deferred diffusion for high-fidelity 3d head avatars

T Kirschstein, S Giebenhain… - Proceedings of the …, 2024 - openaccess.thecvf.com
DiffusionAvatars synthesizes a high-fidelity 3D head avatar of a person offering intuitive
control over both pose and expression. We propose a diffusion-based neural renderer that …

Dilightnet: Fine-grained lighting control for diffusion-based image generation

C Zeng, Y Dong, P Peers, Y Kong, H Wu… - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org
This paper presents a novel method for exerting fine-grained lighting control during text-
driven diffusion-based image generation. While existing diffusion models already have the …

Aigc for various data modalities: A survey

LG Foo, H Rahmani, J Liu - arxiv preprint arxiv:2308.14177, 2023 - arxiv.org
AI-generated content (AIGC) methods aim to produce text, images, videos, 3D assets, and
other media using AI algorithms. Due to its wide range of applications and the demonstrated …

Migc++: Advanced multi-instance generation controller for image synthesis

D Zhou, Y Li, F Ma, Z Yang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
We introduce the Multi-Instance Generation (MIG) task, which focuses on generating
multiple instances within a single image, each accurately placed at predefined positions with …

Caphuman: Capture your moments in parallel universes

C Liang, F Ma, L Zhu, Y Deng… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We concentrate on a novel human-centric image synthesis task that is given only one
reference facial photograph it is expected to generate specific individual images with diverse …