Object-level Visual Prompts for Compositional Image Generation

G Parmar, O Patashnik, KC Wang, D Ostashev… - arxiv preprint arxiv …, 2025 - arxiv.org
We introduce a method for composing object-level visual prompts within a text-to-image
diffusion model. Our approach addresses the task of generating semantically coherent …

Difflora: Generating personalized low-rank adaptation weights with diffusion

Y Wu, Y Shi, J Wei, C Sun, Y Zhou, Y Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
Personalized text-to-image generation has gained significant attention for its capability to
generate high-fidelity portraits of specific identities conditioned on user-defined prompts …

CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization

F Wu, Y Pang, J Zhang, L Pang, J Yin, B Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advances in text-to-image personalization have enabled high-quality and
controllable image synthesis for user-provided concepts. However, existing methods still …

Personalization Toolkit: Training Free Personalization of Large Vision Language Models

S Seifi, V Dorovatas, DO Reino, R Aljundi - arxiv preprint arxiv …, 2025 - arxiv.org
Large Vision Language Models (LVLMs) have significant potential to deliver personalized
assistance by adapting to individual users' unique needs and preferences. Personalization …

One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

T Liu, K Wang, S Li, J van de Weijer, FS Khan… - arxiv preprint arxiv …, 2025 - arxiv.org
Text-to-image generation models can create high-quality images from input prompts.
However, they struggle to support the consistent generation of identity-preserving …

LoRA. rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation

D Shenaj, O Bohdal, M Ozay, P Zanuttigh… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advancements in image generation models have enabled personalized image
creation with both user-defined subjects (content) and styles. Prior works achieved …

Controllable Human Image Generation with Personalized Multi-Garments

Y Choi, S Kwak, S Yu, H Choi, J Shin - arxiv preprint arxiv:2411.16801, 2024 - arxiv.org
We present BootComp, a novel framework based on text-to-image diffusion models for
controllable human image generation with multiple reference garments. Here, the main …

PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation

Q Huang, L Chan, J Liu, W He, H Jiang, M Song… - arxiv preprint arxiv …, 2024 - arxiv.org
Finetuning-free personalized image generation can synthesize customized images without
test-time finetuning, attracting wide research interest owing to its high efficiency. Current …

Towards Identity-Aware Cross-Modal Retrieval: a Dataset and a Baseline

N Messina, L Vadicamo, L Maltese… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advancements in deep learning have significantly enhanced content-based retrieval
methods, notably through models like CLIP that map images and texts into a shared …