Object-level Visual Prompts for Compositional Image Generation
We introduce a method for composing object-level visual prompts within a text-to-image
diffusion model. Our approach addresses the task of generating semantically coherent …
diffusion model. Our approach addresses the task of generating semantically coherent …
Difflora: Generating personalized low-rank adaptation weights with diffusion
Personalized text-to-image generation has gained significant attention for its capability to
generate high-fidelity portraits of specific identities conditioned on user-defined prompts …
generate high-fidelity portraits of specific identities conditioned on user-defined prompts …
CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization
Recent advances in text-to-image personalization have enabled high-quality and
controllable image synthesis for user-provided concepts. However, existing methods still …
controllable image synthesis for user-provided concepts. However, existing methods still …
Personalization Toolkit: Training Free Personalization of Large Vision Language Models
Large Vision Language Models (LVLMs) have significant potential to deliver personalized
assistance by adapting to individual users' unique needs and preferences. Personalization …
assistance by adapting to individual users' unique needs and preferences. Personalization …
One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
Text-to-image generation models can create high-quality images from input prompts.
However, they struggle to support the consistent generation of identity-preserving …
However, they struggle to support the consistent generation of identity-preserving …
LoRA. rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation
Recent advancements in image generation models have enabled personalized image
creation with both user-defined subjects (content) and styles. Prior works achieved …
creation with both user-defined subjects (content) and styles. Prior works achieved …
Controllable Human Image Generation with Personalized Multi-Garments
We present BootComp, a novel framework based on text-to-image diffusion models for
controllable human image generation with multiple reference garments. Here, the main …
controllable human image generation with multiple reference garments. Here, the main …
PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation
Finetuning-free personalized image generation can synthesize customized images without
test-time finetuning, attracting wide research interest owing to its high efficiency. Current …
test-time finetuning, attracting wide research interest owing to its high efficiency. Current …
Towards Identity-Aware Cross-Modal Retrieval: a Dataset and a Baseline
Recent advancements in deep learning have significantly enhanced content-based retrieval
methods, notably through models like CLIP that map images and texts into a shared …
methods, notably through models like CLIP that map images and texts into a shared …