State of the art on diffusion models for visual computing
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …
Next-gpt: Any-to-any multimodal llm
While recently Multimodal Large Language Models (MM-LLMs) have made exciting strides,
they mostly fall prey to the limitation of only input-side multimodal understanding, without the …
they mostly fall prey to the limitation of only input-side multimodal understanding, without the …
Dynamicrafter: Animating open-domain images with video diffusion priors
Animating a still image offers an engaging visual experience. Traditional image animation
techniques mainly focus on animating natural scenes with stochastic dynamics (eg clouds …
techniques mainly focus on animating natural scenes with stochastic dynamics (eg clouds …
Svdiff: Compact parameter space for diffusion fine-tuning
Recently, diffusion models have achieved remarkable success in text-to-image generation,
enabling the creation of high-quality images from text prompts and various conditions …
enabling the creation of high-quality images from text prompts and various conditions …
Instantbooth: Personalized text-to-image generation without test-time finetuning
Recent advances in personalized image generation have enabled pre-trained text-to-image
models to learn new concepts from specific image sets. However these methods often …
models to learn new concepts from specific image sets. However these methods often …
Hyperdreambooth: Hypernetworks for fast personalization of text-to-image models
Personalization has emerged as a prominent aspect within the field of generative AI
enabling the synthesis of individuals in diverse contexts and styles while retaining high …
enabling the synthesis of individuals in diverse contexts and styles while retaining high …
Photomaker: Customizing realistic human photos via stacked id embedding
Recent advances in text-to-image generation have made remarkable progress in
synthesizing realistic human photos conditioned on given text prompts. However existing …
synthesizing realistic human photos conditioned on given text prompts. However existing …
Break-a-scene: Extracting multiple concepts from a single image
Text-to-image model personalization aims to introduce a user-provided concept to the
model, allowing its synthesis in diverse contexts. However, current methods primarily focus …
model, allowing its synthesis in diverse contexts. However, current methods primarily focus …
Subject-diffusion: Open domain personalized text-to-image generation without test-time fine-tuning
Recent progress in personalized image generation using diffusion models has been
significant. However, development in the area of open-domain and test-time fine-tuning-free …
significant. However, development in the area of open-domain and test-time fine-tuning-free …
Mix-of-show: Decentralized low-rank adaptation for multi-concept customization of diffusion models
Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained
significant attention from the community. These models can be easily customized for new …
significant attention from the community. These models can be easily customized for new …