Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers
This paper for the first time explores text-to-image diffusion models for Zero-Shot Sketch-
based Image Retrieval (ZS-SBIR). We highlight a pivotal discovery: the capacity of text-to …
based Image Retrieval (ZS-SBIR). We highlight a pivotal discovery: the capacity of text-to …
It's All About Your Sketch: Democratising Sketch Control in Diffusion Models
This paper unravels the potential of sketches for diffusion models addressing the deceptive
promise of direct sketch control in generative AI. We importantly democratise the process …
promise of direct sketch control in generative AI. We importantly democratise the process …
Controllable generation with text-to-image diffusion models: A survey
In the rapidly advancing realm of visual generation, diffusion models have revolutionized the
landscape, marking a significant shift in capabilities with their impressive text-guided …
landscape, marking a significant shift in capabilities with their impressive text-guided …
S2TD-Face: Reconstruct a Detailed 3D Face with Controllable Texture from a Single Sketch
3D textured face reconstruction from sketches applicable in many scenarios such as
animation, 3D avatars, artistic design, missing people search, etc., is a highly promising but …
animation, 3D avatars, artistic design, missing people search, etc., is a highly promising but …
A Survey on Personalized Content Synthesis with Diffusion Models
Recent advancements in generative models have significantly impacted content creation,
leading to the emergence of Personalized Content Synthesis (PCS). With a small set of user …
leading to the emergence of Personalized Content Synthesis (PCS). With a small set of user …
PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation
Audio-driven talking face generation is a challenging task in digital communication. Despite
significant progress in the area, most existing methods concentrate on audio-lip …
significant progress in the area, most existing methods concentrate on audio-lip …