Portraitbooth: A versatile portrait model for fast identity-preserved personalization
Recent advancements in personalized image generation using diffusion models have been
noteworthy. However existing methods suffer from inefficiencies due to the requirement for …
noteworthy. However existing methods suffer from inefficiencies due to the requirement for …
Dreamtalk: When expressive talking head generation meets diffusion probabilistic models
Diffusion models have shown remarkable success in a variety of downstream generative
tasks, yet remain under-explored in the important and challenging expressive talking head …
tasks, yet remain under-explored in the important and challenging expressive talking head …
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
In this paper we abstract the process of people hearing speech extracting meaningful cues
and creating various dynamically audio-consistent talking faces termed Listening and …
and creating various dynamically audio-consistent talking faces termed Listening and …
Texdreamer: Towards zero-shot high-fidelity 3d human texture generation
Texturing 3D humans with semantic UV maps remains a challenge due to the difficulty of
acquiring reasonably unfolded UV. Despite recent text-to-3D advancements in supervising …
acquiring reasonably unfolded UV. Despite recent text-to-3D advancements in supervising …
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance Head-pose and Facial Expression Features
The task of face reenactment is to transfer the head motion and facial expressions from a
driving video to the appearance of a source image which may be of a different person (cross …
driving video to the appearance of a source image which may be of a different person (cross …
EarSE: Bringing Robust Speech Enhancement to COTS Headphones
Speech enhancement is regarded as the key to the quality of digital communication and is
gaining increasing attention in the research field of audio processing. In this paper, we …
gaining increasing attention in the research field of audio processing. In this paper, we …
Dream-talk: diffusion-based realistic emotional audio-driven method for single image talking face generation
The generation of emotional talking faces from a single portrait image remains a significant
challenge. The simultaneous achievement of expressive emotional talking and accurate lip …
challenge. The simultaneous achievement of expressive emotional talking and accurate lip …
MegActor-: Unlocking Flexible Mixed-Modal Control in Portrait Animation with Diffusion Transformer
Diffusion models have demonstrated superior performance in the field of portrait animation.
However, current approaches relied on either visual or audio modality to control character …
However, current approaches relied on either visual or audio modality to control character …