- Academic Search

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier

The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

Opslaan Citeren Geciteerd door 227 Verwante artikelen Alle 6 versies

[Free GPT-4]

[PDF] arxiv.org

Deep learning for visual speech analysis: A survey

C Sheng, G Kuang, L Bai, C Hou, Y Guo… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Visual speech, referring to the visual domain of speech, has attracted increasing attention
due to its wide applications, such as public security, medical treatment, military defense, and …

Opslaan Citeren Geciteerd door 46 Verwante artikelen Alle 9 versies

[Free GPT-4]

[PDF] arxiv.org

Speech driven talking face generation from a single image and an emotion condition

SE Eskimez, Y Zhang, Z Duan - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Visual emotion expression plays an important role in audiovisual speech communication. In
this work, we propose a novel approach to rendering visual emotion expression in speech …

Opslaan Citeren Geciteerd door 96 Verwante artikelen Alle 6 versies

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Talking human face generation: A survey

M Toshpulatov, W Lee, S Lee - Expert Systems with Applications, 2023 - Elsevier

Talking human face generation aims at synthesizing a natural human face that talks in
correspondence to the given text or audio series. Implementing the recently developed …

Opslaan Citeren Geciteerd door 25 Verwante artikelen Alle 3 versies

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Speech driven video editing via an audio-conditioned diffusion model

D Bigioi, S Basak, M Stypułkowski, M Zieba… - Image and Vision …, 2024 - Elsevier

Taking inspiration from recent developments in visual generative tasks using diffusion
models, we propose a method for end-to-end speech-driven video editing using a denoising …

Opslaan Citeren Geciteerd door 30 Verwante artikelen Alle 5 versies

[Free GPT-4]

[PDF] arxiv.org

Deep person generation: A survey from the perspective of face, pose, and cloth synthesis

T Sha, W Zhang, T Shen, Z Li, T Mei - ACM Computing Surveys, 2023 - dl.acm.org

Deep person generation has attracted extensive research attention due to its wide
applications in virtual agents, video conferencing, online shop**, and art/movie …

Opslaan Citeren Geciteerd door 40 Verwante artikelen Alle 3 versies

Expression-tailored talking face generation with adaptive cross-modal weighting

D Zeng, S Zhao, J Zhang, H Liu, K Li - Neurocomputing, 2022 - Elsevier

The key of talking face generation is to synthesize the identity-preserving natural facial
expressions with accurate audio-lip synchronization. To accomplish this, it requires to …

Opslaan Citeren Geciteerd door 10 Verwante artikelen Alle 2 versies

[Free GPT-4]

[PDF] arxiv.org

Talking head generation with audio and speech related facial action units

S Chen, Z Liu, J Liu, Z Yan, L Wang - arxiv preprint arxiv:2110.09951, 2021 - arxiv.org

The task of talking head generation is to synthesize a lip synchronized talking head video by
inputting an arbitrary face image and audio clips. Most existing methods ignore the local …

Opslaan Citeren Geciteerd door 19 Verwante artikelen Alle 5 versies HTML-versie

[Free GPT-4]

[PDF] arxiv.org

Speech2video: Cross-modal distillation for speech to video generation

S Si, J Wang, X Qu, N Cheng, W Wei, X Zhu… - arxiv preprint arxiv …, 2021 - arxiv.org

This paper investigates a novel task of talking face video generation solely from speeches.
The speech-to-video generation technique can spark interesting applications in …

Opslaan Citeren Geciteerd door 18 Verwante artikelen Alle 6 versies HTML-versie

Talking face generation via facial anatomy

S Liu, H Wang - ACM Transactions on Multimedia Computing …, 2023 - dl.acm.org

To generate the corresponding talking face from a speech audio and a face image, it is
essential to match the variations in the facial appearance with the speech audio in subtle …

Opslaan Citeren Geciteerd door 12 Verwante artikelen

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

End-to-end generation of talking faces from noisy speech

A review of deep learning techniques for speech processing

Deep learning for visual speech analysis: A survey

Speech driven talking face generation from a single image and an emotion condition

[HTML][HTML] Talking human face generation: A survey

[HTML][HTML] Speech driven video editing via an audio-conditioned diffusion model

Deep person generation: A survey from the perspective of face, pose, and cloth synthesis

Expression-tailored talking face generation with adaptive cross-modal weighting

Talking head generation with audio and speech related facial action units

Speech2video: Cross-modal distillation for speech to video generation

Talking face generation via facial anatomy