Video rewrite: Driving visual speech with audio

C Bregler, M Covell, M Slaney - Seminal Graphics Papers: Pushing the …, 2023 - dl.acm.org
Video Rewrite uses existing footage to create automatically new video of a person mouthing
words that she did not speak in the original footage. This technique is useful in movie …

Voice puppetry

M Brand - Proceedings of the 26th annual conference on …, 1999 - dl.acm.org
We introduce a method for predicting a control signal from another related signal, and apply
it to voice puppetry: Generating full facial animation from expressive information in an audio …

[HTML][HTML] Face and 2-D mesh animation in MPEG-4

AM Tekalp, J Ostermann - Signal Processing: Image Communication, 2000 - Elsevier
This paper presents an overview of some of the synthetic visual objects supported by MPEG-
4 version-1, namely animated faces and animated arbitrary 2D uniform and Delaunay …

Lip movement synthesis from speech based on Hidden Markov Models

E Yamamoto, S Nakamura, K Shikano - Speech Communication, 1998 - Elsevier
Speech intelligibility can be improved by adding lip images to the speech signal. Thus lip
movement synthesis plays an important role to realize a natural human-like face of computer …

Real-time speech-driven face animation with expressions using neural networks

P Hong, Z Wen, TS Huang - IEEE Transactions on neural …, 2002 - ieeexplore.ieee.org
A real-time speech-driven synthetic talking face provides an effective multimodal
communication interface in distributed collaboration environments. Nonverbal gestures such …

Converting speech into lip movements: A multimedia telephone for hard of hearing people

F Lavagetto - IEEE Transactions on Rehabilitation Engineering, 1995 - ieeexplore.ieee.org
Presents the latest results of a research activity oriented to the development of a multimedia
telephone for hard of hearing persons, based on the conversion of speech into graphic …

Emotional expressions in audiovisual human computer interaction

LS Chen, TS Huang - … Proceedings. Latest Advances in the Fast …, 2000 - ieeexplore.ieee.org
Visual and auditory modalities are two of the most commonly used media in interactions
between humans. The authors describe a system to continuously monitor the user's voice …

Sample-based synthesis of photo-realistic talking heads

E Cosatto, HP Graf - Proceedings Computer Animation'98 (Cat …, 1998 - ieeexplore.ieee.org
The paper describes a system that generates photo-realistic video animations of talking
heads. First the system derives head models from existing video footage using image …

Generating human-like behaviors using joint, speech-driven models for conversational agents

S Mariooryad, C Busso - IEEE Transactions on Audio, Speech …, 2012 - ieeexplore.ieee.org
During human communication, every spoken message is intrinsically modulated within
different verbal and nonverbal cues that are externalized through various aspects of speech …

Method and apparatus for analyzing facial configurations and components

SR Marquardt - US Patent 5,659,625, 1997 - Google Patents
BACKGROUND The face is the most important part of the human body for interpersonal
communication, emotional expression, and most Other forms of social interaction. The face …