Audiovisual speech synthesis: An overview of the state-of-the-art

W Mattheyses, W Verhelst - Speech Communication, 2015 - Elsevier
We live in a world where there are countless interactions with computer systems in every-
day situations. In the most ideal case, this interaction feels as familiar and as natural as the …

Current state of text-to-speech system ARTIC: a decade of research on the field of speech technologies

D Tihelka, Z Hanzlíček, M Jůzová, J Vít… - Text, Speech, and …, 2018 - Springer
This paper provides a survey of the current state of ARTIC–the modern Czech concatenative
corpus-based text-to-speech system. Through more than a decade of research & …

Recognition of isolated words using Zernike and MFCC features for audio visual speech recognition

P Borde, A Varpe, R Manza, P Yannawar - International journal of speech …, 2015 - Springer
Automatic speech recognition by machine is an attractive research topic in signal processing
domain and has attracted many researchers to contribute in this area. In recent year, there …

D64: A corpus of richly recorded conversational interaction

C Oertel, F Cummins, J Edlund, P Wagner… - Journal on Multimodal …, 2013 - Springer
In recent years there has been a substantial debate about the need for increasingly
spontaneous, conversational corpora of spoken interaction that are not controlled or task …

The future of multimodal corpora

D Knight - Revista brasileira de linguística aplicada, 2011 - SciELO Brasil
This paper takes stock of the current state-of-the-art in multimodal corpus linguistics, and
proposes some projections of future developments in this field. It provides a critical overview …

Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments

R Thézé, MA Gadiri, L Albert, A Provost, AL Giraud… - Scientific reports, 2020 - nature.com
Natural speech is processed in the brain as a mixture of auditory and visual features. An
example of the importance of visual speech is the McGurk effect and related perceptual …

Automatic technologies for processing spoken sign languages

A Karpov, I Kipyatkova, M Zelezny - Procedia Computer Science, 2016 - Elsevier
Sign languages are known as a natural means for verbal communication of the deaf and
hard of hearing people. There is no universal sign language, and almost each country has …

Multimodality and active listenership

D Knight - 2011 - torrossa.com
Current methodologies in corpus linguistics have revolutionized the way we look at
language. They allow us to make objective observations about written and spoken …

[PDF][PDF] SynFace—speech-driven facial animation for virtual speech-reading support

G Salvi, J Beskow, S Al Moubayed… - EURASIP journal on audio …, 2009 - Springer
This paper describes SynFace, a supportive technology that aims at enhancing audio-based
spoken communication in adverse acoustic conditions by providing the missing visual …

Information enquiry kiosk with multimodal user interface

AA Karpov, AL Ronzhin - Pattern Recognition and Image Analysis, 2009 - Springer
A multimodal interactive dialogue automaton (kiosk) for self-service is presented in the
paper. Multimodal user interface allow people to interact with the kiosk by natural speech …