Silent speech interfaces for speech restoration: A review

JA Gonzalez-Lopez, A Gomez-Alanis… - IEEE …, 2020 - ieeexplore.ieee.org
This review summarises the status of silent speech interface (SSI) research. SSIs rely on non-
acoustic biosignals generated by the human body during speech production to enable …

Recent advances in the automatic recognition of audiovisual speech

G Potamianos, C Neti, G Gravier, A Garg… - Proceedings of the …, 2003 - ieeexplore.ieee.org
Visual speech information from the speaker's mouth region has been successfully shown to
improve noise robustness of automatic speech recognizers, thus promising to extend their …

Combining residual networks with LSTMs for lipreading

T Stafylakis, G Tzimiropoulos - arxiv preprint arxiv:1703.04105, 2017 - arxiv.org
We propose an end-to-end deep learning architecture for word-level visual speech
recognition. The system is a combination of spatiotemporal convolutional, residual and …

Biosignal-based spoken communication: A survey

T Schultz, M Wand, T Hueber… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org
Speech is a complex process involving a wide range of biosignals, including but not limited
to acoustics. These biosignals-stemming from the articulators, the articulator muscle …

Lipreading with long short-term memory

M Wand, J Koutník… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
Lipreading, ie speech recognition from visual-only recordings of a speaker's face, can be
achieved with a processing pipeline based solely on neural networks, yielding significantly …

Learning from the master: Distilling cross-modal advanced knowledge for lip reading

S Ren, Y Du, J Lv, G Han, S He - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Lip reading aims to predict the spoken sentences from silent lip videos. Due to the fact that
such a vision task usually performs worse than its counterpart speech recognition, one …

Lipreading with local spatiotemporal descriptors

G Zhao, M Barnard… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Visual speech information plays an important role in lipreading under noisy conditions or for
listeners with a hearing impairment. In this paper, we present local spatiotemporal …

[PDF][PDF] Audio-visual automatic speech recognition: An overview

G Potamianos, C Neti, J Luettin… - Issues in visual and audio …, 2004 - academia.edu
We have made significant progress in automatic speech recognition (ASR) for well-defined
applications like dictation and medium vocabulary transaction processing tasks in relatively …

CUAVE: A new audio-visual database for multimodal human-computer interface research

EK Patterson, S Gurbuz, Z Tufekci… - 2002 IEEE International …, 2002 - ieeexplore.ieee.org
Multimodal signal processing has become an important topic of research for overcoming
certain problems of audio-only speech processing. Audio-visual speech recognition is one …

Driver drowsiness monitoring based on yawning detection

S Abtahi, B Hariri… - 2011 IEEE international …, 2011 - ieeexplore.ieee.org
Fatigue and drowsiness of drivers are amongst the significant causes of road accidents. In
this paper, we discuss a method for detecting drivers' drowsiness and subsequently alerting …