Lira: Learning visual speech representations from audio through self-supervision

P Ma, R Mira, S Petridis, BW Schuller… - arxiv preprint arxiv …, 2021 - arxiv.org
The large amount of audiovisual content being shared online today has drawn substantial
attention to the prospect of audiovisual self-supervised learning. Recent works have focused …

[PDF][PDF] Deep Audio-visual Speech Recognition

P Ma - 2022 - core.ac.uk
Decades of research in acoustic speech recognition have led to systems that we use in our
everyday life. However, even the most advanced speech recognition systems fail in the …

[PDF][PDF] Leveraging Audio-visual Speech Effectively via Deep Learning

RSC de Mira - 2022 - core.ac.uk
The rising popularity of neural networks, combined with the recent proliferation of online
audiovisual media, has led to a revolution in the way machines encode, recognize, and …