SottoVoce: An ultrasound imaging-based silent speech interaction using deep neural networks

N Kimura, M Kono, J Rekimoto - … of the 2019 CHI Conference on Human …, 2019 - dl.acm.org
The availability of digital devices operated by voice is expanding rapidly. However, the
applications of voice interfaces are still restricted. For example, speaking in public places …

A systematic review of the application of machine learning techniques to ultrasound tongue imaging analysis

Z ** with WaveGlow speech synthesis
TG Csapó, C Zainkó, L Tóth, G Gosztolya… - ar** using deep neural networks, typically spectral and
excitation parameters of vocoders have been used as the training targets. However …

SilentSpeller: Towards mobile, hands-free, silent speech text entry using electropalatography

N Kimura, T Gemicioglu, J Womack, R Li… - Proceedings of the …, 2022 - dl.acm.org
Speech is inappropriate in many situations, limiting when voice control can be used. Most
unvoiced speech text entry systems can not be used while on-the-go due to movement …

DNN-based acoustic-to-articulatory inversion using ultrasound tongue imaging

D Porras, A Sepúlveda-Sepúlveda… - 2019 International Joint …, 2019 - ieeexplore.ieee.org
Speech sounds are produced as the coordinated movement of the speaking organs. There
are several available methods to model the relation of articulatory movements and the …

[PDF][PDF] Multi-Task Learning of Speech Recognition and Speech Synthesis Parameters for Ultrasound-based Silent Speech Interfaces.

L Tóth, G Gosztolya, T Grósz, A Markó, TG Csapó - INTERSPEECH, 2018 - inf.u-szeged.hu
Abstract Silent Speech Interface systems apply two different strategies to solve the
articulatory-to-acoustic conversion task. The recognition-and-synthesis approach applies …

Optimizing the ultrasound tongue image representation for residual network-based articulatory-to-acoustic map**

TG Csapó, G Gosztolya, L Tóth, AH Shandiz, A Markó - Sensors, 2022 - mdpi.com
Within speech processing, articulatory-to-acoustic map** (AAM) methods can apply
ultrasound tongue imaging (UTI) as an input.(Micro) convex transducers are mostly used …

TieLent: A Casual Neck-Mounted Mouth Capturing Device for Silent Speech Interaction

N Kimura, K Hayashi, J Rekimoto - Proceedings of the International …, 2020 - dl.acm.org
With the increased use of smart speakers, silent speech interaction (SSI) is attracting
attention. Unfortunately, traditional silent speech interaction methods require the addition of …

Ultrasound-based silent speech interface built on a continuous vocoder

TG Csapó, MS Al-Radhi, G Németh… - arxiv preprint arxiv …, 2019 - arxiv.org
Recently it was shown that within the Silent Speech Interface (SSI) field, the prediction of F0
is possible from Ultrasound Tongue Images (UTI) as the articulatory input, using Deep …

TaLNet: Voice reconstruction from tongue and lip articulation with transfer learning from text-to-speech synthesis

JX Zhang, K Richmond, ZH Ling, L Dai - Proceedings of the AAAI …, 2021 - ojs.aaai.org
This paper presents TaLNet, a model for voice reconstruction with ultrasound tongue and
optical lip videos as inputs. TaLNet is based on an encoder-decoder architecture. Separate …