Silent speech interfaces for speech restoration: A review

JA Gonzalez-Lopez, A Gomez-Alanis… - IEEE …, 2020 - ieeexplore.ieee.org
This review summarises the status of silent speech interface (SSI) research. SSIs rely on non-
acoustic biosignals generated by the human body during speech production to enable …

Biosignal sensors and deep learning-based speech recognition: A review

W Lee, JJ Seong, B Ozlu, BS Shim, A Marakhimov… - Sensors, 2021 - mdpi.com
Voice is one of the essential mechanisms for communicating and expressing one's
intentions as a human being. There are several causes of voice inability, including disease …

Improving image autoencoder embeddings with perceptual loss

GG Pihlgren, F Sandin, M Liwicki - 2020 International Joint …, 2020 - ieeexplore.ieee.org
Autoencoders are commonly trained using element-wise loss. However, element-wise loss
disregards high-level structures in the image which can lead to embeddings that disregard …

Ultrasound-based articulatory-to-acoustic map** with WaveGlow speech synthesis

TG Csapó, C Zainkó, L Tóth, G Gosztolya… - ar** using deep neural networks, typically spectral and
excitation parameters of vocoders have been used as the training targets. However …

Exploring silent speech interfaces based on frequency-modulated continuous-wave radar

D Ferreira, S Silva, F Curado, A Teixeira - Sensors, 2022 - mdpi.com
Speech is our most natural and efficient form of communication and offers a strong potential
to improve how we interact with machines. However, speech communication can sometimes …

[HTML][HTML] Optimizing the ultrasound tongue image representation for residual network-based articulatory-to-acoustic map**

TG Csapó, G Gosztolya, L Tóth, AH Shandiz, A Markó - Sensors, 2022 - mdpi.com
Within speech processing, articulatory-to-acoustic map** (AAM) methods can apply
ultrasound tongue imaging (UTI) as an input.(Micro) convex transducers are mostly used …

[HTML][HTML] Deep learning classification of reading disability with regional brain volume features

F Joshi, JZ Wang, KI Vaden Jr, MA Eckert - NeuroImage, 2023 - Elsevier
Developmental reading disability is a prevalent and often enduring problem with varied
mechanisms that contribute to its phenotypic heterogeneity. This mechanistic and …

Autoencoding improves pre-trained word embeddings

M Kaneko, D Bollegala - arxiv preprint arxiv:2010.13094, 2020 - arxiv.org
Prior work investigating the geometry of pre-trained word embeddings have shown that word
embeddings to be distributed in a narrow cone and by centering and projecting using …

A systematic review of the application of machine learning techniques to ultrasound tongue imaging analysis

Z **a, R Yuan, Y Cao, T Sun, Y **ong… - The Journal of the …, 2024 - pubs.aip.org
B-mode ultrasound has emerged as a prevalent tool for observing tongue motion in speech
production, gaining traction in speech therapy applications. However, the effective analysis …

Speech synthesis from three-axis accelerometer signals using conformer-based deep neural network

J Kwon, J Hwang, JE Sung, CH Im - Computers in Biology and Medicine, 2024 - Elsevier
Silent speech interfaces (SSIs) have emerged as innovative non-acoustic communication
methods, and our previous study demonstrated the significant potential of three-axis …