A high-performance speech neuroprosthesis

FR Willett, EM Kunz, C Fan, DT Avansino, GH Wilson… - Nature, 2023 - nature.com
Speech brain–computer interfaces (BCIs) have the potential to restore rapid communication
to people with paralysis by decoding neural activity evoked by attempted speech into text, or …

A review of data collection practices using electromagnetic articulography

T Rebernik, J Jacobi, R Jonkers, A Noiray… - Laboratory …, 2021 - research.rug.nl
This paper reviews data collection practices in electromagnetic articulography (EMA)
studies, with a focus on sensor placement. It consists of three parts: in the first part, we …

Statistics in phonetics

S Tavakoli, B Matteo, D Pigoli, E Chodroff… - Annual Review of …, 2024 - annualreviews.org
Phonetics is the scientific field concerned with the study of how speech is produced, heard,
and perceived. It abounds with data, such as acoustic speech recordings, neuroimaging …

The secret source: Incorporating source features to improve acoustic-to-articulatory speech inversion

YM Siriwardena, C Espy-Wilson - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
In this work, we incorporated acoustically derived source features, aperiodicity, periodicity
and pitch as additional targets to an acoustic-to-articulatory speech inversion (SI) system …

Ssdm: Scalable speech dysfluency modeling

J Lian, X Zhou, Z Ezzes, J Vonk… - Advances in neural …, 2025 - proceedings.neurips.cc
Speech dysfluency modeling is the core module for spoken language learning, and speech
therapy. However, there are three challenges. First, current state-of-the-art solutions~~\cite …

Speech driven tongue animation

S Medina, D Tome, C Stoll, M Tiede… - Proceedings of the …, 2022 - openaccess.thecvf.com
Advances in speech driven animation techniques allow the creation of convincing
animations for virtual characters solely from audio data. Many existing approaches focus on …

Audio–visual deepfake detection using articulatory representation learning

Y Wang, H Huang - Computer Vision and Image Understanding, 2024 - Elsevier
Advancements in generative artificial intelligence have made it easier to manipulate auditory
and visual elements, highlighting the critical need for robust audio–visual deepfake …

A dual mechanism for intrinsic f0

WR Chen, DH Whalen, MK Tiede - Journal of Phonetics, 2021 - Elsevier
Vowel-intrinsic fundamental frequency (IF0), the phenomenon that high vowels tend to have
a higher fundamental frequency (f0) than low vowels, has been studied for over a century …

[HTML][HTML] Speaker adaptation on articulation and acoustics for articulation-to-speech synthesis

B Cao, A Wisler, J Wang - Sensors, 2022 - mdpi.com
Silent speech interfaces (SSIs) convert non-audio bio-signals, such as articulatory
movement, to speech. This technology has the potential to recover the speech ability of …

[HTML][HTML] Unsupervised speaker adaptation for speaker independent acoustic to articulatory speech inversion

G Sivaraman, V Mitra, H Nam, M Tiede… - The Journal of the …, 2019 - pubs.aip.org
Speech inversion is a well-known ill-posed problem and addition of speaker differences
typically makes it even harder. Normalizing the speaker differences is essential to effectively …