Speech production knowledge in automatic speech recognition

S King, J Frankel, K Livescu, E McDermott… - The Journal of the …, 2007 - pubs.aip.org
Although much is known about how speech is produced, and research into speech
production has resulted in measured articulatory data, feature systems of different kinds, and …

Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory

T Toda, AW Black, K Tokuda - IEEE Transactions on Audio …, 2007 - ieeexplore.ieee.org
In this paper, we describe a novel spectral conversion method for voice conversion (VC). A
Gaussian mixture model (GMM) of the joint probability density of source and target features …

[HTML][HTML] Uncertainty estimation with deep learning for rainfall–runoff modeling

D Klotz, F Kratzert, M Gauch… - Hydrology and Earth …, 2022 - hess.copernicus.org
Deep learning is becoming an increasingly important way to produce accurate hydrological
predictions across a wide range of spatial and temporal scales. Uncertainty estimations are …

The TORGO database of acoustic and articulatory speech from speakers with dysarthria

F Rudzicz, AK Namasivayam, T Wolff - Language resources and …, 2012 - Springer
This paper describes the acquisition of a new database of dysarthric speech in terms of
aligned acoustics and articulatory data. This database currently includes data from seven …

Statistical map** between articulatory movements and acoustic spectrum using a Gaussian mixture model

T Toda, AW Black, K Tokuda - Speech communication, 2008 - Elsevier
In this paper, we describe a statistical approach to both an articulatory-to-acoustic map**
and an acoustic-to-articulatory inversion map** without using phonetic information. The …

A deep recurrent approach for acoustic-to-articulatory inversion

P Liu, Q Yu, Z Wu, S Kang, H Meng… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
To solve the acoustic-to-articulatory inversion problem, this paper proposes a deep
bidirectional long short term memory recurrent neural network and a deep recurrent mixture …

An artificial neural network approach to automatic speech processing

SM Siniscalchi, T Svendsen, CH Lee - Neurocomputing, 2014 - Elsevier
An artificial neural network (ANN) is a powerful mathematical framework used to either
model complex relationships between inputs and outputs or find patterns in data. It is based …

A trajectory mixture density network for the acoustic-articulatory inversion map**

K Richmond - … 2006-ICSLP Ninth International Conference on …, 2006 - research.ed.ac.uk
This paper proposes a trajectory model which is based on a mixture density network trained
with target features augmented with dynamic features together with an algorithm for …

[HTML][HTML] Unsupervised speaker adaptation for speaker independent acoustic to articulatory speech inversion

G Sivaraman, V Mitra, H Nam, M Tiede… - The Journal of the …, 2019 - pubs.aip.org
Speech inversion is a well-known ill-posed problem and addition of speaker differences
typically makes it even harder. Normalizing the speaker differences is essential to effectively …

Deep architectures for articulatory inversion

B Uria, I Murray, S Renals… - INTERSPEECH 2012 13th …, 2012 - research.ed.ac.uk
We implement two deep architectures for the acoustic-articulatory inversion map**
problem: a deep neural network and a deep trajectory mixture density network. We find that …