Robust automatic speech recognition using wavelet-based adaptive wavelet thresholding: a review

M Shanthamallappa, K Puttegowda… - SN Computer …, 2024 - Springer
Automatic speech recognition (ASR) is one of the most fascinating fields of research and the
performance of ASR systems is most promising in a closed environment having negligible or …

The subspace Gaussian mixture model—A structured model for speech recognition

D Povey, L Burget, M Agarwal, P Akyazi, F Kai… - Computer Speech & …, 2011 - Elsevier
We describe a new approach to speech recognition, in which all Hidden Markov Model
(HMM) states share the same Gaussian Mixture Model (GMM) structure with the same …

Subspace Gaussian mixture models for speech recognition

D Povey, L Burget, M Agarwal, P Akyazi… - … , Speech and Signal …, 2010 - ieeexplore.ieee.org
We describe an acoustic modeling approach in which all phonetic states share a common
Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of …

Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models

L Burget, P Schwarz, M Agarwal… - … on acoustics, speech …, 2010 - ieeexplore.ieee.org
Although research has previously been done on multilingual speech recognition, it has been
found to be very difficult to improve over separately trained systems. The usual approach …

[PDF][PDF] Using Gaussian mixtures for Hindi speech recognition system

RK Aggarwal, M Dave - International Journal of Signal Processing …, 2011 - researchgate.net
The goal of automatic speech recognition (ASR) system is to accurately and efficiently
convert a speech signal into a text message independent of the device, speaker or the …

Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition

L Lu, A Ghoshal, S Renals - 2013 IEEE Workshop on Automatic …, 2013 - ieeexplore.ieee.org
Speech recognition systems normally use handcrafted pronunciation lexicons designed by
linguistic experts. Building and maintaining such a lexicon is expensive and time …

Acoustic modeling problem for automatic speech recognition system: conventional methods (Part I)

RK Aggarwal, M Dave - International Journal of Speech Technology, 2011 - Springer
In automatic speech recognition (ASR) systems, the speech signal is captured and
parameterized at front end and evaluated at back end using the statistical framework of …

UT-VOCAL EFFORT II: Analysis and constrained-lexicon recognition of whispered speech

S Ghaffarzadegan, H Bořil… - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
This study focuses on acoustic variations in speech introduced by whispering, and proposes
several strategies to improve robustness of automatic speech recognition of whispered …

Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings

MS Ribeiro, G Comini, J Lorenzo-Trueba - arxiv preprint arxiv:2307.16643, 2023 - arxiv.org
The Grapheme-to-Phoneme (G2P) task aims to convert orthographic input into a discrete
phonetic representation. G2P conversion is beneficial to various speech processing …

Cross-lingual subspace gaussian mixture models for low-resource speech recognition

L Lu, A Ghoshal, S Renals - IEEE/ACM transactions on audio …, 2013 - ieeexplore.ieee.org
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian
mixture models (SGMMs). SGMMs factorize the acoustic model parameters into a set that is …