Robust automatic speech recognition using wavelet-based adaptive wavelet thresholding: a review
M Shanthamallappa, K Puttegowda… - SN Computer …, 2024 - Springer
Automatic speech recognition (ASR) is one of the most fascinating fields of research and the
performance of ASR systems is most promising in a closed environment having negligible or …
performance of ASR systems is most promising in a closed environment having negligible or …
The subspace Gaussian mixture model—A structured model for speech recognition
We describe a new approach to speech recognition, in which all Hidden Markov Model
(HMM) states share the same Gaussian Mixture Model (GMM) structure with the same …
(HMM) states share the same Gaussian Mixture Model (GMM) structure with the same …
Subspace Gaussian mixture models for speech recognition
We describe an acoustic modeling approach in which all phonetic states share a common
Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of …
Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of …
Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models
Although research has previously been done on multilingual speech recognition, it has been
found to be very difficult to improve over separately trained systems. The usual approach …
found to be very difficult to improve over separately trained systems. The usual approach …
[PDF][PDF] Using Gaussian mixtures for Hindi speech recognition system
The goal of automatic speech recognition (ASR) system is to accurately and efficiently
convert a speech signal into a text message independent of the device, speaker or the …
convert a speech signal into a text message independent of the device, speaker or the …
Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition
Speech recognition systems normally use handcrafted pronunciation lexicons designed by
linguistic experts. Building and maintaining such a lexicon is expensive and time …
linguistic experts. Building and maintaining such a lexicon is expensive and time …
Acoustic modeling problem for automatic speech recognition system: conventional methods (Part I)
In automatic speech recognition (ASR) systems, the speech signal is captured and
parameterized at front end and evaluated at back end using the statistical framework of …
parameterized at front end and evaluated at back end using the statistical framework of …
UT-VOCAL EFFORT II: Analysis and constrained-lexicon recognition of whispered speech
This study focuses on acoustic variations in speech introduced by whispering, and proposes
several strategies to improve robustness of automatic speech recognition of whispered …
several strategies to improve robustness of automatic speech recognition of whispered …
Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings
The Grapheme-to-Phoneme (G2P) task aims to convert orthographic input into a discrete
phonetic representation. G2P conversion is beneficial to various speech processing …
phonetic representation. G2P conversion is beneficial to various speech processing …
Cross-lingual subspace gaussian mixture models for low-resource speech recognition
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian
mixture models (SGMMs). SGMMs factorize the acoustic model parameters into a set that is …
mixture models (SGMMs). SGMMs factorize the acoustic model parameters into a set that is …