Bayesian neural network language modeling for speech recognition
State-of-the-art neural network language models (NNLMs) represented by long short term
memory recurrent neural networks (LSTM-RNNs) and Transformers are becoming highly …
memory recurrent neural networks (LSTM-RNNs) and Transformers are becoming highly …
Development of the cuhk elderly speech recognition system for neurocognitive disorder detection using the dementiabank corpus
Early diagnosis of Neurocognitive Disorder (NCD) is crucial in facilitating preventive care
and timely treatment to delay further progression. This paper presents the development of a …
and timely treatment to delay further progression. This paper presents the development of a …
Bayesian learning for deep neural network adaptation
A key task for speech recognition systems is to reduce the mismatch between training and
evaluation data that is often attributable to speaker differences. Speaker adaptation …
evaluation data that is often attributable to speaker differences. Speaker adaptation …
Bayesian learning of LF-MMI trained time delay neural networks for speech recognition
Discriminative training techniques define state-of-the-art performance for automatic speech
recognition systems. However, they are inherently prone to overfitting, leading to poor …
recognition systems. However, they are inherently prone to overfitting, leading to poor …
[PDF][PDF] Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.
Automatic speech recognition (ASR) for disordered speech is a challenging task. People
with speech disorders such as dysarthria often have physical disabilities, leading to severe …
with speech disorders such as dysarthria often have physical disabilities, leading to severe …
[PDF][PDF] On the Use of Pitch Features for Disordered Speech Recognition.
Pitch features have long been known to be useful for recognition of normal speech.
However, for disordered speech, the significant degradation of voice quality renders the …
However, for disordered speech, the significant degradation of voice quality renders the …
Bayesian x-vector: Bayesian neural network based x-vector system for speaker verification
Speaker verification systems usually suffer from the mismatch problem between training and
evaluation data, such as speaker population mismatch, the channel and environment …
evaluation data, such as speaker population mismatch, the channel and environment …
[HTML][HTML] Mandarin Recognition Based on Self-Attention Mechanism with Deep Convolutional Neural Network (DCNN)-Gated Recurrent Unit (GRU)
X Chen, C Wang, C Hu, Q Wang - Big Data and Cognitive Computing, 2024 - mdpi.com
Speech recognition technology is an important branch in the field of artificial intelligence,
aiming to transform human speech into computer-readable text information. However …
aiming to transform human speech into computer-readable text information. However …
Transient anomaly detection using gaussian process depth analysis
J Baranowski, A Dudek… - 2021 25th International …, 2021 - ieeexplore.ieee.org
Effective and reliable monitoring and diagnostics of process control installations is of utmost
importance, as they are important part of world's economy. Detection of faulty behavior fits …
importance, as they are important part of world's economy. Detection of faulty behavior fits …
[PDF][PDF] LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition.
Discriminative training techniques define state-of-the-art performance for deep neural
networks (DNNs) based speech recognition systems across a wide range of tasks …
networks (DNNs) based speech recognition systems across a wide range of tasks …