Bayesian neural network language modeling for speech recognition

B Xue, S Hu, J Xu, M Geng, X Liu… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
State-of-the-art neural network language models (NNLMs) represented by long short term
memory recurrent neural networks (LSTM-RNNs) and Transformers are becoming highly …

Development of the cuhk elderly speech recognition system for neurocognitive disorder detection using the dementiabank corpus

Z Ye, S Hu, J Li, X **e, M Geng, J Yu… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Early diagnosis of Neurocognitive Disorder (NCD) is crucial in facilitating preventive care
and timely treatment to delay further progression. This paper presents the development of a …

Bayesian learning for deep neural network adaptation

X **e, X Liu, T Lee, L Wang - IEEE/ACM Transactions on Audio …, 2021 - ieeexplore.ieee.org
A key task for speech recognition systems is to reduce the mismatch between training and
evaluation data that is often attributable to speaker differences. Speaker adaptation …

Bayesian learning of LF-MMI trained time delay neural networks for speech recognition

S Hu, X **e, S Liu, J Yu, Z Ye, M Geng… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Discriminative training techniques define state-of-the-art performance for automatic speech
recognition systems. However, they are inherently prone to overfitting, leading to poor …

[PDF][PDF] Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.

S Liu, S Hu, Y Wang, J Yu, R Su, X Liu, H Meng - INTERSPEECH, 2019 - isca-archive.org
Automatic speech recognition (ASR) for disordered speech is a challenging task. People
with speech disorders such as dysarthria often have physical disabilities, leading to severe …

[PDF][PDF] On the Use of Pitch Features for Disordered Speech Recognition.

S Liu, S Hu, X Liu, H Meng - Interspeech, 2019 - researchgate.net
Pitch features have long been known to be useful for recognition of normal speech.
However, for disordered speech, the significant degradation of voice quality renders the …

Bayesian x-vector: Bayesian neural network based x-vector system for speaker verification

X Li, J Zhong, J Yu, S Hu, X Wu, X Liu… - arxiv preprint arxiv …, 2020 - arxiv.org
Speaker verification systems usually suffer from the mismatch problem between training and
evaluation data, such as speaker population mismatch, the channel and environment …

[HTML][HTML] Mandarin Recognition Based on Self-Attention Mechanism with Deep Convolutional Neural Network (DCNN)-Gated Recurrent Unit (GRU)

X Chen, C Wang, C Hu, Q Wang - Big Data and Cognitive Computing, 2024 - mdpi.com
Speech recognition technology is an important branch in the field of artificial intelligence,
aiming to transform human speech into computer-readable text information. However …

Transient anomaly detection using gaussian process depth analysis

J Baranowski, A Dudek… - 2021 25th International …, 2021 - ieeexplore.ieee.org
Effective and reliable monitoring and diagnostics of process control installations is of utmost
importance, as they are important part of world's economy. Detection of faulty behavior fits …

[PDF][PDF] LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition.

S Hu, X **e, S Liu, MWY Lam, J Yu, X Wu, X Liu… - Interspeech, 2019 - isca-archive.org
Discriminative training techniques define state-of-the-art performance for deep neural
networks (DNNs) based speech recognition systems across a wide range of tasks …