The Microsoft 2017 conversational speech recognition system
We describe the latest version of Microsoft's conversational speech recognition system for
the Switchboard and CallHome domains. The system adds a CNN-BLSTM acoustic model to …
the Switchboard and CallHome domains. The system adds a CNN-BLSTM acoustic model to …
Achieving human parity in conversational speech recognition
Conversational speech recognition has served as a flagship speech recognition task since
the release of the Switchboard corpus in the 1990s. In this paper, we measure the human …
the release of the Switchboard corpus in the 1990s. In this paper, we measure the human …
Toward human parity in conversational speech recognition
Conversational speech recognition has served as a flagship speech recognition task since
the release of the Switchboard corpus in the 1990s. In this paper, we measure a human …
the release of the Switchboard corpus in the 1990s. In this paper, we measure a human …
[PDF][PDF] Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization.
Training neural network acoustic models with sequencediscriminative criteria, such as state-
level minimum Bayes risk (sMBR), been shown to produce large improvements in …
level minimum Bayes risk (sMBR), been shown to produce large improvements in …
Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
We investigate back-propagation based sequence training of Context-Dependent Deep-
Neural-Network HMMs, or CD-DNN-HMMs, for conversational speech transcription …
Neural-Network HMMs, or CD-DNN-HMMs, for conversational speech transcription …
Optimizing a multi-layer perceptron based on an improved gray wolf algorithm to identify plant diseases
C Bi, Q Tian, H Chen, X Meng, H Wang, W Liu, J Jiang - Mathematics, 2023 - mdpi.com
Metaheuristic optimization algorithms play a crucial role in optimization problems. However,
the traditional identification methods have the following problems:(1) difficulties in nonlinear …
the traditional identification methods have the following problems:(1) difficulties in nonlinear …
ASR error detection using recurrent neural network language model and complementary ASR
Detecting automatic speech recognition (ASR) errors can play an important role for effective
human-computer spoken dialogue system, as recognition errors can hinder accurate system …
human-computer spoken dialogue system, as recognition errors can hinder accurate system …
Discriminative method for recurrent neural network language models
A recurrent neural network language model (RNN-LM) can use a long word context more
than can an n-gram language model, and its effective has recently been shown in its …
than can an n-gram language model, and its effective has recently been shown in its …
[PDF][PDF] Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models
We describe a simple but effective way of using multi-frame targets to improve the accuracy
of Artificial Neural Network-Hidden Markov Model (ANN-HMM) hybrid systems. In this …
of Artificial Neural Network-Hidden Markov Model (ANN-HMM) hybrid systems. In this …