The Microsoft 2017 conversational speech recognition system

W **ong, L Wu, F Alleva, J Droppo… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
We describe the latest version of Microsoft's conversational speech recognition system for
the Switchboard and CallHome domains. The system adds a CNN-BLSTM acoustic model to …

Achieving human parity in conversational speech recognition

W **ong, J Droppo, X Huang, F Seide, M Seltzer… - arxiv preprint arxiv …, 2016 - arxiv.org
Conversational speech recognition has served as a flagship speech recognition task since
the release of the Switchboard corpus in the 1990s. In this paper, we measure the human …

Toward human parity in conversational speech recognition

W **ong, J Droppo, X Huang, F Seide… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org
Conversational speech recognition has served as a flagship speech recognition task since
the release of the Switchboard corpus in the 1990s. In this paper, we measure a human …

[PDF][PDF] Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization.

B Kingsbury, TN Sainath, H Soltau - Interspeech, 2012 - isca-archive.org
Training neural network acoustic models with sequencediscriminative criteria, such as state-
level minimum Bayes risk (sMBR), been shown to produce large improvements in …

Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription

H Su, G Li, D Yu, F Seide - 2013 IEEE International Conference …, 2013 - ieeexplore.ieee.org
We investigate back-propagation based sequence training of Context-Dependent Deep-
Neural-Network HMMs, or CD-DNN-HMMs, for conversational speech transcription …

Optimizing a multi-layer perceptron based on an improved gray wolf algorithm to identify plant diseases

C Bi, Q Tian, H Chen, X Meng, H Wang, W Liu, J Jiang - Mathematics, 2023 - mdpi.com
Metaheuristic optimization algorithms play a crucial role in optimization problems. However,
the traditional identification methods have the following problems:(1) difficulties in nonlinear …

ASR error detection using recurrent neural network language model and complementary ASR

YC Tam, Y Lei, J Zheng, W Wang - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
Detecting automatic speech recognition (ASR) errors can play an important role for effective
human-computer spoken dialogue system, as recognition errors can hinder accurate system …

Discriminative method for recurrent neural network language models

Y Tachioka, S Watanabe - 2015 IEEE International Conference …, 2015 - ieeexplore.ieee.org
A recurrent neural network language model (RNN-LM) can use a long word context more
than can an n-gram language model, and its effective has recently been shown in its …

[PDF][PDF] Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

N Jaitly, V Vanhoucke, G Hinton - Fifteenth Annual Conference of …, 2014 - isca-archive.org
We describe a simple but effective way of using multi-frame targets to improve the accuracy
of Artificial Neural Network-Hidden Markov Model (ANN-HMM) hybrid systems. In this …

[인용][C] Sequence-discriminative training of deep neural networks.

K Veselý, A Ghoshal, L Burget, D Povey - Interspeech, 2013