[HTML][HTML] A survey on the application of recurrent neural networks to statistical language modeling
In this paper, we present a survey on the application of recurrent neural networks to the task
of statistical language modeling. Although it has been shown that these models obtain good …
of statistical language modeling. Although it has been shown that these models obtain good …
Large-vocabulary continuous speech recognition systems: A look at some recent advances
Over the past decade or so, several advances have been made to the design of modern
large vocabulary continuous speech recognition (LVCSR) systems to the point where their …
large vocabulary continuous speech recognition (LVCSR) systems to the point where their …
Convolutional, long short-term memory, fully connected deep neural networks
Both Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) have
shown improvements over Deep Neural Networks (DNNs) across a wide variety of speech …
shown improvements over Deep Neural Networks (DNNs) across a wide variety of speech …
[PDF][PDF] Learning the speech front-end with raw waveform CLDNNs.
Learning an acoustic model directly from the raw waveform has been an active area of
research. However, waveformbased models have not yet matched the performance of …
research. However, waveformbased models have not yet matched the performance of …
Deep convolutional neural networks for large-scale speech tasks
Abstract Convolutional Neural Networks (CNNs) are an alternative type of neural network
that can be used to reduce spectral variations and model spectral correlations which exist in …
that can be used to reduce spectral variations and model spectral correlations which exist in …
Improving deep neural networks for LVCSR using rectified linear units and dropout
Recently, pre-trained deep neural networks (DNNs) have outperformed traditional acoustic
models based on Gaussian mixture models (GMMs) on a variety of large vocabulary speech …
models based on Gaussian mixture models (GMMs) on a variety of large vocabulary speech …
Data augmentation for deep neural network acoustic modeling
This paper investigates data augmentation for deep neural network acoustic modeling
based on label-preserving transformations to deal with data sparsity. Two data …
based on label-preserving transformations to deal with data sparsity. Two data …
Low-rank matrix factorization for deep neural network training with high-dimensional output targets
While Deep Neural Networks (DNNs) have achieved tremendous success for large
vocabulary continuous speech recognition (LVCSR) tasks, training of these networks is …
vocabulary continuous speech recognition (LVCSR) tasks, training of these networks is …
Strategies for training large scale neural network language models
We describe how to effectively train neural network based language models on large data
sets. Fast convergence during training and better overall performance is observed when the …
sets. Fast convergence during training and better overall performance is observed when the …
The IBM 2015 English conversational telephone speech recognition system
We describe the latest improvements to the IBM English conversational telephone speech
recognition system. Some of the techniques that were found beneficial are: maxout networks …
recognition system. Some of the techniques that were found beneficial are: maxout networks …