[HTML][HTML] A survey on the application of recurrent neural networks to statistical language modeling

W De Mulder, S Bethard, MF Moens - Computer Speech & Language, 2015 - Elsevier
In this paper, we present a survey on the application of recurrent neural networks to the task
of statistical language modeling. Although it has been shown that these models obtain good …

Large-vocabulary continuous speech recognition systems: A look at some recent advances

G Saon, JT Chien - IEEE signal processing magazine, 2012 - ieeexplore.ieee.org
Over the past decade or so, several advances have been made to the design of modern
large vocabulary continuous speech recognition (LVCSR) systems to the point where their …

Convolutional, long short-term memory, fully connected deep neural networks

TN Sainath, O Vinyals, A Senior… - 2015 IEEE international …, 2015 - ieeexplore.ieee.org
Both Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) have
shown improvements over Deep Neural Networks (DNNs) across a wide variety of speech …

[PDF][PDF] Learning the speech front-end with raw waveform CLDNNs.

TN Sainath, RJ Weiss, AW Senior, KW Wilson… - Interspeech, 2015 - isca-archive.org
Learning an acoustic model directly from the raw waveform has been an active area of
research. However, waveformbased models have not yet matched the performance of …

Deep convolutional neural networks for large-scale speech tasks

TN Sainath, B Kingsbury, G Saon, H Soltau… - Neural networks, 2015 - Elsevier
Abstract Convolutional Neural Networks (CNNs) are an alternative type of neural network
that can be used to reduce spectral variations and model spectral correlations which exist in …

Improving deep neural networks for LVCSR using rectified linear units and dropout

GE Dahl, TN Sainath, GE Hinton - 2013 IEEE international …, 2013 - ieeexplore.ieee.org
Recently, pre-trained deep neural networks (DNNs) have outperformed traditional acoustic
models based on Gaussian mixture models (GMMs) on a variety of large vocabulary speech …

Data augmentation for deep neural network acoustic modeling

X Cui, V Goel, B Kingsbury - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
This paper investigates data augmentation for deep neural network acoustic modeling
based on label-preserving transformations to deal with data sparsity. Two data …

Low-rank matrix factorization for deep neural network training with high-dimensional output targets

TN Sainath, B Kingsbury, V Sindhwani… - … on acoustics, speech …, 2013 - ieeexplore.ieee.org
While Deep Neural Networks (DNNs) have achieved tremendous success for large
vocabulary continuous speech recognition (LVCSR) tasks, training of these networks is …

Strategies for training large scale neural network language models

T Mikolov, A Deoras, D Povey, L Burget… - 2011 IEEE Workshop …, 2011 - ieeexplore.ieee.org
We describe how to effectively train neural network based language models on large data
sets. Fast convergence during training and better overall performance is observed when the …

The IBM 2015 English conversational telephone speech recognition system

G Saon, HKJ Kuo, S Rennie, M Picheny - arxiv preprint arxiv:1505.05899, 2015 - arxiv.org
We describe the latest improvements to the IBM English conversational telephone speech
recognition system. Some of the techniques that were found beneficial are: maxout networks …