New types of deep neural network learning for speech recognition and related applications: An overview

L Deng, G Hinton, B Kingsbury - 2013 IEEE international …, 2013 - ieeexplore.ieee.org
In this paper, we provide an overview of the invited and contributed papers presented at the
special session at ICASSP-2013, entitled “New Types of Deep Neural Network Learning for …

A tutorial survey of architectures, algorithms, and applications for deep learning

L Deng - APSIPA transactions on Signal and Information …, 2014 - cambridge.org
In this invited paper, my overview material on the same topic as presented in the plenary
overview session of APSIPA-2011 and the tutorial material presented in the same …

[PDF][PDF] Deep Speech: Scaling up end-to-end speech recognition

A Hannun - arxiv preprint arxiv:1412.5567, 2014 - research.baidu.com
We present a state-of-the-art speech recognition system developed using end-to-end deep
learning. Our architecture is significantly simpler than traditional speech systems, which rely …

[图书][B] Automatic speech recognition

D Yu, L Deng - 2016 - Springer
Automatic Speech Recognition (ASR), which is aimed to enable natural human–machine
interaction, has been an intensive research area for decades. Many core technologies, such …

[PDF][PDF] 1-bit stochastic gradient descent and its application to data-parallel distributed training of speech DNNs.

F Seide, H Fu, J Droppo, G Li, D Yu - Interspeech, 2014 - isca-archive.org
We show empirically that in SGD training of deep neural networks, one can, at no or nearly
no loss of accuracy, quantize the gradients aggressively—to but one bit per value—if the …

Deep learning: methods and applications

L Deng, D Yu - Foundations and trends® in signal processing, 2014 - nowpublishers.com
This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …

[PDF][PDF] Rectifier nonlinearities improve neural network acoustic models

AL Maas, AY Hannun, AY Ng - Proc. icml, 2013 - awnihannun.com
Deep neural network acoustic models produce substantial gains in large vocabulary
continuous speech recognition systems. Emerging work with rectified linear (ReL) hidden …

Deep convolutional neural networks for large-scale speech tasks

TN Sainath, B Kingsbury, G Saon, H Soltau… - Neural networks, 2015 - Elsevier
Abstract Convolutional Neural Networks (CNNs) are an alternative type of neural network
that can be used to reduce spectral variations and model spectral correlations which exist in …

Using recurrent neural networks for slot filling in spoken language understanding

G Mesnil, Y Dauphin, K Yao, Y Bengio… - … on Audio, Speech …, 2014 - ieeexplore.ieee.org
Semantic slot filling is one of the most challenging problems in spoken language
understanding (SLU). In this paper, we propose to use recurrent neural networks (RNNs) for …

Improving deep neural networks for LVCSR using rectified linear units and dropout

GE Dahl, TN Sainath, GE Hinton - 2013 IEEE international …, 2013 - ieeexplore.ieee.org
Recently, pre-trained deep neural networks (DNNs) have outperformed traditional acoustic
models based on Gaussian mixture models (GMMs) on a variety of large vocabulary speech …