New types of deep neural network learning for speech recognition and related applications: An overview
In this paper, we provide an overview of the invited and contributed papers presented at the
special session at ICASSP-2013, entitled “New Types of Deep Neural Network Learning for …
special session at ICASSP-2013, entitled “New Types of Deep Neural Network Learning for …
A tutorial survey of architectures, algorithms, and applications for deep learning
L Deng - APSIPA transactions on Signal and Information …, 2014 - cambridge.org
In this invited paper, my overview material on the same topic as presented in the plenary
overview session of APSIPA-2011 and the tutorial material presented in the same …
overview session of APSIPA-2011 and the tutorial material presented in the same …
[PDF][PDF] Deep Speech: Scaling up end-to-end speech recognition
A Hannun - arxiv preprint arxiv:1412.5567, 2014 - research.baidu.com
We present a state-of-the-art speech recognition system developed using end-to-end deep
learning. Our architecture is significantly simpler than traditional speech systems, which rely …
learning. Our architecture is significantly simpler than traditional speech systems, which rely …
[图书][B] Automatic speech recognition
Automatic Speech Recognition (ASR), which is aimed to enable natural human–machine
interaction, has been an intensive research area for decades. Many core technologies, such …
interaction, has been an intensive research area for decades. Many core technologies, such …
[PDF][PDF] 1-bit stochastic gradient descent and its application to data-parallel distributed training of speech DNNs.
We show empirically that in SGD training of deep neural networks, one can, at no or nearly
no loss of accuracy, quantize the gradients aggressively—to but one bit per value—if the …
no loss of accuracy, quantize the gradients aggressively—to but one bit per value—if the …
Deep learning: methods and applications
This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …
applications to a variety of signal and information processing tasks. The application areas …
[PDF][PDF] Rectifier nonlinearities improve neural network acoustic models
Deep neural network acoustic models produce substantial gains in large vocabulary
continuous speech recognition systems. Emerging work with rectified linear (ReL) hidden …
continuous speech recognition systems. Emerging work with rectified linear (ReL) hidden …
Deep convolutional neural networks for large-scale speech tasks
Abstract Convolutional Neural Networks (CNNs) are an alternative type of neural network
that can be used to reduce spectral variations and model spectral correlations which exist in …
that can be used to reduce spectral variations and model spectral correlations which exist in …
Using recurrent neural networks for slot filling in spoken language understanding
Semantic slot filling is one of the most challenging problems in spoken language
understanding (SLU). In this paper, we propose to use recurrent neural networks (RNNs) for …
understanding (SLU). In this paper, we propose to use recurrent neural networks (RNNs) for …
Improving deep neural networks for LVCSR using rectified linear units and dropout
Recently, pre-trained deep neural networks (DNNs) have outperformed traditional acoustic
models based on Gaussian mixture models (GMMs) on a variety of large vocabulary speech …
models based on Gaussian mixture models (GMMs) on a variety of large vocabulary speech …