Machine learning paradigms for speech recognition: An overview

L Deng, X Li - IEEE Transactions on Audio, Speech, and …, 2013 - ieeexplore.ieee.org
Automatic Speech Recognition (ASR) has historically been a driving force behind many
machine learning (ML) techniques, including the ubiquitously used hidden Markov model …

[HTML][HTML] A survey on the application of recurrent neural networks to statistical language modeling

W De Mulder, S Bethard, MF Moens - Computer Speech & Language, 2015 - Elsevier
In this paper, we present a survey on the application of recurrent neural networks to the task
of statistical language modeling. Although it has been shown that these models obtain good …

[HTML][HTML] Deep speech 2: End-to-end speech recognition in english and mandarin

D Amodei, S Ananthanarayanan… - International …, 2016 - proceedings.mlr.press
We show that an end-to-end deep learning approach can be used to recognize either
English or Mandarin Chinese speech–two vastly different languages. Because it replaces …

[PDF][PDF] Deep Speech: Scaling up end-to-end speech recognition

A Hannun - arxiv preprint arxiv:1412.5567, 2014 - research.baidu.com
We present a state-of-the-art speech recognition system developed using end-to-end deep
learning. Our architecture is significantly simpler than traditional speech systems, which rely …

Audio-visual speech recognition using deep learning

K Noda, Y Yamaguchi, K Nakadai, HG Okuno… - Applied intelligence, 2015 - Springer
Audio-visual speech recognition (AVSR) system is thought to be one of the most promising
solutions for reliable speech recognition, particularly when the audio is corrupted by noise …

[BUCH][B] Supervised sequence labelling

A Graves, A Graves - 2012 - Springer
This chapter provides the background material and literature review for supervised
sequence labelling. Section 2.1 briefly reviews supervised learning in general. Section 2.2 …

Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition

GE Dahl, D Yu, L Deng, A Acero - IEEE Transactions on audio …, 2011 - ieeexplore.ieee.org
We propose a novel context-dependent (CD) model for large-vocabulary speech recognition
(LVSR) that leverages recent advances in using deep belief networks for phone recognition …

An investigation of deep neural networks for noise robust speech recognition

ML Seltzer, D Yu, Y Wang - 2013 IEEE international conference …, 2013 - ieeexplore.ieee.org
Recently, a new acoustic model based on deep neural networks (DNN) has been
introduced. While the DNN has generated significant improvements over GMM-based …

[PDF][PDF] Conversational speech transcription using context-dependent deep neural networks.

F Seide, G Li, D Yu - Interspeech, 2011 - microsoft.com
We apply the recently proposed Context-Dependent Deep-Neural-Network HMMs, or CD-
DNN-HMMs, to speech-to-text transcription. For single-pass speaker-independent …

[BUCH][B] Biological sequence analysis: probabilistic models of proteins and nucleic acids

R Durbin, SR Eddy, A Krogh, G Mitchison - 1998 - books.google.com
Probabilistic models are becoming increasingly important in analysing the huge amount of
data being produced by large-scale DNA-sequencing efforts such as the Human Genome …