Machine learning paradigms for speech recognition: An overview
L Deng, X Li - IEEE Transactions on Audio, Speech, and …, 2013 - ieeexplore.ieee.org
Automatic Speech Recognition (ASR) has historically been a driving force behind many
machine learning (ML) techniques, including the ubiquitously used hidden Markov model …
machine learning (ML) techniques, including the ubiquitously used hidden Markov model …
[HTML][HTML] A survey on the application of recurrent neural networks to statistical language modeling
In this paper, we present a survey on the application of recurrent neural networks to the task
of statistical language modeling. Although it has been shown that these models obtain good …
of statistical language modeling. Although it has been shown that these models obtain good …
[HTML][HTML] Deep speech 2: End-to-end speech recognition in english and mandarin
We show that an end-to-end deep learning approach can be used to recognize either
English or Mandarin Chinese speech–two vastly different languages. Because it replaces …
English or Mandarin Chinese speech–two vastly different languages. Because it replaces …
[PDF][PDF] Deep Speech: Scaling up end-to-end speech recognition
A Hannun - arxiv preprint arxiv:1412.5567, 2014 - research.baidu.com
We present a state-of-the-art speech recognition system developed using end-to-end deep
learning. Our architecture is significantly simpler than traditional speech systems, which rely …
learning. Our architecture is significantly simpler than traditional speech systems, which rely …
Audio-visual speech recognition using deep learning
Audio-visual speech recognition (AVSR) system is thought to be one of the most promising
solutions for reliable speech recognition, particularly when the audio is corrupted by noise …
solutions for reliable speech recognition, particularly when the audio is corrupted by noise …
[BUCH][B] Supervised sequence labelling
A Graves, A Graves - 2012 - Springer
This chapter provides the background material and literature review for supervised
sequence labelling. Section 2.1 briefly reviews supervised learning in general. Section 2.2 …
sequence labelling. Section 2.1 briefly reviews supervised learning in general. Section 2.2 …
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
We propose a novel context-dependent (CD) model for large-vocabulary speech recognition
(LVSR) that leverages recent advances in using deep belief networks for phone recognition …
(LVSR) that leverages recent advances in using deep belief networks for phone recognition …
An investigation of deep neural networks for noise robust speech recognition
Recently, a new acoustic model based on deep neural networks (DNN) has been
introduced. While the DNN has generated significant improvements over GMM-based …
introduced. While the DNN has generated significant improvements over GMM-based …
[PDF][PDF] Conversational speech transcription using context-dependent deep neural networks.
We apply the recently proposed Context-Dependent Deep-Neural-Network HMMs, or CD-
DNN-HMMs, to speech-to-text transcription. For single-pass speaker-independent …
DNN-HMMs, to speech-to-text transcription. For single-pass speaker-independent …
[BUCH][B] Biological sequence analysis: probabilistic models of proteins and nucleic acids
Probabilistic models are becoming increasingly important in analysing the huge amount of
data being produced by large-scale DNA-sequencing efforts such as the Human Genome …
data being produced by large-scale DNA-sequencing efforts such as the Human Genome …