Survey of deep learning paradigms for speech processing

KB Bhangale, M Kothandaraman - Wireless Personal Communications, 2022 - Springer
Over the past decades, a particular focus is given to research on machine learning
techniques for speech processing applications. However, in the past few years, research …

An overview of noise-robust automatic speech recognition

J Li, L Deng, Y Gong… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org
New waves of consumer-centric applications, such as voice search and voice interaction
with mobile devices and home entertainment systems, increasingly require automatic …

The application of hidden Markov models in speech recognition

M Gales, S Young - Foundations and Trends® in Signal …, 2008 - nowpublishers.com
The Application of Hidden Markov Models in Speech Recognition Page 1 The Application of
Hidden Markov Models in Speech Recognition Full text available at: http://dx.doi.org/10.1561/2000000004 …

The IBM 2015 English conversational telephone speech recognition system

G Saon, HKJ Kuo, S Rennie, M Picheny - arxiv preprint arxiv:1505.05899, 2015 - arxiv.org
We describe the latest improvements to the IBM English conversational telephone speech
recognition system. Some of the techniques that were found beneficial are: maxout networks …

Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition

T Yoshioka, A Sehr, M Delcroix… - IEEE Signal …, 2012 - ieeexplore.ieee.org
Speech recognition technology has left the research laboratory and is increasingly coming
into practical use, enabling a wide spectrum of innovative and exciting voice-driven …

[KÖNYV][B] Distant speech recognition

M Wölfel, J McDonough - 2009 - books.google.com
A complete overview of distant automatic speech recognition The performance of
conventional Automatic Speech Recognition (ASR) systems degrades dramatically as soon …

Speech recognition and keyword spotting for low-resource languages: Babel project research at cued

MJF Gales, KM Knill, A Ragni… - … workshop on spoken …, 2014 - eprints.whiterose.ac.uk
Recently there has been increased interest in Automatic Speech Recognition (ASR) and
Key Word Spotting (KWS) systems for low resource languages. One of the driving forces for …

Boosted MMI for model and feature-space discriminative training

D Povey, D Kanevsky, B Kingsbury… - … , Speech and Signal …, 2008 - ieeexplore.ieee.org
We present a modified form of the maximum mutual information (MMI) objective function
which gives improved results for discriminative training. The modification consists of …

Developments and directions in speech recognition and understanding, Part 1 [DSP Education]

JM Baker, L Deng, J Glass, S Khudanpur… - IEEE Signal …, 2009 - ieeexplore.ieee.org
To advance research, it is important to identify promising future research directions,
especially those that have not been adequately pursued or funded in the past. The working …

Audio-visual deep learning for noise robust speech recognition

J Huang, B Kingsbury - 2013 IEEE international conference on …, 2013 - ieeexplore.ieee.org
Deep belief networks (DBN) have shown impressive improvements over Gaussian mixture
models for automatic speech recognition. In this work we use DBNs for audio-visual speech …