Survey of deep learning paradigms for speech processing

KB Bhangale, M Kothandaraman - Wireless Personal Communications, 2022 - Springer
Over the past decades, a particular focus is given to research on machine learning
techniques for speech processing applications. However, in the past few years, research …

An overview of noise-robust automatic speech recognition

J Li, L Deng, Y Gong… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org
New waves of consumer-centric applications, such as voice search and voice interaction
with mobile devices and home entertainment systems, increasingly require automatic …

Conv-tasnet: Surpassing ideal time–frequency magnitude masking for speech separation

Y Luo, N Mesgarani - IEEE/ACM transactions on audio, speech …, 2019 - ieeexplore.ieee.org
Single-channel, speaker-independent speech separation methods have recently seen great
progress. However, the accuracy, latency, and computational cost of such methods remain …

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org
Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

Multichannel audio source separation with deep neural networks

AA Nugraha, A Liutkus, E Vincent - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org
This article addresses the problem of multichannel audio source separation. We propose a
framework where deep neural networks (DNNs) are used to model the source spectra and …

[Књига][B] Microphone array signal processing

J Benesty, J Chen, Y Huang - 2008 - books.google.com
In the past few years we have written and edited several books in the area of
acousticandspeechsignalprocessing. Thereasonbehindthisendeavoristhat there were …

The diverse environments multi-channel acoustic noise database (demand): A database of multichannel environmental noise recordings

J Thiemann, N Ito, E Vincent - Proceedings of Meetings on Acoustics, 2013 - pubs.aip.org
BACKGROUND In audio recordings outside of controlled studio setups, the presence of
acoustic background noise is a simple fact of life. As a result, there is continued interest in …

Far-field automatic speech recognition

R Haeb-Umbach, J Heymann, L Drude… - Proceedings of the …, 2020 - ieeexplore.ieee.org
The machine recognition of speech spoken at a distance from the microphones, known as
far-field automatic speech recognition (ASR), has received a significant increase in attention …