An overview of noise-robust automatic speech recognition

J Li, L Deng, Y Gong… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org
New waves of consumer-centric applications, such as voice search and voice interaction
with mobile devices and home entertainment systems, increasingly require automatic …

PLACES: Prompting language models for social conversation synthesis

M Chen, A Papangelis, C Tao, S Kim… - arxiv preprint arxiv …, 2023 - arxiv.org
Collecting high quality conversational data can be very expensive for most applications and
infeasible for others due to privacy, ethical, or similar concerns. A promising direction to …

A regression approach to speech enhancement based on deep neural networks

Y Xu, J Du, LR Dai, CH Lee - IEEE/ACM transactions on audio …, 2014 - ieeexplore.ieee.org
In contrast to the conventional minimum mean square error (MMSE)-based noise reduction
techniques, we propose a supervised method to enhance speech by means of finding a …

Detection and classification of acoustic scenes and events: Outcome of the DCASE 2016 challenge

A Mesaros, T Heittola, E Benetos… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org
Public evaluation campaigns and datasets promote active development in target research
areas, allowing direct comparison of algorithms. The second edition of the challenge on …

Towards scaling up classification-based speech separation

Y Wang, DL Wang - IEEE Transactions on Audio, Speech, and …, 2013 - ieeexplore.ieee.org
Formulating speech separation as a binary classification problem has been shown to be
effective. While good separation performance is achieved in matched test conditions using …

Noise robust automatic speech recognition: review and analysis

M Dua, Akanksha, S Dua - International Journal of Speech Technology, 2023 - Springer
Abstract Automatic Speech Recognition (ASR) system is an emerging technology used in
various fields such as robotics, traffic controls, and healthcare, etc. The leading cause of …

Enabling reproducible research in sensor-based transportation mode recognition with the Sussex-Huawei dataset

L Wang, H Gjoreski, M Ciliberto, S Mekki… - IEEE …, 2019 - ieeexplore.ieee.org
Transportation and locomotion mode recognition from multimodal smartphone sensors is
useful for providing just-in-time context-aware assistance. However, the field is currently …

The PASCAL CHiME speech separation and recognition challenge

J Barker, E Vincent, N Ma, H Christensen… - Computer Speech & …, 2013 - Elsevier
Distant microphone speech recognition systems that operate with human-like robustness
remain a distant goal. The key difficulty is that operating in everyday listening conditions …

The signal separation evaluation campaign (2007–2010): Achievements and remaining challenges

E Vincent, S Araki, F Theis, G Nolte, P Bofill, H Sawada… - Signal Processing, 2012 - Elsevier
We present the outcomes of three recent evaluation campaigns in the field of audio and
biomedical source separation. These campaigns have witnessed a boom in the range of …

Coherent-to-diffuse power ratio estimation for dereverberation

A Schwarz, W Kellermann - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
The estimation of the time-and frequency-dependent coherent-to-diffuse power ratio (CDR)
from the measured spatial coherence between two omnidirectional microphones is …