Dense CNN with self-attention for time-domain speech enhancement

A Pandey, DL Wang - IEEE/ACM transactions on audio, speech …, 2021 - ieeexplore.ieee.org
Speech enhancement in the time domain is becoming increasingly popular in recent years,
due to its capability to jointly enhance both the magnitude and the phase of speech. In this …

U-shaped transformer with frequency-band aware attention for speech enhancement

Y Li, Y Sun, W Wang, SM Naqvi - IEEE/ACM transactions on …, 2023 - ieeexplore.ieee.org
Recently, Transformer shows the potential to exploit the long-range sequence dependency
in speech with self-attention. It has been introduced in single channel speech enhancement …

Dual application of speech enhancement for automatic speech recognition

A Pandey, C Liu, Y Wang… - 2021 IEEE Spoken …, 2021 - ieeexplore.ieee.org
In this work, we exploit speech enhancement for improving a re-current neural network
transducer (RNN-T) based ASR system. We employ a dense convolutional recurrent …

Assessing the generalization gap of learning-based speech enhancement systems in noisy and reverberant environments

P Gonzalez, TS Alstrøm, T May - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
The acoustic variability of noisy and reverberant speech mixtures is influenced by multiple
factors, such as the spectro-temporal characteristics of the target speaker and the interfering …

PACDNN: A phase-aware composite deep neural network for speech enhancement

M Hasannezhad, H Yu, WP Zhu, B Champagne - Speech Communication, 2022 - Elsevier
Most of the current approaches for speech enhancement (SE) using deep neural network
(DNN) face a number of limitations: they do not exploit information contained in the phase …

[PDF][PDF] A simple rnn model for lightweight, low-compute and low-latency multichannel speech enhancement in the time domain

A Pandey, K Tan, B Xu - INTERSPEECH, 2023 - isca-archive.org
Deep learning has led to unprecedented advances in speech enhancement. However, deep
neural networks (DNNs) typically require large amount of computation, memory, signal …

Self-attending RNN for speech enhancement to improve cross-corpus generalization

A Pandey, DL Wang - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
Deep neural networks (DNNs) represent the mainstream methodology for supervised
speech enhancement, primarily due to their capability to model complex functions using …

Progress made in the efficacy and viability of deep-learning-based noise reduction

EW Healy, EM Johnson, A Pandey… - The Journal of the …, 2023 - pubs.aip.org
Recent years have brought considerable advances to our ability to increase intelligibility
through deep-learning-based noise reduction, especially for hearing-impaired (HI) listeners …

Attentive training: A new training framework for speech enhancement

A Pandey, DL Wang - IEEE/ACM transactions on audio, speech …, 2023 - ieeexplore.ieee.org
Dealing with speech interference in a speech enhancement system requires either speaker
separation or target speaker extraction. Speaker separation has multiple output streams with …

Dual-path self-attention RNN for real-time speech enhancement

A Pandey, DL Wang - arxiv preprint arxiv:2010.12713, 2020 - arxiv.org
We propose a dual-path self-attention recurrent neural network (DP-SARNN) for time-
domain speech enhancement. We improve dual-path RNN (DP-RNN) by augmenting inter …