Generative adversarial networks for speech processing: A review

A Wali, Z Alamgir, S Karim, A Fawaz, MB Ali… - Computer Speech & …, 2022 - Elsevier
Generative adversarial networks (GANs) have seen remarkable progress in recent years.
They are used as generative models for all kinds of data such as text, images, audio, music …

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

P Ochieng - Artificial Intelligence Review, 2023 - Springer
Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …

CochleaNet: A robust language-independent audio-visual model for real-time speech enhancement

M Gogate, K Dashtipour, A Adeel, A Hussain - Information Fusion, 2020 - Elsevier
Noisy situations cause huge problems for the hearing-impaired, as hearing aids often make
speech more audible but do not always restore intelligibility. In noisy settings, humans …

A multi-module generative adversarial network augmented with adaptive decoupling strategy for intelligent fault diagnosis of machines with small sample

K Zhang, Q Chen, J Chen, S He, F Li, Z Zhou - Knowledge-Based Systems, 2022 - Elsevier
In actual industrial environment, intelligent diagnosis method requires a sufficient number of
samples to ensure application effect. However, once industrial system fails, it usually stops …

Training neural audio classifiers with few data

J Pons, J Serrà, X Serra - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
We investigate supervised learning strategies that improve the training of neural network
audio classifiers on small annotated collections. In particular, we study whether (i) a naive …

Efficient speech enhancement using recurrent convolution encoder and decoder

A Karthik, JL MazherIqbal - Wireless Personal Communications, 2021 - Springer
The accuracy of voice or speech recognition is affected due to the presence of various
background noises present in the surroundings. Automatic Speech Recognition …

Domain adaptation and autoencoder-based unsupervised speech enhancement

Y Li, Y Sun, K Horoshenkov… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
As a category of transfer learning, domain adaptation plays an important role in generalizing
the model trained in one task and applying it to other similar tasks or settings. In speech …

Towards generalized speech enhancement with generative adversarial networks

S Pascual, J Serrà, A Bonafonte - arxiv preprint arxiv:1904.03418, 2019 - arxiv.org
The speech enhancement task usually consists of removing additive noise or reverberation
that partially mask spoken utterances, affecting their intelligibility. However, little attention is …

On generative-adversarial-network-based underwater acoustic noise modeling

M Zhou, J Wang, X Feng, H Sun, J Li… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Noise fitting plays a key role in underwater acoustic communications. Traditional
approximate models can fit global heavy-tail distribution of the impulsive noise with fixed …

Improving generative adversarial networks for speech enhancement through regularization of latent representations

F Yang, Z Wang, J Li, R **a, Y Yan - Speech Communication, 2020 - Elsevier
Speech enhancement aims to improve the quality and intelligibility of speech signals, which
is a challenging task in adverse environments. Speech enhancement generative adversarial …