PAAPLoss: A phonetic-aligned acoustic parameter loss for speech enhancement

M Yang, J Konan, D Bick, Y Zeng, S Han… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Despite rapid advancement in recent years, current speech enhancement models often
produce speech that differs in perceptual quality from real clean speech. We propose a …

The effect of spoken language on speech enhancement using self-supervised speech representation loss functions

G Close, T Hain, S Goetze - … of Signal Processing to Audio and …, 2023 - ieeexplore.ieee.org
Recent work in the field of speech enhancement (SE) has involved the use of self-
supervised speech representations (SSSRs) as feature transformations in loss functions …

An empirical study on speech restoration guided by self-supervised speech representation

J Byun, Y Ji, SW Chung, S Choe… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Enhancing speech quality is an indispensable yet difficult task as it is often complicated by a
range of degradation factors. In addition to additive noise, reverberation, clip**, and …

Boosting Speech Enhancement with Clean Self-Supervised Features Via Conditional Variational Autoencoders

Y Lee, K Jung - … 2024-2024 IEEE International Conference on …, 2024 - ieeexplore.ieee.org
Recently, Self-Supervised Features (SSF) trained on extensive speech datasets have shown
significant performance gains across various speech processing tasks. Nevertheless, their …

Enhancing TTS Stability in Hebrew using Discrete Semantic Units

E Zeldes, O Tal, Y Adi - arxiv preprint arxiv:2410.21502, 2024 - arxiv.org
This study introduces a refined approach to Text-to-Speech (TTS) generation that
significantly enhances sampling stability across languages, with a particular focus on …

A Closer Look at Wav2vec2 Embeddings for On-Device Single-Channel Speech Enhancement

R Shankar, K Tan, B Xu, A Kumar - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Self-supervised learned models have been found to be very effective for tasks such as
automatic speech recognition, speaker identification, and others. However, their utility in …