A review of deep learning techniques for speech processing
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …
learning. The use of multiple processing layers has enabled the creation of models capable …
Styletts-vc: One-shot voice conversion by knowledge transfer from style-based tts models
One-shot voice conversion (VC) aims to convert speech from any source speaker to an
arbitrary target speaker with only a few seconds of reference speech from the target speaker …
arbitrary target speaker with only a few seconds of reference speech from the target speaker …
Zero-shot voice conversion based on feature disentanglement
Voice conversion (VC) aims to convert the voice from a source speaker to a target speaker
without modifying the linguistic content. Zero-shot voice conversion has attracted significant …
without modifying the linguistic content. Zero-shot voice conversion has attracted significant …
Robust speaker personalisation using generalized low-rank adaptation for automatic speech recognition
A Baby, G Joseph, S Singh - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
For voice assistant systems, personalizing automated speech recognition (ASR) to a
customer is the proverbial holy grail. Careful selection of hyper-parameters will be …
customer is the proverbial holy grail. Careful selection of hyper-parameters will be …