Fithubert: Going thinner and deeper for knowledge distillation of speech self-supervised learning

Y Lee, K Jang, J Goo, Y Jung, H Kim - ar** human in the loop of drone-assisted inspection
Y Li, A Parsan, B Wang, P Dong, S Yao… - Engineering Applications of …, 2023 - Elsevier
Audio commands are a preferred communication medium to keep inspectors in the loop of
civil infrastructure inspection performed by a semi-autonomous drone. To understand job …

Adapting TTS models for new speakers using transfer learning

P Neekhara, J Li, B Ginsburg - arxiv preprint arxiv:2110.05798, 2021 - arxiv.org
Training neural text-to-speech (TTS) models for a new speaker typically requires several
hours of high quality speech data. Prior works on voice cloning attempt to address this …

Automatic Fluency Assessment Method for Spontaneous Speech without Reference Text

J Liu, A Wumaier, C Fan, S Guo - Electronics, 2023 - mdpi.com
The automatic fluency assessment of spontaneous speech without reference text is a
challenging task that heavily depends on the accuracy of automatic speech recognition …

One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification

J Heo, C Lim, J Kim, H Shin, HJ Yu - arxiv preprint arxiv:2305.17394, 2023 - arxiv.org
The application of speech self-supervised learning (SSL) models has achieved remarkable
performance in speaker verification (SV). However, there is a computational cost hurdle in …

[PDF][PDF] Multi-task wav2vec2 serving as a pronunciation training system for children

Y Getman, R Al-Ghezi, T Grosz… - 9th Workshop on Speech …, 2023 - research.aalto.fi
Computer-assisted learning tools (CAPT) are increasingly reliant on AI tools. Recent studies
demonstrated how neural systems pre-trained in a self-supervised fashion, such as …

SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning

L Zampierin, GB Hacene, B Nguyen… - arxiv preprint arxiv …, 2024 - arxiv.org
Self-supervised learning (SSL) has achieved remarkable success across various speech-
processing tasks. To enhance its efficiency, previous works often leverage the use of …

SelfVC: Voice Conversion With Iterative Refinement using Self Transformations

P Neekhara, S Hussain, R Valle, B Ginsburg… - arxiv preprint arxiv …, 2023 - arxiv.org
We propose SelfVC, a training strategy to iteratively improve a voice conversion model with
self-synthesized examples. Previous efforts on voice conversion focus on explicitly …