A review of deep learning techniques for speech processing

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

Less forgetting for better generalization: Exploring continual-learning fine-tuning methods for speech self-supervised representations

S Zaiem, T Parcollet, S Essid - arxiv preprint arxiv:2407.00756, 2024 - arxiv.org
Despite being trained on massive and diverse datasets, speech self-supervised encoders
are generally used for downstream purposes as mere frozen feature extractors or model …

Parameter-Efficient Tuning with Adaptive Bottlenecks for Automatic Speech Recognition

G Vanderreydt, A Prasad, D Khalil… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
Transfer learning from large multilingual pretrained models, like XLSR, has become the new
paradigm for Automatic Speech Recognition (ASR). Considering their ever-increasing size …

Confidence-based ensembles of end-to-end speech recognition models

I Gitman, V Lavrukhin, A Laptev, B Ginsburg - arxiv preprint arxiv …, 2023 - arxiv.org
The number of end-to-end speech recognition models grows every year. These models are
often adapted to new domains or languages resulting in a proliferation of expert systems that …

[PDF][PDF] Leveraging Adapter for Parameter-Efficient ASR Encoder

K Shim, J Lee, H Kim - Proc. Interspeech 2024, 2024 - isca-archive.org
The expansion of speech models emphasizes the importance of parameter efficiency in
practical automatic speech recognition (ASR) systems. Parameter sharing, which reuses the …

[PDF][PDF] Adapter Integration: Mitigating Catastrophic Forgetting in Multi-Language and Multi-Accent Whisper ASR Model Fine-tuning

Z Huang, H **ng, M Liu - researchgate.net
Recently, Whisper, an extra-large end-to-end ASR model, was released and demonstrated
powerful capability in multilingual ASR. Fine-tuning Whisper, however, can lead to …