A review of deep learning techniques for speech processing
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …
learning. The use of multiple processing layers has enabled the creation of models capable …
Less forgetting for better generalization: Exploring continual-learning fine-tuning methods for speech self-supervised representations
Despite being trained on massive and diverse datasets, speech self-supervised encoders
are generally used for downstream purposes as mere frozen feature extractors or model …
are generally used for downstream purposes as mere frozen feature extractors or model …
Parameter-Efficient Tuning with Adaptive Bottlenecks for Automatic Speech Recognition
G Vanderreydt, A Prasad, D Khalil… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
Transfer learning from large multilingual pretrained models, like XLSR, has become the new
paradigm for Automatic Speech Recognition (ASR). Considering their ever-increasing size …
paradigm for Automatic Speech Recognition (ASR). Considering their ever-increasing size …
Confidence-based ensembles of end-to-end speech recognition models
The number of end-to-end speech recognition models grows every year. These models are
often adapted to new domains or languages resulting in a proliferation of expert systems that …
often adapted to new domains or languages resulting in a proliferation of expert systems that …
[PDF][PDF] Leveraging Adapter for Parameter-Efficient ASR Encoder
K Shim, J Lee, H Kim - Proc. Interspeech 2024, 2024 - isca-archive.org
The expansion of speech models emphasizes the importance of parameter efficiency in
practical automatic speech recognition (ASR) systems. Parameter sharing, which reuses the …
practical automatic speech recognition (ASR) systems. Parameter sharing, which reuses the …
[PDF][PDF] Adapter Integration: Mitigating Catastrophic Forgetting in Multi-Language and Multi-Accent Whisper ASR Model Fine-tuning
Z Huang, H **ng, M Liu - researchgate.net
Recently, Whisper, an extra-large end-to-end ASR model, was released and demonstrated
powerful capability in multilingual ASR. Fine-tuning Whisper, however, can lead to …
powerful capability in multilingual ASR. Fine-tuning Whisper, however, can lead to …