- Academic Search

H Wang, WQ Zhang - IEEE Journal of Selected Topics in Signal …, 2024 - ieeexplore.ieee.org

Self-supervised pre-trained speech models require significant memory and computational
resources, limiting their applicability to many speech tasks. Unstructured pruning is a …

Speichern Zitieren Zitiert von: 3 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Full-Rank No More: Low-Rank Weight Training for Modern Speech Recognition Models

A Fernandez-Lopez, S Liu, L Yin, S Petridis… - arxiv preprint arxiv …, 2024 - arxiv.org

This paper investigates the under-explored area of low-rank weight training for large-scale
Conformer-based speech recognition models from scratch. Our study demonstrates the …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Sla-former: conformer using shifted linear attention for audio-visual speech recognition

Y **ao, J Huang, X Liu, A Zhu - Complex & Intelligent Systems, 2024 - Springer

Conformer-based models have proven highly effective in Audio-visual Speech Recognition,
integrating auditory and visual inputs to significantly enhance speech recognition accuracy …

Speichern Zitieren Zitiert von: 1 Ähnliche Artikel

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Enhancing Quantised End-to-End ASR Models Via Personalisation

Q Zhao, G Sun, C Zhang, M Xu… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

Recent end-to-end automatic speech recognition (ASR) models have become increasingly
larger, making them particularly challenging to be deployed on resource-constrained …

Speichern Zitieren Zitiert von: 2 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] Leveraging Adapter for Parameter-Efficient ASR Encoder

K Shim, J Lee, H Kim - Proc. Interspeech 2024, 2024 - isca-archive.org

The expansion of speech models emphasizes the importance of parameter efficiency in
practical automatic speech recognition (ASR) systems. Parameter sharing, which reuses the …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Speaker Adaptation for Quantised End-to-End ASR Models

Q Zhao, G Sun, C Zhang, M Xu, TF Zheng - arxiv preprint arxiv …, 2024 - arxiv.org

End-to-end models have shown superior performance for automatic speech recognition
(ASR). However, such models are often very large in size and thus challenging to deploy on …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen HTML-Version

Conformer-Based Audio Visual Speech Recognition with Taylor Attention

Y **ao, J Huang, X Liu, A Zhu - International Conference on Pattern …, 2024 - Springer

Abstract Audio Visual Speech Recognition (AVSR) has witnessed significant advancements
by deploying sophisticated neural networks, particularly Convolutional Neural Networks …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Residualtransformer: Residual Low-Rank Learning With Weight-Sharing For Transformer Layers

Y Wang, J Li - … 2024-2024 IEEE International Conference on …, 2024 - ieeexplore.ieee.org

Memory constraint of always-on devices is one of the major concerns when deploying
speech processing models on these devices. While larger models trained with sufficiently …

Speichern Zitieren Zitiert von: 4 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] vcu.edu

WiFi Sensing at the Edge Towards Scalable On-Device Wireless Sensing Systems

SM Hernandez - 2023 - scholarscompass.vcu.edu

WiFi sensing offers a powerful method for tracking physical activities using the radio-
frequency signals already found throughout our homes and offices. This novel sensing …

Speichern Zitieren Ähnliche Artikel Alle 3 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Sharing low rank conformer weights for tiny always-on ambient speech recognition models

Unstructured Pruning and Low Rank Factorisation of Self-Supervised Pre-Trained Speech Models

Full-Rank No More: Low-Rank Weight Training for Modern Speech Recognition Models

Sla-former: conformer using shifted linear attention for audio-visual speech recognition

Enhancing Quantised End-to-End ASR Models Via Personalisation

[PDF][PDF] Leveraging Adapter for Parameter-Efficient ASR Encoder

Speaker Adaptation for Quantised End-to-End ASR Models

Conformer-Based Audio Visual Speech Recognition with Taylor Attention

Residualtransformer: Residual Low-Rank Learning With Weight-Sharing For Transformer Layers

WiFi Sensing at the Edge Towards Scalable On-Device Wireless Sensing Systems