- Academic Search

Y Wang, Z Chen, C Zheng, Y Zhang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

We propose a novel method to accelerate training and inference process of recurrent neural
network transducer (RNN-T) based on the guidance from a co-trained connectionist …

Save Cite Cited by 24 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Adapting large language model with speech for fully formatted end-to-end speech recognition

S Ling, Y Hu, S Qian, G Ye, Y Qian… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

Most end-to-end (E2E) speech recognition models are composed of encoder and decoder
blocks that perform acoustic and language modeling functions. Pretrained large language …

Save Cite Cited by 12 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Decoder-only architecture for speech recognition with ctc prompts and text data augmentation

E Tsunoo, H Futami, Y Kashiwagi, S Arora… - ar** in neural transducer

Y Yang, X Yang, L Guo, Z Yao, W Kang… - ar**: Highly Efficient Decoding for Transducers

V Bataev, H Xu, D Galvez, V Lavrukhin… - arxiv preprint arxiv …, 2024 - arxiv.org

This paper introduces a highly efficient greedy decoding algorithm for Transducer inference.
We propose a novel data structure using CUDA tensors to represent partial hypotheses in a …

Save Cite Cited by 1 Related articles All 2 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Fsr: Accelerating the inference process of transducer-based models by applying fast-skip...

Accelerating rnn-t training and inference using ctc guidance

Adapting large language model with speech for fully formatted end-to-end speech recognition

Decoder-only architecture for speech recognition with ctc prompts and text data augmentation