- Academic Search

CIJ Lai, Y Zhang, AH Liu, S Chang… - Advances in …, 2021 - proceedings.neurips.cc

Self-supervised speech representation learning (speech SSL) has demonstrated the benefit
of scale in learning rich representations for Automatic Speech Recognition (ASR) with …

Save Cite Cited by 73 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Tied & reduced rnn-t decoder

R Botros, TN Sainath, R David, E Guzman, W Li… - ar** in neural transducer

Y Yang, X Yang, L Guo, Z Yao, W Kang… - arxiv preprint arxiv …, 2023 - arxiv.org

Neural Transducer and connectionist temporal classification (CTC) are popular end-to-end
automatic speech recognition systems. Due to their frame-synchronous design, blank …

Save Cite Cited by 9 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Efficient conformer with prob-sparse attention mechanism for end-to-endspeech recognition

X Wang, S Sun, L **e, L Ma - arxiv preprint arxiv:2106.09236, 2021 - arxiv.org

End-to-end models are favored in automatic speech recognition (ASR) because of their
simplified system structure and superior performance. Among these models, Transformer …

Save Cite Cited by 22 Related articles All 6 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Tiny transducer: A highly-efficient speech recognition model on edge devices

Parp: Prune, adjust and re-prune for self-supervised speech recognition

Tied & reduced rnn-t decoder

Efficient conformer with prob-sparse attention mechanism for end-to-endspeech recognition