Parp: Prune, adjust and re-prune for self-supervised speech recognition

CIJ Lai, Y Zhang, AH Liu, S Chang… - Advances in …, 2021 - proceedings.neurips.cc
Self-supervised speech representation learning (speech SSL) has demonstrated the benefit
of scale in learning rich representations for Automatic Speech Recognition (ASR) with …

Tied & reduced rnn-t decoder

R Botros, TN Sainath, R David, E Guzman, W Li… - ar** in neural transducer
Y Yang, X Yang, L Guo, Z Yao, W Kang… - arxiv preprint arxiv …, 2023 - arxiv.org
Neural Transducer and connectionist temporal classification (CTC) are popular end-to-end
automatic speech recognition systems. Due to their frame-synchronous design, blank …

Efficient conformer with prob-sparse attention mechanism for end-to-endspeech recognition

X Wang, S Sun, L **e, L Ma - arxiv preprint arxiv:2106.09236, 2021 - arxiv.org
End-to-end models are favored in automatic speech recognition (ASR) because of their
simplified system structure and superior performance. Among these models, Transformer …