Parp: Prune, adjust and re-prune for self-supervised speech recognition
Self-supervised speech representation learning (speech SSL) has demonstrated the benefit
of scale in learning rich representations for Automatic Speech Recognition (ASR) with …
of scale in learning rich representations for Automatic Speech Recognition (ASR) with …
Tied & reduced rnn-t decoder
R Botros, TN Sainath, R David, E Guzman, W Li… - ar** in neural transducer
Neural Transducer and connectionist temporal classification (CTC) are popular end-to-end
automatic speech recognition systems. Due to their frame-synchronous design, blank …
automatic speech recognition systems. Due to their frame-synchronous design, blank …
Efficient conformer with prob-sparse attention mechanism for end-to-endspeech recognition
End-to-end models are favored in automatic speech recognition (ASR) because of their
simplified system structure and superior performance. Among these models, Transformer …
simplified system structure and superior performance. Among these models, Transformer …