Quartznet: Deep automatic speech recognition with 1d time-channel separable convolutions S Kriman, S Beliaev, B Ginsburg, J Huang, O Kuchaiev, V Lavrukhin, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 370 | 2020 |
Nemo: a toolkit for building ai applications using neural modules O Kuchaiev, J Li, H Nguyen, O Hrinchuk, R Leary, B Ginsburg, S Kriman, ... arXiv preprint arXiv:1909.09577, 2019 | 295 | 2019 |
RULER: What's the Real Context Size of Your Long-Context Language Models? CP Hsieh, S Sun, S Kriman, S Acharya, D Rekesh, F Jia, Y Zhang, ... arXiv preprint arXiv:2404.06654, 2024 | 106 | 2024 |
Fast conformer with linearly scalable attention for efficient speech recognition D Rekesh, NR Koluguri, S Kriman, S Majumdar, V Noroozi, H Huang, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 80 | 2023 |
Nemo: A toolkit for building ai applications using neural modules.(2019) O Kuchaiev, J Li, H Nguyen, O Hrinchuk, R Leary, B Ginsburg, S Kriman, ... arXiv preprint arXiv:1909.09577, 1909 | 10 | 1909 |
Ruler: What’s the real context size of your long-context language models?, 2024 CP Hsieh, S Sun, S Kriman, S Acharya, D Rekesh, F Jia, Y Zhang, ... URL https://arxiv. org/abs/2404.06654, 0 | 10 | |
Investigating End-to-End ASR Architectures for Long Form Audio Transcription NR Koluguri, S Kriman, G Zelenfroind, S Majumdar, D Rekesh, V Noroozi, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 6 | 2024 |
Accidental learners: Spoken language identification in multilingual self-supervised models TM Bartley, F Jia, KC Puvvada, S Kriman, B Ginsburg ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |