Obserwuj
Jay Mahadeokar
Jay Mahadeokar
Facebook AI
Zweryfikowany adres z fb.com
Tytuł
Cytowane przez
Cytowane przez
Rok
The llama 3 herd of models
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
arXiv preprint arXiv:2407.21783, 2024
24472024
Transformer-based acoustic modeling for hybrid speech recognition
Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2782020
Voicebox: Text-guided multilingual universal speech generation at scale
M Le, A Vyas, B Shi, B Karrer, L Sari, R Moritz, M Williamson, V Manohar, ...
Advances in neural information processing systems 36, 2024
2532024
Torchaudio: Building blocks for audio and speech processing
YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
2182022
Transformer-transducer: End-to-end speech recognition with self-attention
CF Yeh, J Mahadeokar, K Kalgaonkar, Y Wang, D Le, M Jain, K Schubert, ...
arXiv preprint arXiv:1910.12977, 2019
1832019
Prompting large language models with speech recognition abilities
Y Fathullah, C Wu, E Lakomkin, J Jia, Y Shangguan, K Li, J Guo, W Xiong, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
1102024
The llama 3 herd of models, 2024
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
URL https://arxiv. org/abs/2407.21783 2407, 21783, 0
106
Contextual RNN-T for open domain ASR
M Jain, G Keren, J Mahadeokar, G Zweig, F Metze, Y Saraf
arXiv preprint arXiv:2006.03411, 2020
1042020
Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion
D Le, M Jain, G Keren, S Kim, Y Shi, J Mahadeokar, J Chan, ...
arXiv preprint arXiv:2104.02194, 2021
882021
Deep shallow fusion for RNN-T personalization
D Le, G Keren, J Chan, J Mahadeokar, C Fuegen, ML Seltzer
2021 IEEE Spoken Language Technology Workshop (SLT), 251-257, 2021
832021
The llama 3 herd of models
A Grattafiori, A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, ...
arXiv e-prints, arXiv: 2407.21783, 2024
772024
Alignment restricted streaming recurrent neural network transducer
J Mahadeokar, Y Shangguan, D Le, G Keren, H Su, T Le, CF Yeh, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 52-59, 2021
712021
RNN-T for latency controlled ASR with improved beam search
M Jain, K Schubert, J Mahadeokar, CF Yeh, K Kalgaonkar, A Sriram, ...
arXiv preprint arXiv:1911.01629, 2019
452019
Improved neural language model fusion for streaming recurrent neural network transducer
S Kim, Y Shangguan, J Mahadeokar, A Bruguier, C Fuegen, ML Seltzer, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
312021
Dissecting user-perceived latency of on-device E2E speech recognition
Y Shangguan, R Prabhavalkar, H Su, J Mahadeokar, Y Shi, J Zhou, C Wu, ...
arXiv preprint arXiv:2104.02207, 2021
292021
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs
Y Fathullah, C Wu, E Lakomkin, K Li, J Jia, Y Shangguan, J Mahadeokar, ...
Proceedings of the 2024 Conference of the North American Chapter of the …, 2024
242024
Computerized system and method for automatically identifying and providing digital content based on physical geographic location data
V Mahadevan, SS Farfade, JK Mahadeokar, A Arasu, VKR Barakam, ...
US Patent 11,194,856, 2021
202021
Streaming transformer transducer based speech recognition using non-causal convolution
Y Shi, C Wu, D Wang, A Xiao, J Mahadeokar, X Zhang, C Liu, K Li, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
172022
Dynamic encoder transducer: A flexible solution for trading off accuracy for latency
Y Shi, V Nagaraja, C Wu, J Mahadeokar, D Le, R Prabhavalkar, A Xiao, ...
arXiv preprint arXiv:2104.02176, 2021
172021
Streaming parallel transducer beam search with fast-slow cascaded encoders
J Mahadeokar, Y Shi, K Li, D Le, J Zhu, V Chandra, O Kalinli, ML Seltzer
arXiv preprint arXiv:2203.15773, 2022
152022
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20