Redat: Accent-invariant representation for end-to-end asr by domain adversarial training with relabeling H Hu, X Yang, Z Raeesy, J Guo, G Keskin, H Arsikere, A Rastrow, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 31 | 2021 |
Streaming end-to-end bilingual asr systems with joint language identification S Punjabi, H Arsikere, Z Raeesy, C Chandak, N Bhave, A Bansal, ... arXiv preprint arXiv:2007.03900, 2020 | 26 | 2020 |
Automatic segmentation of vocal tract MR images Z Raeesy, S Rueda, JK Udupa, J Coleman 2013 IEEE 10th International Symposium on Biomedical Imaging, 1328-1331, 2013 | 25 | 2013 |
Joint ASR and language identification using RNN-T: An efficient approach to dynamic language switching S Punjabi, H Arsikere, Z Raeesy, C Chandak, N Bhave, A Bansal, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 24 | 2021 |
LSTM-based whisper detection Z Raeesy, K Gillespie, C Ma, T Drugman, J Gu, R Maas, A Rastrow, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 139-144, 2018 | 23 | 2018 |
Integrating summarization and retrieval for enhanced personalization via large language models C Richardson, Y Zhang, K Gillespie, S Kar, A Singh, Z Raeesy, OZ Khan, ... arXiv preprint arXiv:2310.20081, 2023 | 22 | 2023 |
Streaming language identification using combination of acoustic representations and ASR hypotheses C Chandak, Z Raeesy, A Rastrow, Y Liu, X Huang, S Wang, DK Joo, ... arXiv preprint arXiv:2006.00703, 2020 | 13 | 2020 |
Whisper to Alexa, and She’ll Whisper Back Z Raeesy Amazon Science. Accessed September 26, 2018, 2018 | 8* | 2018 |
Multimodal context carryover P Wanigasekara, N Gupta, F Yang, E Barut, Z Raeesy, K Qin, S Rawls, ... Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 3 | 2022 |
Learning to retrieve engaging follow-up queries C Richardson, S Kar, A Kumar, A Ramachandran, OZ Khan, Z Raeesy, ... arXiv preprint arXiv:2302.10978, 2023 | 2 | 2023 |
Parametrising Degree of Articulator Movement from Dynamic MRI Data Z Raeesy, L Baghai-Ravary, J Coleman Twelfth Annual Conference of the International Speech Communication Association, 2011 | 1 | 2011 |
Generating contextual images for long-form text A Mitra, N Gupta, C Naik, A Sethy, K Bice, Z Raeesy Proceedings of the 2024 Joint International Conference on Computational …, 2024 | | 2024 |
[Industry] Unified Contextual Query Rewriting Y Zhou, J Hao, M Rungta, Y Liu, E Cho, X Fan, Y Lu, V Vasudevan, ... The 61st Annual Meeting Of The Association For Computational Linguistics, 2023 | | 2023 |
Speaker-specific Typical Vocal Tract Shapes Obtained Using Dynamic MRI. Z Raeesy ICPhS, 1658-1661, 2011 | | 2011 |