The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024 | 2794 | 2024 |
Multi-modal sensor based emotion recognition and emotional interface O Kalinli-Akbacak US Patent 9,031,293, 2015 | 274 | 2015 |
Adaptive displays using gaze tracking O Kalinli US Patent 8,493,390, 2013 | 138 | 2013 |
A saliency-based auditory attention model with applications to unsupervised prominent syllable detection in speech. O Kalinli, SS Narayanan Interspeech 2007, 1941-1944, 2007 | 132 | 2007 |
Prompting large language models with speech recognition abilities Y Fathullah, C Wu, E Lakomkin, J Jia, Y Shangguan, K Li, J Guo, W Xiong, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 113 | 2024 |
Noise adaptive training for robust automatic speech recognition O Kalinli, ML Seltzer, J Droppo, A Acero IEEE Transactions on Audio, Speech, and Language Processing 18 (8), 1889-1901, 2010 | 103 | 2010 |
Interface using eye tracking contact lenses R Chen, O Kalinli US Patent 8,632,182, 2014 | 97 | 2014 |
The llama 3 herd of models A Grattafiori, A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, ... arXiv e-prints, arXiv: 2407.21783, 2024 | 91 | 2024 |
Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion D Le, M Jain, G Keren, S Kim, Y Shi, J Mahadeokar, J Chan, ... arXiv preprint arXiv:2104.02194, 2021 | 90 | 2021 |
Prominence detection using auditory attention cues and task-dependent high level information O Kalinli, S Narayanan IEEE Transactions on audio, Speech, and language processing 17 (5), 1009-1024, 2009 | 83 | 2009 |
Apparatus and method for determining relevance of input speech O Kalinli US Patent App. 13/083,356, 2012 | 81 | 2012 |
Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition O Kalinli, ML Seltzer, A Acero 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009 | 72 | 2009 |
Emotion recognition using auditory attention cues extracted from users voice O Kalinli-Akbacak US Patent 9,020,822, 2015 | 47 | 2015 |
Combining auditory attention cues with phoneme posterior scores for phone/vowel/syllable boundary detection O Kalinli-Akbacak US Patent 9,672,811, 2017 | 44 | 2017 |
Saliency-driven unstructured acoustic scene classification using latent perceptual indexing O Kalinli, S Sundaram, S Narayanan 2009 IEEE International Workshop on Multimedia Signal Processing, 1-6, 2009 | 42 | 2009 |
Speech syllable/vowel/phone boundary detection using auditory attention cues O Kalinli, R Chen US Patent 8,756,061, 2014 | 36 | 2014 |
Semantic distance: A new metric for asr performance analysis towards spoken language understanding S Kim, A Arora, D Le, CF Yeh, C Fuegen, O Kalinli, ML Seltzer arXiv preprint arXiv:2104.02138, 2021 | 35 | 2021 |
Method for tone/intonation recognition using auditory attention cues O Kalinli US Patent 8,676,574, 2014 | 35 | 2014 |
Dissecting user-perceived latency of on-device E2E speech recognition Y Shangguan, R Prabhavalkar, H Su, J Mahadeokar, Y Shi, J Zhou, C Wu, ... arXiv preprint arXiv:2104.02207, 2021 | 29 | 2021 |
Evaluating user perception of speech recognition system quality with semantic distance metric S Kim, D Le, W Zheng, T Singh, A Arora, X Zhai, C Fuegen, O Kalinli, ... arXiv preprint arXiv:2110.05376, 2021 | 28 | 2021 |