Følg
Khe Chai Sim
Titel
Citeret af
Citeret af
År
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
11202024
Streaming end-to-end speech recognition for mobile devices
Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
7682019
Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems.
B Li, KC Sim
Interspeech 10, 526-529, 2010
2122010
Acoustic Modeling for Google Home.
B Li, TN Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, ...
Interspeech, 399-403, 2017
2082017
Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition
Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1519-1532, 2022
2052022
Multi-dialect speech recognition with a single sequence-to-sequence model
B Li, TN Sainath, KC Sim, M Bacchiani, E Weinstein, P Nguyen, Z Chen, ...
2018 IEEE international conference on acoustics, speech and signal …, 2018
1562018
Consensus network decoding for statistical machine translation system combination
KC Sim, WJ Byrne, MJF Gales, H Sahbi, PC Woodland
2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007
1382007
The NUS sung and spoken lyrics corpus: A quantitative comparison of singing and speech
Z Duan, H Fang, B Li, KC Sim, Y Wang
2013 Asia-Pacific Signal and Information Processing Association Annual …, 2013
1312013
Toward domain-invariant speech recognition via large scale training
A Narayanan, A Misra, KC Sim, G Pundak, A Tripathi, M Elfeky, P Haghani, ...
2018 IEEE Spoken Language Technology Workshop (SLT), 441-447, 2018
1222018
Personalization of end-to-end speech recognition on mobile devices for named entities
KC Sim, F Beaufays, A Benard, D Guliani, A Kabel, N Khare, T Lucassen, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 23-30, 2019
802019
Factorized hidden layer adaptation for deep neural network based acoustic modeling
L Samarakoon, KC Sim
IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (12 …, 2016
762016
A lattice-based approach to query-by-example spoken document retrieval
TK Chia, KC Sim, H Li, HT Ng
Proceedings of the 31st annual international ACM SIGIR conference on …, 2008
722008
An investigation into on-device personalization of end-to-end automatic speech recognition models
KC Sim, P Zadrazil, F Beaufays
arXiv preprint arXiv:1909.06678, 2019
712019
Semantic transliteration of personal names
H Li, KC Sim, JS Kuo, M Dong
Proceedings of the 45th Annual Meeting of the Association of Computational …, 2007
672007
Improving the interpretability of deep neural networks with stimulated learning
S Tan, KC Sim, M Gales
2015 ieee workshop on automatic speech recognition and understanding (asru …, 2015
642015
A spectral masking approach to noise-robust speech recognition using deep neural networks
B Li, KC Sim
IEEE/ACM transactions on audio, speech, and language processing 22 (8), 1296 …, 2014
632014
Joint unsupervised and supervised training for multilingual asr
J Bai, B Li, Y Zhang, A Bapna, N Siddhartha, KC Sim, TN Sainath
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
592022
An investigation of augmenting speaker representations to improve speaker normalisation for dnn-based speech recognition
H Huang, KC Sim
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
572015
Improving interpretability and regularization in deep learning
C Wu, MJF Gales, A Ragni, P Karanasou, KC Sim
IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (2), 256-265, 2017
512017
Large-scale asr domain adaptation using self-and semi-supervised learning
D Hwang, A Misra, Z Huo, N Siddhartha, S Garg, D Qiu, KC Sim, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
502022
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–20