Recent progress in the CUHK dysarthric speech recognition system S Liu, M Geng, S Hu, X Xie, M Cui, J Yu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2267-2281, 2021 | 78 | 2021 |
Neural architecture search for LF-MMI trained time delay neural networks S Hu, X Xie, M Cui, J Deng, S Liu, J Yu, M Geng, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1093-1107, 2022 | 33 | 2022 |
Exploring self-supervised pre-trained asr models for dysarthric and elderly speech recognition S Hu, X Xie, Z Jin, M Geng, Y Wang, M Cui, J Deng, X Liu, H Meng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 31 | 2023 |
Conformer based elderly speech recognition system for Alzheimer's disease detection T Wang, J Deng, M Geng, Z Ye, S Hu, Y Wang, M Cui, Z Jin, X Liu, ... arXiv preprint arXiv:2206.13232, 2022 | 23 | 2022 |
Exploiting cross domain acoustic-to-articulatory inverted features for disordered speech recognition S Hu, S Liu, X Xie, M Geng, T Wang, S Hu, M Cui, X Liu, H Meng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 18 | 2022 |
A sidecar separator can convert a single-talker speech recognition system to a multi-talker one L Meng, J Kang, M Cui, Y Wang, X Wu, H Meng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 16 | 2023 |
Audio-visual end-to-end multi-channel speech separation, dereverberation and recognition G Li, J Deng, M Geng, Z Jin, T Wang, S Hu, M Cui, H Meng, X Liu IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 15 | 2023 |
Confidence score based conformer speaker adaptation for speech recognition J Deng, X Xie, T Wang, M Cui, B Xue, Z Jin, M Geng, G Li, X Liu, H Meng arXiv preprint arXiv:2206.12045, 2022 | 15 | 2022 |
Topicrefine: Joint topic prediction and dialogue response generation for multi-turn end-to-end dialogue system H Wang, M Cui, Z Zhou, GPC Fung, KF Wong arXiv preprint arXiv:2109.05187, 2021 | 14 | 2021 |
Use of speech impairment severity for dysarthric speech recognition M Geng, Z Jin, T Wang, S Hu, J Deng, M Cui, G Li, J Yu, X Xie, X Liu arXiv preprint arXiv:2305.10659, 2023 | 12 | 2023 |
Unified modeling of multi-talker overlapped speech recognition and diarization with a sidecar separator L Meng, J Kang, M Cui, H Wu, X Wu, H Meng arXiv preprint arXiv:2305.16263, 2023 | 11 | 2023 |
Confidence score based speaker adaptation of conformer speech recognition systems J Deng, X Xie, T Wang, M Cui, B Xue, Z Jin, G Li, S Hu, X Liu IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1175-1190, 2023 | 11 | 2023 |
Two-pass decoding and cross-adaptation based system combination of end-to-end conformer and hybrid tdnn asr systems M Cui, J Deng, S Hu, X Xie, T Wang, S Hu, M Geng, B Xue, X Liu, H Meng arXiv preprint arXiv:2206.11596, 2022 | 11 | 2022 |
Exploiting cross-domain and cross-lingual ultrasound tongue imaging features for elderly and dysarthric speech recognition S Hu, X Xie, M Geng, M Cui, J Deng, G Li, T Wang, X Liu, H Meng arXiv preprint arXiv:2206.07327, 2022 | 10 | 2022 |
Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems M Cui, J Kang, J Deng, X Yin, Y Xie, X Chen, X Liu Proc. Interspeech 2023, 2023 | 8 | 2023 |
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement Y Yang, Z Song, J Zhuo, M Cui, J Li, B Yang, Y Du, Z Ma, X Liu, Z Wang, ... arXiv preprint arXiv:2406.11546, 2024 | 5 | 2024 |
Cross-Speaker Encoding Network for Multi-Talker Speech Recognition J Kang, L Meng, M Cui, H Guo, X Wu, X Liu, H Meng ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Self-supervised asr models and features for dysarthric and elderly speech recognition S Hu, X Xie, M Geng, Z Jin, J Deng, G Li, Y Wang, M Cui, T Wang, H Meng, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 3 | 2024 |
A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models D Wang, M Cui, D Yang, X Chen, H Meng arXiv preprint arXiv:2411.08742, 2024 | 1 | 2024 |
Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models D Han, M Cui, J Kang, X Wu, X Liu, H Meng 2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024 | 1 | 2024 |