Follow
Mingyu Cui
Mingyu Cui
Verified email at se.cuhk.edu.hk - Homepage
Title
Cited by
Cited by
Year
Recent progress in the CUHK dysarthric speech recognition system
S Liu, M Geng, S Hu, X Xie, M Cui, J Yu, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2267-2281, 2021
782021
Neural architecture search for LF-MMI trained time delay neural networks
S Hu, X Xie, M Cui, J Deng, S Liu, J Yu, M Geng, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1093-1107, 2022
332022
Exploring self-supervised pre-trained asr models for dysarthric and elderly speech recognition
S Hu, X Xie, Z Jin, M Geng, Y Wang, M Cui, J Deng, X Liu, H Meng
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
312023
Conformer based elderly speech recognition system for Alzheimer's disease detection
T Wang, J Deng, M Geng, Z Ye, S Hu, Y Wang, M Cui, Z Jin, X Liu, ...
arXiv preprint arXiv:2206.13232, 2022
232022
Exploiting cross domain acoustic-to-articulatory inverted features for disordered speech recognition
S Hu, S Liu, X Xie, M Geng, T Wang, S Hu, M Cui, X Liu, H Meng
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
182022
A sidecar separator can convert a single-talker speech recognition system to a multi-talker one
L Meng, J Kang, M Cui, Y Wang, X Wu, H Meng
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
162023
Audio-visual end-to-end multi-channel speech separation, dereverberation and recognition
G Li, J Deng, M Geng, Z Jin, T Wang, S Hu, M Cui, H Meng, X Liu
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
152023
Confidence score based conformer speaker adaptation for speech recognition
J Deng, X Xie, T Wang, M Cui, B Xue, Z Jin, M Geng, G Li, X Liu, H Meng
arXiv preprint arXiv:2206.12045, 2022
152022
Topicrefine: Joint topic prediction and dialogue response generation for multi-turn end-to-end dialogue system
H Wang, M Cui, Z Zhou, GPC Fung, KF Wong
arXiv preprint arXiv:2109.05187, 2021
142021
Use of speech impairment severity for dysarthric speech recognition
M Geng, Z Jin, T Wang, S Hu, J Deng, M Cui, G Li, J Yu, X Xie, X Liu
arXiv preprint arXiv:2305.10659, 2023
122023
Unified modeling of multi-talker overlapped speech recognition and diarization with a sidecar separator
L Meng, J Kang, M Cui, H Wu, X Wu, H Meng
arXiv preprint arXiv:2305.16263, 2023
112023
Confidence score based speaker adaptation of conformer speech recognition systems
J Deng, X Xie, T Wang, M Cui, B Xue, Z Jin, G Li, S Hu, X Liu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1175-1190, 2023
112023
Two-pass decoding and cross-adaptation based system combination of end-to-end conformer and hybrid tdnn asr systems
M Cui, J Deng, S Hu, X Xie, T Wang, S Hu, M Geng, B Xue, X Liu, H Meng
arXiv preprint arXiv:2206.11596, 2022
112022
Exploiting cross-domain and cross-lingual ultrasound tongue imaging features for elderly and dysarthric speech recognition
S Hu, X Xie, M Geng, M Cui, J Deng, G Li, T Wang, X Liu, H Meng
arXiv preprint arXiv:2206.07327, 2022
102022
Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
M Cui, J Kang, J Deng, X Yin, Y Xie, X Chen, X Liu
Proc. Interspeech 2023, 2023
82023
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement
Y Yang, Z Song, J Zhuo, M Cui, J Li, B Yang, Y Du, Z Ma, X Liu, Z Wang, ...
arXiv preprint arXiv:2406.11546, 2024
52024
Cross-Speaker Encoding Network for Multi-Talker Speech Recognition
J Kang, L Meng, M Cui, H Guo, X Wu, X Liu, H Meng
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Self-supervised asr models and features for dysarthric and elderly speech recognition
S Hu, X Xie, M Geng, Z Jin, J Deng, G Li, Y Wang, M Cui, T Wang, H Meng, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
32024
A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models
D Wang, M Cui, D Yang, X Chen, H Meng
arXiv preprint arXiv:2411.08742, 2024
12024
Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models
D Han, M Cui, J Kang, X Wu, X Liu, H Meng
2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20