Speaker adaptation using spectro-temporal deep features for dysarthric and elderly speech recognition M Geng, X Xie, Z Ye, T Wang, G Li, S Hu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2597-2611, 2022 | 36 | 2022 |
Personalized adversarial data augmentation for dysarthric and elderly speech recognition Z Jin, M Geng, J Deng, T Wang, S Hu, G Li, X Liu IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 25 | 2023 |
Adversarial data augmentation using vae-gan for disordered speech recognition Z Jin, X Xie, M Geng, T Wang, S Hu, J Deng, G Li, X Liu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 21 | 2023 |
Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation H Wang, Z Jin, M Geng, S Hu, G Li, T Wang, H Xu, X Liu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 19 | 2024 |
Audio-visual end-to-end multi-channel speech separation, dereverberation and recognition G Li, J Deng, M Geng, Z Jin, T Wang, S Hu, M Cui, H Meng, X Liu IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 15 | 2023 |
Confidence score based conformer speaker adaptation for speech recognition J Deng, X Xie, T Wang, M Cui, B Xue, Z Jin, M Geng, G Li, X Liu, H Meng arXiv preprint arXiv:2206.12045, 2022 | 15 | 2022 |
Use of Speech Impairment Severity for Dysarthric Speech Recognition M Geng, Z Jin, T Wang, S Hu, J Deng, M Cui, G Li, J Yu, X Xie, X Liu arXiv preprint arXiv:2305.10659, 2023 | 12 | 2023 |
Confidence score based speaker adaptation of conformer speech recognition systems J Deng, X Xie, T Wang, M Cui, B Xue, Z Jin, G Li, S Hu, X Liu IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1175-1190, 2023 | 11 | 2023 |
Audio-visual multi-channel speech separation, dereverberation and recognition G Li, J Yu, J Deng, X Liu, H Meng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 11 | 2022 |
Exploiting cross-domain and cross-lingual ultrasound tongue imaging features for elderly and dysarthric speech recognition S Hu, X Xie, M Geng, M Cui, J Deng, G Li, T Wang, X Liu, H Meng arXiv preprint arXiv:2206.07327, 2022 | 10 | 2022 |
SAILoc: A novel acoustic single array system for indoor localization G Li, L Zhang, F Lin, M Chen, Z Wang 2017 9th international conference on wireless communications and signal …, 2017 | 8 | 2017 |
The design and implementation of a smartphone-based acoustic array system for DOA estimation G Li, X Bao, Z Wang 2017 36th Chinese Control Conference (CCC), 5416-5423, 2017 | 5 | 2017 |
Self-supervised ASR Models and Features For Dysarthric and Elderly Speech Recognition S Hu, X Xie, M Geng, Z Jin, J Deng, G Li, Y Wang, M Cui, T Wang, H Meng, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 3 | 2024 |
Towards Automatic Data Augmentation for Disordered Speech Recognition Z Jin, X Xie, T Wang, M Geng, J Deng, G Li, S Hu, X Liu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |
Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems J Deng, G Li, X Xie, Z Jin, M Cui, T Wang, S Hu, M Geng, X Liu arXiv preprint arXiv:2306.14608, 2023 | 1 | 2023 |
Effective and Efficient Mixed Precision Quantization of Speech Foundation Models H Xu, Z Li, Z Jin, H Wang, Y Chen, G Li, M Geng, S Hu, J Deng, X Liu arXiv preprint arXiv:2501.03643, 2025 | | 2025 |
Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition S Hu, X Xie, M Geng, J Deng, Z Jin, T Wang, M Cui, G Li, Z Li, H Meng, ... arXiv preprint arXiv:2412.18832, 2024 | | 2024 |
Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation M Geng, X Xie, J Deng, Z Jin, G Li, T Wang, S Hu, Z Li, H Meng, X Liu arXiv preprint arXiv:2407.06310, 2024 | | 2024 |
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition G Li, J Deng, Y Chen, M Geng, S Hu, Z Li, Z Jin, T Wang, X Xie, H Meng, ... arXiv preprint arXiv:2406.10152, 2024 | | 2024 |
Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask T Wang, X Xie, Z Li, S Hu, Z Jing, J Deng, M Cui, S Hu, M Geng, G Li, ... arXiv preprint arXiv:2406.10034, 2024 | | 2024 |