Dilated residual network with multi-head self-attention for speech emotion recognition R Li, Z Wu, J Jia, S Zhao, H Meng ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019 | 94 | 2019 |
Learning discriminative features from spectrograms using center loss for speech emotion recognition D Dai, Z Wu, R Li, X Wu, J Jia, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 65 | 2019 |
One-shot voice conversion with global speaker embeddings. H Lu, Z Wu, D Dai, R Li, S Kang, J Jia, H Meng Interspeech, 669-673, 2019 | 53 | 2019 |
Towards Discriminative Representation Learning for Speech Emotion Recognition. R Li, Z Wu, J Jia, Y Bu, S Zhao, H Meng IJCAI 2019, 5060-5066, 2019 | 52 | 2019 |
Multi-task deep learning for user intention understanding in speech interaction systems Y Ning, J Jia, Z Wu, R Li, Y An, Y Wang, H Meng Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017 | 27 | 2017 |
Inferring user emotive state changes in realistic human-computer conversational dialogs R Li, Z Wu, J Jia, J Li, W Chen, H Meng Proceedings of the 26th ACM international conference on Multimedia, 136-144, 2018 | 25 | 2018 |
StableFace: Analyzing and improving motion stability for talking face generation J Ling, X Tan, L Chen, R Li, Y Zhang, S Zhao, L Song IEEE Journal of Selected Topics in Signal Processing 17 (6), 1232-1247, 2023 | 20 | 2023 |
Applying multitask learning to acoustic-phonemic model for mispronunciation detection and diagnosis in l2 english speech S Mao, Z Wu, R Li, X Li, H Meng, L Cai 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 20 | 2018 |
Hiface: High-fidelity 3d face reconstruction by learning static and dynamic details Z Chai, T Zhang, T He, X Tan, T Baltrusaitis, HT Wu, R Li, S Zhao, C Yuan, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 19 | 2023 |
Transformer-s2a: Robust and efficient speech-to-animation L Chen, Z Wu, J Ling, R Li, X Tan, S Zhao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 19 | 2022 |
Memories are one-to-many mapping alleviators in talking face generation A Tang, T He, X Tan, J Ling, R Li, S Zhao, J Bian, L Song IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | 18 | 2024 |
Siamese Recurrent Auto-Encoder Representation for Query-by-Example Spoken Term Detection. Z Zhu, Z Wu, R Li, H Meng, L Cai Interspeech, 102-106, 2018 | 18 | 2018 |
Learning cross-lingual knowledge with multilingual BLSTM for emphasis detection with limited training data Y Ning, Z Wu, R Li, J Jia, M Xu, H Meng, L Cai 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 18 | 2017 |
Multi-task learning of structured output layer bidirectional LSTMs for speech synthesis R Li, Z Wu, X Liu, H Meng, L Cai 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 17 | 2017 |
Knowledge-Based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis. J Li, Z Wu, R Li, P Zhi, S Yang, H Meng INTERSPEECH, 4494-4498, 2019 | 16 | 2019 |
A compact framework for voice conversion using wavenet conditioned on phonetic posteriorgrams H Lu, Z Wu, R Li, S Kang, J Jia, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 14 | 2019 |
Emphatic speech generation with conditioned input layer and bidirectional LSTMS for expressive speech synthesis R Li, Z Wu, Y Huang, J Jia, H Meng, L Cai 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 12 | 2018 |
Integrating articulatory features into acoustic-phonemic model for mispronunciation detection and diagnosis in l2 english speech S Mao, Z Wu, X Li, R Li, X Wu, H Meng 2018 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2018 | 11 | 2018 |
Era-solver: Error-robust adams solver for fast sampling of diffusion probabilistic models S Li, L Liu, Z Chai, R Li, X Tan arXiv preprint arXiv:2301.12935, 2023 | 9 | 2023 |
Multi-Task Learning for Prosodic Structure Generation Using BLSTM RNN with Structured Output Layer. Y Huang, Z Wu, R Li, H Meng, L Cai INTERSPEECH, 779-783, 2017 | 8 | 2017 |