Speech emotion recognition using capsule networks X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 139 | 2019 |
Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition X Cai, D Dai, Z Wu, X Li, J Li, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 85 | 2021 |
Learning discriminative features from spectrograms using center loss for speech emotion recognition D Dai, Z Wu, R Li, X Wu, J Jia, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 65 | 2019 |
One-shot voice conversion with global speaker embeddings. H Lu, Z Wu, D Dai, R Li, S Kang, J Jia, H Meng Interspeech, 669-673, 2019 | 52 | 2019 |
Disambiguation of chinese polyphones in an end-to-end framework with semantic features extracted by pre-trained bert D Dai, Z Wu, S Kang, X Wu, J Jia, D Su, D Yu, H Meng arXiv preprint arXiv:2501.01102, 2025 | 25 | 2025 |
Unsupervised cross-lingual speech emotion recognition using domain adversarial neural network X Cai, Z Wu, K Zhong, B Su, D Dai, H Meng 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 17 | 2021 |
Noise robust tts for low resource speakers using pre-trained model and speech enhancement D Dai, L Chen, Y Wang, M Wang, R Xia, X Song, Z Wu, Y Wang arXiv preprint arXiv:2005.12531, 2020 | 12 | 2020 |
Cloning one’s voice using very limited data in the wild D Dai, Y Chen, L Chen, M Tu, L Liu, R Xia, Q Tian, Y Wang, Y Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 11 | 2022 |
Speaker independent and multilingual/mixlingual speech-driven talking head generation using phonetic posteriorgrams H Huang, Z Wu, S Kang, D Dai, J Jia, T Fu, D Tuo, G Lei, P Liu, D Su, ... 2021 Asia-Pacific Signal and Information Processing Association Annual …, 2021 | 8 | 2021 |
Multi-modal adversarial training for zero-shot voice cloning J Janiczek, D Chong, D Dai, A Faria, C Wang, T Wang, Y Liu arXiv preprint arXiv:2408.15916, 2024 | 1 | 2024 |
RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction P Liu, D Dai, Z Wu arXiv preprint arXiv:2403.05010, 2024 | 1 | 2024 |
Speech synthesis method and apparatus, electronic device, and readable storage medium D Dai, Y Chen, L Chen, Y Wang, Q Tian, M Tu, R Xia, Y Wang US Patent App. 18/568,261, 2024 | | 2024 |
NRAdapt: Noise-Robust Adaptive Text to Speech Using Untranscribed Data M Cheng, S Lei, D Dai, Z Wu, D Chong 2024 International Joint Conference on Neural Networks (IJCNN), 1-8, 2024 | | 2024 |