关注
Rui Liu (刘 瑞)
Rui Liu (刘 瑞)
在 mail.imu.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Seen and unseen emotional style transfer for voice conversion with a new emotional speech dataset
K Zhou, B Sisman, R Liu, H Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2422021
Emotional voice conversion: Theory, databases and ESD
K Zhou, B Sisman, R Liu, H Li
Speech Communication 137, 1-18, 2022
1872022
Expressive TTS training with frame and style reconstruction loss
R Liu, B Sisman, G Gao, H Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1806-1818, 2021
932021
Teacher-student training for robust tacotron-based tts
R Liu, B Sisman, J Li, F Bao, G Gao, H Li
ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020
682020
Reinforcement learning for emotional text-to-speech synthesis with improved emotion discriminability
R Liu, B Sisman, H Li
arXiv preprint arXiv:2104.01408, 2021
482021
Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities
H Zuo, R Liu, J Zhao, G Gao, H Li
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
402023
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis
R Liu, B Sisman, H Li
IEEE ICASSP 2021. IEEE International Conference on Acoustics, Speech and …, 2021
352021
Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis
R Liu, B Sisman, F Bao, J Yang, G Gao, H Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 274-285, 2021
312021
Mongolian text-to-speech system based on deep neural network
R Liu, F Bao, G Gao, Y Wang
Man-Machine Speech Communication: 14th National Conference, NCMMSC 2017 …, 2018
312018
Modeling prosodic phrasing with multi-task learning in tacotron-based TTS
R Liu, B Sisman, F Bao, G Gao, H Li
IEEE Signal Processing Letters 27, 1470-1474, 2020
292020
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
R Liu, F Bao, G Gao, H Zhang, Y Wang
Interspeech, 57-61, 2018
242018
Text-to-speech for low-resource agglutinative language with morphology-aware language model pre-training
R Liu, Y Hu, H Zuo, Z Luo, L Wang, G Gao
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
212024
Mer 2024: Semi-supervised learning, noise robustness, and open-vocabulary multimodal emotion recognition
Z Lian, H Sun, L Sun, Z Wen, S Zhang, S Chen, H Gu, J Zhao, Z Ma, ...
Proceedings of the 2nd International Workshop on Multimodal and Responsible …, 2024
202024
Multistage deep transfer learning for EmIoT-Enabled Human–Computer interaction
R Liu, Q Liu, H Zhu, H Cao
IEEE Internet of Things Journal 9 (16), 15128-15137, 2022
192022
Fasttalker: A neural text-to-speech architecture with shallow and group autoregression
R Liu, B Sisman, Y Lin, H Li
Neural Networks 141, 306-314, 2021
192021
Visualtts: Tts with accurate lip-speech synchronization for automatic voice over
J Lu, B Sisman, R Liu, M Zhang, H Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
182022
Wavetts: Tacotron-based tts with joint time-frequency domain loss
R Liu, B Sisman, F Bao, G Gao, H Li
arXiv preprint arXiv:2002.00417, 2020
182020
Emotion rendering for conversational speech synthesis with heterogeneous graph-based context modeling
R Liu, Y Hu, Y Ren, X Yin, H Li
Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 18698 …, 2024
172024
Contrastive Learning based Modality-Invariant Feature Acquisition for Robust Multimodal Emotion Recognition with Missing Modalities
R Liu, H Zuo, Z Lian, BW Schuller, H Li
IEEE Transactions on Affective Computing, 2024
172024
Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering
R Liu, B Sisman, G Gao, H Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
152024
系统目前无法执行此操作,请稍后再试。
文章 1–20