Audiogpt: Understanding and generating speech, music, sound, and talking head R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (21), 23802 …, 2024 | 175 | 2024 |
Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer Y Wang, J Bai, R Huang, R Li, Z Hong, Z Zhao arXiv preprint arXiv:2309.07566, 2023 | 8 | 2023 |
Multi-level spatial-temporal adaptation network for motor imagery classification W Xu, J Wang, Z Jia, Z Hong, Y Li, Y Lin ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 8 | 2022 |
Unisinger: Unified end-to-end singing voice synthesis with cross-modality information matching Z Hong, C Cui, R Huang, L Zhang, J Liu, J He, Z Zhao Proceedings of the 31st ACM International Conference on Multimedia, 7569-7579, 2023 | 7 | 2023 |
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head, April 2023 R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ... URL http://arxiv. org/abs/2304.12995, 0 | 6 | |
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt Y Wang, R Hu, R Huang, Z Hong, R Li, W Liu, F You, T Jin, Z Zhao arXiv preprint arXiv:2403.11780, 2024 | 4 | 2024 |
Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment H Zhiqing, H Rongjie, C Xize, W Yongqi, L Ruiqi, Y Fuming, Z Zhou, ... arXiv preprint arXiv:2404.09313, 2024 | 3 | 2024 |
VoiceTuner: Self-Supervised Pre-training and Efficient Fine-tuning For Voice Generation R Huang, Y Wang, R Hu, X Xu, Z Hong, D Yang, X Cheng, Z Wang, ... Proceedings of the 32nd ACM International Conference on Multimedia, 10630-10639, 2024 | 2 | 2024 |
Gtsinger: A global multi-technique singing corpus with realistic music scores for all singing tasks Y Zhang, C Pan, W Guo, R Li, Z Zhu, J Wang, W Xu, J Lu, Z Hong, ... arXiv preprint arXiv:2409.13832, 2024 | 2 | 2024 |
Accompanied Singing Voice Synthesis with Fully Text-controlled Melody R Li, Z Hong, Y Wang, L Zhang, R Huang, S Zheng, Z Zhao arXiv preprint arXiv:2407.02049, 2024 | 2 | 2024 |
Robust Singing Voice Transcription Serves Synthesis R Li, Y Zhang, Y Wang, Z Hong, R Huang, Z Zhao arXiv preprint arXiv:2405.09940, 2024 | 2 | 2024 |
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion R Li, R Huang, Y Wang, Z Hong, Z Zhao arXiv preprint arXiv:2406.02429, 2024 | 1 | 2024 |
AudioVSR: Enhancing Video Speech Recognition with Audio Data X Yang, X Cheng, J Duan, H Qiu, M Hong, M Fang, S Ji, J Zuo, Z Hong, ... Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | | 2024 |
MultiBand: Multi-Task Song Generation with Personalized Prompt-Based Control Y Zhang, W Guo, C Pan, R Li, Z Zhu, R Huang, R Zhang, Z Hong, Z Jiang, ... | | |