Unicats: A unified context-aware text-to-speech framework with contextual vq-diffusion and vocoding C Du, Y Guo, F Shen, Z Liu, Z Liang, X Chen, S Wang, H Zhang, K Yu Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17924 …, 2024 | 47 | 2024 |
Towards universal speech discrete tokens: A case study for asr and tts Y Yang, F Shen, C Du, Z Ma, K Yu, D Povey, X Chen ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 27 | 2024 |
Fireredtts: A foundation text-to-speech framework for industry-level generative speech applications HH Guo, K Liu, FY Shen, YC Wu, FL Xie, K Xie, KT Xu arXiv preprint arXiv:2409.03283, 2024 | 12 | 2024 |
Acoustic bpe for speech generation with discrete tokens F Shen, Y Guo, C Du, X Chen, K Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 10 | 2024 |
Multi-speaker multi-lingual vqtts system for limmits 2023 challenge C Du, Y Guo, F Shen, K Yu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
On the Effectiveness of Acoustic BPE in Decoder-Only TTS B Li, F Shen, Y Guo, S Wang, X Chen, K Yu arXiv preprint arXiv:2407.03892, 2024 | 3 | 2024 |
Acoustic word embeddings for end-to-end speech synthesis F Shen, C Du, K Yu Applied Sciences 11 (19), 9010, 2021 | 3 | 2021 |