Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian arXiv preprint arXiv:2304.09116, 2023 | 212 | 2023 |
MedDialog: Large-scale medical dialogue datasets G Zeng, W Yang, Z Ju, Y Yang, S Wang, R Zhang, M Zhou, J Zeng, ... Proceedings of the 2020 conference on empirical methods in natural language …, 2020 | 189 | 2020 |
Musicbert: Symbolic music understanding with large-scale pre-training M Zeng, X Tan, R Wang, Z Ju, T Qin, TY Liu arXiv preprint arXiv:2106.05630, 2021 | 145 | 2021 |
Naturalspeech 3: Zero-shot speech synthesis with factorized codec and diffusion models Z Ju, Y Wang, K Shen, X Tan, D Xin, D Yang, Y Liu, Y Leng, K Song, ... arXiv preprint arXiv:2403.03100, 2024 | 124 | 2024 |
Audit: Audio editing by following instructions with latent diffusion models Y Wang, Z Ju, X Tan, L He, Z Wu, J Bian Advances in Neural Information Processing Systems 36, 71340-71357, 2023 | 52 | 2023 |
Meddialog: a large-scale medical dialogue dataset S Chen, Z Ju, X Dong, H Fang, S Wang, Y Yang, J Zeng, R Zhang, ... arXiv preprint arXiv:2004.03329 3, 2020 | 41 | 2020 |
Prompttts 2: Describing and generating voices with text prompt Y Leng, Z Guo, K Shen, X Tan, Z Ju, Y Liu, Y Liu, D Yang, L Zhang, ... arXiv preprint arXiv:2309.02285, 2023 | 39 | 2023 |
Telemelody: Lyric-to-melody generation with a template-based two-stage method Z Ju, P Lu, X Tan, R Wang, C Zhang, S Wu, K Zhang, X Li, T Qin, TY Liu arXiv preprint arXiv:2109.09617, 2021 | 38 | 2021 |
On the generation of medical dialogues for COVID-19 W Yang, G Zeng, B Tan, Z Ju, S Chakravorty, X He, S Chen, X Yang, ... arXiv preprint arXiv:2005.05442, 2020 | 32 | 2020 |
On the generation of medical dialogs for COVID-19 M Zhou, Z Li, B Tan, G Zeng, W Yang, X He, Z Ju, S Chakravorty, S Chen, ... Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 25 | 2021 |
Rall-e: Robust codec language modeling with chain-of-thought prompting for text-to-speech synthesis D Xin, X Tan, K Shen, Z Ju, D Yang, Y Wang, S Takamichi, H Saruwatari, ... arXiv preprint arXiv:2404.03204, 2024 | 23 | 2024 |
Meddialog: Two large-scale medical dialogue datasets X He, S Chen, Z Ju, X Dong, H Fang, S Wang, Y Yang, J Zeng, R Zhang, ... arXiv preprint arXiv:2004.03329, 2020 | 17 | 2020 |
Flashspeech: Efficient zero-shot speech synthesis Z Ye, Z Ju, H Liu, X Tan, J Chen, Y Lu, P Sun, J Pan, W Bian, S He, W Xue, ... Proceedings of the 32nd ACM International Conference on Multimedia, 6998-7007, 2024 | 12 | 2024 |
Coviddialog: Medical dialogue datasets about covid-19 Z Ju, S Chakravorty, X He, S Chen, X Yang, P Xie | 8 | 2020 |
On the Generation of Medical Dialogues for COVID-19. CoRR abs/2005.05442 (2020) W Yang, G Zeng, B Tan, Z Ju, S Chakravorty, X He, S Chen, X Yang, ... arXiv preprint arXiv:2005.05442, 2020 | 2 | 2020 |