Chenpeng Du

Geciteerd door

	Alles	Sinds 2020
Citaties	503	503
h-index	13	13
i10-index	15	15

300

150

225

2020202120222023202420252 17 48 126 294 15

Openbare toegang

Alles bekijken

3 artikelen

1 artikel

beschikbaar

niet beschikbaar

Op basis van financieringsmachtigingen

Medeauteurs

Kai Yu（俞凯）Shanghai Jiao Tong UniversityGeverifieerd e-mailadres voor sjtu.edu.cn
Xie ChenShanghai Jiao Tong UniversityGeverifieerd e-mailadres voor sjtu.edu.cn
Yiwei GuoShanghai Jiao Tong UniversityGeverifieerd e-mailadres voor sjtu.edu.cn
Feiyu ShenShanghai Jiao Tong UniversityGeverifieerd e-mailadres voor sjtu.edu.cn
Shuai WangSRIBDGeverifieerd e-mailadres voor sribd.cn
Yanmin QianProfessor, Shanghai Jiao Tong UniversityGeverifieerd e-mailadres voor sjtu.edu.cn
Qi ChenShanghai Jiao Tong UniversityGeverifieerd e-mailadres voor sjtu.edu.cn
Zhijun LiuThe Chinese University of Hong Kong, ShenzhenGeverifieerd e-mailadres voor link.cuhk.edu.cn
Zhuo ChenBytedance (formerly Microsoft, Columbia University)Geverifieerd e-mailadres voor columbia.edu

Volgen

Chenpeng Du

ByteDance

Geverifieerd e-mailadres voor bytedance.com - Homepage

Speech Synthesis Speech Recognition


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
VQTTS: High-fidelity text-to-speech synthesis with self-supervised VQ acoustic feature C Du, Y Guo, X Chen, K Yu Interspeech 2022, 1596-1600, 2022	68	2022
UniCATS: A unified context-aware text-to-speech framework with contextual vq-diffusion and vocoding C Du, Y Guo, F Shen, Z Liu, Z Liang, X Chen, S Wang, H Zhang, K Yu Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17924 …, 2024	47	2024
Speaker augmentation for low resource speech recognition C Du, K Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	44	2020
Dae-talker: High fidelity speech-driven talking face generation with diffusion autoencoder C Du, Q Chen, T He, X Tan, X Chen, K Yu, S Zhao, J Bian Proceedings of the 31st ACM International Conference on Multimedia, 4281-4289, 2023	41	2023
Emodiff: Intensity controllable emotional text-to-speech with soft-label guidance Y Guo, C Du, X Chen, K Yu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	37	2023
Data augmentation for end-to-end code-switching speech recognition C Du, H Li, Y Lu, L Wang, Y Qian 2021 IEEE Spoken Language Technology Workshop (SLT), 194-200, 2021	31	2021
Rich prosody diversity modelling with phone-level mixture density network C Du, K Yu Interspeech 2021, 3136-3140, 2021	28*	2021
Voiceflow: Efficient text-to-speech with rectified flow matching Y Guo, C Du, Z Ma, X Chen, K Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	27*	2024
Towards universal speech discrete tokens: A case study for asr and tts Y Yang, F Shen, C Du, Z Ma, K Yu, D Povey, X Chen ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	26*	2024
Phone-level prosody modelling with GMM-based MDN for diverse and controllable speech synthesis C Du, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 190-201, 2021	23*	2021
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech C Du, Y Guo, H Wang, Y Yang, Z Niu, S Wang, H Zhang, X Chen, K Yu arXiv preprint arXiv:2401.14321, 2024	19	2024
Towards data selection on tts data for children’s speech recognition W Wang, Z Zhou, Y Lu, H Wang, C Du, Y Qian ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	17	2021
Unsupervised word-level prosody tagging for controllable speech synthesis Y Guo, C Du, K Yu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	14	2022
Language Model Can Listen While Speaking Z Ma, Y Song, C Du, J Cong, Z Chen, Y Wang, Y Wang, X Chen arXiv preprint arXiv:2408.02622, 2024	13	2024
Synaug: Synthesis-based data augmentation for text-dependent speaker verification C Du, B Han, S Wang, Y Qian, K Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	10	2021
Speaker adaptive text-to-speech with timbre-normalized vector-quantized feature C Du, Y Guo, X Chen, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 3446-3456, 2023	9	2023
Acoustic bpe for speech generation with discrete tokens F Shen, Y Guo, C Du, X Chen, K Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	8	2024
DSE-TTS: dual speaker embedding for cross-lingual text-to-speech S Liu, Y Guo, C Du, X Chen, K Yu Interspeech 2023, 616-620, 2023	7	2023
Neural fusion for voice cloning B Chen, C Du, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1993-2001, 2022	7	2022
Anitalker: animate vivid and diverse talking faces through identity-decoupled facial motion encoding T Liu, F Chen, S Fan, C Du, Q Chen, X Chen, K Yu Proceedings of the 32nd ACM International Conference on Multimedia, 6696-6705, 2024	6	2024

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs