Yiwei Guo

Geciteerd door

	Alles	Sinds 2020
Citaties	293	293
h-index	9	9
i10-index	9	9

220

110

165

20222023202420255 62 206 19

Medeauteurs

Kai Yu（俞凯）Shanghai Jiao Tong UniversityGeverifieerd e-mailadres voor sjtu.edu.cn
Xie ChenShanghai Jiao Tong UniversityGeverifieerd e-mailadres voor sjtu.edu.cn
Chenpeng DuByteDanceGeverifieerd e-mailadres voor bytedance.com
Shuai WangSRIBDGeverifieerd e-mailadres voor sribd.cn
Feiyu ShenShanghai Jiao Tong UniversityGeverifieerd e-mailadres voor sjtu.edu.cn

Volgen

Yiwei Guo

Shanghai Jiao Tong University

Geverifieerd e-mailadres voor sjtu.edu.cn - Homepage

Speech and Audio Processing Speech Synthesis Text-to-speech Artificial Intelligence


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
VQTTS: High-fidelity text-to-speech synthesis with self-supervised VQ acoustic feature C Du, Y Guo, X Chen, K Yu Interspeech 2022, 1596-1600, 2022	68	2022
UniCATS: A unified context-aware text-to-speech framework with contextual vq-diffusion and vocoding C Du, Y Guo, F Shen, Z Liu, Z Liang, X Chen, S Wang, H Zhang, K Yu Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17924 …, 2024	47	2024
Emodiff: Intensity controllable emotional text-to-speech with soft-label guidance Y Guo, C Du, X Chen, K Yu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	37	2023
Diffvoice: Text-to-speech with latent diffusion Z Liu, Y Guo, K Yu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	28	2023
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech C Du, Y Guo, H Wang, Y Yang, Z Niu, S Wang, H Zhang, X Chen, K Yu arXiv preprint arXiv:2401.14321, 2024	19	2024
Leveraging speech ptm, text llm, and emotional tts for speech emotion recognition Z Ma, W Wu, Z Zheng, Y Guo, Q Chen, S Zhang, X Chen ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	16	2024
VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching Y Guo, C Du, Z Ma, X Chen, K Yu ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and …, 2024	15	2024
SEF-VC: Speaker Embedding Free Zero-Shot Voice Conversion with Cross Attention J Li, Y Guo, X Chen, K Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	15	2024
Unsupervised word-level prosody tagging for controllable speech synthesis Y Guo, C Du, K Yu ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and …, 2022	14	2022
Speaker adaptive text-to-speech with timbre-normalized vector-quantized feature C Du, Y Guo, X Chen, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023	9	2023
Acoustic bpe for speech generation with discrete tokens F Shen, Y Guo, C Du, X Chen, K Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	8	2024
DSE-TTS: dual speaker embedding for cross-lingual text-to-speech S Liu, Y Guo, C Du, X Chen, K Yu Interspeech 2023, 616--620, 2023	7	2023
Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge C Du, Y Guo, F Shen, K Yu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	5	2023
On the Effectiveness of Acoustic BPE in Decoder-Only TTS B Li, F Shen, Y Guo, S Wang, X Chen, K Yu Interspeech 2024, 4134-4138, 2024	2	2024
vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders Y Guo, Z Li, J Li, C Du, H Wang, S Wang, X Chen, K Yu arXiv preprint arXiv:2409.01995, 2024	1	2024
Attention-Constrained Inference for Robust Decoder-Only Text-to-Speech H Wang, C Du, Y Guo, S Wang, X Chen, K Yu IEEE Spoken Language Technology Workshop 2024, 2024	1	2024
GlobalWalk: Learning Global-aware Node Embeddings via Biased Sampling Z Xue, Z Guo, Y Guo arXiv preprint arXiv:2201.09882, 2022	1*	2022
Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective H Wang, H Wang, Y Guo, Z Li, C Du, X Chen, K Yu arXiv preprint arXiv:2412.17048, 2024		2024
Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding B Li, H Wang, S Zhang, Y Guo, K Yu arXiv preprint arXiv:2410.21951, 2024		2024
LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec Y Guo, Z Li, C Du, H Wang, X Chen, K Yu arXiv preprint arXiv:2410.15764, 2024		2024

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs