StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching

J Yao, Y Yan, Y Pan, Z Ning, J Ye, H Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org
Zero-shot voice conversion (VC) aims to transfer the timbre from the source speaker to an
arbitrary unseen speaker while preserving the original linguistic content. Despite recent …

CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching

Y Pan, Y Yang, J Yao, J Ye, H Zhou, L Ma… - arxiv preprint arxiv …, 2024 - arxiv.org
Zero-shot voice conversion (VC) aims to transform the timbre of a source speaker into any
previously unseen target speaker, while preserving the original linguistic content. Despite …