StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching
Zero-shot voice conversion (VC) aims to transfer the timbre from the source speaker to an
arbitrary unseen speaker while preserving the original linguistic content. Despite recent …
arbitrary unseen speaker while preserving the original linguistic content. Despite recent …
CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching
Zero-shot voice conversion (VC) aims to transform the timbre of a source speaker into any
previously unseen target speaker, while preserving the original linguistic content. Despite …
previously unseen target speaker, while preserving the original linguistic content. Despite …