Ppg-based singing voice conversion with adversarial representation learning
Singing voice conversion (SVC) aims to convert the voice of one singer to that of other
singers while kee** the singing content and melody. On top of recent voice conversion …
singers while kee** the singing content and melody. On top of recent voice conversion …
Tts-guided training for accent conversion without parallel data
Accent Conversion (AC) seeks to change the accent of speech from one (source) to another
(target) while preserving the speech content and speaker identity. However, many existing …
(target) while preserving the speech content and speaker identity. However, many existing …
Towards zero-shot multi-speaker multi-accent text-to-speech synthesis
This letter presents a framework towards multi-accent neural text-to-speech synthesis for
zero-shot multi-speaker, which employs an encoder-decoder architecture and an accent …
zero-shot multi-speaker, which employs an encoder-decoder architecture and an accent …
Convert and speak: Zero-shot accent conversion with minimum supervision
H Xue, X Peng, Y Lu - ACM Multimedia 2024, 2024 - openreview.net
Low resource of parallel data is the key challenge of accent conversion (AC) problem in
which both the pronunciation units and prosody pattern need to be converted. We propose a …
which both the pronunciation units and prosody pattern need to be converted. We propose a …
Zero-shot multi-speaker accent TTS with limited accent data
In this paper, we present a multi-speaker accent speech synthesis framework. It can
generate accented speech of unseen speakers using only a limited amount of accent …
generate accented speech of unseen speakers using only a limited amount of accent …
Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision
Low resource of parallel data is the key challenge of accent conversion (AC) problem in
which both the pronunciation units and prosody pattern need to be converted. We propose a …
which both the pronunciation units and prosody pattern need to be converted. We propose a …
Remap, warp and attend: Non-parallel many-to-many accent conversion with normalizing flows
Regional accents of the same language affect not only how words are pronounced (ie,
phonetic content), but also impact prosodic aspects of speech such as speaking rate and …
phonetic content), but also impact prosodic aspects of speech such as speaking rate and …
[HTML][HTML] Advancements in End-to-End Audio Style Transformation: A Differentiable Approach for Voice Conversion and Musical Style Transfer
S Aggarwal, S Uttam, S Garg, S Garg, K Jain… - AI, 2025 - mdpi.com
Introduction: This study introduces a fully differentiable, end-to-end audio transformation
network designed to overcome these limitations by operating directly on acoustic features …
network designed to overcome these limitations by operating directly on acoustic features …
Foreign Accent Conversion using Concentrated Attention
X Zang, F **e, F Weng - 2022 IEEE International Conference on …, 2022 - ieeexplore.ieee.org
Foreign accent conversion is an important and challenging problem due to significant
differences in the manner of articulation and the speech prosody of different regional …
differences in the manner of articulation and the speech prosody of different regional …
Diffusion-Based Method with TTS Guidance for Foreign Accent Conversion
Accent conversion (AC) aims to alter the accent of spoken language while preserving the
original content and speaker characteristics. While any accent can be selected as a target …
original content and speaker characteristics. While any accent can be selected as a target …