Ppg-based singing voice conversion with adversarial representation learning

Z Li, B Tang, X Yin, Y Wan, L Xu… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Singing voice conversion (SVC) aims to convert the voice of one singer to that of other
singers while kee** the singing content and melody. On top of recent voice conversion …

Tts-guided training for accent conversion without parallel data

Y Zhou, Z Wu, M Zhang, X Tian… - IEEE Signal Processing …, 2023 - ieeexplore.ieee.org
Accent Conversion (AC) seeks to change the accent of speech from one (source) to another
(target) while preserving the speech content and speaker identity. However, many existing …

Towards zero-shot multi-speaker multi-accent text-to-speech synthesis

M Zhang, X Zhou, Z Wu, H Li - IEEE Signal Processing Letters, 2023 - ieeexplore.ieee.org
This letter presents a framework towards multi-accent neural text-to-speech synthesis for
zero-shot multi-speaker, which employs an encoder-decoder architecture and an accent …

Convert and speak: Zero-shot accent conversion with minimum supervision

H Xue, X Peng, Y Lu - ACM Multimedia 2024, 2024 - openreview.net
Low resource of parallel data is the key challenge of accent conversion (AC) problem in
which both the pronunciation units and prosody pattern need to be converted. We propose a …

Zero-shot multi-speaker accent TTS with limited accent data

M Zhang, Y Zhou, Z Wu, H Li - 2023 Asia Pacific Signal and …, 2023 - ieeexplore.ieee.org
In this paper, we present a multi-speaker accent speech synthesis framework. It can
generate accented speech of unseen speakers using only a limited amount of accent …

Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision

Z Jia, H Xue, X Peng, Y Lu - Proceedings of the 32nd ACM International …, 2024 - dl.acm.org
Low resource of parallel data is the key challenge of accent conversion (AC) problem in
which both the pronunciation units and prosody pattern need to be converted. We propose a …

Remap, warp and attend: Non-parallel many-to-many accent conversion with normalizing flows

A Ezzerg, T Merritt, K Yanagisawa… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org
Regional accents of the same language affect not only how words are pronounced (ie,
phonetic content), but also impact prosodic aspects of speech such as speaking rate and …

[HTML][HTML] Advancements in End-to-End Audio Style Transformation: A Differentiable Approach for Voice Conversion and Musical Style Transfer

S Aggarwal, S Uttam, S Garg, S Garg, K Jain… - AI, 2025 - mdpi.com
Introduction: This study introduces a fully differentiable, end-to-end audio transformation
network designed to overcome these limitations by operating directly on acoustic features …

Foreign Accent Conversion using Concentrated Attention

X Zang, F **e, F Weng - 2022 IEEE International Conference on …, 2022 - ieeexplore.ieee.org
Foreign accent conversion is an important and challenging problem due to significant
differences in the manner of articulation and the speech prosody of different regional …

Diffusion-Based Method with TTS Guidance for Foreign Accent Conversion

Q Bai, S Wang, Z Liu, M Zhang, W Rao… - 2024 IEEE 14th …, 2024 - ieeexplore.ieee.org
Accent conversion (AC) aims to alter the accent of spoken language while preserving the
original content and speaker characteristics. While any accent can be selected as a target …