- Academic Search

Z Li, B Tang, X Yin, Y Wan, L Xu… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

Singing voice conversion (SVC) aims to convert the voice of one singer to that of other
singers while kee** the singing content and melody. On top of recent voice conversion …

保存引用被引用次数：41 相关文章所有 3 个版本

[Free GPT-4]

[PDF] ieee.org

Tts-guided training for accent conversion without parallel data

Y Zhou, Z Wu, M Zhang, X Tian… - IEEE Signal Processing …, 2023 - ieeexplore.ieee.org

Accent Conversion (AC) seeks to change the accent of speech from one (source) to another
(target) while preserving the speech content and speaker identity. However, many existing …

保存引用被引用次数：10 相关文章所有 3 个版本

Towards zero-shot multi-speaker multi-accent text-to-speech synthesis

M Zhang, X Zhou, Z Wu, H Li - IEEE Signal Processing Letters, 2023 - ieeexplore.ieee.org

This letter presents a framework towards multi-accent neural text-to-speech synthesis for
zero-shot multi-speaker, which employs an encoder-decoder architecture and an accent …

保存引用被引用次数：4 相关文章所有 2 个版本

[Free GPT-4]

[PDF] openreview.net

Convert and speak: Zero-shot accent conversion with minimum supervision

H Xue, X Peng, Y Lu - ACM Multimedia 2024, 2024 - openreview.net

Low resource of parallel data is the key challenge of accent conversion (AC) problem in
which both the pronunciation units and prosody pattern need to be converted. We propose a …

保存引用被引用次数：2 相关文章 HTML 版

Zero-shot multi-speaker accent TTS with limited accent data

M Zhang, Y Zhou, Z Wu, H Li - 2023 Asia Pacific Signal and …, 2023 - ieeexplore.ieee.org

In this paper, we present a multi-speaker accent speech synthesis framework. It can
generate accented speech of unseen speakers using only a limited amount of accent …

保存引用被引用次数：2 相关文章

[Free GPT-4]

[PDF] arxiv.org

Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision

Z Jia, H Xue, X Peng, Y Lu - Proceedings of the 32nd ACM International …, 2024 - dl.acm.org

Low resource of parallel data is the key challenge of accent conversion (AC) problem in
which both the pronunciation units and prosody pattern need to be converted. We propose a …

保存引用相关文章所有 3 个版本

[Free GPT-4]

[PDF] arxiv.org

Remap, warp and attend: Non-parallel many-to-many accent conversion with normalizing flows

A Ezzerg, T Merritt, K Yanagisawa… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

Regional accents of the same language affect not only how words are pronounced (ie,
phonetic content), but also impact prosodic aspects of speech such as speaking rate and …

保存引用被引用次数：4 相关文章所有 4 个版本

[Free GPT-4]

[HTML] mdpi.com

[HTML][HTML] Advancements in End-to-End Audio Style Transformation: A Differentiable Approach for Voice Conversion and Musical Style Transfer

S Aggarwal, S Uttam, S Garg, S Garg, K Jain… - AI, 2025 - mdpi.com

Introduction: This study introduces a fully differentiable, end-to-end audio transformation
network designed to overcome these limitations by operating directly on acoustic features …

保存引用相关文章所有 5 个版本网页快照

Foreign Accent Conversion using Concentrated Attention

X Zang, F **e, F Weng - 2022 IEEE International Conference on …, 2022 - ieeexplore.ieee.org

Foreign accent conversion is an important and challenging problem due to significant
differences in the manner of articulation and the speech prosody of different regional …

保存引用被引用次数：4 相关文章所有 2 个版本

Diffusion-Based Method with TTS Guidance for Foreign Accent Conversion

Q Bai, S Wang, Z Liu, M Zhang, W Rao… - 2024 IEEE 14th …, 2024 - ieeexplore.ieee.org

Accent conversion (AC) aims to alter the accent of spoken language while preserving the
original content and speaker characteristics. While any accent can be selected as a target …

保存引用相关文章

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Improving accent conversion with reference encoder and end-to-end text-to-speech

Ppg-based singing voice conversion with adversarial representation learning

Tts-guided training for accent conversion without parallel data

Towards zero-shot multi-speaker multi-accent text-to-speech synthesis

Convert and speak: Zero-shot accent conversion with minimum supervision

Zero-shot multi-speaker accent TTS with limited accent data

Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision

Remap, warp and attend: Non-parallel many-to-many accent conversion with normalizing flows

[HTML][HTML] Advancements in End-to-End Audio Style Transformation: A Differentiable Approach for Voice Conversion and Musical Style Transfer

Foreign Accent Conversion using Concentrated Attention

Diffusion-Based Method with TTS Guidance for Foreign Accent Conversion