Google Académico

J Gui, Z Sun, Y Wen, D Tao, J Ye - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Generative adversarial networks (GANs) have recently become a hot research topic;
however, they have been studied since 2014, and a large number of algorithms have been …

Guardar Citar Citado por 1275 Artigos relacionados Todas as 14 versões

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

An overview of voice conversion and its challenges: From statistical modeling to deep learning

B Sisman, J Yamagishi, S King… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org

Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while kee** the linguistic …

Guardar Citar Citado por 420 Artigos relacionados Todas as 9 versões

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Autovc: Zero-shot voice style transfer with only autoencoder loss

K Qian, Y Zhang, S Chang, X Yang… - International …, 2019 - proceedings.mlr.press

Despite the progress in voice conversion, many-to-many voice conversion trained on non-
parallel data, as well as zero-shot voice conversion, remains under-explored. Deep style …

Guardar Citar Citado por 596 Artigos relacionados Todas as 8 versões Ver em HTML

[Free GPT-4]
[DeepSeek]

[PDF] sciencedirect.com

Emotional voice conversion: Theory, databases and esd

K Zhou, B Sisman, R Liu, H Li - Speech Communication, 2022 - Elsevier

In this paper, we first provide a review of the state-of-the-art emotional voice conversion
research, and the existing emotional speech databases. We then motivate the development …

Guardar Citar Citado por 193 Artigos relacionados Todas as 7 versões

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] A review of synthetic image data and its use in computer vision

K Man, J Chahl - Journal of Imaging, 2022 - mdpi.com

Development of computer vision algorithms using convolutional neural networks and deep
learning has necessitated ever greater amounts of annotated and labelled data to produce …

Guardar Citar Citado por 84 Artigos relacionados Todas as 6 versões Em cache

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Vqmivc: Vector quantization and mutual information-based unsupervised speech representation disentanglement for one-shot voice conversion

D Wang, L Deng, YT Yeung, X Chen, X Liu… - ar** from source to
target speech without relying on parallel data. This is an important task, but it has been …

Guardar Citar Citado por 356 Artigos relacionados Todas as 7 versões

[Free GPT-4]
[DeepSeek]

[PDF] ntt.co.jp

Cyclegan-vc: Non-parallel voice conversion using cycle-consistent adversarial networks

T Kaneko, H Kameoka - 2018 26th European Signal …, 2018 - ieeexplore.ieee.org

We propose a non-parallel voice-conversion (VC) method that can learn a map** from
source to target speech without relying on parallel data. The proposed method is particularly …

Guardar Citar Citado por 368 Artigos relacionados Todas as 7 versões

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

One-shot voice conversion by separating speaker and content representations with instance normalization

J Chou, C Yeh, H Lee - arxiv preprint arxiv:1904.05742, 2019 - arxiv.org

Recently, voice conversion (VC) without parallel data has been successfully adapted to multi-
target scenario in which a single model is trained to convert the input voice to many different …

Guardar Citar Citado por 290 Artigos relacionados Todas as 9 versões Ver em HTML

Criar alerta

Citar

Pesquisa avançada

Guardado em A minha biblioteca

Parallel-data-free voice conversion using cycle-consistent adversarial networks

A review on generative adversarial networks: Algorithms, theory, and applications

An overview of voice conversion and its challenges: From statistical modeling to deep learning

Autovc: Zero-shot voice style transfer with only autoencoder loss

Emotional voice conversion: Theory, databases and esd

[HTML][HTML] A review of synthetic image data and its use in computer vision

Vqmivc: Vector quantization and mutual information-based unsupervised speech representation disentanglement for one-shot voice conversion

Cyclegan-vc: Non-parallel voice conversion using cycle-consistent adversarial networks

One-shot voice conversion by separating speaker and content representations with instance normalization