- Academic Search

Voice conversion from unaligned corpora using variational autoencoding wasserstein generative adversarial networks

CC Hsu, HT Hwang, YC Wu, Y Tsao… - arxiv preprint arxiv …, 2017 - arxiv.org

Building a voice conversion (VC) system from non-parallel speech corpora is challenging
but highly valuable in real application scenarios. In most situations, the source and the target …

保存引用被引用次数：465 相关文章所有 11 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Voice conversion from non-parallel corpora using variational auto-encoder

CC Hsu, HT Hwang, YC Wu, Y Tsao… - 2016 Asia-Pacific …, 2016 - ieeexplore.ieee.org

We propose a flexible framework for spectral conversion (SC) that facilitates training with
unaligned corpora. Many SC frameworks require parallel corpora, phonetic alignments, or …

保存引用被引用次数：378 相关文章所有 12 个版本

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

High-quality nonparallel voice conversion based on cycle-consistent adversarial network

F Fang, J Yamagishi, I Echizen… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

Although voice conversion (VC) algorithms have achieved remarkable success along with
the development of machine learning, superior performance is still difficult to achieve when …

保存引用被引用次数：170 相关文章所有 9 个版本

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Non-parallel voice conversion with cyclic variational autoencoder

PL Tobing, YC Wu, T Hayashi, K Kobayashi… - arxiv preprint arxiv …, 2019 - arxiv.org

In this paper, we present a novel technique for a non-parallel voice conversion (VC) with the
use of cyclic variational autoencoder (CycleVAE)-based spectral modeling. In a variational …

保存引用被引用次数：90 相关文章所有 4 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

Catch you and i can: Revealing source voiceprint against voice conversion

J Deng, Y Chen, Y Zhong, Q Miao, X Gong… - 32nd USENIX Security …, 2023 - usenix.org

Voice conversion (VC) techniques can be abused by malicious parties to transform their
audios to sound like a target speaker, making it hard for a human being or a speaker …

保存引用被引用次数：14 相关文章所有 10 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Non-parallel training in voice conversion using an adaptive restricted Boltzmann machine

T Nakashika, T Takiguchi… - IEEE/ACM Transactions …, 2016 - ieeexplore.ieee.org

In this paper, we present a voice conversion (VC) method that does not use any parallel data
while training the model. VC is a technique where only speaker-specific information in …

保存引用被引用次数：80 相关文章所有 6 个版本

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] One-Shot Voice Conversion with Disentangled Representations by Leveraging Phonetic Posteriorgrams.

SH Mohammadi, T Kim - Interspeech, 2019 - isca-archive.org

We propose voice conversion model from arbitrary source speaker to arbitrary target
speaker with disentangled representations. Voice conversion is a task to convert the voice of …

保存引用被引用次数：22 相关文章所有 5 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Investigation of using disentangled and interpretable representations for one-shot cross-lingual voice conversion

SH Mohammadi, T Kim - arxiv preprint arxiv:1808.05294, 2018 - arxiv.org

We study the problem of cross-lingual voice conversion in non-parallel speech corpora and
one-shot learning setting. Most prior work require either parallel speech corpora or enough …

保存引用被引用次数：19 相关文章所有 5 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] cmu.edu

Speech synthesis from found data

P Baljekar - 2018 - kilthub.cmu.edu

Text-to-speech synthesis (TTS) has progressed to such a stage that given a large, clean,
phonetically balanced dataset from a single speaker, it can produce intelligible, almost …

保存引用被引用次数：16 相关文章所有 5 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Many-to-many unsupervised speech conversion from nonparallel corpora

YK Lee, HW Kim, JG Park - IEEE Access, 2021 - ieeexplore.ieee.org

We address a nonparallel data-driven many-to-many speech modeling and multimodal style
conversion method. In this work, we train a speech conversion model for multiple domains …

保存引用被引用次数：10 相关文章所有 2 个版本

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Non-parallel training for voice conversion based on adaptation method

Voice conversion from unaligned corpora using variational autoencoding wasserstein generative adversarial networks

Voice conversion from non-parallel corpora using variational auto-encoder

High-quality nonparallel voice conversion based on cycle-consistent adversarial network

Non-parallel voice conversion with cyclic variational autoencoder

Catch you and i can: Revealing source voiceprint against voice conversion

Non-parallel training in voice conversion using an adaptive restricted Boltzmann machine

[PDF][PDF] One-Shot Voice Conversion with Disentangled Representations by Leveraging Phonetic Posteriorgrams.

Investigation of using disentangled and interpretable representations for one-shot cross-lingual voice conversion

Speech synthesis from found data

Many-to-many unsupervised speech conversion from nonparallel corpora