An overview of voice conversion and its challenges: From statistical modeling to deep learning

B Sisman, J Yamagishi, S King… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while kee** the linguistic …

An overview of voice conversion systems

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

A regression approach to speech enhancement based on deep neural networks

Y Xu, J Du, LR Dai, CH Lee - IEEE/ACM transactions on audio …, 2014 - ieeexplore.ieee.org
In contrast to the conventional minimum mean square error (MMSE)-based noise reduction
techniques, we propose a supervised method to enhance speech by means of finding a …

Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory

T Toda, AW Black, K Tokuda - IEEE Transactions on Audio …, 2007 - ieeexplore.ieee.org
In this paper, we describe a novel spectral conversion method for voice conversion (VC). A
Gaussian mixture model (GMM) of the joint probability density of source and target features …

F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder

K Qian, Z **, M Hasegawa-Johnson… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Non-parallel many-to-many voice conversion remains an interesting but challenging speech
processing task. Many style-transfer-inspired methods such as generative adversarial …

[PDF][PDF] WaveNet Vocoder with Limited Training Data for Voice Conversion.

LJ Liu, ZH Ling, Y Jiang, M Zhou, LR Dai - Interspeech, 2018 - isca-archive.org
This paper investigates the approaches of building WaveNet vocoders with limited training
data for voice conversion (VC). Current VC systems using statistical acoustic models always …

A speech parameter generation algorithm considering global variance for HMM-based speech synthesis

T Toda, K Tokuda - IEICE TRANSACTIONS on Information and …, 2007 - search.ieice.org
This paper describes a novel parameter generation algorithm for an HMM-based speech
synthesis technique. The conventional algorithm generates a parameter trajectory of static …

Voice conversion using artificial neural networks

S Desai, EV Raghavendra… - … , Speech and Signal …, 2009 - ieeexplore.ieee.org
In this paper, we propose to use artificial neural networks (ANN) for voice conversion. We
have exploited the map** abilities of ANN to perform map** of spectral features of a …

Prosody conversion from neutral speech to emotional speech

J Tao, Y Kang, A Li - IEEE transactions on Audio, Speech, and …, 2006 - ieeexplore.ieee.org
Emotion is an important element in expressive speech synthesis. Unlike traditional discrete
emotion simulations, this paper attempts to synthesize emotional speech by using" strong"," …

Voice transformation: a survey

Y Stylianou - 2009 IEEE International Conference on Acoustics …, 2009 - ieeexplore.ieee.org
Voice transformation refers to the various modifications one may apply to the sound
produced by a person, speaking or singing. Voice transformation is usually seen as an add …