An overview of voice conversion and its challenges: From statistical modeling to deep learning
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while kee** the linguistic …
conversion, we change the speaker identity from one to another, while kee** the linguistic …
An overview of voice conversion systems
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
A regression approach to speech enhancement based on deep neural networks
In contrast to the conventional minimum mean square error (MMSE)-based noise reduction
techniques, we propose a supervised method to enhance speech by means of finding a …
techniques, we propose a supervised method to enhance speech by means of finding a …
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
In this paper, we describe a novel spectral conversion method for voice conversion (VC). A
Gaussian mixture model (GMM) of the joint probability density of source and target features …
Gaussian mixture model (GMM) of the joint probability density of source and target features …
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder
Non-parallel many-to-many voice conversion remains an interesting but challenging speech
processing task. Many style-transfer-inspired methods such as generative adversarial …
processing task. Many style-transfer-inspired methods such as generative adversarial …
[PDF][PDF] WaveNet Vocoder with Limited Training Data for Voice Conversion.
This paper investigates the approaches of building WaveNet vocoders with limited training
data for voice conversion (VC). Current VC systems using statistical acoustic models always …
data for voice conversion (VC). Current VC systems using statistical acoustic models always …
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
This paper describes a novel parameter generation algorithm for an HMM-based speech
synthesis technique. The conventional algorithm generates a parameter trajectory of static …
synthesis technique. The conventional algorithm generates a parameter trajectory of static …
Voice conversion using artificial neural networks
S Desai, EV Raghavendra… - … , Speech and Signal …, 2009 - ieeexplore.ieee.org
In this paper, we propose to use artificial neural networks (ANN) for voice conversion. We
have exploited the map** abilities of ANN to perform map** of spectral features of a …
have exploited the map** abilities of ANN to perform map** of spectral features of a …
Prosody conversion from neutral speech to emotional speech
J Tao, Y Kang, A Li - IEEE transactions on Audio, Speech, and …, 2006 - ieeexplore.ieee.org
Emotion is an important element in expressive speech synthesis. Unlike traditional discrete
emotion simulations, this paper attempts to synthesize emotional speech by using" strong"," …
emotion simulations, this paper attempts to synthesize emotional speech by using" strong"," …
Voice transformation: a survey
Y Stylianou - 2009 IEEE International Conference on Acoustics …, 2009 - ieeexplore.ieee.org
Voice transformation refers to the various modifications one may apply to the sound
produced by a person, speaking or singing. Voice transformation is usually seen as an add …
produced by a person, speaking or singing. Voice transformation is usually seen as an add …