An overview of voice conversion systems

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

Asynchronous FIR filters: towards a new digital processing chain

F Aeschlimann, E Allier, L Fesquet… - … Circuits and Systems …, 2004 - ieeexplore.ieee.org
This paper is a contribution to the definition of a new kind of digital signal processing chain.
It is focused on Finite-Impulse-Response filtering (FIR) applied to irregularly sampled …

Evaluation of expressive speech synthesis with voice conversion and copy resynthesis techniques

O Turk, M Schroder - IEEE Transactions on Audio, Speech, and …, 2010 - ieeexplore.ieee.org
Generating expressive synthetic voices requires carefully designed databases that contain
sufficient amount of expressive speech material. This paper investigates voice conversion …

Voice conversion based on non-negative matrix factorization using phoneme-categorized dictionary

R Aihara, T Nakashika, T Takiguchi… - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
We present in this paper an exemplar-based voice conversion (VC) method using a
phoneme-categorized dictionary. Sparse representation-based VC using Non-negative …

[PDF][PDF] Voice Conversion Using GMM with Enhanced Global Variance.

H Benisty, D Malah - INTERSPEECH, 2011 - isca-archive.org
The goal of voice conversion is to transform a sentence said by one speaker, to sound as if
another speaker had said it. The classical conversion based on a Gaussian Mixture Model …

[PDF][PDF] Investigating the role of phoneme-level modifications in emotional speech resynthesis.

M Bulut, C Busso, S Yildirim, A Kazemzadeh, CM Lee… - Interspeech, 2005 - ecs.utdallas.edu
Recent studies in our lab show that emotions in speech are manifested as, besides supra-
segmental trends, distinct variations in phoneme-level prosodic and spectral parameters. In …

Emotion conversion based on prosodic unit selection

D Erro, E Navas, I Hernáez… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Voice conversion has been traditionally focused on spectrum. Current systems lack a solid
prosody conversion method suitable for different speaking styles. Recently, the unit selection …

[HTML][HTML] Intra-lingual and cross-lingual voice conversion using harmonic plus stochastic models

DE Eslava - 2008 - dialnet.unirioja.es
Dentro de las tecnologías del habla, la conversión de voz consiste en transformar la voz de
un hablante, llamado hablante origen, de tal modo que los oyentes la perciban como si …

Reconstruction of normal speech from whispered speech based on RBF neural network

Z Tao, XD Tan, T Han, JH Gu, YS Xu… - 2010 Third …, 2010 - ieeexplore.ieee.org
Restriction of normal speech from Chinese whispered speech based on radial basis function
neural network (RBF NN) is proposed in this paper. Firstly, capture the nonlinear map** of …

Voice conversion based on simultaneous modelling of spectrum and F0

K Yutani, Y Uto, Y Nankaku, A Lee… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
This paper proposes a simultaneous modeling of spectrum and F0 for voice conversion
based on MSD (multi-space probability distribution) models. As a conventional technique, a …