Parametric voice conversion based on bilinear frequency war** plus amplitude scaling
Voice conversion methods based on frequency war** followed by amplitude scaling have
been recently proposed. These methods modify the frequency axis of the source spectrum in …
been recently proposed. These methods modify the frequency axis of the source spectrum in …
Alaryngeal speech enhancement based on one-to-many eigenvoice conversion
H Doi, T Toda, K Nakamura… - … on Audio, Speech …, 2013 - ieeexplore.ieee.org
In this paper, we present novel speaking-aid systems based on one-to-many eigenvoice
conversion (EVC) to enhance three types of alaryngeal speech: esophageal speech …
conversion (EVC) to enhance three types of alaryngeal speech: esophageal speech …
On the use of i-vectors and average voice model for voice conversion without parallel data
Recently, deep and/or recurrent neural networks (DNNs/RNNs) have been employed for
voice conversion, and have significantly improved the performance of converted speech …
voice conversion, and have significantly improved the performance of converted speech …
Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system
The voice quality (identity) of singing voices is usually fixed in each singer. To overcome this
limitation and enable singers to freely change their voice quality using signal-processing …
limitation and enable singers to freely change their voice quality using signal-processing …
Many-to-many and completely parallel-data-free voice conversion based on eigenspace dnn
T Hashimoto, D Saito… - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org
Media conversion of image, text, speech, etc., generally requires a large amount of parallel
data for training a conversion model. Recently, methods for training the model using no or a …
data for training a conversion model. Recently, methods for training the model using no or a …
Personalized spectral and prosody conversion using frame-based codeword distribution and adaptive CRF
This study proposes a voice conversion-based approach to personalized text-to-speech
(TTS) synthesis. The conversion functions, trained using a small parallel corpus with source …
(TTS) synthesis. The conversion functions, trained using a small parallel corpus with source …
[PDF][PDF] Parallel-Data-Free Many-to-Many Voice Conversion Based on DNN Integrated with Eigenspace Using a Non-Parallel Speech Corpus.
T Hashimoto, H Uchida, D Saito, N Minematsu - INTERSPEECH, 2017 - isca-archive.org
This paper proposes a novel approach to parallel-data-free and many-to-many voice
conversion (VC). As 1-to-1 conversion has less flexibility, researchers focus on many-to …
conversion (VC). As 1-to-1 conversion has less flexibility, researchers focus on many-to …
Arbitrary speaker conversion based on speaker space bases constructed by deep neural networks
T Hashimoto, D Saito… - 2016 Asia-Pacific Signal …, 2016 - ieeexplore.ieee.org
This paper proposes a novel approach to construct a Deep Neural Network (DNN) based
voice conversion (VC) system, where DNNs are integrated with speaker eigenspace. The …
voice conversion (VC) system, where DNNs are integrated with speaker eigenspace. The …
Automatic variation of the degree of articulation in new HMM-based voices
This paper focuses on the automatic modification of the degree of articulation (hypo and
hyperarticulation) of an existing standard neutral voice in the framework of HMM-based …
hyperarticulation) of an existing standard neutral voice in the framework of HMM-based …
Shape-adaptive image compression using lossy shape coding, SA-prediction, and SA-deblocking
LA Chen, JJ Ding, YC Lee - 2016 Asia-Pacific Signal and …, 2016 - ieeexplore.ieee.org
As the annoying blocking or ghost artifacts tend to appear in the conventional compression
approaches either in the JPEG or JPEG2000 standards at low bitrate, the concept of the …
approaches either in the JPEG or JPEG2000 standards at low bitrate, the concept of the …