An overview of voice conversion systems
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
[PDF][PDF] Classification-based detection of glottal closure instants from speech signals
In this paper a classification-based method for the automatic detection of glottal closure
instants (GCIs) from the speech signal is proposed. Peaks in the speech waveforms are …
instants (GCIs) from the speech signal is proposed. Peaks in the speech waveforms are …
[PDF][PDF] Voice conversion: A critical survey
Voice conversion is an emergent problem in voice and speech processing with increasing
commercial interest, due to applications such as Speech-to-Speech Translation (SST) and …
commercial interest, due to applications such as Speech-to-Speech Translation (SST) and …
On the detection of pitch marks using a robust multi-phase algorithm
A large number of methods for identifying glottal closure instants (GCIs) in voiced speech
have been proposed in recent years. In this paper, we propose to take advantage of both …
have been proposed in recent years. In this paper, we propose to take advantage of both …
[HTML][HTML] Intra-lingual and cross-lingual voice conversion using harmonic plus stochastic models
DE Eslava - 2008 - dialnet.unirioja.es
Dentro de las tecnologías del habla, la conversión de voz consiste en transformar la voz de
un hablante, llamado hablante origen, de tal modo que los oyentes la perciban como si …
un hablante, llamado hablante origen, de tal modo que los oyentes la perciban como si …
Pitch transformation in neural network based voice conversion
In voice conversion task, prosody conversion especially pitch conversion is a very
challenging research topic because of the discontinuity property of pitch. Conventionally …
challenging research topic because of the discontinuity property of pitch. Conventionally …
[PDF][PDF] Glottal Closure Instants Detection from Speech Signal by Deep Features Extracted from Raw Speech and Linear Prediction Residual.
Glottal closure instants (GCI) also called as instants of significant excitation occur during
abrupt closure of vocal folds is a well-studied problem for its many potential applications in …
abrupt closure of vocal folds is a well-studied problem for its many potential applications in …
Speaker intonation adaptation for transforming text-to-speech synthesis speaker identity
MSE Langarani, J Van Santen - 2015 IEEE Workshop on …, 2015 - ieeexplore.ieee.org
In this study, we propose a new intonation adaptation method to transform the perceived
identity of a Text-To-Speech system to that of a target speaker with a small amount of …
identity of a Text-To-Speech system to that of a target speaker with a small amount of …
[PDF][PDF] Studies on spectral modification in voice transformation
BP Nguyen - 2009 - dspace.jaist.ac.jp
This dissertation aims to propose spectral modelings and spectral modification algorithms to
improve the quality of modified speech in voice transformation. Voice transformation is a …
improve the quality of modified speech in voice transformation. Voice transformation is a …
DBLSTM-based multi-task learning for pitch transformation in voice conversion
While both spectral and prosody transformation are important for voice conversion (VC),
traditional methods have focused on the conversion of spectral features with less emphasis …
traditional methods have focused on the conversion of spectral features with less emphasis …