An overview of voice conversion and its challenges: From statistical modeling to deep learning
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while kee** the linguistic …
conversion, we change the speaker identity from one to another, while kee** the linguistic …
Spoofing and countermeasures for speaker verification: A survey
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …
the technology can be susceptible to malicious spoofing attacks. The research community …
Emotional voice conversion: Theory, databases and ESD
In this paper, we first provide a review of the state-of-the-art emotional voice conversion
research, and the existing emotional speech databases. We then motivate the development …
research, and the existing emotional speech databases. We then motivate the development …
Voice conversion from non-parallel corpora using variational auto-encoder
We propose a flexible framework for spectral conversion (SC) that facilitates training with
unaligned corpora. Many SC frameworks require parallel corpora, phonetic alignments, or …
unaligned corpora. Many SC frameworks require parallel corpora, phonetic alignments, or …
An overview of voice conversion systems
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends
Hidden Markov models (HMMs) and Gaussian mixture models (GMMs) are the two most
common types of acoustic models used in statistical parametric approaches for generating …
common types of acoustic models used in statistical parametric approaches for generating …
[PDF][PDF] The Voice Conversion Challenge 2016.
This paper describes the Voice Conversion Challenge 2016 devised by the authors to better
understand different voice conversion (VC) techniques by comparing their performance on a …
understand different voice conversion (VC) techniques by comparing their performance on a …
Exemplar-based sparse representation with residual compensation for voice conversion
We propose a nonparametric framework for voice conversion, that is, exemplar-based
sparse representation with residual compensation. In this framework, a spectrogram is …
sparse representation with residual compensation. In this framework, a spectrogram is …