An overview of voice conversion and its challenges: From statistical modeling to deep learning

B Sisman, J Yamagishi, S King… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while kee** the linguistic …

Spoofing and countermeasures for speaker verification: A survey

Z Wu, N Evans, T Kinnunen, J Yamagishi, F Alegre… - speech …, 2015 - Elsevier
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …

Emotional voice conversion: Theory, databases and ESD

K Zhou, B Sisman, R Liu, H Li - Speech Communication, 2022 - Elsevier
In this paper, we first provide a review of the state-of-the-art emotional voice conversion
research, and the existing emotional speech databases. We then motivate the development …

Voice conversion from non-parallel corpora using variational auto-encoder

CC Hsu, HT Hwang, YC Wu, Y Tsao… - 2016 Asia-Pacific …, 2016 - ieeexplore.ieee.org
We propose a flexible framework for spectral conversion (SC) that facilitates training with
unaligned corpora. Many SC frameworks require parallel corpora, phonetic alignments, or …

An overview of voice conversion systems

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends

ZH Ling, SY Kang, H Zen, A Senior… - IEEE Signal …, 2015 - ieeexplore.ieee.org
Hidden Markov models (HMMs) and Gaussian mixture models (GMMs) are the two most
common types of acoustic models used in statistical parametric approaches for generating …

[PDF][PDF] The Voice Conversion Challenge 2016.

T Toda, LH Chen, D Saito, F Villavicencio, M Wester… - Interspeech, 2016 - isca-archive.org
This paper describes the Voice Conversion Challenge 2016 devised by the authors to better
understand different voice conversion (VC) techniques by comparing their performance on a …

Exemplar-based sparse representation with residual compensation for voice conversion

Z Wu, T Virtanen, ES Chng, H Li - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org
We propose a nonparametric framework for voice conversion, that is, exemplar-based
sparse representation with residual compensation. In this framework, a spectrogram is …