Beyond correlation: acoustic transformation methods for the experimental study of emotional voice and speech

P Arias, L Rachman, M Liuni… - Emotion Review, 2021 - journals.sagepub.com
While acoustic analysis methods have become a commodity in voice emotion research,
experiments that attempt not only to describe but to computationally manipulate expressive …

A uniform phase representation for the harmonic model in speech synthesis applications

G Degottex, D Erro - EURASIP Journal on Audio, Speech, and Music …, 2014 - Springer
Feature-based vocoders, eg, STRAIGHT, offer a way to manipulate the perceived
characteristics of the speech signal in speech transformation and synthesis. For the …

Unsupervised music source separation using differentiable parametric source models

K Schulze-Forster, G Richard, L Kelley… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
Supervised deep learning approaches to underdetermined audio source separation achieve
state-of-the-art performance but require a dataset of mixtures along with their corresponding …

Analysis and synthesis of speech using an adaptive full-band harmonic model

G Degottex, Y Stylianou - IEEE Transactions on Audio, Speech …, 2013 - ieeexplore.ieee.org
Voice models often use frequency limits to split the speech spectrum into two or more
voiced/unvoiced frequency bands. However, from the voice production, the amplitude …

Neural vocoding for singing and speaking voices with the multi-band excited wavenet

A Roebel, F Bous - Information, 2022 - mdpi.com
The use of the mel spectrogram as a signal parameterization for voice generation is quite
recent and linked to the development of neural vocoders. These are deep neural networks …

A bottleneck auto-encoder for f0 transformations on speech and singing voice

F Bous, A Roebel - Information, 2022 - mdpi.com
In this publication, we present a deep learning-based method to transform the f 0 in speech
and singing voice recordings. f 0 transformation is performed by training an auto-encoder on …

A spectral glottal flow model for source-filter separation of speech

O Perrotin, I McLoughlin - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
The estimation of glottal flow from a speech waveform is an essential technique used in
speech analysis and parameterisation. Significant research effort has been addressed at …

Glottal spectral separation for speech synthesis

JP Cabral, K Richmond, J Yamagishi… - IEEE Journal of …, 2014 - ieeexplore.ieee.org
This paper proposes an analysis method to separate the glottal source and vocal tract
components of speech that is called Glottal Spectral Separation (GSS). This method can …

A log domain pulse model for parametric speech synthesis

G Degottex, P Lanchantin… - IEEE/ACM Transactions on …, 2017 - ieeexplore.ieee.org
Most of the degradation in current Statistical Parametric Speech Synthesis (SPSS) results
from the form of the vocoder. One of the main causes of degradation is the reconstruction of …

Glottal flow synthesis for whisper-to-speech conversion

O Perrotin, IV McLoughlin - IEEE/ACM Transactions on Audio …, 2020 - ieeexplore.ieee.org
Whisper-to-speech conversion is motivated by laryngeal disorders, in which malfunction of
the vocal folds leads to loss of voicing. Many patients with laryngeal disorders can still …