Synthesis and expressive transformation of singing voice

L Ardaillon - 2017 - hal.science
This thesis aimed at conducting research on the synthesis and expressive transformations of
the singing voice, towards the development of a high-quality synthesizer that can generate a …

Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity

T Nakano, K Yoshii, M Goto - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
This paper presents a vocal timbre analysis method based on topic modeling using latent
Dirichlet allocation (LDA). Although many works have focused on analyzing characteristics …

[PDF][PDF] Musical Note Estimation for F0 Trajectories of Singing Voices Based on a Bayesian Semi-Beat-Synchronous HMM.

R Nishikimi, E Nakamura, K Itoyama, K Yoshii - ISMIR, 2016 - m.mr-pc.org
This paper presents a statistical method that estimates a sequence of discrete musical notes
from a temporal trajectory of vocal F0s. Since considerable effort has been devoted to …

Many-to-many Singing Performance Style Transfer on Pitch and Energy Contours

YT Hsu, JY Wang, JSR Jang - IEEE Signal Processing Letters, 2024 - ieeexplore.ieee.org
Singing voice conversion (SVC) aims to convert the singer identity of a singing voice to that
of another singer. However, most existing SVC systems only perform the conversion of …

Expressive singing synthesis using local style token and dual-path pitch encoder

J Lee, HS Choi, K Lee - arxiv preprint arxiv:2204.03249, 2022 - arxiv.org
This paper proposes a controllable singing voice synthesis system capable of generating
expressive singing voice with two novel methodologies. First, a local style token module …

Sequential generation of singing f0 contours from musical note sequences based on wavenet

Y Wada, R Nishikimi, E Nakamura… - 2018 Asia-Pacific …, 2018 - ieeexplore.ieee.org
This paper describes a method that can generate a continuous F0 contour of a singing voice
from a monophonic sequence of musical notes (musical score) by using a deep neural …

VAE-SPACE: Deep generative model of voice fundamental frequency contours

K Tanaka, H Kameoka… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Modeling the speech generation process can provide flexible and interpretable ways to
generate intended synthetic speech. In this paper, we present a deep generative model of …

Mixture of Gaussian process experts for predicting sung melodic contour with expressive dynamic fluctuations

Y Ohishi, D Mochihashi, H Kameoka… - … on Acoustics, Speech …, 2014 - ieeexplore.ieee.org
We present a generative model for predicting the sung melodic contour, ie, F 0 contour, with
expressive dynamic fluctuations, such as vibrato and portamento, for a given musical score …

Transferring vocal expression of f0 contour using singing voice synthesizer

Y Ikemiya, K Itoyama, HG Okuno - … of Applied Intelligent Systems, IEA/AIE …, 2014 - Springer
A system for transferring vocal expressions separately from singing voices with
accompaniment to singing voice synthesizers is described. The expressions appear as …

Parametric model of spectral envelope to synthesize realistic intensity variations in singing voice

E Molina, I Barbancho, AM Barbancho… - … on Acoustics, Speech …, 2014 - ieeexplore.ieee.org
In this paper, we propose a method to synthesize the natural variations of spectral envelope
as intensity varies in singing voice. To this end, we propose a parametric model of spectral …