Glottal inverse filtering analysis of human voice production—A review of estimation and parameterization methods of the glottal excitation and their applications

P Alku - Sadhana, 2011 - Springer
Glottal inverse filtering (GIF) refers to methods of estimating the source of voiced speech, the
glottal volume velocity waveform. GIF is based on the idea of inversion, in which the effects …

[BOOK][B] Linear and nonlinear inverse problems with practical applications

JL Mueller, S Siltanen - 2012 - SIAM
Inverse problems arise from the need to interpret indirect and incomplete measurements. As
an area of contemporary mathematics, the field of inverse problems is strongly driven by …

Speech synthesis based on hidden Markov models

K Tokuda, Y Nankaku, T Toda, H Zen… - Proceedings of the …, 2013 - ieeexplore.ieee.org
This paper gives a general overview of hidden Markov model (HMM)-based speech
synthesis, which has recently been demonstrated to be very effective in synthesizing …

[PDF][PDF] Concatenative speech synthesis: A review

RA Khan, JS Chitode - International Journal of Computer Applications, 2016 - Citeseer
The primary objective of this paper is to provide an overview of existing Concatenative Text-
To-Speech synthesis techniques. Concatenative speech synthesis can be broadly …

Evaluation of speaker verification security and detection of HMM-based synthetic speech

PL De Leon, M Pucher, J Yamagishi… - … on Audio, Speech …, 2012 - ieeexplore.ieee.org
In this paper, we evaluate the vulnerability of speaker verification (SV) systems to synthetic
speech. The SV systems are based on either the Gaussian mixture model–universal …

[HTML][HTML] A comparison of data augmentation methods in voice pathology detection

F Javanmardi, SR Kadiri, P Alku - Computer Speech & Language, 2024 - Elsevier
To distinguish pathological voices from healthy voices, automatic voice pathology detection
systems can be built using machine learning (ML) and deep learning (DL) techniques. To …

Quasi closed phase glottal inverse filtering analysis with weighted linear prediction

M Airaksinen, T Raitio, B Story… - IEEE/ACM Transactions …, 2013 - ieeexplore.ieee.org
This study presents a new glottal inverse filtering (GIF) technique based on closed phase
analysis over multiple fundamental periods. The proposed quasi closed phase (QCP) …

Glottal source processing: From analysis to applications

T Drugman, P Alku, A Alwan… - Computer Speech & …, 2014 - Elsevier
The great majority of current voice technology applications rely on acoustic features, such as
the widely used MFCC or LP parameters, which characterize the vocal tract response …

Harmonics plus noise model based vocoder for statistical parametric speech synthesis

D Erro, I Sainz, E Navas… - IEEE Journal of Selected …, 2013 - ieeexplore.ieee.org
This article explores the potential of the harmonics plus noise model of speech in the
development of a high-quality vocoder applicable in statistical frameworks, particularly in …