Detection of glottal closure instants from speech signals: A quantitative review

T Drugman, M Thomas, J Gudnason… - … on Audio, Speech …, 2011 - ieeexplore.ieee.org
The pseudo-periodicity of voiced speech can be exploited in several speech processing
applications. This requires however that the precise locations of the glottal closure instants …

Empirical mode decomposition for adaptive AM-FM analysis of speech: A review

R Sharma, L Vignolo, G Schlotthauer… - Speech …, 2017 - Elsevier
This work reviews the advancements in the non-conventional analysis of speech signals,
particularly from an AM-FM analysis point of view. The benefits of such an analysis, as …

COVAREP—A collaborative voice analysis repository for speech technologies

G Degottex, J Kane, T Drugman… - … on acoustics, speech …, 2014 - ieeexplore.ieee.org
Speech processing algorithms are often developed demonstrating improvements over the
state-of-the-art, but sometimes at the cost of high complexity. This makes algorithm …

Glottal source processing: From analysis to applications

T Drugman, P Alku, A Alwan… - Computer Speech & …, 2014 - Elsevier
The great majority of current voice technology applications rely on acoustic features, such as
the widely used MFCC or LP parameters, which characterize the vocal tract response …

Epoch extraction based on integrated linear prediction residual using plosion index

AP Prathosh, TV Ananthapadmanabha… - … on Audio, Speech …, 2013 - ieeexplore.ieee.org
Epoch is defined as the instant of significant excitation within a pitch period of voiced
speech. Epoch extraction continues to attract the interest of researchers because of its …

A comparative study of glottal source estimation techniques

T Drugman, B Bozkurt, T Dutoit - Computer Speech & Language, 2012 - Elsevier
Source-tract decomposition (or glottal flow estimation) is one of the basic problems of
speech processing. For this, several techniques have been proposed in the literature …

The deterministic plus stochastic model of the residual signal and its applications

T Drugman, T Dutoit - IEEE Transactions on Audio, Speech …, 2011 - ieeexplore.ieee.org
The modeling of speech production often relies on a source-filter approach. Although
methods parameterizing the filter have nowadays reached a certain maturity, there is still a …

Wavelet maxima dispersion for breathy to tense voice discrimination

J Kane, C Gobl - IEEE Transactions on Audio, Speech, and …, 2013 - ieeexplore.ieee.org
This paper proposes a new parameter, the Maxima Dispersion Quotient (MDQ), for
differentiating breathy to tense voice. Maxima derived following wavelet decomposition are …

A deterministic plus stochastic model of the residual signal for improved parametric speech synthesis

T Drugman, G Wilfart, T Dutoit - arxiv preprint arxiv:2001.00842, 2019 - arxiv.org
Speech generated by parametric synthesizers generally suffers from a typical buzziness,
similar to what was encountered in old LPC-like vocoders. In order to alleviate this problem …

Traditional machine learning for pitch detection

T Drugman, G Huybrechts, V Klimkov… - IEEE Signal …, 2018 - ieeexplore.ieee.org
Pitch detection is a fundamental problem in speech processing as F0 is used in a large
number of applications. Recent papers have proposed deep learning for robust pitch …